The default English stopwords
These stopwords can usually be filtered out before indexing with little negative impact on retrieval.
But is is a good idea to do so?
Pros and Cons of Stopwords
the primary advantage
Using Stopwords
Stopwords and the Standard Analyzer
Maintaining Positions
The output from the analyze API is quite interesting:
Specifying Stopwords
Using the stop Token Filter
Updating Stopwords
Stopwords and Performance
minimum_should_match
Divide and Conquer
The terms in a query string can be divided into more important(low-frequency) and less-important(high-frequency) terms.
Documents that match only the less important terms are probably of very little interest. Really, we want document that match as many of the important terms as possible.
Controlling Precision
Only High-Frequency Terms
Index Options