Recommendation for using stop words


(sulemanmubarik) #1

Hi
What are good recommendation to use the stop words.
Using the default stop words provided by elastic search is good
Or should I use some custom stop words too
More than 60% data is from twitter.
Thanks


(Ivan Brusic) #2

Tuning stop words can be as long of a process as you want it to be. Saving
your queries/results and doing some search analytics can help you fine tune
the stop words. In general, the default stop words list is very good for
English, but Twitterspeak is not really English. :slight_smile:

You can look at all the terms in your inverted index (
https://github.com/jprante/elasticsearch-index-termlist) to see what the
top words are and see if they are relevant.

Have you looked at the common words query?

http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/query-dsl-common-terms-query.html

Cheers,

Ivan

On Wed, Jul 30, 2014 at 4:35 PM, sulemanmubarik sulemanmubarik@gmail.com
wrote:

Hi
What are good recommendation to use the stop words.
Using the default stop words provided by elastic search is good
Or should I use some custom stop words too
More than 60% data is from twitter.
Thanks

--
View this message in context:
http://elasticsearch-users.115913.n3.nabble.com/Recommendation-for-using-stop-words-tp4060924.html
Sent from the ElasticSearch Users mailing list archive at Nabble.com.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/1406763306312-4060924.post%40n3.nabble.com
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CALY%3DcQDhFfibwP1pbG52AKaNfx%3DtHcqW-KZJ7rV42ugwFbe_Mg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


(system) #3