Stop types for tokenizer filter?


(David Weinstein) #1

Hi group,

I want to be able to specify what types to use as stop words. So:

curl -XGET '192.168.42.43:9200/_analyze?tokenizer=uax_url_email&char_filter=html_strip&filter=keyword' -d 'http://www.google.com hello'
{"tokens":[{"token":"http://www.google.com","start_offset":0,"end_offset":21,"type":"","position":1},{"token":"hello","start_offset":22,"end_offset":27,"type":"","position":2}]}%

I'd like to filter out the ALPHANUM tokens and keep the URL token.

I didn't see a token filter that would do this... did I miss something?

Regards,

David

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/3934b9de-c111-47d6-a204-d918c6beab7f%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(system) #2