Is it possible to write my own filter?

Hi everyone,

I need a filter to split in two words a word containing a suffix that
belongs to a list (Maybe a text file containing all the suffix) but I can't
find an existing filter doing that.

Does anyone have a solution to this?
If not, is there a way to write my own filter in Java and add it to
ElasticSearch ? : )

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/ef12f3ec-1210-4890-8f52-49cb5d7243d1%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Off the top of my head, I cannot think of an existing filter that
accomplishes that task.

Creating a custom filter is easy. Simply creating a Lucene filter and
create a plug-in around it. Take a look at existing analysis plug-ins for
inspiration.

http://www.elastic.co/guide/en/elasticsearch/reference/current/modules-plugins.html#analysis-plugins

Cheers,

Ivan
On Mar 12, 2015 11:43 AM, cornet.remi@gmail.com wrote:

Hi everyone,

I need a filter to split in two words a word containing a suffix that
belongs to a list (Maybe a text file containing all the suffix) but I can't
find an existing filter doing that.

Does anyone have a solution to this?
If not, is there a way to write my own filter in Java and add it to
ElasticSearch ? : )

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/ef12f3ec-1210-4890-8f52-49cb5d7243d1%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/ef12f3ec-1210-4890-8f52-49cb5d7243d1%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CALY%3DcQB9QcgC-d%3DkE36U04k9_S1QrzdZbEj_%3Dk2UCtrOSz8b3A%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

I wonder if you could use a Pattern Tokenizer in that case???

http://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-pattern-tokenizer.html

--
David :wink:
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs

Le 12 mars 2015 à 04:32, Ivan Brusic ivan@brusic.com a écrit :

Off the top of my head, I cannot think of an existing filter that accomplishes that task.

Creating a custom filter is easy. Simply creating a Lucene filter and create a plug-in around it. Take a look at existing analysis plug-ins for inspiration.

http://www.elastic.co/guide/en/elasticsearch/reference/current/modules-plugins.html#analysis-plugins

Cheers,

Ivan

On Mar 12, 2015 11:43 AM, cornet.remi@gmail.com wrote:
Hi everyone,

I need a filter to split in two words a word containing a suffix that belongs to a list (Maybe a text file containing all the suffix) but I can't find an existing filter doing that.

Does anyone have a solution to this?
If not, is there a way to write my own filter in Java and add it to ElasticSearch ? : )

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/ef12f3ec-1210-4890-8f52-49cb5d7243d1%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CALY%3DcQB9QcgC-d%3DkE36U04k9_S1QrzdZbEj_%3Dk2UCtrOSz8b3A%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/5AF81E5A-912E-4CD0-9E06-C3730C62433E%40pilato.fr.
For more options, visit https://groups.google.com/d/optout.

Yes, that's an idea : )
Pattern Tokenizer seems to give the results I want but I don't know if it's
possible to define patterns based on a list of specific words.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/f160d9eb-d3ef-48f8-899c-259a8fa90406%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.