Bug in official document sample

slion · June 1, 2016, 2:59am

In the page https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-pattern-analyzer.html, there is a sample as below:

DELETE test

PUT /test
{
"settings": {
"analysis": {
"analyzer": {
"whitespace": {
"type": "pattern",
"pattern": "\s+"
}
}
}
}
}

GET /test/_analyze?analyzer=whitespace&text=foo,bar baz

But actually it outputs "foo", "bar", "baz", how to output "foo,bar", "baz"?
Any help is welcome.

warkolm · June 1, 2016, 3:02am

It seems that your post formatting may have lost some of the code, use the </> button to format things

slion · June 1, 2016, 3:10am

Thanks for your instant reply. Please see the url: https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-pattern-analyzer.html
, what I need is an analyzer which could output terms "foo,bar", "baz" for input "foo,bar;baz"? Could you help me ?

slion · June 1, 2016, 3:13am

My data like this: "Jack,Ma;Mary Wu;", I just want it to be tokenized by semicolon, so it should bring out "Jack,Ma", "Mary Wu".

Topic		Replies	Views
Aalyzer issue - terms not getting tokenized on whitespace Elasticsearch	1	302	July 6, 2017
Whitespace analyzer issue Elasticsearch	3	521	November 25, 2017
Keyword analyzer but allow redundant white spaces Elasticsearch	3	4092	January 15, 2018
Template analyzer whitespace Elasticsearch	4	404	May 23, 2019
Whitespace analyzer (char-filter And token-filter) Elasticsearch	7	1217	November 27, 2019