How is data stored

bkamiche · January 1, 2015, 3:36am

Hello, I'm new in using elasticsearch, so maybe this is a basic question...

Is there any way to see how is the text "stored" or at least how it would
look once the filters defined for a field (in the analyzer) are applied?

I know that the actual field is stored "as is", and the filters are
considered for indexing purposes, but i want to see what is the result of
that operation.

Why do I ask this, the reason is that I'm applying filters to a field (for
example removing urls), and performing aggregations on that field, but the
aggregations return "http" as a used word, and guess that is not correct,
as it shall be removed...

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/841f7899-a45f-4b19-bc05-6e156f76777a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

nik9000 · January 1, 2015, 4:26am

Use the analyze API to get a view into how your analysis chain (tokenizer
and filters) affect text.

The index itself is all jumbled together with all the documents and there
isn't a good way to dig the data for a single document out of it.
On Dec 31, 2014 10:36 PM, "Bruno Kamiche" bkamiche@gmail.com wrote:

Hello, I'm new in using elasticsearch, so maybe this is a basic question...

Is there any way to see how is the text "stored" or at least how it would
look once the filters defined for a field (in the analyzer) are applied?

I know that the actual field is stored "as is", and the filters are
considered for indexing purposes, but i want to see what is the result of
that operation.

Why do I ask this, the reason is that I'm applying filters to a field (for
example removing urls), and performing aggregations on that field, but the
aggregations return "http" as a used word, and guess that is not correct,
as it shall be removed...

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/841f7899-a45f-4b19-bc05-6e156f76777a%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/841f7899-a45f-4b19-bc05-6e156f76777a%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAPmjWd0O%3D3YV17kxuRVxYLX3na-CXYFFhW28wtDveNb2LDutSQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

jprante · January 1, 2015, 4:09pm

Beside analyze API, the explain API in the query can tell why a document
has been included into the result set, and allows conclusions about the
terms how they are stored in the index.

Jörg

On Thu, Jan 1, 2015 at 5:26 AM, Nikolas Everett nik9000@gmail.com wrote:

Use the analyze API to get a view into how your analysis chain (tokenizer
and filters) affect text.

The index itself is all jumbled together with all the documents and there
isn't a good way to dig the data for a single document out of it.
On Dec 31, 2014 10:36 PM, "Bruno Kamiche" bkamiche@gmail.com wrote:

Hello, I'm new in using elasticsearch, so maybe this is a basic
question...

Is there any way to see how is the text "stored" or at least how it would
look once the filters defined for a field (in the analyzer) are applied?

I know that the actual field is stored "as is", and the filters are
considered for indexing purposes, but i want to see what is the result of
that operation.

Why do I ask this, the reason is that I'm applying filters to a field
(for example removing urls), and performing aggregations on that field, but
the aggregations return "http" as a used word, and guess that is not
correct, as it shall be removed...

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/841f7899-a45f-4b19-bc05-6e156f76777a%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/841f7899-a45f-4b19-bc05-6e156f76777a%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAPmjWd0O%3D3YV17kxuRVxYLX3na-CXYFFhW28wtDveNb2LDutSQ%40mail.gmail.com
https://groups.google.com/d/msgid/elasticsearch/CAPmjWd0O%3D3YV17kxuRVxYLX3na-CXYFFhW28wtDveNb2LDutSQ%40mail.gmail.com?utm_medium=email&utm_source=footer
.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAKdsXoHEQO1jf-8xxwFd2ZCdm3zrraPUBEBuGXgLwYrje9Jmgg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

bkamiche · January 1, 2015, 5:05pm

Thanks for your replies, that gave me the clue for what I was looking for,
and now it is solved!

On Wednesday, December 31, 2014 10:36:27 PM UTC-5, Bruno Kamiche wrote:

Hello, I'm new in using elasticsearch, so maybe this is a basic question...

Is there any way to see how is the text "stored" or at least how it would
look once the filters defined for a field (in the analyzer) are applied?

I know that the actual field is stored "as is", and the filters are
considered for indexing purposes, but i want to see what is the result of
that operation.

Why do I ask this, the reason is that I'm applying filters to a field (for
example removing urls), and performing aggregations on that field, but the
aggregations return "http" as a used word, and guess that is not correct,
as it shall be removed...

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/9196629f-7bf8-4256-8b16-8ecee2fbd020%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Peter_Johnson_2 · January 5, 2015, 3:23pm

have a look
at GitHub - polyfractal/elasticsearch-inquisitor: Site plugin for Elasticsearch to help understand and debug queries.

On Thursday, 1 January 2015 17:05:25 UTC, Bruno Kamiche wrote:

Thanks for your replies, that gave me the clue for what I was looking for,
and now it is solved!

On Wednesday, December 31, 2014 10:36:27 PM UTC-5, Bruno Kamiche wrote:

Hello, I'm new in using elasticsearch, so maybe this is a basic
question...

Is there any way to see how is the text "stored" or at least how it would
look once the filters defined for a field (in the analyzer) are applied?

I know that the actual field is stored "as is", and the filters are
considered for indexing purposes, but i want to see what is the result of
that operation.

Why do I ask this, the reason is that I'm applying filters to a field
(for example removing urls), and performing aggregations on that field, but
the aggregations return "http" as a used word, and guess that is not
correct, as it shall be removed...

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/59accfbd-46a4-437c-81b1-6978b752ac4c%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Topic		Replies	Views
To get information about parse data in elasticsearch Elasticsearch	1	311	July 6, 2017
What does it mean to "store" a field? Elasticsearch	23	4982	July 6, 2017
Storing and analyzing user agent strings, general approach Elasticsearch	4	4536	July 6, 2017
How to know details about how a document is indexed? Elasticsearch	5	843	July 6, 2017
Aggregation / Sort and CircuitBreakingException Elasticsearch	6	517	July 6, 2017

How is data stored

Related topics