Discuss the Elastic Stack

Prefix of special character on the id

Elastic Stack Elasticsearch

stexxen (Steven) July 13, 2021, 3:28pm 1

I have inadvertently loaded some documents without sanitising the field that was meant to be the id.

So some records now have an id that begins with the ascii 09 i.e. the tab character

I can extract an individual record with
GET /nyy_uprn/_doc/%09100040171995
returns the following

{
  "_index" : "xxxxxxx",
  "_type" : "_doc",
  "_id" : """	100040171995""",
  "_version" : 1,
  "_seq_no" : 56576,
  "_primary_term" : 3,
  "found" : true,
  "_source" : {
    "id" : """	100040171995""",

You can see both the _id and id both contain the char 09 and is escaped with double double-quotes.

However I cannot seem to retrieve these records using a prefix query
I have tried the following

GET /nyy_uprn/_search
{
  "query": {
    "prefix": {
      "id": "\t"
    }
  }
}

and have also tried many variations along the forms

""" """
"\\t"
"//t"
"/t"

I always get 0 records back. I'm not sure what is the problem here, as the tab character as valid json \t should work?

stexxen (Steven) July 19, 2021, 9:31am 2

Hi Does anyone have any ideas here?

spinscale (Alexander Reelsen) July 19, 2021, 9:58am 3

You would need to specify the _id field instead of id. Maybe a script filter searching for ids starting with a tab helps...

GET test/_search
{
  "query": {
    "script": {
      "script": "doc['_id'].value.startsWith('\t')"
    }
  }
}

note: that one might be super slow depending on the amount of documents.

system (system) Closed August 16, 2021, 9:59am 4

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views	Activity
IDs with special characters Elasticsearch	2	2596	March 11, 2019
Cannot use "+" in document id Kibana	5	329	September 17, 2020
Retrieve all ids that starts with a string Elasticsearch	4	3695	February 28, 2017
Elasticsearch query search with special characters Elasticsearch	1	378	May 10, 2018
Query on _id field Elasticsearch	2	41026	August 30, 2017

© 2020. All Rights Reserved - Elasticsearch

Elasticsearch is a trademark of Elasticsearch BV, registered in the U.S. and in other countries
Trademarks
Terms
Privacy
Brand
Code of Conduct

Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant logo are trademarks of the Apache Software Foundation in the United States and/or other countries.