Searching words within a Sentence/paragraph

Alok_Tripathi · May 23, 2024, 6:58am

I want to search for words to be in same sentence/paragraph.
Example: If I search for words "Bill", "Steve" then no document2 should be returned because both words exist in the same sentence. But if I search for "Bill" , "computer" then no document should be returned because both these words lies in two different sentences.

{
    "_index": "test",
    "_type": "_doc",
    "_id": "1",
    "_score": 1.0,
    "_source": {
        "content": "Bill Gates founded Microsoft. Steve Jobs founded Apple. They were both influential in the tech industry."
    }
},
{
    "_index": "test",
    "_type": "_doc",
    "_id": "2",
    "_score": 1.0,
    "_source": {
        "content": "Bill Gates and Steve Jobs were both pioneers in technology. They revolutionized the personal computer industry."
    }
}

dadoonet · May 23, 2024, 7:22am

Welcome!

You could use some proximity search but it won't be within a sentence. I mean that if you have something like: "I'm Bill. Steve is here." That will match as well.

Have a look at: Query string query | Elasticsearch Guide [8.13] | Elastic

Alok_Tripathi · May 23, 2024, 7:39am

If a store each sentence in a nested dictionary format with a unique key like:

{
    "_index": "test",
    "_type": "_doc",
    "_id": "1",
    "_score": 1.0,
    "_source": {
        "content" : {
            "sentence1" : "Bill Gates founded Microsoft. Steve Jobs founded Apple.",
            "sentence2" : "They were both influential in the tech industry."
        }
    }
},
{
    "_index": "test",
    "_type": "_doc",
    "_id": "2",
    "_score": 1.0,
    "_source": {
        "content" : {
            "sentence1" : "Bill Gates and Steve Jobs were both pioneers in technology.",
            "sentence2" : "They revolutionized the personal computer industry."
        }
        
    }
}

Can we perform search within single sentence now? Or storing each sentence in form of Array of sentences.

Can a custom sentence tokenizer will help to achieve this?

Topic		Replies	Views
Search two words in the same sentence Elastic Search painless , elastic-app-search	4	49	October 21, 2024
Proximity searches - sentenses and paragraphs Elasticsearch	1	1045	July 5, 2017
Does not match words exactly Elasticsearch	2	371	July 6, 2017
Return documents that match a minimal number of words in the same sentence Elasticsearch	3	493	September 7, 2020
Search multiple words within the same paragraph in the target document? Elasticsearch	2	790	December 30, 2019

Searching words within a Sentence/paragraph

Related topics