Unable to perform hybrid search

harish32 · December 26, 2024, 4:41am

I am having trouble doing hybrid search. to give you guys bit of background I am using semantic search created inference with openai embeddings.

const addIndex = async () => {
  // try {
  //   await client.indices.delete({ index });
  // } catch (err) {
  //   //
  // }
  await client.indices.create({
    index,
    mappings: {
      properties: {
        phrase: {
          type: "semantic_text",
          inference_id: "openai_embeddings",
        },
        copilot: {
          type: "keyword",
        },
        org: {
          type: "keyword",
        },
        topic: {
          type: "keyword",
        },
        mode: {
          type: "keyword",
        },
      },
    },
  });
  console.log("created successfully");
};

const response = await client.transport.request({
    method: "PUT",
    path: "/_inference/text_embedding/openai_embeddings",
    body: {
      service: "openai",
      service_settings: {
        api_key: "xxxxxxxx-xxxxxxxx",
        model_id: "text-embedding-3-small",
      },
    },
  });

this is my search query

const response = await client.search({
    index,
    min_score: 0.8,
    query: {
      bool: {
        must: {
          semantic: {
            field: "phrase",
            query: "can i talk to a real person",
          },
        },
        filter: [
          { term: { copilot: "67108e2e8c0611df3ad71109" } }, // Filter by copilotId
          { term: { mode: "DRAFT" } }, // Filter by mode
        ],
      },
    },
  });

above query randomly works sometimes and doesnt work other times.
i make page to 1000 then it started giving results. why is pagesize altering the behaviour? Is it performing post filter? so that means it doesnt work if i have 2000 phrases? Please help me here.

ashishtiwari1993 · December 26, 2024, 8:01am

Hi @harish32, Welcome to the Elastic community.

The page should not impact to the response.

You defining 1000 in size?
Any error you getting when there is no response?
I am also assuming Async is well handled in your script.

harish32 · December 26, 2024, 10:35am

no its returning empty array
here is the query i have ran

  const response = await client.search({
    index,
    query: {
      bool: {
        must: [
          {
            semantic: {
              field: "phrase",
              query: "talk to a agent",
            },
          },
        ],
        filter: [
          { term: { copilot: "67108e2e8c0611df3ad71109" } }, // Filter by copilotId
          { term: { mode: "DRAFT" } }, // Filter by mode
          { term: { topic: "675a88feb73a5d37e21fc352" } },
        ],
      },
    },
  });
  console.log(response.hits);

i have attached the screenshot where i ran same function 3 times you can see it gave result not completely but returned only one then
3rd time it didnt give anything

xeraa · December 27, 2024, 4:36pm

I would

check the Elasticsearch logs (especially for any errors around inference or search)
log the actual query run to compare it to the successful ones.

If I had to guess, I'd say turning the semantic query into its dense vector representation fails, so you're not running the same Elasticsearch query in the background. Maybe something like a rate limit is kicking in (but that's just a random guess). I'd be surprised if you ran the exact same Elasticsearch query and it would give you different results (assuming the cluster is healthy).

Topic		Replies	Views
Elasticsearch Hybrid Query - No Results Elasticsearch	2	786	February 18, 2021
Implementing Hybrid Seach with AppSearch Elastic Search elastic-app-search	1	256	December 11, 2022
ElasticSearch 7.x exact match Elasticsearch	4	504	February 26, 2020
Unsuccessful Search Elasticsearch	3	349	July 16, 2018
A query is not working, and I think it should Elasticsearch	11	495	July 6, 2017

Unable to perform hybrid search

Related topics