How can i disable content extraction?

Hi @maddy30 ,

Looks like this might be related to your other question here: How can i update the pipeline used for a app search engine?

The configurations to extract content from files (like PDFs) are made at a deployment level, not on an engine-by-engine basis. What you could do is add conditionals to your ingest pipeline to run certain processors only if the URL matches a certain domain or pattern.

Alternatively, you can take the approach I suggest in the other post to use different pipelines per index, and have some pipelines remove the body_content from your documents before indexing it.

2 Likes