Can ElasticSearch be used to search through Google Books?
There are some issues with how search results are organised- they are best explained over at Stack Exchange- where they didn't initially appear to be clear enough.
Suppose we search for a near contiguous set of terms known to be associated with a sentence or phrase, but the OCR technology barfs some of the characters in the text of books which may contain them. Let's take "Hello My Elastic World". To reduce the chance of this happening, we then try for "Hello" "My" "Elastic" "World", which is great, because if the "r" in "world" turns out to be "?", courtesy of OCR, Google will still return results for "Hello" "My" "Elastic".
But, unfortunately, in the search results, there is no rhyme or reason as to how close together those terms might be. I have discovered situations where "Elastic" might be 30 pages after "My" in a search result. Or it might even be before it.
Yet, go to page 100 of the search results, and find a result where "Hello" "My" "Elastic" are on the same text page, and are close together.
In the context of above, has anyone here any idea of how Google order their results they way they do?
Thanks for reading!