How to source search queries from a file

username123 · July 31, 2018, 8:54pm

Let's say I have a huge file with the queries that I have recorded over time:

"GET /primary/_analyze?analyzer=foldedlower&text=yomu
"GET /primary/_analyze?analyzer=foldedlower&text=yonde
"GET /primary/_analyze?analyzer=foldedlower&text=Yo+oigo+con+mis+orejas
"GET /primary/_analyze?analyzer=foldedlower&text=Yo+tengo+dos+ojos
"GET /primary/_analyze?analyzer=foldedlower&text=You%27re+eligible

I want to be able to point my query operation type to this file, so that the queries created by it would be sourced from the file. I can clean the data out and have everything after &text= utilized, but I am not certain how I would plug it into configuration. What would be a correct path to take?

danielmitterdorfer · August 1, 2018, 5:39am

Hi,

I think this is best solved by implementing a so-called parameter source. There are two options to implement them: either as a (Python) function or as a class. In your case I'd go for a class. In the constructor you can read the file and prepare a suitable data structure (e.g. store the queries in a list) and in the params() method you just pick a query randomly or iterate through them - whatever suits your use case. In our official tracks, we to something very similar in the geonames track which you can use as a starting point for your own parameter source.

Note that the params() method of the parameter source is called on a performance-critical path so you should do any preprocessing of the file already in the constructor, otherwise you might introduce an accidental bottleneck in the load generator. You can use Rally's profiling support to profile it and double-check that there are no problems.

Daniel

username123 · August 2, 2018, 7:28pm

I see!

Thank you Daniel! That was pretty much the direction I was thinking of going.

system · August 30, 2018, 7:28pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Make a custom parameter source send two (related) queries Elasticsearch rally	3	315	May 26, 2022
Unable to change query paramters in rally Elasticsearch rally	7	527	October 5, 2022
How Rally will behave with Millions of operations in track.json? Elasticsearch rally	4	616	October 26, 2018
Question about source data Elasticsearch	2	488	July 5, 2017
Programmatically overriding query parameters from external source (setSource/setExtraSource) Elasticsearch	1	538	July 6, 2017

How to source search queries from a file

Related topics