Hi,
I'm trying to get IDs for all our 300k+ documents. The AppSearch API has list_documents which only lists 10k documents. Is there an argument I can use with it to get the next 10k documents?
I have searched and found that lots of other users are struggling with how to access all their documents. I even considered exporting/dumping all our data to my laptop and working outside elastic.co. It seems like our data is held hostage with no apparent way to do this.
Could this be done with the python Elasticsearch or AppSearch packages?
I successfully did a get() with a 200 return but it is just html with this comment at the end: "This Elastic installation has strict security requirements enabled that your current browser does not meet".
Once again, isn't there an easy way to get all our ids for 300k+ documents via Elasticsearch or AppSearch?
If that doesn't work and there is a package, probably but I am not familiar with the module(s) and tend to just use the API. The API's are well build and it removes a component to just use it directly.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.