[ANNOUNCEMENT] - FSCrawler 2.9 released

The FSCrawler team is pleased to announce the FSCrawler 2.9 release!

FSCrawler

FS Crawler offers a simple way to index binary files into Elasticsearch.

Usage

Download FSCrawler 2.9:

wget https://repo1.maven.org/maven2/fr/pilato/elasticsearch/crawler/fscrawler-es7/2.9/fscrawler-es7-2.9.zip

Start FS crawler with:

bin/fscrawler job_name

FS crawler will read a local file (default to ~/.fscrawler/{job_name}/_settings.json).
If the file does not exist, FS crawler will propose to create your first job.

$ bin/fscrawler job_name
18:28:58,174 WARN  [f.p.e.c.f.FsCrawler] job [job_name] does not exist
18:28:58,177 INFO  [f.p.e.c.f.FsCrawler] Do you want to create it (Y/N)?
y
18:29:05,711 INFO  [f.p.e.c.f.FsCrawler] Settings have been created in [~/.fscrawler/job_name/_settings.json]. Please review and edit before relaunch

Create a directory named /tmp/es or c:\tmp\es, add some files you want to index in it and start again:

$ bin/fscrawler job_name
18:30:34,330 INFO  [f.p.e.c.f.FsCrawlerImpl] Starting FS crawler
18:30:34,332 INFO  [f.p.e.c.f.FsCrawlerImpl] FS crawler started in watch mode. It will run unless you stop it with CTRL+C.
18:30:34,682 INFO  [f.p.e.c.f.FsCrawlerImpl] FS crawler started for [job_name] for [/tmp/es] every [15m]

More details in the documentation.

New features

  • #1345: Improve documentation for settings . Thanks to cbb-colab.
  • #1298: Add more default displayed fields . Thanks to dadoonet.

Changes

  • #1348: Switch to the new sonatype service . Thanks to dadoonet.
  • #1347: Bump maven-jar-plugin from 3.2.0 to 3.2.1 . Thanks to dependabot[bot].
  • #1346: Bump build-helper-maven-plugin from 3.2.0 to 3.3.0 . Thanks to dependabot[bot].
  • #1339: Bump log4j-api from 2.17.0 to 2.17.1 . Thanks to dependabot[bot].
  • #1338: Bump log4j-core from 2.17.0 to 2.17.1 . Thanks to dependabot[bot].
  • #1337: Bump log4j.version from 2.17.0 to 2.17.1 . Thanks to dependabot[bot].
  • #1336: Bump tika.version from 2.1.0 to 2.2.1 . Thanks to dependabot[bot].
  • #1333: Bump Elasticsearch-rest-high-level-client from 7.16.1 to 7.16.2 . Thanks to dependabot[bot].
  • #1332: Bump docker-maven-plugin from 0.38.0 to 0.38.1 . Thanks to dependabot[bot].
  • #1331: Bump jackson.version from 2.13.0 to 2.13.1 . Thanks to dependabot[bot].
  • #1330: Bump tika.version from 2.1.0 to 2.2.0 . Thanks to dependabot[bot].
  • #1329: Bump log4j-core from 2.16.0 to 2.17.0 . Thanks to dependabot[bot].
  • #1328: Bump log4j-api from 2.16.0 to 2.17.0 . Thanks to dependabot[bot].
  • #1325: Bump log4j.version from 2.15.0 to 2.16.0 . Thanks to dependabot[bot].

Have fun!
-FSCrawler team

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.