This is the terminal
jorge@ubuntu:~/Escritorio/FSCRAWLER/FSCrawlerWorkplace/fscrawler-es7-2.7-SNAPSHOT/bin$ ./fscrawler /home/jorge/Escritorio/FSCRAWLER/fscrawler/prueba --debug --restart
^[[D18:15:26,782 INFO [f.p.e.c.f.c.BootstrapChecks] Memory [Free/Total=Percent]: HEAP [129.9mb/2.8gb=4.45%], RAM [4.1gb/11.7gb=35.27%], Swap [1.9gb/1.9gb=100.0%].
18:15:26,787 DEBUG [f.p.e.c.f.f.FsCrawlerUtil] Mapping [6/_settings.json] already exists
18:15:26,787 DEBUG [f.p.e.c.f.f.FsCrawlerUtil] Mapping [6/_settings_folder.json] already exists
18:15:26,787 DEBUG [f.p.e.c.f.f.FsCrawlerUtil] Mapping [7/_settings.json] already exists
18:15:26,788 DEBUG [f.p.e.c.f.f.FsCrawlerUtil] Mapping [7/_settings_folder.json] already exists
18:15:26,789 DEBUG [f.p.e.c.f.c.FsCrawlerCli] Cleaning existing status for job [/home/jorge/Escritorio/FSCRAWLER/fscrawler/prueba]...
18:15:26,789 DEBUG [f.p.e.c.f.c.FsCrawlerCli] Starting job [/home/jorge/Escritorio/FSCRAWLER/fscrawler/prueba]...
^[[D^[[D^[[D^[[D^[[D18:15:27,198 INFO [f.p.e.c.f.c.FsCrawlerCli] Workplace Search integration is an experimental feature. As is it is not fully implemented and settings might change in the future.
18:15:27,199 WARN [f.p.e.c.f.c.FsCrawlerCli] Workplace Search integration does not support yet watching a directory. It will be able to run only once and exit. We manually force from --loop -1 to --loop 1. If you want to remove this message next time, please start FSCrawler with --loop 1
18:15:27,201 DEBUG [f.p.e.c.f.c.ElasticsearchClientUtil] Trying to find a client version 7
18:15:27,208 DEBUG [f.p.e.c.f.c.WorkplaceSearchClientUtil] Trying to find a client version 7
18:15:27,219 INFO [f.p.e.c.f.FsCrawlerImpl] Starting FS crawler
^[[C18:15:28,025 INFO [f.p.e.c.f.c.v.ElasticsearchClientV7] Elasticsearch Client for version 7.x connected to a node running version 7.9.2
18:15:28,261 DEBUG [f.p.e.c.f.s.FsCrawlerManagementServiceElasticsearchImpl] Elasticsearch Management Service started
18:15:28,263 DEBUG [f.p.e.c.f.t.w.WPSearchClient] Starting the WPSearchClient
18:15:28,319 DEBUG [f.p.e.c.f.c.ElasticsearchClientUtil] Trying to find a client version 7
18:15:28,338 INFO [f.p.e.c.f.c.v.ElasticsearchClientV7] Elasticsearch Client for version 7.x connected to a node running version 7.9.2
18:15:28,344 DEBUG [f.p.e.c.f.s.FsCrawlerDocumentServiceWorkplaceSearchImpl] Workplace
Search Document Service started
18:15:28,349 DEBUG [f.p.e.c.f.FsParserAbstract] creating fs crawler thread [prueba] for [//home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros//ejemplo_esp.pdf] every [15m]
18:15:28,355 INFO [f.p.e.c.f.FsParserAbstract] FS crawler started for [prueba] for [//home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros//ejemplo_esp.pdf] every [15m]
18:15:28,357 DEBUG [f.p.e.c.f.FsParserAbstract] Fs crawler thread [prueba] is now running. Run #1...
18:15:28,377 DEBUG [f.p.e.c.f.FsParserAbstract] indexing [//home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros//ejemplo_esp.pdf] content
18:15:28,378 DEBUG [f.p.e.c.f.c.f.FileAbstractorFile] Listing local files from //home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros//ejemplo_esp.pdf
18:15:28,382 DEBUG [f.p.e.c.f.c.f.FileAbstractorFile] Symlink on windows gives null for listFiles(). Skipping [//home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros//ejemplo_esp.pdf]
18:15:28,389 DEBUG [f.p.e.c.f.c.f.FileAbstractorFile] 0 local files found
18:15:28,389 DEBUG [f.p.e.c.f.FsParserAbstract] Looking for removed files in [//home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros//ejemplo_esp.pdf]...
18:15:28,449 DEBUG [f.p.e.c.f.FsParserAbstract] Looking for removed directories in [//home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros//ejemplo_esp.pdf]...
18:15:28,461 INFO [f.p.e.c.f.FsParserAbstract] FS crawler is stopping after 1 run
18:15:28,558 DEBUG [f.p.e.c.f.FsCrawlerImpl] Closing FS crawler [prueba]
18:15:28,559 DEBUG [f.p.e.c.f.FsCrawlerImpl] FS crawler thread is now stopped
18:15:28,559 DEBUG [f.p.e.c.f.c.v.ElasticsearchClientV7] Closing Elasticsearch client manager
18:15:28,561 DEBUG [f.p.e.c.f.s.FsCrawlerManagementServiceElasticsearchImpl] Elasticsearch Management Service stopped
18:15:28,562 DEBUG [f.p.e.c.f.c.v.ElasticsearchClientV7] Closing Elasticsearch client manager
18:15:28,563 DEBUG [f.p.e.c.f.t.w.WPSearchClient] Closing the WPSearchClient
18:15:28,563 DEBUG [f.p.e.c.f.f.b.FsCrawlerBulkProcessor] Closing BulkProcessor
18:15:28,563 DEBUG [f.p.e.c.f.f.b.FsCrawlerBulkProcessor] BulkProcessor is now closed
18:15:28,563 DEBUG [f.p.e.c.f.s.FsCrawlerDocumentServiceWorkplaceSearchImpl] Workplace Search Document Service stopped
18:15:28,563 DEBUG [f.p.e.c.f.FsCrawlerImpl] ES Client Manager stopped
18:15:28,563 INFO [f.p.e.c.f.FsCrawlerImpl] FS crawler [prueba] stopped
18:15:28,569 DEBUG [f.p.e.c.f.FsCrawlerImpl] Closing FS crawler [prueba]
18:15:28,570 DEBUG [f.p.e.c.f.FsCrawlerImpl] FS crawler thread is now stopped
18:15:28,570 DEBUG [f.p.e.c.f.c.v.ElasticsearchClientV7] Closing Elasticsearch client manager
18:15:28,571 DEBUG [f.p.e.c.f.s.FsCrawlerManagementServiceElasticsearchImpl] Elasticsearch Management Service stopped
18:15:28,571 DEBUG [f.p.e.c.f.c.v.ElasticsearchClientV7] Closing Elasticsearch client manager
18:15:28,571 DEBUG [f.p.e.c.f.t.w.WPSearchClient] Closing the WPSearchClient
18:15:28,571 DEBUG [f.p.e.c.f.s.FsCrawlerDocumentServiceWorkplaceSearchImpl] Workplace Search Document Service stopped
18:15:28,571 DEBUG [f.p.e.c.f.FsCrawlerImpl] ES Client Manager stopped
I think that it is detecting workplace but it is doing nothing with it.
The file that I used it was indexed before the test, so it is not going to index it again
This is my YAML
---
name: "prueba"
fs:
url: "//home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros"
update_rate: "15m"
excludes:
- "*/~*"
json_support: false
filename_as_id: false
add_filesize: true
remove_deleted: true
add_as_inner_object: false
store_source: false
index_content: true
attributes_support: false
raw_metadata: false
xml_support: false
index_folders: true
lang_detect: false
continue_on_error: false
ocr:
language: "eng"
enabled: true
pdf_strategy: "ocr_and_text"
follow_symlinks: false
elasticsearch:
username: "elastic"
password: "L3pfydSSgRtZxfg5gWmX"
nodes:
- url: "http://127.0.0.1:9200"
bulk_size: 100
flush_interval: "5s"
byte_size: "10mb"
workplace_search:
access_token: "489e56799532ca13c49161f82093a41387fca45458617277705b5e8d0e250e77"
key: "5f959a6e1d41c88afcdc280e"
I have just added the two lines in the end.
Thank you very much