Create custom source elasticWorkplace

This is the terminal

jorge@ubuntu:~/Escritorio/FSCRAWLER/FSCrawlerWorkplace/fscrawler-es7-2.7-SNAPSHOT/bin$ ./fscrawler /home/jorge/Escritorio/FSCRAWLER/fscrawler/prueba --debug --restart

^[[D18:15:26,782 INFO  [f.p.e.c.f.c.BootstrapChecks] Memory [Free/Total=Percent]: HEAP [129.9mb/2.8gb=4.45%], RAM [4.1gb/11.7gb=35.27%], Swap [1.9gb/1.9gb=100.0%].
18:15:26,787 DEBUG [f.p.e.c.f.f.FsCrawlerUtil] Mapping [6/_settings.json] already exists
18:15:26,787 DEBUG [f.p.e.c.f.f.FsCrawlerUtil] Mapping [6/_settings_folder.json] already exists
18:15:26,787 DEBUG [f.p.e.c.f.f.FsCrawlerUtil] Mapping [7/_settings.json] already exists
18:15:26,788 DEBUG [f.p.e.c.f.f.FsCrawlerUtil] Mapping [7/_settings_folder.json] already exists
18:15:26,789 DEBUG [f.p.e.c.f.c.FsCrawlerCli] Cleaning existing status for job [/home/jorge/Escritorio/FSCRAWLER/fscrawler/prueba]...
18:15:26,789 DEBUG [f.p.e.c.f.c.FsCrawlerCli] Starting job [/home/jorge/Escritorio/FSCRAWLER/fscrawler/prueba]...

^[[D^[[D^[[D^[[D^[[D18:15:27,198 INFO  [f.p.e.c.f.c.FsCrawlerCli] Workplace Search integration is an experimental feature. As is it is not fully implemented and settings might change in the future.
18:15:27,199 WARN  [f.p.e.c.f.c.FsCrawlerCli] Workplace Search integration does not support yet watching a directory. It will be able to run only once and exit. We manually force from --loop -1 to --loop 1. If you want to remove this message next time, please start FSCrawler with --loop 1
18:15:27,201 DEBUG [f.p.e.c.f.c.ElasticsearchClientUtil] Trying to find a client version 7
18:15:27,208 DEBUG [f.p.e.c.f.c.WorkplaceSearchClientUtil] Trying to find a client version 7

18:15:27,219 INFO  [f.p.e.c.f.FsCrawlerImpl] Starting FS crawler
^[[C18:15:28,025 INFO  [f.p.e.c.f.c.v.ElasticsearchClientV7] Elasticsearch Client for version 7.x connected to a node running version 7.9.2
18:15:28,261 DEBUG [f.p.e.c.f.s.FsCrawlerManagementServiceElasticsearchImpl] Elasticsearch Management Service started
18:15:28,263 DEBUG [f.p.e.c.f.t.w.WPSearchClient] Starting the WPSearchClient
18:15:28,319 DEBUG [f.p.e.c.f.c.ElasticsearchClientUtil] Trying to find a client version 7
18:15:28,338 INFO  [f.p.e.c.f.c.v.ElasticsearchClientV7] Elasticsearch Client for version 7.x connected to a node running version 7.9.2

18:15:28,344 DEBUG [f.p.e.c.f.s.FsCrawlerDocumentServiceWorkplaceSearchImpl] Workplace 
Search Document Service started

18:15:28,349 DEBUG [f.p.e.c.f.FsParserAbstract] creating fs crawler thread [prueba] for [//home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros//ejemplo_esp.pdf] every [15m]
18:15:28,355 INFO  [f.p.e.c.f.FsParserAbstract] FS crawler started for [prueba] for [//home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros//ejemplo_esp.pdf] every [15m]
18:15:28,357 DEBUG [f.p.e.c.f.FsParserAbstract] Fs crawler thread [prueba] is now running. Run #1...
18:15:28,377 DEBUG [f.p.e.c.f.FsParserAbstract] indexing [//home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros//ejemplo_esp.pdf] content
18:15:28,378 DEBUG [f.p.e.c.f.c.f.FileAbstractorFile] Listing local files from //home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros//ejemplo_esp.pdf
18:15:28,382 DEBUG [f.p.e.c.f.c.f.FileAbstractorFile] Symlink on windows gives null for listFiles(). Skipping [//home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros//ejemplo_esp.pdf]
18:15:28,389 DEBUG [f.p.e.c.f.c.f.FileAbstractorFile] 0 local files found
18:15:28,389 DEBUG [f.p.e.c.f.FsParserAbstract] Looking for removed files in [//home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros//ejemplo_esp.pdf]...
18:15:28,449 DEBUG [f.p.e.c.f.FsParserAbstract] Looking for removed directories in [//home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros//ejemplo_esp.pdf]...
18:15:28,461 INFO  [f.p.e.c.f.FsParserAbstract] FS crawler is stopping after 1 run
18:15:28,558 DEBUG [f.p.e.c.f.FsCrawlerImpl] Closing FS crawler [prueba]
18:15:28,559 DEBUG [f.p.e.c.f.FsCrawlerImpl] FS crawler thread is now stopped
18:15:28,559 DEBUG [f.p.e.c.f.c.v.ElasticsearchClientV7] Closing Elasticsearch client manager
18:15:28,561 DEBUG [f.p.e.c.f.s.FsCrawlerManagementServiceElasticsearchImpl] Elasticsearch Management Service stopped
18:15:28,562 DEBUG [f.p.e.c.f.c.v.ElasticsearchClientV7] Closing Elasticsearch client manager
18:15:28,563 DEBUG [f.p.e.c.f.t.w.WPSearchClient] Closing the WPSearchClient
18:15:28,563 DEBUG [f.p.e.c.f.f.b.FsCrawlerBulkProcessor] Closing BulkProcessor
18:15:28,563 DEBUG [f.p.e.c.f.f.b.FsCrawlerBulkProcessor] BulkProcessor is now closed
18:15:28,563 DEBUG [f.p.e.c.f.s.FsCrawlerDocumentServiceWorkplaceSearchImpl] Workplace Search Document Service stopped
18:15:28,563 DEBUG [f.p.e.c.f.FsCrawlerImpl] ES Client Manager stopped
18:15:28,563 INFO  [f.p.e.c.f.FsCrawlerImpl] FS crawler [prueba] stopped
18:15:28,569 DEBUG [f.p.e.c.f.FsCrawlerImpl] Closing FS crawler [prueba]
18:15:28,570 DEBUG [f.p.e.c.f.FsCrawlerImpl] FS crawler thread is now stopped
18:15:28,570 DEBUG [f.p.e.c.f.c.v.ElasticsearchClientV7] Closing Elasticsearch client manager
18:15:28,571 DEBUG [f.p.e.c.f.s.FsCrawlerManagementServiceElasticsearchImpl] Elasticsearch Management Service stopped
18:15:28,571 DEBUG [f.p.e.c.f.c.v.ElasticsearchClientV7] Closing Elasticsearch client manager
18:15:28,571 DEBUG [f.p.e.c.f.t.w.WPSearchClient] Closing the WPSearchClient

18:15:28,571 DEBUG [f.p.e.c.f.s.FsCrawlerDocumentServiceWorkplaceSearchImpl] Workplace Search Document Service stopped
18:15:28,571 DEBUG [f.p.e.c.f.FsCrawlerImpl] ES Client Manager stopped

I think that it is detecting workplace but it is doing nothing with it.
The file that I used it was indexed before the test, so it is not going to index it again

This is my YAML

---
name: "prueba"
fs:
  url: "//home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros"
  update_rate: "15m"
  excludes:
  - "*/~*"
  json_support: false
  filename_as_id: false
  add_filesize: true
  remove_deleted: true
  add_as_inner_object: false
  store_source: false
  index_content: true
  attributes_support: false
  raw_metadata: false
  xml_support: false
  index_folders: true
  lang_detect: false
  continue_on_error: false
  ocr:
    language: "eng"
    enabled: true
    pdf_strategy: "ocr_and_text"
  follow_symlinks: false
elasticsearch:
  username: "elastic"
  password: "L3pfydSSgRtZxfg5gWmX"
  nodes:
  - url: "http://127.0.0.1:9200"
  bulk_size: 100
  flush_interval: "5s"
  byte_size: "10mb"
workplace_search:
  access_token: "489e56799532ca13c49161f82093a41387fca45458617277705b5e8d0e250e77"
  key: "5f959a6e1d41c88afcdc280e"

I have just added the two lines in the end.

Thank you very much

So this is interesting:

18:15:28,382 DEBUG [f.p.e.c.f.c.f.FileAbstractorFile] Symlink on windows gives null for listFiles(). Skipping [//home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros//ejemplo_esp.pdf]
18:15:28,389 DEBUG [f.p.e.c.f.c.f.FileAbstractorFile] 0 local files found

Do you have a symbolic link to your files is it a copy of the files that you put in //home//jorge//Escritorio//FSCRAWLER//fscrawler//ficheros?

BTW I think that using:

fs:
  url: "/home/jorge/Escritorio/FSCRAWLER/fscrawler/ficheros"

Should work better. Could you try?