Indexing files with file system _river in scrutmydocs won't read any files

Thank you very much for your support.

  1. everything clean in /root/.scrutmydocs (there only was a config doc, no
    data)

  2. glassfish restarted

  3. the docs that were manually uploaded are still there

  4. running processes:

( I do learn )

  1. when doing

http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers

I get the not-so-good old dummy river:wa

{"ok":true,

"errors":null,

"object": [{"id":"tiger",

"name":"tiger",

"indexname":"docstiger",

"typename":"doctiger",

"start":true,

"type":"dummy"}] <--- dummy

}

2013/6/22 David Pilato david@pilato.fr

Clean everything in /root/.scrutmydocs
And restart glassfish.

Your old docs/rivers should disappear.
If not, could you list running process? ps -ef

Note: when sharing content like this on the mailing list, please use Gist
instead of pasting your code here.

--
David :wink:
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs

Le 22 juin 2013 à 07:32, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

Thank you very much for your support.
I will redo all from the beginning, so you can see if I am doing something
weird or wrong. I followed the instructions from the web, and did not do
anything special.

  1. folders and files

a. I do not have a ~/.scrutmydocs folder, nor a ~/.elasticsearch folder
but there is a /root/.scrutmydocs/config folder with a scrutmydocs.properties
file

(this happens because I start-up GlassFish while su... maybe this is the
root (no pun intended) of my problems?)

scrutmydocs.properties file contents:

################################################################

Licensed to scrutmydocs.org (the "Author") under one

or more contributor license agreements. See the NOTICE file

distributed with this work for additional information

regarding copyright ownership. Author licenses this

file to you under the Apache License, Version 2.0 (the

"License"); you may not use this file except in compliance

with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing,

software distributed under the License is distributed on an

"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY

KIND, either express or implied. See the License for the

specific language governing permissions and limitations

under the License.

################################################################

################################################################

Scrutmydocs configuration file

This file should be in ~/.scrutmydocs/config/ directory

under the name scrutsmydocs.properties

If not present, it will be created the first time you start

the web application...

################################################################

Set to false if you want to connect your webapp to an existing

Elasticsearch cluster, default to true

node.embedded=false

If false, you have to define your node(s) address(es), default to :

localhost:9300,localhost:9301

node.addresses=localhost:9300,localhost:9301

Define the cluster name, default to : scrutmydocs

cluster.name=scrutmydocs

Define the Elasticsearch data dir, default to ~/.scrutmydocs/esdata,

where ~ is the user home dir

path.data=/home/user/.scrutmydocs/esdata

b. I do have a /usr/share/elasticsearch/conf/elasticsearch.yml, with the
following contents:

Mandatory cluster Name. You should be able to modify it in a future

release.
cluster.name: scrutmydocs

If you want to check plugins before starting

plugin.mandatory: mapper-attachments, river-fs

If you want to disable multicast

discovery.zen.ping.multicast.enabled: false

#cluster:

name: TigerCluster

#network:

host: 127.0.0.1

#discovery:

zen:

multicast.enabled: false

#http:

max_content_length: 100000

#index:

number_of_shards: 1

analysis:

analyzer:

default:

type: standard

lowercase_analyzer:

type: custom

tokenizer: standard

filter: [standard, lowercase]

  1. I will un-install Elasticsearch, undeploy ScrutMyDocs, reboot,
    re-install Elasticsearch and re-deploy ScrutMyDocs to be sure nothing is
    wrong and so you can follow the procedure

  2. un-install Elasticsearch (in bash)

fatima@FatiLinux:~$ sudo su
[sudo] password for fatima:
root@FatiLinux:/home/fatima# apt-get purge elasticsearch
Reading package lists... Done
Building dependency tree
Reading state information... Done
The following packages were automatically installed and are no longer
required:
antlr3 cup default-jdk-doc javahelp2 junit4-doc libbeansbinding-java
libbetter-appframework-java libbindex-java
libbytelist-java libcommons-compress-java libcommons-net1-java
libdb-java libdb-je-java libdb5.1-java libdb5.1-java-jni
libfelix-framework-java libfelix-main-java libflute-java libfsplib0
libhamcrest-java-doc libicu4j-java libini4j-java
libjcodings-java libjemmy2-java libjna-java libjoda-convert-java
libjoda-time-java libjvyamlb-java libjzlib-java
liblucene2-java libmysql-java libnb-absolutelayout-java
libnb-apisupport3-java libnb-ide14-java libnb-java5-java
libnb-javaparser-java libnb-org-openide-modules-java
libnb-org-openide-util-java libnb-org-openide-util-lookup-java
libnb-platform-devel-java libnb-platform13-java libnetx-java
libpostgresql-jdbc-java libsac-java libsac-java-gcj
libsequence-library-java libserf1 libsimple-validation-java
libsqljet-java libstringtemplate-java libsvn-java libsvn1
libsvnclientadapter-java libsvnkit-java libswing-layout-java
libswingx1-java libswt-cairo-gtk-3-jni libswt-gnome-gtk-3-jni
libswt-gtk-3-java libswt-gtk-3-jni libswt-webkit-gtk-3-jni libtre5
libtrilead-ssh2-java libxz-java openjdk-7-doc weka
Use 'apt-get autoremove' to remove them.
The following packages will be REMOVED:
elasticsearch*
0 upgraded, 0 newly installed, 1 to remove and 82 not upgraded.
After this operation, 19,9 MB disk space will be freed.
Do you want to continue [Y/n]? y
(Reading database ... 305350 files and directories currently installed.)
Removing elasticsearch ...

  • Stopping Elasticsearch Server
    [ OK ]
    Purging configuration files for elasticsearch ...
    Removing user elasticsearch' ... Warning: group elasticsearch' has no more members.
    Done.
    The group `elasticsearch' does not exist.
    dpkg: warning: while removing elasticsearch, directory
    '/etc/elasticsearch' not empty so not removed
    dpkg: warning: while removing elasticsearch, directory
    '/usr/share/elasticsearch' not empty so not removed
    Processing triggers for ureadahead ...
    ureadahead will be reprofiled on next reboot
    root@FatiLinux:/home/fatima#
  1. went to GlassFish, undeployed ScrutMyDocs

  2. reboot

  3. re-install Elasticsearch (from bash)... skip this step, as ScrutMyDocs
    uses and embedded server, so it will be useless

  4. re-deploy ScrutMyDocs
    did it in GlassFish again.

When it starts, I found four things:

a. it still has all the docs that were manually uploaded

b. it still has the river, pointing to the same path as before

c. it still won't read my docs from the file system

d. when I do
http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers

{"ok":true,"errors":null,"object":[{"id":"tiger","name":"tiger","indexname":"docstiger","typename":"doctiger","start":true,"type":"dummy"}]}

(the change in the name of the app is because I downloaded the code from
GitHub and compiled it locally with Maven)

Thanks in advance.

2013/6/21 David Pilato david@pilato.fr

By default, Scrutmydocs runs an embedded node. So elasticsearch logs
appears within your container logs.
If you run an external elasticsearch node, you should know where you put
logs.

I still don't understand how you get this. Logs would help a lot here I
think.

Could you describe each step, one by one, of what you are doing? And do
it from start?
Also remove ~/.scrutmydocs dir
Perhaps you have strange data/config here???

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr
| @scrutmydocs https://twitter.com/scrutmydocs

Le 21 juin 2013 à 05:15, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

Yes... That was the problem.
Thank you very much.
Now I can download it from the repository and compile it without problems.

Anyway, I got still two problems left:

  1. did you configure someone special in scrutmydocs about the location
    for the logs?

they are not in the usual places:

/usr/share/elasticsearch/logs/
/var/log/elasticsearch/elasticsearch.log

  1. when I re-create the river, all is the same as before

a. it won't read my files, just shows the one which were manually uploaded

b. the index is not shown in http://localhost:9200/_plugin/head/

c. when I do
http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers
I get:

{"ok":true,"errors":null,"object":[{"id":"tiger","name":"tiger","indexname":"docstiger","typename":"doctiger","start":true,"type":"dummy"}]}

...so it is yet a dummy river still.

  1. after this works, I am planning to add Twitter, Wikipedia and RSS
    capabilities to it. At least that is what my client wants, so sometime in
    the future you will get a nice version, as a way to thank you for all your
    help.

Thanks in advance,
Fatima

2013/6/20 David Pilato david@pilato.fr

You probably downloaded gh-pages branch instead of master.
Try git checkout master

The right repository is this one:
GitHub - scrutmydocs/scrutmydocs: Search Web Application for hard drive documents

--
David Pilato | Technical Advocate | *Elasticsearch.comhttp://elasticsearch.com/
*
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr
| @scrutmydocs https://twitter.com/scrutmydocs

Le 17 juin 2013 à 12:25, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

and it is exactly the same no matter what URL I use:

root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# git clone
GitHub - scrutmydocs/scrutmydocs: Search Web Application for hard drive documents
Cloning into 'scrutmydocs'...
remote: Counting objects: 4218, done.
remote: Compressing objects: 100% (1980/1980), done.
remote: Total 4218 (delta 1367), reused 4123 (delta 1277)
Receiving objects: 100% (4218/4218), 1.34 MiB | 278 KiB/s, done.
Resolving deltas: 100% (1367/1367), done.
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# ls -ralh
total 41M
-rw------- 1 fatima fatima 711K jun 17 06:59 scrutmydocs-master.zip
drwx------ 1 fatima fatima 4,0K jun 17 07:21 scrutmydocs-master
-rw------- 1 fatima fatima 40M jun 10 02:22 scrutmydocs-0.2.0.war
drwx------ 1 fatima fatima 352 jun 17 07:24 scrutmydocs
drwx------ 1 fatima fatima 4,0K jun 16 21:27 ..
drwx------ 1 fatima fatima 4,0K jun 17 07:23 .
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# cd
scrutmydocs
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch/scrutmydocs#
ls -ralh
total 8,5K
-rw------- 1 fatima fatima 3,0K jun 17 07:24 index.html
-rw------- 1 fatima fatima 10 jun 17 07:24 .gitignore
drwx------ 1 fatima fatima 440 jun 17 07:24 .git
drwx------ 1 fatima fatima 4,0K jun 17 07:23 ..
drwx------ 1 fatima fatima 352 jun 17 07:24 .
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch/scrutmydocs#

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

_______|||
<
) .------.
-----------,------.-' ,-' -.

             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \


      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |


      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /

            `.    |  |            `.                    ,'

             |    |  |              `.                ,'

_____,------------------. -._ _,-' <___________________________) ------'
| _| |

               `.____|

=================================

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

_______|||
<
) .------.
-----------,------.-' ,-' -.

             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \

      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |

      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /
            `.    |  |            `.                    ,'

             |    |  |              `.                ,'

_____,------------------. -._ _,-' <___________________________) ------'
| _| |

               `.____|

=================================

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

|||
<
) .------.
-----------,------.-' ,-' -.
| | | ,' . ,' | | ,' .
| ,-' |
/
,'-' . ---.
|
_________
.--' -----. | _____________________ -. ----- | | ___| | | \ ,- \ | | ___| |===========================((|) | | | | | | _____________________/ - / |
--._ -----' | _________________,-' ----- | .-._ ,' __.---' | /
| -. | \ / . | | . ,' | | | . ,'
_____,------------------. -._ _,-' <___________________________) ------'
| | |
`.
___|

=================================

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

I also got the glassfish logs, they got some errors related to
elasticsearch... I guess the problem is there, but I do not yet really know
how to read some of the information there...

Which would be the right way to show them to you ?
Gist also ?

(they are 33 KB and 606 KB long)

2013/6/25 Fatima Castiglione Maldonado 发 castiglionemaldonado@gmail.com

Thank you very much for your support.

  1. everything clean in /root/.scrutmydocs (there only was a config doc, no
    data)

  2. glassfish restarted

  3. the docs that were manually uploaded are still there

  4. running processes:

fatima running processes · GitHub

( I do learn )

  1. when doing

http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers

I get the not-so-good old dummy river:wa

{"ok":true,

"errors":null,

"object": [{"id":"tiger",

"name":"tiger",

"indexname":"docstiger",

"typename":"doctiger",

"start":true,

"type":"dummy"}] <--- dummy

}

2013/6/22 David Pilato david@pilato.fr

Clean everything in /root/.scrutmydocs
And restart glassfish.

Your old docs/rivers should disappear.
If not, could you list running process? ps -ef

Note: when sharing content like this on the mailing list, please use Gist
instead of pasting your code here.

--
David :wink:
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs

Le 22 juin 2013 à 07:32, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

Thank you very much for your support.
I will redo all from the beginning, so you can see if I am doing
something weird or wrong. I followed the instructions from the web, and did
not do anything special.

  1. folders and files

a. I do not have a ~/.scrutmydocs folder, nor a ~/.elasticsearch folder
but there is a /root/.scrutmydocs/config folder with a scrutmydocs.properties
file

(this happens because I start-up GlassFish while su... maybe this is the
root (no pun intended) of my problems?)

scrutmydocs.properties file contents:

################################################################

Licensed to scrutmydocs.org (the "Author") under one

or more contributor license agreements. See the NOTICE file

distributed with this work for additional information

regarding copyright ownership. Author licenses this

file to you under the Apache License, Version 2.0 (the

"License"); you may not use this file except in compliance

with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing,

software distributed under the License is distributed on an

"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY

KIND, either express or implied. See the License for the

specific language governing permissions and limitations

under the License.

################################################################

################################################################

Scrutmydocs configuration file

This file should be in ~/.scrutmydocs/config/ directory

under the name scrutsmydocs.properties

If not present, it will be created the first time you start

the web application...

################################################################

Set to false if you want to connect your webapp to an existing

Elasticsearch cluster, default to true

node.embedded=false

If false, you have to define your node(s) address(es), default to :

localhost:9300,localhost:9301

node.addresses=localhost:9300,localhost:9301

Define the cluster name, default to : scrutmydocs

cluster.name=scrutmydocs

Define the Elasticsearch data dir, default to ~/.scrutmydocs/esdata,

where ~ is the user home dir

path.data=/home/user/.scrutmydocs/esdata

b. I do have a /usr/share/elasticsearch/conf/elasticsearch.yml, with the
following contents:

Mandatory cluster Name. You should be able to modify it in a future

release.
cluster.name: scrutmydocs

If you want to check plugins before starting

plugin.mandatory: mapper-attachments, river-fs

If you want to disable multicast

discovery.zen.ping.multicast.enabled: false

#cluster:

name: TigerCluster

#network:

host: 127.0.0.1

#discovery:

zen:

multicast.enabled: false

#http:

max_content_length: 100000

#index:

number_of_shards: 1

analysis:

analyzer:

default:

type: standard

lowercase_analyzer:

type: custom

tokenizer: standard

filter: [standard, lowercase]

  1. I will un-install Elasticsearch, undeploy ScrutMyDocs, reboot,
    re-install Elasticsearch and re-deploy ScrutMyDocs to be sure nothing is
    wrong and so you can follow the procedure

  2. un-install Elasticsearch (in bash)

fatima@FatiLinux:~$ sudo su
[sudo] password for fatima:
root@FatiLinux:/home/fatima# apt-get purge elasticsearch
Reading package lists... Done
Building dependency tree
Reading state information... Done
The following packages were automatically installed and are no longer
required:
antlr3 cup default-jdk-doc javahelp2 junit4-doc libbeansbinding-java
libbetter-appframework-java libbindex-java
libbytelist-java libcommons-compress-java libcommons-net1-java
libdb-java libdb-je-java libdb5.1-java libdb5.1-java-jni
libfelix-framework-java libfelix-main-java libflute-java libfsplib0
libhamcrest-java-doc libicu4j-java libini4j-java
libjcodings-java libjemmy2-java libjna-java libjoda-convert-java
libjoda-time-java libjvyamlb-java libjzlib-java
liblucene2-java libmysql-java libnb-absolutelayout-java
libnb-apisupport3-java libnb-ide14-java libnb-java5-java
libnb-javaparser-java libnb-org-openide-modules-java
libnb-org-openide-util-java libnb-org-openide-util-lookup-java
libnb-platform-devel-java libnb-platform13-java libnetx-java
libpostgresql-jdbc-java libsac-java libsac-java-gcj
libsequence-library-java libserf1 libsimple-validation-java
libsqljet-java libstringtemplate-java libsvn-java libsvn1
libsvnclientadapter-java libsvnkit-java libswing-layout-java
libswingx1-java libswt-cairo-gtk-3-jni libswt-gnome-gtk-3-jni
libswt-gtk-3-java libswt-gtk-3-jni libswt-webkit-gtk-3-jni libtre5
libtrilead-ssh2-java libxz-java openjdk-7-doc weka
Use 'apt-get autoremove' to remove them.
The following packages will be REMOVED:
elasticsearch*
0 upgraded, 0 newly installed, 1 to remove and 82 not upgraded.
After this operation, 19,9 MB disk space will be freed.
Do you want to continue [Y/n]? y
(Reading database ... 305350 files and directories currently installed.)
Removing elasticsearch ...

  • Stopping Elasticsearch Server
    [ OK ]
    Purging configuration files for elasticsearch ...
    Removing user elasticsearch' ... Warning: group elasticsearch' has no more members.
    Done.
    The group `elasticsearch' does not exist.
    dpkg: warning: while removing elasticsearch, directory
    '/etc/elasticsearch' not empty so not removed
    dpkg: warning: while removing elasticsearch, directory
    '/usr/share/elasticsearch' not empty so not removed
    Processing triggers for ureadahead ...
    ureadahead will be reprofiled on next reboot
    root@FatiLinux:/home/fatima#
  1. went to GlassFish, undeployed ScrutMyDocs

  2. reboot

  3. re-install Elasticsearch (from bash)... skip this step, as ScrutMyDocs
    uses and embedded server, so it will be useless

  4. re-deploy ScrutMyDocs
    did it in GlassFish again.

When it starts, I found four things:

a. it still has all the docs that were manually uploaded

b. it still has the river, pointing to the same path as before

c. it still won't read my docs from the file system

d. when I do
http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers

{"ok":true,"errors":null,"object":[{"id":"tiger","name":"tiger","indexname":"docstiger","typename":"doctiger","start":true,"type":"dummy"}]}

(the change in the name of the app is because I downloaded the code from
GitHub and compiled it locally with Maven)

Thanks in advance.

2013/6/21 David Pilato david@pilato.fr

By default, Scrutmydocs runs an embedded node. So elasticsearch logs
appears within your container logs.
If you run an external elasticsearch node, you should know where you put
logs.

I still don't understand how you get this. Logs would help a lot here I
think.

Could you describe each step, one by one, of what you are doing? And do
it from start?
Also remove ~/.scrutmydocs dir
Perhaps you have strange data/config here???

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr
| @scrutmydocs https://twitter.com/scrutmydocs

Le 21 juin 2013 à 05:15, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

Yes... That was the problem.
Thank you very much.
Now I can download it from the repository and compile it without
problems.

Anyway, I got still two problems left:

  1. did you configure someone special in scrutmydocs about the location
    for the logs?

they are not in the usual places:

/usr/share/elasticsearch/logs/
/var/log/elasticsearch/elasticsearch.log

  1. when I re-create the river, all is the same as before

a. it won't read my files, just shows the one which were manually
uploaded

b. the index is not shown in http://localhost:9200/_plugin/head/

c. when I do
http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers
I get:

{"ok":true,"errors":null,"object":[{"id":"tiger","name":"tiger","indexname":"docstiger","typename":"doctiger","start":true,"type":"dummy"}]}

...so it is yet a dummy river still.

  1. after this works, I am planning to add Twitter, Wikipedia and RSS
    capabilities to it. At least that is what my client wants, so sometime in
    the future you will get a nice version, as a way to thank you for all your
    help.

Thanks in advance,
Fatima

2013/6/20 David Pilato david@pilato.fr

You probably downloaded gh-pages branch instead of master.
Try git checkout master

The right repository is this one:
GitHub - scrutmydocs/scrutmydocs: Search Web Application for hard drive documents

--
David Pilato | Technical Advocate | *Elasticsearch.comhttp://elasticsearch.com/
*
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr
| @scrutmydocs https://twitter.com/scrutmydocs

Le 17 juin 2013 à 12:25, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

and it is exactly the same no matter what URL I use:

root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# git clone
GitHub - scrutmydocs/scrutmydocs: Search Web Application for hard drive documents
Cloning into 'scrutmydocs'...
remote: Counting objects: 4218, done.
remote: Compressing objects: 100% (1980/1980), done.
remote: Total 4218 (delta 1367), reused 4123 (delta 1277)
Receiving objects: 100% (4218/4218), 1.34 MiB | 278 KiB/s, done.
Resolving deltas: 100% (1367/1367), done.
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# ls -ralh
total 41M
-rw------- 1 fatima fatima 711K jun 17 06:59 scrutmydocs-master.zip
drwx------ 1 fatima fatima 4,0K jun 17 07:21 scrutmydocs-master
-rw------- 1 fatima fatima 40M jun 10 02:22 scrutmydocs-0.2.0.war
drwx------ 1 fatima fatima 352 jun 17 07:24 scrutmydocs
drwx------ 1 fatima fatima 4,0K jun 16 21:27 ..
drwx------ 1 fatima fatima 4,0K jun 17 07:23 .
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# cd
scrutmydocs
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch/scrutmydocs#
ls -ralh
total 8,5K
-rw------- 1 fatima fatima 3,0K jun 17 07:24 index.html
-rw------- 1 fatima fatima 10 jun 17 07:24 .gitignore
drwx------ 1 fatima fatima 440 jun 17 07:24 .git
drwx------ 1 fatima fatima 4,0K jun 17 07:23 ..
drwx------ 1 fatima fatima 352 jun 17 07:24 .
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch/scrutmydocs#

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

_______|||
<
) .------.
-----------,------.-' ,-' -.

             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \



      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |



      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /


            `.    |  |            `.                    ,'

             |    |  |              `.                ,'

_____,------------------. -._ _,-' <___________________________) ------'
| _| |

               `.____|

=================================

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

_______|||
<
) .------.
-----------,------.-' ,-' -.

             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \


      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |


      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /

            `.    |  |            `.                    ,'

             |    |  |              `.                ,'

_____,------------------. -._ _,-' <___________________________) ------'
| _| |

               `.____|

=================================

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

_______|||
<
) .------.
-----------,------.-' ,-' -.

             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \

      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |

      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /
            `.    |  |            `.                    ,'

             |    |  |              `.                ,'

_____,------------------. -._ _,-' <___________________________) ------'
| _| |

               `.____|

=================================

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

|||
<
) .------.
-----------,------.-' ,-' -.
| | | ,' . ,' | | ,' .
| ,-' |
/
,'-' . ---.
|
_________
.--' -----. | _____________________ -. ----- | | ___| | | \ ,- \ | | ___| |===========================((|) | | | | | | _____________________/ - / |
--._ -----' | _________________,-' ----- | .-._ ,' __.---' | /
| -. | \ / . | | . ,' | | | . ,'
_____,------------------. -._ _,-' <___________________________) ------'
| | |
`.
___|

=================================

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Well, I uploaded them to gist, hope did not do anything wrong... It took
them without crying anyway.

  1. jvm.log :
  1. server.log

2013/6/25 Fatima Castiglione Maldonado 发 castiglionemaldonado@gmail.com

I also got the glassfish logs, they got some errors related to
elasticsearch... I guess the problem is there, but I do not yet really know
how to read some of the information there...

Which would be the right way to show them to you ?
Gist also ?

(they are 33 KB and 606 KB long)

2013/6/25 Fatima Castiglione Maldonado 发 castiglionemaldonado@gmail.com

Thank you very much for your support.

  1. everything clean in /root/.scrutmydocs (there only was a config doc,
    no data)

  2. glassfish restarted

  3. the docs that were manually uploaded are still there

  4. running processes:

fatima running processes · GitHub

( I do learn )

  1. when doing

http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers

I get the not-so-good old dummy river:wa

{"ok":true,

"errors":null,

"object": [{"id":"tiger",

"name":"tiger",

"indexname":"docstiger",

"typename":"doctiger",

"start":true,

"type":"dummy"}] <--- dummy

}

2013/6/22 David Pilato david@pilato.fr

Clean everything in /root/.scrutmydocs
And restart glassfish.

Your old docs/rivers should disappear.
If not, could you list running process? ps -ef

Note: when sharing content like this on the mailing list, please use
Gist instead of pasting your code here.

--
David :wink:
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs

Le 22 juin 2013 à 07:32, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

Thank you very much for your support.
I will redo all from the beginning, so you can see if I am doing
something weird or wrong. I followed the instructions from the web, and did
not do anything special.

  1. folders and files

a. I do not have a ~/.scrutmydocs folder, nor a ~/.elasticsearch folder
but there is a /root/.scrutmydocs/config folder with a scrutmydocs.properties
file

(this happens because I start-up GlassFish while su... maybe this is
the root (no pun intended) of my problems?)

scrutmydocs.properties file contents:

################################################################

Licensed to scrutmydocs.org (the "Author") under one

or more contributor license agreements. See the NOTICE file

distributed with this work for additional information

regarding copyright ownership. Author licenses this

file to you under the Apache License, Version 2.0 (the

"License"); you may not use this file except in compliance

with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing,

software distributed under the License is distributed on an

"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY

KIND, either express or implied. See the License for the

specific language governing permissions and limitations

under the License.

################################################################

################################################################

Scrutmydocs configuration file

This file should be in ~/.scrutmydocs/config/ directory

under the name scrutsmydocs.properties

If not present, it will be created the first time you start

the web application...

################################################################

Set to false if you want to connect your webapp to an existing

Elasticsearch cluster, default to true

node.embedded=false

If false, you have to define your node(s) address(es), default to :

localhost:9300,localhost:9301

node.addresses=localhost:9300,localhost:9301

Define the cluster name, default to : scrutmydocs

cluster.name=scrutmydocs

Define the Elasticsearch data dir, default to ~/.scrutmydocs/esdata,

where ~ is the user home dir

path.data=/home/user/.scrutmydocs/esdata

b. I do have a /usr/share/elasticsearch/conf/elasticsearch.yml, with
the following contents:

Mandatory cluster Name. You should be able to modify it in a future

release.
cluster.name: scrutmydocs

If you want to check plugins before starting

plugin.mandatory: mapper-attachments, river-fs

If you want to disable multicast

discovery.zen.ping.multicast.enabled: false

#cluster:

name: TigerCluster

#network:

host: 127.0.0.1

#discovery:

zen:

multicast.enabled: false

#http:

max_content_length: 100000

#index:

number_of_shards: 1

analysis:

analyzer:

default:

type: standard

lowercase_analyzer:

type: custom

tokenizer: standard

filter: [standard, lowercase]

  1. I will un-install Elasticsearch, undeploy ScrutMyDocs, reboot,
    re-install Elasticsearch and re-deploy ScrutMyDocs to be sure nothing is
    wrong and so you can follow the procedure

  2. un-install Elasticsearch (in bash)

fatima@FatiLinux:~$ sudo su
[sudo] password for fatima:
root@FatiLinux:/home/fatima# apt-get purge elasticsearch
Reading package lists... Done
Building dependency tree
Reading state information... Done
The following packages were automatically installed and are no longer
required:
antlr3 cup default-jdk-doc javahelp2 junit4-doc libbeansbinding-java
libbetter-appframework-java libbindex-java
libbytelist-java libcommons-compress-java libcommons-net1-java
libdb-java libdb-je-java libdb5.1-java libdb5.1-java-jni
libfelix-framework-java libfelix-main-java libflute-java libfsplib0
libhamcrest-java-doc libicu4j-java libini4j-java
libjcodings-java libjemmy2-java libjna-java libjoda-convert-java
libjoda-time-java libjvyamlb-java libjzlib-java
liblucene2-java libmysql-java libnb-absolutelayout-java
libnb-apisupport3-java libnb-ide14-java libnb-java5-java
libnb-javaparser-java libnb-org-openide-modules-java
libnb-org-openide-util-java libnb-org-openide-util-lookup-java
libnb-platform-devel-java libnb-platform13-java libnetx-java
libpostgresql-jdbc-java libsac-java libsac-java-gcj
libsequence-library-java libserf1 libsimple-validation-java
libsqljet-java libstringtemplate-java libsvn-java libsvn1
libsvnclientadapter-java libsvnkit-java libswing-layout-java
libswingx1-java libswt-cairo-gtk-3-jni libswt-gnome-gtk-3-jni
libswt-gtk-3-java libswt-gtk-3-jni libswt-webkit-gtk-3-jni libtre5
libtrilead-ssh2-java libxz-java openjdk-7-doc weka
Use 'apt-get autoremove' to remove them.
The following packages will be REMOVED:
elasticsearch*
0 upgraded, 0 newly installed, 1 to remove and 82 not upgraded.
After this operation, 19,9 MB disk space will be freed.
Do you want to continue [Y/n]? y
(Reading database ... 305350 files and directories currently installed.)
Removing elasticsearch ...

  • Stopping Elasticsearch Server
    [ OK ]
    Purging configuration files for elasticsearch ...
    Removing user elasticsearch' ... Warning: group elasticsearch' has no more members.
    Done.
    The group `elasticsearch' does not exist.
    dpkg: warning: while removing elasticsearch, directory
    '/etc/elasticsearch' not empty so not removed
    dpkg: warning: while removing elasticsearch, directory
    '/usr/share/elasticsearch' not empty so not removed
    Processing triggers for ureadahead ...
    ureadahead will be reprofiled on next reboot
    root@FatiLinux:/home/fatima#
  1. went to GlassFish, undeployed ScrutMyDocs

  2. reboot

  3. re-install Elasticsearch (from bash)... skip this step, as
    ScrutMyDocs uses and embedded server, so it will be useless

  4. re-deploy ScrutMyDocs
    did it in GlassFish again.

When it starts, I found four things:

a. it still has all the docs that were manually uploaded

b. it still has the river, pointing to the same path as before

c. it still won't read my docs from the file system

d. when I do
http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers

{"ok":true,"errors":null,"object":[{"id":"tiger","name":"tiger","indexname":"docstiger","typename":"doctiger","start":true,"type":"dummy"}]}

(the change in the name of the app is because I downloaded the code from
GitHub and compiled it locally with Maven)

Thanks in advance.

2013/6/21 David Pilato david@pilato.fr

By default, Scrutmydocs runs an embedded node. So elasticsearch logs
appears within your container logs.
If you run an external elasticsearch node, you should know where you
put logs.

I still don't understand how you get this. Logs would help a lot here I
think.

Could you describe each step, one by one, of what you are doing? And
do it from start?
Also remove ~/.scrutmydocs dir
Perhaps you have strange data/config here???

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr
| @scrutmydocs https://twitter.com/scrutmydocs

Le 21 juin 2013 à 05:15, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

Yes... That was the problem.
Thank you very much.
Now I can download it from the repository and compile it without
problems.

Anyway, I got still two problems left:

  1. did you configure someone special in scrutmydocs about the location
    for the logs?

they are not in the usual places:

/usr/share/elasticsearch/logs/
/var/log/elasticsearch/elasticsearch.log

  1. when I re-create the river, all is the same as before

a. it won't read my files, just shows the one which were manually
uploaded

b. the index is not shown in http://localhost:9200/_plugin/head/

c. when I do
http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers
I get:

{"ok":true,"errors":null,"object":[{"id":"tiger","name":"tiger","indexname":"docstiger","typename":"doctiger","start":true,"type":"dummy"}]}

...so it is yet a dummy river still.

  1. after this works, I am planning to add Twitter, Wikipedia and RSS
    capabilities to it. At least that is what my client wants, so sometime in
    the future you will get a nice version, as a way to thank you for all your
    help.

Thanks in advance,
Fatima

2013/6/20 David Pilato david@pilato.fr

You probably downloaded gh-pages branch instead of master.
Try git checkout master

The right repository is this one:
GitHub - scrutmydocs/scrutmydocs: Search Web Application for hard drive documents

--
David Pilato | Technical Advocate | *Elasticsearch.comhttp://elasticsearch.com/
*
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr
| @scrutmydocs https://twitter.com/scrutmydocs

Le 17 juin 2013 à 12:25, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

and it is exactly the same no matter what URL I use:

root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# git clone
GitHub - scrutmydocs/scrutmydocs: Search Web Application for hard drive documents
Cloning into 'scrutmydocs'...
remote: Counting objects: 4218, done.
remote: Compressing objects: 100% (1980/1980), done.
remote: Total 4218 (delta 1367), reused 4123 (delta 1277)
Receiving objects: 100% (4218/4218), 1.34 MiB | 278 KiB/s, done.
Resolving deltas: 100% (1367/1367), done.
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# ls -ralh
total 41M
-rw------- 1 fatima fatima 711K jun 17 06:59 scrutmydocs-master.zip
drwx------ 1 fatima fatima 4,0K jun 17 07:21 scrutmydocs-master
-rw------- 1 fatima fatima 40M jun 10 02:22 scrutmydocs-0.2.0.war
drwx------ 1 fatima fatima 352 jun 17 07:24 scrutmydocs
drwx------ 1 fatima fatima 4,0K jun 16 21:27 ..
drwx------ 1 fatima fatima 4,0K jun 17 07:23 .
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# cd
scrutmydocs
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch/scrutmydocs#
ls -ralh
total 8,5K
-rw------- 1 fatima fatima 3,0K jun 17 07:24 index.html
-rw------- 1 fatima fatima 10 jun 17 07:24 .gitignore
drwx------ 1 fatima fatima 440 jun 17 07:24 .git
drwx------ 1 fatima fatima 4,0K jun 17 07:23 ..
drwx------ 1 fatima fatima 352 jun 17 07:24 .
root@FatiLinux
:/media/fatima/Elements/Tiger/elasticSearch/scrutmydocs#

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

_______|||
<
) .------.
-----------,------.-' ,-' -.

             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \




      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |




      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /



            `.    |  |            `.                    ,'

             |    |  |              `.                ,'

_____,------------------. -._ _,-' <___________________________) ------'
| _| |

               `.____|

=================================

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

_______|||
<
) .------.
-----------,------.-' ,-' -.

             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \



      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |



      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /


            `.    |  |            `.                    ,'

             |    |  |              `.                ,'

_____,------------------. -._ _,-' <___________________________) ------'
| _| |

               `.____|

=================================

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

_______|||
<
) .------.
-----------,------.-' ,-' -.

             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \


      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |


      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /

            `.    |  |            `.                    ,'

             |    |  |              `.                ,'

_____,------------------. -._ _,-' <___________________________) ------'
| _| |

               `.____|

=================================

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

_______|||
<
) .------.
-----------,------.-' ,-' -.

             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \

      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |

      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /
            `.    |  |            `.                    ,'

             |    |  |              `.                ,'

_____,------------------. -._ _,-' <___________________________) ------'
| _| |

               `.____|

=================================

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

|||
<
) .------.
-----------,------.-' ,-' -.
| | | ,' . ,' | | ,' .
| ,-' |
/
,'-' . ---.
|
_________
.--' -----. | _____________________ -. ----- | | ___| | | \ ,- \ | | ___| |===========================((|) | | | | | | _____________________/ - / |
--._ -----' | _________________,-' ----- | .-._ ,' __.---' | /
| -. | \ / . | | . ,' | | | . ,'
_____,------------------. -._ _,-' <___________________________) ------'
| | |
`.
___|

=================================

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Oh, I could have put less text into the second, the interesting part starts
here, at this string:
(because I have run a new test today)

Jun 25, 2013 4:52:07 AM com.sun.enterprise.admin.launcher.GFLauncherLogger
info

2013/6/25 Fatima Castiglione Maldonado 发 castiglionemaldonado@gmail.com

Well, I uploaded them to gist, hope did not do anything wrong... It took
them without crying anyway.

  1. jvm.log :

fatima jvm.log · GitHub

  1. server.log

fatima server.log · GitHub

2013/6/25 Fatima Castiglione Maldonado 发 castiglionemaldonado@gmail.com

I also got the glassfish logs, they got some errors related to
elasticsearch... I guess the problem is there, but I do not yet really know
how to read some of the information there...

Which would be the right way to show them to you ?
Gist also ?

(they are 33 KB and 606 KB long)

2013/6/25 Fatima Castiglione Maldonado 发 castiglionemaldonado@gmail.com

Thank you very much for your support.

  1. everything clean in /root/.scrutmydocs (there only was a config doc,
    no data)

  2. glassfish restarted

  3. the docs that were manually uploaded are still there

  4. running processes:

fatima running processes · GitHub

( I do learn )

  1. when doing

http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers

I get the not-so-good old dummy river:wa

{"ok":true,

"errors":null,

"object": [{"id":"tiger",

"name":"tiger",

"indexname":"docstiger",

"typename":"doctiger",

"start":true,

"type":"dummy"}] <--- dummy

}

2013/6/22 David Pilato david@pilato.fr

Clean everything in /root/.scrutmydocs
And restart glassfish.

Your old docs/rivers should disappear.
If not, could you list running process? ps -ef

Note: when sharing content like this on the mailing list, please use
Gist instead of pasting your code here.

--
David :wink:
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs

Le 22 juin 2013 à 07:32, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

Thank you very much for your support.
I will redo all from the beginning, so you can see if I am doing
something weird or wrong. I followed the instructions from the web, and did
not do anything special.

  1. folders and files

a. I do not have a ~/.scrutmydocs folder, nor a ~/.elasticsearch
folder
but there is a /root/.scrutmydocs/config folder with a scrutmydocs.properties
file

(this happens because I start-up GlassFish while su... maybe this is
the root (no pun intended) of my problems?)

scrutmydocs.properties file contents:

################################################################

Licensed to scrutmydocs.org (the "Author") under one

or more contributor license agreements. See the NOTICE file

distributed with this work for additional information

regarding copyright ownership. Author licenses this

file to you under the Apache License, Version 2.0 (the

"License"); you may not use this file except in compliance

with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing,

software distributed under the License is distributed on an

"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY

KIND, either express or implied. See the License for the

specific language governing permissions and limitations

under the License.

################################################################

################################################################

Scrutmydocs configuration file

This file should be in ~/.scrutmydocs/config/ directory

under the name scrutsmydocs.properties

If not present, it will be created the first time you start

the web application...

################################################################

Set to false if you want to connect your webapp to an existing

Elasticsearch cluster, default to true

node.embedded=false

If false, you have to define your node(s) address(es), default to :

localhost:9300,localhost:9301

node.addresses=localhost:9300,localhost:9301

Define the cluster name, default to : scrutmydocs

cluster.name=scrutmydocs

Define the Elasticsearch data dir, default to ~/.scrutmydocs/esdata,

where ~ is the user home dir

path.data=/home/user/.scrutmydocs/esdata

b. I do have a /usr/share/elasticsearch/conf/elasticsearch.yml, with
the following contents:

Mandatory cluster Name. You should be able to modify it in a future

release.
cluster.name: scrutmydocs

If you want to check plugins before starting

plugin.mandatory: mapper-attachments, river-fs

If you want to disable multicast

discovery.zen.ping.multicast.enabled: false

#cluster:

name: TigerCluster

#network:

host: 127.0.0.1

#discovery:

zen:

multicast.enabled: false

#http:

max_content_length: 100000

#index:

number_of_shards: 1

analysis:

analyzer:

default:

type: standard

lowercase_analyzer:

type: custom

tokenizer: standard

filter: [standard, lowercase]

  1. I will un-install Elasticsearch, undeploy ScrutMyDocs, reboot,
    re-install Elasticsearch and re-deploy ScrutMyDocs to be sure nothing is
    wrong and so you can follow the procedure

  2. un-install Elasticsearch (in bash)

fatima@FatiLinux:~$ sudo su
[sudo] password for fatima:
root@FatiLinux:/home/fatima# apt-get purge elasticsearch
Reading package lists... Done
Building dependency tree
Reading state information... Done
The following packages were automatically installed and are no longer
required:
antlr3 cup default-jdk-doc javahelp2 junit4-doc libbeansbinding-java
libbetter-appframework-java libbindex-java
libbytelist-java libcommons-compress-java libcommons-net1-java
libdb-java libdb-je-java libdb5.1-java libdb5.1-java-jni
libfelix-framework-java libfelix-main-java libflute-java libfsplib0
libhamcrest-java-doc libicu4j-java libini4j-java
libjcodings-java libjemmy2-java libjna-java libjoda-convert-java
libjoda-time-java libjvyamlb-java libjzlib-java
liblucene2-java libmysql-java libnb-absolutelayout-java
libnb-apisupport3-java libnb-ide14-java libnb-java5-java
libnb-javaparser-java libnb-org-openide-modules-java
libnb-org-openide-util-java libnb-org-openide-util-lookup-java
libnb-platform-devel-java libnb-platform13-java libnetx-java
libpostgresql-jdbc-java libsac-java libsac-java-gcj
libsequence-library-java libserf1 libsimple-validation-java
libsqljet-java libstringtemplate-java libsvn-java libsvn1
libsvnclientadapter-java libsvnkit-java libswing-layout-java
libswingx1-java libswt-cairo-gtk-3-jni libswt-gnome-gtk-3-jni
libswt-gtk-3-java libswt-gtk-3-jni libswt-webkit-gtk-3-jni libtre5
libtrilead-ssh2-java libxz-java openjdk-7-doc weka
Use 'apt-get autoremove' to remove them.
The following packages will be REMOVED:
elasticsearch*
0 upgraded, 0 newly installed, 1 to remove and 82 not upgraded.
After this operation, 19,9 MB disk space will be freed.
Do you want to continue [Y/n]? y
(Reading database ... 305350 files and directories currently installed.)
Removing elasticsearch ...

  • Stopping Elasticsearch Server
    [ OK ]
    Purging configuration files for elasticsearch ...
    Removing user elasticsearch' ... Warning: group elasticsearch' has no more members.
    Done.
    The group `elasticsearch' does not exist.
    dpkg: warning: while removing elasticsearch, directory
    '/etc/elasticsearch' not empty so not removed
    dpkg: warning: while removing elasticsearch, directory
    '/usr/share/elasticsearch' not empty so not removed
    Processing triggers for ureadahead ...
    ureadahead will be reprofiled on next reboot
    root@FatiLinux:/home/fatima#
  1. went to GlassFish, undeployed ScrutMyDocs

  2. reboot

  3. re-install Elasticsearch (from bash)... skip this step, as
    ScrutMyDocs uses and embedded server, so it will be useless

  4. re-deploy ScrutMyDocs
    did it in GlassFish again.

When it starts, I found four things:

a. it still has all the docs that were manually uploaded

b. it still has the river, pointing to the same path as before

c. it still won't read my docs from the file system

d. when I do
http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers

{"ok":true,"errors":null,"object":[{"id":"tiger","name":"tiger","indexname":"docstiger","typename":"doctiger","start":true,"type":"dummy"}]}

(the change in the name of the app is because I downloaded the code
from GitHub and compiled it locally with Maven)

Thanks in advance.

2013/6/21 David Pilato david@pilato.fr

By default, Scrutmydocs runs an embedded node. So elasticsearch logs
appears within your container logs.
If you run an external elasticsearch node, you should know where you
put logs.

I still don't understand how you get this. Logs would help a lot here
I think.

Could you describe each step, one by one, of what you are doing? And
do it from start?
Also remove ~/.scrutmydocs dir
Perhaps you have strange data/config here???

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr
| @scrutmydocs https://twitter.com/scrutmydocs

Le 21 juin 2013 à 05:15, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

Yes... That was the problem.
Thank you very much.
Now I can download it from the repository and compile it without
problems.

Anyway, I got still two problems left:

  1. did you configure someone special in scrutmydocs about the location
    for the logs?

they are not in the usual places:

/usr/share/elasticsearch/logs/
/var/log/elasticsearch/elasticsearch.log

  1. when I re-create the river, all is the same as before

a. it won't read my files, just shows the one which were manually
uploaded

b. the index is not shown in http://localhost:9200/_plugin/head/

c. when I do
http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers
I get:

{"ok":true,"errors":null,"object":[{"id":"tiger","name":"tiger","indexname":"docstiger","typename":"doctiger","start":true,"type":"dummy"}]}

...so it is yet a dummy river still.

  1. after this works, I am planning to add Twitter, Wikipedia and RSS
    capabilities to it. At least that is what my client wants, so sometime in
    the future you will get a nice version, as a way to thank you for all your
    help.

Thanks in advance,
Fatima

2013/6/20 David Pilato david@pilato.fr

You probably downloaded gh-pages branch instead of master.
Try git checkout master

The right repository is this one:
GitHub - scrutmydocs/scrutmydocs: Search Web Application for hard drive documents

--
David Pilato | Technical Advocate | *Elasticsearch.comhttp://elasticsearch.com/
*
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr
| @scrutmydocs https://twitter.com/scrutmydocs

Le 17 juin 2013 à 12:25, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

and it is exactly the same no matter what URL I use:

root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# git clone
GitHub - scrutmydocs/scrutmydocs: Search Web Application for hard drive documents
Cloning into 'scrutmydocs'...
remote: Counting objects: 4218, done.
remote: Compressing objects: 100% (1980/1980), done.
remote: Total 4218 (delta 1367), reused 4123 (delta 1277)
Receiving objects: 100% (4218/4218), 1.34 MiB | 278 KiB/s, done.
Resolving deltas: 100% (1367/1367), done.
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# ls -ralh
total 41M
-rw------- 1 fatima fatima 711K jun 17 06:59 scrutmydocs-master.zip
drwx------ 1 fatima fatima 4,0K jun 17 07:21 scrutmydocs-master
-rw------- 1 fatima fatima 40M jun 10 02:22 scrutmydocs-0.2.0.war
drwx------ 1 fatima fatima 352 jun 17 07:24 scrutmydocs
drwx------ 1 fatima fatima 4,0K jun 16 21:27 ..
drwx------ 1 fatima fatima 4,0K jun 17 07:23 .
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# cd
scrutmydocs
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch/scrutmydocs#
ls -ralh
total 8,5K
-rw------- 1 fatima fatima 3,0K jun 17 07:24 index.html
-rw------- 1 fatima fatima 10 jun 17 07:24 .gitignore
drwx------ 1 fatima fatima 440 jun 17 07:24 .git
drwx------ 1 fatima fatima 4,0K jun 17 07:23 ..
drwx------ 1 fatima fatima 352 jun 17 07:24 .
root@FatiLinux
:/media/fatima/Elements/Tiger/elasticSearch/scrutmydocs#

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

_______|||
<
) .------.
-----------,------.-' ,-' -.

             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \





      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |





      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /




            `.    |  |            `.                    ,'

             |    |  |              `.                ,'

_____,------------------. -._ _,-' <___________________________) ------'
| _| |

               `.____|

=================================

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

_______|||
<
) .------.
-----------,------.-' ,-' -.

             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \




      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |




      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /



            `.    |  |            `.                    ,'

             |    |  |              `.                ,'

_____,------------------. -._ _,-' <___________________________) ------'
| _| |

               `.____|

=================================

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

_______|||
<
) .------.
-----------,------.-' ,-' -.

             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \



      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |



      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /


            `.    |  |            `.                    ,'

             |    |  |              `.                ,'

_____,------------------. -._ _,-' <___________________________) ------'
| _| |

               `.____|

=================================

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

_______|||
<
) .------.
-----------,------.-' ,-' -.

             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \


      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |


      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /

            `.    |  |            `.                    ,'

             |    |  |              `.                ,'

_____,------------------. -._ _,-' <___________________________) ------'
| _| |

               `.____|

=================================

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

_______|||
<
) .------.
-----------,------.-' ,-' -.

             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \

      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |

      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /
            `.    |  |            `.                    ,'

             |    |  |              `.                ,'

_____,------------------. -._ _,-' <___________________________) ------'
| _| |

               `.____|

=================================

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

|||
<
) .------.
-----------,------.-' ,-' -.
| | | ,' . ,' | | ,' .
| ,-' |
/
,'-' . ---.
|
_________
.--' -----. | _____________________ -. ----- | | ___| | | \ ,- \ | | ___| |===========================((|) | | | | | | _____________________/ - / |
--._ -----' | _________________,-' ----- | .-._ ,' __.---' | /
| -. | \ / . | | . ,' | | | . ,'
_____,------------------. -._ _,-' <___________________________) ------'
| | |
`.
___|

=================================

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.