Indexing files with file system _river in scrutmydocs won't read any files

Fatima_Castiglione_M · June 25, 2013, 8:37am

Thank you very much for your support.

everything clean in /root/.scrutmydocs (there only was a config doc, no
data)
glassfish restarted
the docs that were manually uploaded are still there
running processes:

gist.github.com

https://gist.github.com/anonymous/26084c596e886c262bd6

procs

fatima@FatiLinux:~$ ps -ef
UID        PID  PPID  C STIME TTY          TIME CMD
root         1     0  0 jun24 ?        00:00:01 /sbin/init
root         2     0  0 jun24 ?        00:00:00 [kthreadd]
root         3     2  0 jun24 ?        00:09:39 [ksoftirqd/0]
root         5     2  0 jun24 ?        00:00:00 [kworker/0:0H]
root         7     2  0 jun24 ?        00:00:00 [kworker/u:0H]
root         8     2  0 jun24 ?        00:00:00 [migration/0]
root         9     2  0 jun24 ?        00:04:02 [rcu_preempt]
root        10     2  0 jun24 ?        00:00:00 [rcu_bh]

This file has been truncated. show original

( I do learn )

when doing

http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers

I get the not-so-good old dummy river:wa

{"ok":true,

"errors":null,

"object": [{"id":"tiger",

"name":"tiger",

"indexname":"docstiger",

"typename":"doctiger",

"start":true,

"type":"dummy"}] <--- dummy

}

2013/6/22 David Pilato david@pilato.fr

Clean everything in /root/.scrutmydocs
And restart glassfish.

Your old docs/rivers should disappear.
If not, could you list running process? ps -ef

Note: when sharing content like this on the mailing list, please use Gist
instead of pasting your code here.

--
David
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs

Le 22 juin 2013 à 07:32, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

Thank you very much for your support.
I will redo all from the beginning, so you can see if I am doing something
weird or wrong. I followed the instructions from the web, and did not do
anything special.

folders and files

a. I do not have a ~/.scrutmydocs folder, nor a ~/.elasticsearch folder
but there is a /root/.scrutmydocs/config folder with a scrutmydocs.properties
file

(this happens because I start-up GlassFish while su... maybe this is the
root (no pun intended) of my problems?)

scrutmydocs.properties file contents:

################################################################

Licensed to scrutmydocs.org (the "Author") under one

or more contributor license agreements. See the NOTICE file

distributed with this work for additional information

regarding copyright ownership. Author licenses this

file to you under the Apache License, Version 2.0 (the

"License"); you may not use this file except in compliance

with the License. You may obtain a copy of the License at

Apache License, Version 2.0 | Apache Software Foundations

Unless required by applicable law or agreed to in writing,

software distributed under the License is distributed on an

"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY

KIND, either express or implied. See the License for the

specific language governing permissions and limitations

under the License.

################################################################

################################################################

Scrutmydocs configuration file

This file should be in ~/.scrutmydocs/config/ directory

under the name scrutsmydocs.properties

If not present, it will be created the first time you start

the web application...

################################################################

Set to false if you want to connect your webapp to an existing

Elasticsearch cluster, default to true

node.embedded=false

If false, you have to define your node(s) address(es), default to :

localhost:9300,localhost:9301

node.addresses=localhost:9300,localhost:9301

Define the cluster name, default to : scrutmydocs

cluster.name=scrutmydocs

Define the Elasticsearch data dir, default to ~/.scrutmydocs/esdata,

where ~ is the user home dir

path.data=/home/user/.scrutmydocs/esdata

b. I do have a /usr/share/elasticsearch/conf/elasticsearch.yml, with the
following contents:

Mandatory cluster Name. You should be able to modify it in a future

release.
cluster.name: scrutmydocs

If you want to check plugins before starting

plugin.mandatory: mapper-attachments, river-fs

If you want to disable multicast

discovery.zen.ping.multicast.enabled: false

#cluster:

name: TigerCluster

#network:

host: 127.0.0.1

#discovery:

zen:

multicast.enabled: false

#http:

max_content_length: 100000

#index:

number_of_shards: 1

analysis:

analyzer:

default:

type: standard

lowercase_analyzer:

type: custom

tokenizer: standard

filter: [standard, lowercase]

I will un-install Elasticsearch, undeploy ScrutMyDocs, reboot,
re-install Elasticsearch and re-deploy ScrutMyDocs to be sure nothing is
wrong and so you can follow the procedure

un-install Elasticsearch (in bash)

fatima@FatiLinux:~$ sudo su
[sudo] password for fatima:
root@FatiLinux:/home/fatima# apt-get purge elasticsearch
Reading package lists... Done
Building dependency tree
Reading state information... Done
The following packages were automatically installed and are no longer
required:
antlr3 cup default-jdk-doc javahelp2 junit4-doc libbeansbinding-java
libbetter-appframework-java libbindex-java
libbytelist-java libcommons-compress-java libcommons-net1-java
libdb-java libdb-je-java libdb5.1-java libdb5.1-java-jni
libfelix-framework-java libfelix-main-java libflute-java libfsplib0
libhamcrest-java-doc libicu4j-java libini4j-java
libjcodings-java libjemmy2-java libjna-java libjoda-convert-java
libjoda-time-java libjvyamlb-java libjzlib-java
liblucene2-java libmysql-java libnb-absolutelayout-java
libnb-apisupport3-java libnb-ide14-java libnb-java5-java
libnb-javaparser-java libnb-org-openide-modules-java
libnb-org-openide-util-java libnb-org-openide-util-lookup-java
libnb-platform-devel-java libnb-platform13-java libnetx-java
libpostgresql-jdbc-java libsac-java libsac-java-gcj
libsequence-library-java libserf1 libsimple-validation-java
libsqljet-java libstringtemplate-java libsvn-java libsvn1
libsvnclientadapter-java libsvnkit-java libswing-layout-java
libswingx1-java libswt-cairo-gtk-3-jni libswt-gnome-gtk-3-jni
libswt-gtk-3-java libswt-gtk-3-jni libswt-webkit-gtk-3-jni libtre5
libtrilead-ssh2-java libxz-java openjdk-7-doc weka
Use 'apt-get autoremove' to remove them.
The following packages will be REMOVED:
elasticsearch*
0 upgraded, 0 newly installed, 1 to remove and 82 not upgraded.
After this operation, 19,9 MB disk space will be freed.
Do you want to continue [Y/n]? y
(Reading database ... 305350 files and directories currently installed.)
Removing elasticsearch ...

Stopping Elasticsearch Server
[ OK ]
Purging configuration files for elasticsearch ...
Removing user elasticsearch' ... Warning: group elasticsearch' has no more members.
Done.
The group `elasticsearch' does not exist.
dpkg: warning: while removing elasticsearch, directory
'/etc/elasticsearch' not empty so not removed
dpkg: warning: while removing elasticsearch, directory
'/usr/share/elasticsearch' not empty so not removed
Processing triggers for ureadahead ...
ureadahead will be reprofiled on next reboot
root@FatiLinux:/home/fatima#

went to GlassFish, undeployed ScrutMyDocs

reboot

re-install Elasticsearch (from bash)... skip this step, as ScrutMyDocs
uses and embedded server, so it will be useless

re-deploy ScrutMyDocs
did it in GlassFish again.

When it starts, I found four things:

a. it still has all the docs that were manually uploaded

b. it still has the river, pointing to the same path as before

c. it still won't read my docs from the file system

d. when I do
http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers

{"ok":true,"errors":null,"object":[{"id":"tiger","name":"tiger","indexname":"docstiger","typename":"doctiger","start":true,"type":"dummy"}]}

(the change in the name of the app is because I downloaded the code from
GitHub and compiled it locally with Maven)

Thanks in advance.

2013/6/21 David Pilato david@pilato.fr
By default, Scrutmydocs runs an embedded node. So elasticsearch logs
appears within your container logs.
If you run an external elasticsearch node, you should know where you put
logs.

I still don't understand how you get this. Logs would help a lot here I
think.

Could you describe each step, one by one, of what you are doing? And do
it from start?
Also remove ~/.scrutmydocs dir
Perhaps you have strange data/config here???

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr
| @scrutmydocs https://twitter.com/scrutmydocs

Le 21 juin 2013 à 05:15, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

Yes... That was the problem.
Thank you very much.
Now I can download it from the repository and compile it without problems.

Anyway, I got still two problems left:

did you configure someone special in scrutmydocs about the location
for the logs?

they are not in the usual places:

/usr/share/elasticsearch/logs/
/var/log/elasticsearch/elasticsearch.log

when I re-create the river, all is the same as before

a. it won't read my files, just shows the one which were manually uploaded

b. the index is not shown in http://localhost:9200/_plugin/head/

c. when I do
http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers
I get:

{"ok":true,"errors":null,"object":[{"id":"tiger","name":"tiger","indexname":"docstiger","typename":"doctiger","start":true,"type":"dummy"}]}

...so it is yet a dummy river still.

after this works, I am planning to add Twitter, Wikipedia and RSS
capabilities to it. At least that is what my client wants, so sometime in
the future you will get a nice version, as a way to thank you for all your
help.

Thanks in advance,
Fatima

2013/6/20 David Pilato david@pilato.fr

You probably downloaded gh-pages branch instead of master.
Try git checkout master

The right repository is this one:
GitHub - scrutmydocs/scrutmydocs: Search Web Application for hard drive documents

--
David Pilato | Technical Advocate | *Elasticsearch.com http://elasticsearch.com/
*
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr
| @scrutmydocs https://twitter.com/scrutmydocs

Le 17 juin 2013 à 12:25, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

and it is exactly the same no matter what URL I use:

root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# git clone
GitHub - scrutmydocs/scrutmydocs: Search Web Application for hard drive documents
Cloning into 'scrutmydocs'...
remote: Counting objects: 4218, done.
remote: Compressing objects: 100% (1980/1980), done.
remote: Total 4218 (delta 1367), reused 4123 (delta 1277)
Receiving objects: 100% (4218/4218), 1.34 MiB | 278 KiB/s, done.
Resolving deltas: 100% (1367/1367), done.
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# ls -ralh
total 41M
-rw------- 1 fatima fatima 711K jun 17 06:59 scrutmydocs-master.zip
drwx------ 1 fatima fatima 4,0K jun 17 07:21 scrutmydocs-master
-rw------- 1 fatima fatima 40M jun 10 02:22 scrutmydocs-0.2.0.war
drwx------ 1 fatima fatima 352 jun 17 07:24 scrutmydocs
drwx------ 1 fatima fatima 4,0K jun 16 21:27 ..
drwx------ 1 fatima fatima 4,0K jun 17 07:23 .
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# cd
scrutmydocs
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch/scrutmydocs#
ls -ralh
total 8,5K
-rw------- 1 fatima fatima 3,0K jun 17 07:24 index.html
-rw------- 1 fatima fatima 10 jun 17 07:24 .gitignore
drwx------ 1 fatima fatima 440 jun 17 07:24 .git
drwx------ 1 fatima fatima 4,0K jun 17 07:23 ..
drwx------ 1 fatima fatima 352 jun 17 07:24 .
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch/scrutmydocs#

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com
                 ____
               ,'_   |
_______|||
< ) .------.
-----------,------.-' ,-' -.
             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \


      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |


      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /

            `.    |  |            `.                    ,'

             |    |  |              `.                ,'
_____,------------------. -._ _,-' <___________________________) ------'
| _| |
               `.____|
=================================

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com
                 ____
               ,'_   |
_______|||
< ) .------.
-----------,------.-' ,-' -.
             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \

      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |

      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /
            `.    |  |            `.                    ,'

             |    |  |              `.                ,'
_____,------------------. -._ _,-' <___________________________) ------'
| _| |
               `.____|
=================================

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

|||
< ) .------.
-----------,------.-' ,-' -.
| | | ,' . ,' | | ,' .
| ,-' | /
,'-' . ---.|_________
.--' -----. | _____________________ -. ----- | | ___| | | \ ,- \ | | ___| |===========================((|) | | | | | | _____________________/ - / |
--._ -----' | _________________,-' ----- | .-._ ,' __.---' | /
| -. | \ / . | | . ,' | | | . ,'
_____,------------------. -._ _,-' <___________________________) ------'
| | |
`.___|

=================================

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Fatima_Castiglione_M · June 25, 2013, 8:52am

I also got the glassfish logs, they got some errors related to
elasticsearch... I guess the problem is there, but I do not yet really know
how to read some of the information there...

Which would be the right way to show them to you ?
Gist also ?

(they are 33 KB and 606 KB long)

2013/6/25 Fatima Castiglione Maldonado 发 castiglionemaldonado@gmail.com

Thank you very much for your support.

everything clean in /root/.scrutmydocs (there only was a config doc, no
data)

glassfish restarted

the docs that were manually uploaded are still there

running processes:

fatima running processes · GitHub

( I do learn )

when doing

http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers

I get the not-so-good old dummy river:wa

{"ok":true,

"errors":null,

"object": [{"id":"tiger",

"name":"tiger",

"indexname":"docstiger",

"typename":"doctiger",

"start":true,

"type":"dummy"}] <--- dummy

}

2013/6/22 David Pilato david@pilato.fr
Clean everything in /root/.scrutmydocs
And restart glassfish.

Your old docs/rivers should disappear.
If not, could you list running process? ps -ef

Note: when sharing content like this on the mailing list, please use Gist
instead of pasting your code here.

--
David
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs

Le 22 juin 2013 à 07:32, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

Thank you very much for your support.
I will redo all from the beginning, so you can see if I am doing
something weird or wrong. I followed the instructions from the web, and did
not do anything special.

folders and files

a. I do not have a ~/.scrutmydocs folder, nor a ~/.elasticsearch folder
but there is a /root/.scrutmydocs/config folder with a scrutmydocs.properties
file

(this happens because I start-up GlassFish while su... maybe this is the
root (no pun intended) of my problems?)

scrutmydocs.properties file contents:

################################################################

Licensed to scrutmydocs.org (the "Author") under one

or more contributor license agreements. See the NOTICE file

distributed with this work for additional information

regarding copyright ownership. Author licenses this

file to you under the Apache License, Version 2.0 (the

"License"); you may not use this file except in compliance

with the License. You may obtain a copy of the License at

Apache License, Version 2.0 | Apache Software Foundations

Unless required by applicable law or agreed to in writing,

software distributed under the License is distributed on an

"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY

KIND, either express or implied. See the License for the

specific language governing permissions and limitations

under the License.

################################################################

################################################################

Scrutmydocs configuration file

This file should be in ~/.scrutmydocs/config/ directory

under the name scrutsmydocs.properties

If not present, it will be created the first time you start

the web application...

################################################################

Set to false if you want to connect your webapp to an existing

Elasticsearch cluster, default to true

node.embedded=false

If false, you have to define your node(s) address(es), default to :

localhost:9300,localhost:9301

node.addresses=localhost:9300,localhost:9301

Define the cluster name, default to : scrutmydocs

cluster.name=scrutmydocs

Define the Elasticsearch data dir, default to ~/.scrutmydocs/esdata,

where ~ is the user home dir

path.data=/home/user/.scrutmydocs/esdata

b. I do have a /usr/share/elasticsearch/conf/elasticsearch.yml, with the
following contents:

Mandatory cluster Name. You should be able to modify it in a future

release.
cluster.name: scrutmydocs

If you want to check plugins before starting

plugin.mandatory: mapper-attachments, river-fs

If you want to disable multicast

discovery.zen.ping.multicast.enabled: false

#cluster:

name: TigerCluster

#network:

host: 127.0.0.1

#discovery:

zen:

multicast.enabled: false

#http:

max_content_length: 100000

#index:

number_of_shards: 1

analysis:

analyzer:

default:

type: standard

lowercase_analyzer:

type: custom

tokenizer: standard

filter: [standard, lowercase]

I will un-install Elasticsearch, undeploy ScrutMyDocs, reboot,
re-install Elasticsearch and re-deploy ScrutMyDocs to be sure nothing is
wrong and so you can follow the procedure

un-install Elasticsearch (in bash)

fatima@FatiLinux:~$ sudo su
[sudo] password for fatima:
root@FatiLinux:/home/fatima# apt-get purge elasticsearch
Reading package lists... Done
Building dependency tree
Reading state information... Done
The following packages were automatically installed and are no longer
required:
antlr3 cup default-jdk-doc javahelp2 junit4-doc libbeansbinding-java
libbetter-appframework-java libbindex-java
libbytelist-java libcommons-compress-java libcommons-net1-java
libdb-java libdb-je-java libdb5.1-java libdb5.1-java-jni
libfelix-framework-java libfelix-main-java libflute-java libfsplib0
libhamcrest-java-doc libicu4j-java libini4j-java
libjcodings-java libjemmy2-java libjna-java libjoda-convert-java
libjoda-time-java libjvyamlb-java libjzlib-java
liblucene2-java libmysql-java libnb-absolutelayout-java
libnb-apisupport3-java libnb-ide14-java libnb-java5-java
libnb-javaparser-java libnb-org-openide-modules-java
libnb-org-openide-util-java libnb-org-openide-util-lookup-java
libnb-platform-devel-java libnb-platform13-java libnetx-java
libpostgresql-jdbc-java libsac-java libsac-java-gcj
libsequence-library-java libserf1 libsimple-validation-java
libsqljet-java libstringtemplate-java libsvn-java libsvn1
libsvnclientadapter-java libsvnkit-java libswing-layout-java
libswingx1-java libswt-cairo-gtk-3-jni libswt-gnome-gtk-3-jni
libswt-gtk-3-java libswt-gtk-3-jni libswt-webkit-gtk-3-jni libtre5
libtrilead-ssh2-java libxz-java openjdk-7-doc weka
Use 'apt-get autoremove' to remove them.
The following packages will be REMOVED:
elasticsearch*
0 upgraded, 0 newly installed, 1 to remove and 82 not upgraded.
After this operation, 19,9 MB disk space will be freed.
Do you want to continue [Y/n]? y
(Reading database ... 305350 files and directories currently installed.)
Removing elasticsearch ...

Stopping Elasticsearch Server
[ OK ]
Purging configuration files for elasticsearch ...
Removing user elasticsearch' ... Warning: group elasticsearch' has no more members.
Done.
The group `elasticsearch' does not exist.
dpkg: warning: while removing elasticsearch, directory
'/etc/elasticsearch' not empty so not removed
dpkg: warning: while removing elasticsearch, directory
'/usr/share/elasticsearch' not empty so not removed
Processing triggers for ureadahead ...
ureadahead will be reprofiled on next reboot
root@FatiLinux:/home/fatima#

went to GlassFish, undeployed ScrutMyDocs

reboot

re-install Elasticsearch (from bash)... skip this step, as ScrutMyDocs
uses and embedded server, so it will be useless

re-deploy ScrutMyDocs
did it in GlassFish again.

When it starts, I found four things:

a. it still has all the docs that were manually uploaded

b. it still has the river, pointing to the same path as before

c. it still won't read my docs from the file system

d. when I do
http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers

{"ok":true,"errors":null,"object":[{"id":"tiger","name":"tiger","indexname":"docstiger","typename":"doctiger","start":true,"type":"dummy"}]}

(the change in the name of the app is because I downloaded the code from
GitHub and compiled it locally with Maven)

Thanks in advance.

2013/6/21 David Pilato david@pilato.fr
By default, Scrutmydocs runs an embedded node. So elasticsearch logs
appears within your container logs.
If you run an external elasticsearch node, you should know where you put
logs.

I still don't understand how you get this. Logs would help a lot here I
think.

Could you describe each step, one by one, of what you are doing? And do
it from start?
Also remove ~/.scrutmydocs dir
Perhaps you have strange data/config here???

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr
| @scrutmydocs https://twitter.com/scrutmydocs

Le 21 juin 2013 à 05:15, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

Yes... That was the problem.
Thank you very much.
Now I can download it from the repository and compile it without
problems.

Anyway, I got still two problems left:

did you configure someone special in scrutmydocs about the location
for the logs?

they are not in the usual places:

/usr/share/elasticsearch/logs/
/var/log/elasticsearch/elasticsearch.log

when I re-create the river, all is the same as before

a. it won't read my files, just shows the one which were manually
uploaded

b. the index is not shown in http://localhost:9200/_plugin/head/

c. when I do
http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers
I get:

{"ok":true,"errors":null,"object":[{"id":"tiger","name":"tiger","indexname":"docstiger","typename":"doctiger","start":true,"type":"dummy"}]}

...so it is yet a dummy river still.

after this works, I am planning to add Twitter, Wikipedia and RSS
capabilities to it. At least that is what my client wants, so sometime in
the future you will get a nice version, as a way to thank you for all your
help.

Thanks in advance,
Fatima

2013/6/20 David Pilato david@pilato.fr

You probably downloaded gh-pages branch instead of master.
Try git checkout master

The right repository is this one:
GitHub - scrutmydocs/scrutmydocs: Search Web Application for hard drive documents

--
David Pilato | Technical Advocate | *Elasticsearch.com http://elasticsearch.com/
*
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr
| @scrutmydocs https://twitter.com/scrutmydocs

Le 17 juin 2013 à 12:25, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

and it is exactly the same no matter what URL I use:

root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# git clone
GitHub - scrutmydocs/scrutmydocs: Search Web Application for hard drive documents
Cloning into 'scrutmydocs'...
remote: Counting objects: 4218, done.
remote: Compressing objects: 100% (1980/1980), done.
remote: Total 4218 (delta 1367), reused 4123 (delta 1277)
Receiving objects: 100% (4218/4218), 1.34 MiB | 278 KiB/s, done.
Resolving deltas: 100% (1367/1367), done.
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# ls -ralh
total 41M
-rw------- 1 fatima fatima 711K jun 17 06:59 scrutmydocs-master.zip
drwx------ 1 fatima fatima 4,0K jun 17 07:21 scrutmydocs-master
-rw------- 1 fatima fatima 40M jun 10 02:22 scrutmydocs-0.2.0.war
drwx------ 1 fatima fatima 352 jun 17 07:24 scrutmydocs
drwx------ 1 fatima fatima 4,0K jun 16 21:27 ..
drwx------ 1 fatima fatima 4,0K jun 17 07:23 .
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# cd
scrutmydocs
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch/scrutmydocs#
ls -ralh
total 8,5K
-rw------- 1 fatima fatima 3,0K jun 17 07:24 index.html
-rw------- 1 fatima fatima 10 jun 17 07:24 .gitignore
drwx------ 1 fatima fatima 440 jun 17 07:24 .git
drwx------ 1 fatima fatima 4,0K jun 17 07:23 ..
drwx------ 1 fatima fatima 352 jun 17 07:24 .
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch/scrutmydocs#

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com
                 ____
               ,'_   |
_______|||
< ) .------.
-----------,------.-' ,-' -.
             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \



      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |



      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /


            `.    |  |            `.                    ,'

             |    |  |              `.                ,'
_____,------------------. -._ _,-' <___________________________) ------'
| _| |
               `.____|
=================================

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com
                 ____
               ,'_   |
_______|||
< ) .------.
-----------,------.-' ,-' -.
             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \


      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |


      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /

            `.    |  |            `.                    ,'

             |    |  |              `.                ,'
_____,------------------. -._ _,-' <___________________________) ------'
| _| |
               `.____|
=================================

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com
                 ____
               ,'_   |
_______|||
< ) .------.
-----------,------.-' ,-' -.
             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \

      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |

      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /
            `.    |  |            `.                    ,'

             |    |  |              `.                ,'
_____,------------------. -._ _,-' <___________________________) ------'
| _| |
               `.____|
=================================

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

|||
< ) .------.
-----------,------.-' ,-' -.
| | | ,' . ,' | | ,' .
| ,-' | /
,'-' . ---.|_________
.--' -----. | _____________________ -. ----- | | ___| | | \ ,- \ | | ___| |===========================((|) | | | | | | _____________________/ - / |
--._ -----' | _________________,-' ----- | .-._ ,' __.---' | /
| -. | \ / . | | . ,' | | | . ,'
_____,------------------. -._ _,-' <___________________________) ------'
| | |
`.___|

=================================

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Fatima_Castiglione_M · June 25, 2013, 8:56am

Well, I uploaded them to gist, hope did not do anything wrong... It took
them without crying anyway.

jvm.log :

gist.github.com

https://gist.github.com/anonymous/d8160173305cbc9b084e

jvm.log

<?xml version='1.0' encoding='UTF-8'?>
<hotspot_log version='160 1' process='13287' time_ms='1372146727776'>
<vm_version>
<name>
OpenJDK 64-Bit Server VM
</name>
<release>
20.0-b12
</release>
<info>

This file has been truncated. show original

server.log

gist.github.com

https://gist.github.com/anonymous/d17a86564470ab07c6e8

server.log

[#|2013-06-20T23:08:25.744-0300|INFO|glassfish3.0.1|javax.enterprise.system.std.com.sun.enterprise.v3.services.impl|_ThreadID=32;_ThreadName=Thread-1;|23:08:25,739 DEBUG [RiverService] - Exception stacktrace :
org.elasticsearch.action.NoShardAvailableActionException: [_river][0] No shard available for [[_river][myfirstriver][_status]: routing [null]]
	at org.elasticsearch.action.support.single.shard.TransportShardSingleOperationAction$AsyncSingleAction.perform(TransportShardSingleOperationAction.java:139)
	at org.elasticsearch.action.support.single.shard.TransportShardSingleOperationAction$AsyncSingleAction.start(TransportShardSingleOperationAction.java:124)
	at org.elasticsearch.action.support.single.shard.TransportShardSingleOperationAction.doExecute(TransportShardSingleOperationAction.java:71)
	at org.elasticsearch.action.support.single.shard.TransportShardSingleOperationAction.doExecute(TransportShardSingleOperationAction.java:46)
	at org.elasticsearch.action.support.TransportAction.execute(TransportAction.java:61)
	at org.elasticsearch.client.node.NodeClient.execute(NodeClient.java:92)
	at org.elasticsearch.client.support.AbstractClient.get(AbstractClient.java:179)
	at org.elasticsearch.action.get.GetRequestBuilder.doExecute(GetRequestBuilder.java:112)

This file has been truncated. show original

2013/6/25 Fatima Castiglione Maldonado 发 castiglionemaldonado@gmail.com

I also got the glassfish logs, they got some errors related to
elasticsearch... I guess the problem is there, but I do not yet really know
how to read some of the information there...

Which would be the right way to show them to you ?
Gist also ?

(they are 33 KB and 606 KB long)

2013/6/25 Fatima Castiglione Maldonado 发 castiglionemaldonado@gmail.com
Thank you very much for your support.

everything clean in /root/.scrutmydocs (there only was a config doc,
no data)

glassfish restarted

the docs that were manually uploaded are still there

running processes:

fatima running processes · GitHub

( I do learn )

when doing

http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers

I get the not-so-good old dummy river:wa

{"ok":true,

"errors":null,

"object": [{"id":"tiger",

"name":"tiger",

"indexname":"docstiger",

"typename":"doctiger",

"start":true,

"type":"dummy"}] <--- dummy

}

2013/6/22 David Pilato david@pilato.fr
Clean everything in /root/.scrutmydocs
And restart glassfish.

Your old docs/rivers should disappear.
If not, could you list running process? ps -ef

Note: when sharing content like this on the mailing list, please use
Gist instead of pasting your code here.

--
David
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs

Le 22 juin 2013 à 07:32, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

Thank you very much for your support.
I will redo all from the beginning, so you can see if I am doing
something weird or wrong. I followed the instructions from the web, and did
not do anything special.

folders and files

a. I do not have a ~/.scrutmydocs folder, nor a ~/.elasticsearch folder
but there is a /root/.scrutmydocs/config folder with a scrutmydocs.properties
file

(this happens because I start-up GlassFish while su... maybe this is
the root (no pun intended) of my problems?)

scrutmydocs.properties file contents:

################################################################

Licensed to scrutmydocs.org (the "Author") under one

or more contributor license agreements. See the NOTICE file

distributed with this work for additional information

regarding copyright ownership. Author licenses this

file to you under the Apache License, Version 2.0 (the

"License"); you may not use this file except in compliance

with the License. You may obtain a copy of the License at

Apache License, Version 2.0 | Apache Software Foundations

Unless required by applicable law or agreed to in writing,

software distributed under the License is distributed on an

"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY

KIND, either express or implied. See the License for the

specific language governing permissions and limitations

under the License.

################################################################

################################################################

Scrutmydocs configuration file

This file should be in ~/.scrutmydocs/config/ directory

under the name scrutsmydocs.properties

If not present, it will be created the first time you start

the web application...

################################################################

Set to false if you want to connect your webapp to an existing

Elasticsearch cluster, default to true

node.embedded=false

If false, you have to define your node(s) address(es), default to :

localhost:9300,localhost:9301

node.addresses=localhost:9300,localhost:9301

Define the cluster name, default to : scrutmydocs

cluster.name=scrutmydocs

Define the Elasticsearch data dir, default to ~/.scrutmydocs/esdata,

where ~ is the user home dir

path.data=/home/user/.scrutmydocs/esdata

b. I do have a /usr/share/elasticsearch/conf/elasticsearch.yml, with
the following contents:

Mandatory cluster Name. You should be able to modify it in a future

release.
cluster.name: scrutmydocs

If you want to check plugins before starting

plugin.mandatory: mapper-attachments, river-fs

If you want to disable multicast

discovery.zen.ping.multicast.enabled: false

#cluster:

name: TigerCluster

#network:

host: 127.0.0.1

#discovery:

zen:

multicast.enabled: false

#http:

max_content_length: 100000

#index:

number_of_shards: 1

analysis:

analyzer:

default:

type: standard

lowercase_analyzer:

type: custom

tokenizer: standard

filter: [standard, lowercase]

I will un-install Elasticsearch, undeploy ScrutMyDocs, reboot,
re-install Elasticsearch and re-deploy ScrutMyDocs to be sure nothing is
wrong and so you can follow the procedure

un-install Elasticsearch (in bash)

fatima@FatiLinux:~$ sudo su
[sudo] password for fatima:
root@FatiLinux:/home/fatima# apt-get purge elasticsearch
Reading package lists... Done
Building dependency tree
Reading state information... Done
The following packages were automatically installed and are no longer
required:
antlr3 cup default-jdk-doc javahelp2 junit4-doc libbeansbinding-java
libbetter-appframework-java libbindex-java
libbytelist-java libcommons-compress-java libcommons-net1-java
libdb-java libdb-je-java libdb5.1-java libdb5.1-java-jni
libfelix-framework-java libfelix-main-java libflute-java libfsplib0
libhamcrest-java-doc libicu4j-java libini4j-java
libjcodings-java libjemmy2-java libjna-java libjoda-convert-java
libjoda-time-java libjvyamlb-java libjzlib-java
liblucene2-java libmysql-java libnb-absolutelayout-java
libnb-apisupport3-java libnb-ide14-java libnb-java5-java
libnb-javaparser-java libnb-org-openide-modules-java
libnb-org-openide-util-java libnb-org-openide-util-lookup-java
libnb-platform-devel-java libnb-platform13-java libnetx-java
libpostgresql-jdbc-java libsac-java libsac-java-gcj
libsequence-library-java libserf1 libsimple-validation-java
libsqljet-java libstringtemplate-java libsvn-java libsvn1
libsvnclientadapter-java libsvnkit-java libswing-layout-java
libswingx1-java libswt-cairo-gtk-3-jni libswt-gnome-gtk-3-jni
libswt-gtk-3-java libswt-gtk-3-jni libswt-webkit-gtk-3-jni libtre5
libtrilead-ssh2-java libxz-java openjdk-7-doc weka
Use 'apt-get autoremove' to remove them.
The following packages will be REMOVED:
elasticsearch*
0 upgraded, 0 newly installed, 1 to remove and 82 not upgraded.
After this operation, 19,9 MB disk space will be freed.
Do you want to continue [Y/n]? y
(Reading database ... 305350 files and directories currently installed.)
Removing elasticsearch ...

Stopping Elasticsearch Server
[ OK ]
Purging configuration files for elasticsearch ...
Removing user elasticsearch' ... Warning: group elasticsearch' has no more members.
Done.
The group `elasticsearch' does not exist.
dpkg: warning: while removing elasticsearch, directory
'/etc/elasticsearch' not empty so not removed
dpkg: warning: while removing elasticsearch, directory
'/usr/share/elasticsearch' not empty so not removed
Processing triggers for ureadahead ...
ureadahead will be reprofiled on next reboot
root@FatiLinux:/home/fatima#

went to GlassFish, undeployed ScrutMyDocs

reboot

re-install Elasticsearch (from bash)... skip this step, as
ScrutMyDocs uses and embedded server, so it will be useless

re-deploy ScrutMyDocs
did it in GlassFish again.

When it starts, I found four things:

a. it still has all the docs that were manually uploaded

b. it still has the river, pointing to the same path as before

c. it still won't read my docs from the file system

d. when I do
http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers

{"ok":true,"errors":null,"object":[{"id":"tiger","name":"tiger","indexname":"docstiger","typename":"doctiger","start":true,"type":"dummy"}]}

(the change in the name of the app is because I downloaded the code from
GitHub and compiled it locally with Maven)

Thanks in advance.

2013/6/21 David Pilato david@pilato.fr
By default, Scrutmydocs runs an embedded node. So elasticsearch logs
appears within your container logs.
If you run an external elasticsearch node, you should know where you
put logs.

I still don't understand how you get this. Logs would help a lot here I
think.

Could you describe each step, one by one, of what you are doing? And
do it from start?
Also remove ~/.scrutmydocs dir
Perhaps you have strange data/config here???

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr
| @scrutmydocs https://twitter.com/scrutmydocs

Le 21 juin 2013 à 05:15, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

Yes... That was the problem.
Thank you very much.
Now I can download it from the repository and compile it without
problems.

Anyway, I got still two problems left:

did you configure someone special in scrutmydocs about the location
for the logs?

they are not in the usual places:

/usr/share/elasticsearch/logs/
/var/log/elasticsearch/elasticsearch.log

when I re-create the river, all is the same as before

a. it won't read my files, just shows the one which were manually
uploaded

b. the index is not shown in http://localhost:9200/_plugin/head/

c. when I do
http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers
I get:

{"ok":true,"errors":null,"object":[{"id":"tiger","name":"tiger","indexname":"docstiger","typename":"doctiger","start":true,"type":"dummy"}]}

...so it is yet a dummy river still.

after this works, I am planning to add Twitter, Wikipedia and RSS
capabilities to it. At least that is what my client wants, so sometime in
the future you will get a nice version, as a way to thank you for all your
help.

Thanks in advance,
Fatima

2013/6/20 David Pilato david@pilato.fr

You probably downloaded gh-pages branch instead of master.
Try git checkout master

The right repository is this one:
GitHub - scrutmydocs/scrutmydocs: Search Web Application for hard drive documents

--
David Pilato | Technical Advocate | *Elasticsearch.com http://elasticsearch.com/
*
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr
| @scrutmydocs https://twitter.com/scrutmydocs

Le 17 juin 2013 à 12:25, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

and it is exactly the same no matter what URL I use:

root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# git clone
GitHub - scrutmydocs/scrutmydocs: Search Web Application for hard drive documents
Cloning into 'scrutmydocs'...
remote: Counting objects: 4218, done.
remote: Compressing objects: 100% (1980/1980), done.
remote: Total 4218 (delta 1367), reused 4123 (delta 1277)
Receiving objects: 100% (4218/4218), 1.34 MiB | 278 KiB/s, done.
Resolving deltas: 100% (1367/1367), done.
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# ls -ralh
total 41M
-rw------- 1 fatima fatima 711K jun 17 06:59 scrutmydocs-master.zip
drwx------ 1 fatima fatima 4,0K jun 17 07:21 scrutmydocs-master
-rw------- 1 fatima fatima 40M jun 10 02:22 scrutmydocs-0.2.0.war
drwx------ 1 fatima fatima 352 jun 17 07:24 scrutmydocs
drwx------ 1 fatima fatima 4,0K jun 16 21:27 ..
drwx------ 1 fatima fatima 4,0K jun 17 07:23 .
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# cd
scrutmydocs
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch/scrutmydocs#
ls -ralh
total 8,5K
-rw------- 1 fatima fatima 3,0K jun 17 07:24 index.html
-rw------- 1 fatima fatima 10 jun 17 07:24 .gitignore
drwx------ 1 fatima fatima 440 jun 17 07:24 .git
drwx------ 1 fatima fatima 4,0K jun 17 07:23 ..
drwx------ 1 fatima fatima 352 jun 17 07:24 .
root@FatiLinux
:/media/fatima/Elements/Tiger/elasticSearch/scrutmydocs#

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com
                 ____
               ,'_   |
_______|||
< ) .------.
-----------,------.-' ,-' -.
             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \




      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |




      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /



            `.    |  |            `.                    ,'

             |    |  |              `.                ,'
_____,------------------. -._ _,-' <___________________________) ------'
| _| |
               `.____|
=================================

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com
                 ____
               ,'_   |
_______|||
< ) .------.
-----------,------.-' ,-' -.
             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \



      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |



      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /


            `.    |  |            `.                    ,'

             |    |  |              `.                ,'
_____,------------------. -._ _,-' <___________________________) ------'
| _| |
               `.____|
=================================

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com
                 ____
               ,'_   |
_______|||
< ) .------.
-----------,------.-' ,-' -.
             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \


      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |


      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /

            `.    |  |            `.                    ,'

             |    |  |              `.                ,'
_____,------------------. -._ _,-' <___________________________) ------'
| _| |
               `.____|
=================================
--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com
                 ____
               ,'_   |
_______|||
< ) .------.
-----------,------.-' ,-' -.
             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \

      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |

      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /
            `.    |  |            `.                    ,'

             |    |  |              `.                ,'
_____,------------------. -._ _,-' <___________________________) ------'
| _| |
               `.____|
=================================

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

|||
< ) .------.
-----------,------.-' ,-' -.
| | | ,' . ,' | | ,' .
| ,-' | /
,'-' . ---.|_________
.--' -----. | _____________________ -. ----- | | ___| | | \ ,- \ | | ___| |===========================((|) | | | | | | _____________________/ - / |
--._ -----' | _________________,-' ----- | .-._ ,' __.---' | /
| -. | \ / . | | . ,' | | | . ,'
_____,------------------. -._ _,-' <___________________________) ------'
| | |
`.___|

=================================

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Fatima_Castiglione_M · June 25, 2013, 8:59am

Oh, I could have put less text into the second, the interesting part starts
here, at this string:
(because I have run a new test today)

Jun 25, 2013 4:52:07 AM com.sun.enterprise.admin.launcher.GFLauncherLogger
info

2013/6/25 Fatima Castiglione Maldonado 发 castiglionemaldonado@gmail.com

Well, I uploaded them to gist, hope did not do anything wrong... It took
them without crying anyway.

jvm.log :

fatima jvm.log · GitHub

server.log

fatima server.log · GitHub

2013/6/25 Fatima Castiglione Maldonado 发 castiglionemaldonado@gmail.com
I also got the glassfish logs, they got some errors related to
elasticsearch... I guess the problem is there, but I do not yet really know
how to read some of the information there...

Which would be the right way to show them to you ?
Gist also ?

(they are 33 KB and 606 KB long)

2013/6/25 Fatima Castiglione Maldonado 发 castiglionemaldonado@gmail.com
Thank you very much for your support.

everything clean in /root/.scrutmydocs (there only was a config doc,
no data)

glassfish restarted

the docs that were manually uploaded are still there

running processes:

fatima running processes · GitHub

( I do learn )

when doing

http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers

I get the not-so-good old dummy river:wa

{"ok":true,

"errors":null,

"object": [{"id":"tiger",

"name":"tiger",

"indexname":"docstiger",

"typename":"doctiger",

"start":true,

"type":"dummy"}] <--- dummy

}

2013/6/22 David Pilato david@pilato.fr
Clean everything in /root/.scrutmydocs
And restart glassfish.

Your old docs/rivers should disappear.
If not, could you list running process? ps -ef

Note: when sharing content like this on the mailing list, please use
Gist instead of pasting your code here.

--
David
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs

Le 22 juin 2013 à 07:32, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

Thank you very much for your support.
I will redo all from the beginning, so you can see if I am doing
something weird or wrong. I followed the instructions from the web, and did
not do anything special.

folders and files

a. I do not have a ~/.scrutmydocs folder, nor a ~/.elasticsearch
folder
but there is a /root/.scrutmydocs/config folder with a scrutmydocs.properties
file

(this happens because I start-up GlassFish while su... maybe this is
the root (no pun intended) of my problems?)

scrutmydocs.properties file contents:

################################################################

Licensed to scrutmydocs.org (the "Author") under one

or more contributor license agreements. See the NOTICE file

distributed with this work for additional information

regarding copyright ownership. Author licenses this

file to you under the Apache License, Version 2.0 (the

"License"); you may not use this file except in compliance

with the License. You may obtain a copy of the License at

Apache License, Version 2.0 | Apache Software Foundations

Unless required by applicable law or agreed to in writing,

software distributed under the License is distributed on an

"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY

KIND, either express or implied. See the License for the

specific language governing permissions and limitations

under the License.

################################################################

################################################################

Scrutmydocs configuration file

This file should be in ~/.scrutmydocs/config/ directory

under the name scrutsmydocs.properties

If not present, it will be created the first time you start

the web application...

################################################################

Set to false if you want to connect your webapp to an existing

Elasticsearch cluster, default to true

node.embedded=false

If false, you have to define your node(s) address(es), default to :

localhost:9300,localhost:9301

node.addresses=localhost:9300,localhost:9301

Define the cluster name, default to : scrutmydocs

cluster.name=scrutmydocs

Define the Elasticsearch data dir, default to ~/.scrutmydocs/esdata,

where ~ is the user home dir

path.data=/home/user/.scrutmydocs/esdata

b. I do have a /usr/share/elasticsearch/conf/elasticsearch.yml, with
the following contents:

Mandatory cluster Name. You should be able to modify it in a future

release.
cluster.name: scrutmydocs

If you want to check plugins before starting

plugin.mandatory: mapper-attachments, river-fs

If you want to disable multicast

discovery.zen.ping.multicast.enabled: false

#cluster:

name: TigerCluster

#network:

host: 127.0.0.1

#discovery:

zen:

multicast.enabled: false

#http:

max_content_length: 100000

#index:

number_of_shards: 1

analysis:

analyzer:

default:

type: standard

lowercase_analyzer:

type: custom

tokenizer: standard

filter: [standard, lowercase]

I will un-install Elasticsearch, undeploy ScrutMyDocs, reboot,
re-install Elasticsearch and re-deploy ScrutMyDocs to be sure nothing is
wrong and so you can follow the procedure

un-install Elasticsearch (in bash)

fatima@FatiLinux:~$ sudo su
[sudo] password for fatima:
root@FatiLinux:/home/fatima# apt-get purge elasticsearch
Reading package lists... Done
Building dependency tree
Reading state information... Done
The following packages were automatically installed and are no longer
required:
antlr3 cup default-jdk-doc javahelp2 junit4-doc libbeansbinding-java
libbetter-appframework-java libbindex-java
libbytelist-java libcommons-compress-java libcommons-net1-java
libdb-java libdb-je-java libdb5.1-java libdb5.1-java-jni
libfelix-framework-java libfelix-main-java libflute-java libfsplib0
libhamcrest-java-doc libicu4j-java libini4j-java
libjcodings-java libjemmy2-java libjna-java libjoda-convert-java
libjoda-time-java libjvyamlb-java libjzlib-java
liblucene2-java libmysql-java libnb-absolutelayout-java
libnb-apisupport3-java libnb-ide14-java libnb-java5-java
libnb-javaparser-java libnb-org-openide-modules-java
libnb-org-openide-util-java libnb-org-openide-util-lookup-java
libnb-platform-devel-java libnb-platform13-java libnetx-java
libpostgresql-jdbc-java libsac-java libsac-java-gcj
libsequence-library-java libserf1 libsimple-validation-java
libsqljet-java libstringtemplate-java libsvn-java libsvn1
libsvnclientadapter-java libsvnkit-java libswing-layout-java
libswingx1-java libswt-cairo-gtk-3-jni libswt-gnome-gtk-3-jni
libswt-gtk-3-java libswt-gtk-3-jni libswt-webkit-gtk-3-jni libtre5
libtrilead-ssh2-java libxz-java openjdk-7-doc weka
Use 'apt-get autoremove' to remove them.
The following packages will be REMOVED:
elasticsearch*
0 upgraded, 0 newly installed, 1 to remove and 82 not upgraded.
After this operation, 19,9 MB disk space will be freed.
Do you want to continue [Y/n]? y
(Reading database ... 305350 files and directories currently installed.)
Removing elasticsearch ...

Stopping Elasticsearch Server
[ OK ]
Purging configuration files for elasticsearch ...
Removing user elasticsearch' ... Warning: group elasticsearch' has no more members.
Done.
The group `elasticsearch' does not exist.
dpkg: warning: while removing elasticsearch, directory
'/etc/elasticsearch' not empty so not removed
dpkg: warning: while removing elasticsearch, directory
'/usr/share/elasticsearch' not empty so not removed
Processing triggers for ureadahead ...
ureadahead will be reprofiled on next reboot
root@FatiLinux:/home/fatima#

went to GlassFish, undeployed ScrutMyDocs

reboot

re-install Elasticsearch (from bash)... skip this step, as
ScrutMyDocs uses and embedded server, so it will be useless

re-deploy ScrutMyDocs
did it in GlassFish again.

When it starts, I found four things:

a. it still has all the docs that were manually uploaded

b. it still has the river, pointing to the same path as before

c. it still won't read my docs from the file system

d. when I do
http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers

{"ok":true,"errors":null,"object":[{"id":"tiger","name":"tiger","indexname":"docstiger","typename":"doctiger","start":true,"type":"dummy"}]}

(the change in the name of the app is because I downloaded the code
from GitHub and compiled it locally with Maven)

Thanks in advance.

2013/6/21 David Pilato david@pilato.fr
By default, Scrutmydocs runs an embedded node. So elasticsearch logs
appears within your container logs.
If you run an external elasticsearch node, you should know where you
put logs.

I still don't understand how you get this. Logs would help a lot here
I think.

Could you describe each step, one by one, of what you are doing? And
do it from start?
Also remove ~/.scrutmydocs dir
Perhaps you have strange data/config here???

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr
| @scrutmydocs https://twitter.com/scrutmydocs

Le 21 juin 2013 à 05:15, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

Yes... That was the problem.
Thank you very much.
Now I can download it from the repository and compile it without
problems.

Anyway, I got still two problems left:

did you configure someone special in scrutmydocs about the location
for the logs?

they are not in the usual places:

/usr/share/elasticsearch/logs/
/var/log/elasticsearch/elasticsearch.log

when I re-create the river, all is the same as before

a. it won't read my files, just shows the one which were manually
uploaded

b. the index is not shown in http://localhost:9200/_plugin/head/

c. when I do
http://localhost:8080/scrutmydocs-0.3.1-SNAPSHOT-test/api/1/settings/rivers
I get:

{"ok":true,"errors":null,"object":[{"id":"tiger","name":"tiger","indexname":"docstiger","typename":"doctiger","start":true,"type":"dummy"}]}

...so it is yet a dummy river still.

after this works, I am planning to add Twitter, Wikipedia and RSS
capabilities to it. At least that is what my client wants, so sometime in
the future you will get a nice version, as a way to thank you for all your
help.

Thanks in advance,
Fatima

2013/6/20 David Pilato david@pilato.fr

You probably downloaded gh-pages branch instead of master.
Try git checkout master

The right repository is this one:
GitHub - scrutmydocs/scrutmydocs: Search Web Application for hard drive documents

--
David Pilato | Technical Advocate | *Elasticsearch.com http://elasticsearch.com/
*
@dadoonet https://twitter.com/dadoonet | @elasticsearchfrhttps://twitter.com/elasticsearchfr
| @scrutmydocs https://twitter.com/scrutmydocs

Le 17 juin 2013 à 12:25, Fatima Castiglione Maldonado 发 <
castiglionemaldonado@gmail.com> a écrit :

and it is exactly the same no matter what URL I use:

root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# git clone
GitHub - scrutmydocs/scrutmydocs: Search Web Application for hard drive documents
Cloning into 'scrutmydocs'...
remote: Counting objects: 4218, done.
remote: Compressing objects: 100% (1980/1980), done.
remote: Total 4218 (delta 1367), reused 4123 (delta 1277)
Receiving objects: 100% (4218/4218), 1.34 MiB | 278 KiB/s, done.
Resolving deltas: 100% (1367/1367), done.
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# ls -ralh
total 41M
-rw------- 1 fatima fatima 711K jun 17 06:59 scrutmydocs-master.zip
drwx------ 1 fatima fatima 4,0K jun 17 07:21 scrutmydocs-master
-rw------- 1 fatima fatima 40M jun 10 02:22 scrutmydocs-0.2.0.war
drwx------ 1 fatima fatima 352 jun 17 07:24 scrutmydocs
drwx------ 1 fatima fatima 4,0K jun 16 21:27 ..
drwx------ 1 fatima fatima 4,0K jun 17 07:23 .
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch# cd
scrutmydocs
root@FatiLinux:/media/fatima/Elements/Tiger/elasticSearch/scrutmydocs#
ls -ralh
total 8,5K
-rw------- 1 fatima fatima 3,0K jun 17 07:24 index.html
-rw------- 1 fatima fatima 10 jun 17 07:24 .gitignore
drwx------ 1 fatima fatima 440 jun 17 07:24 .git
drwx------ 1 fatima fatima 4,0K jun 17 07:23 ..
drwx------ 1 fatima fatima 352 jun 17 07:24 .
root@FatiLinux
:/media/fatima/Elements/Tiger/elasticSearch/scrutmydocs#

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com
                 ____
               ,'_   |
_______|||
< ) .------.
-----------,------.-' ,-' -.
             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \





      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |





      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /




            `.    |  |            `.                    ,'

             |    |  |              `.                ,'
_____,------------------. -._ _,-' <___________________________) ------'
| _| |
               `.____|
=================================

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com
                 ____
               ,'_   |
_______|||
< ) .------.
-----------,------.-' ,-' -.
             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \




      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |




      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /



            `.    |  |            `.                    ,'

             |    |  |              `.                ,'
_____,------------------. -._ _,-' <___________________________) ------'
| _| |
               `.____|
=================================

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com
                 ____
               ,'_   |
_______|||
< ) .------.
-----------,------.-' ,-' -.
             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \



      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |



      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /


            `.    |  |            `.                    ,'

             |    |  |              `.                ,'
_____,------------------. -._ _,-' <___________________________) ------'
| _| |
               `.____|
=================================
--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com
                 ____
               ,'_   |
_______|||
< ) .------.
-----------,------.-' ,-' -.
             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \


      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |


      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /

            `.    |  |            `.                    ,'

             |    |  |              `.                ,'
_____,------------------. -._ _,-' <___________________________) ------'
| _| |
               `.____|
=================================
--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com
                 ____
               ,'_   |
_______|||
< ) .------.
-----------,------.-' ,-' -.
             |    |  |              ,'                `.
            ,'    |  |            ,'                    `.
            |  _,-'  |__         /                        \
          _,'-'    `.   `---.___|_____________             \

      .--'  -----.  | _____________________   `-. -----     |
      |    ___|  |  |                      \  ,- \          |
      |    ___|  |===========================((|) |         |
      |       |  |  | _____________________/  `- /          |

      `--._ -----'  |        _________________,-' -----     |
           `.-._   ,' __.---'   |                          /
            |   `-.  |           \                        /
            `.    |  |            `.                    ,'

             |    |  |              `.                ,'
_____,------------------. -._ _,-' <___________________________) ------'
| _| |
               `.____|
=================================

--

Fátima Castiglione Maldonado
castiglionemaldonado@gmail.com

                 ____
               ,'_   |

|||
< ) .------.
-----------,------.-' ,-' -.
| | | ,' . ,' | | ,' .
| ,-' | /
,'-' . ---.|_________
.--' -----. | _____________________ -. ----- | | ___| | | \ ,- \ | | ___| |===========================((|) | | | | | | _____________________/ - / |
--._ -----' | _________________,-' ----- | .-._ ,' __.---' | /
| -. | \ / . | | . ,' | | | . ,'
_____,------------------. -._ _,-' <___________________________) ------'
| | |
`.___|

=================================

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Topic		Replies	Views
ScrutMyDocs : clearing previous install Elasticsearch	11	422	July 6, 2017
Meet a problem while indexing local json files by fsriver Elasticsearch	8	390	July 6, 2017
Error in elastic search cluster Elasticsearch	9	398	July 6, 2017
Fsriver will not index Elasticsearch	10	616	July 6, 2017
How to create index for a attachment of pdf by using elasticsearch-river-couchdb(1.2.0) (don't have any hits) Elasticsearch	7	580	July 6, 2017