[ANN] New IMAP and POP3 email river for Elasticsearch - elasticsearch-river-imap

Hi,
i'd like to announce a IMAP and POP river for Elasticsearch.

It can be found here:

This river connects to IMAP4 or POP3 servers polls your mail and index it.
The emails on the server will be never modified or removed from the server.
The river tracks (after the first initial full load) which mails are new or
deleted and then only update the index for this mails.

Features:

  • Incremental indexing of e-mails from a IMAP or POP3 server
    
  • Support indexing of attachments (in conjunction with 
    

https://github.com/elasticsearch/elasticsearch-mapper-attachments/)

  • Support for UTF-7 encoded e-mails (through jutf7)
    
  • SSL, STARTTLS and SASL are supported (through JavaMail API)
    
  • IMAP only: Folders which should be indexed can be specified with a 
    

regex pattern

  • IMAP only: Subfolders can also be indexed (whole traversal of all 
    

folders)

  • No special server capabilities needed
    
  • Bulk indexing
    

The river acts currently as a disconnected client. This means that the
river is polling and for every indexing run a new server connections are
opened and,
after work is done, closed. At a later time is planned to use additionally
the IMAP IDLE feature (if server supports it).

Suggestions, corrections, improvements are very welcome!
Thanks and best regards
Hendrik

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/6aa11d4d-dce0-4172-b441-794b54c35394%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Hi!

Thanks for this.

I want to use Zarafa with Elasticsearch on ClearOS, in the context of Tiki
Suite.

Tiki Suite is a selection of Free / Libre / Open Source Software (FLOSS)
server, web, mobile and desktop apps with a concerted focus on greater
interoperability, security and adaptability, which is aimed at small &
medium-sized organizations. The Tiki Suite is especially suited to
decentralized and knowledge-centric organizations and offers most (80%+) of
the features all organizations need, such as: Email, Website & Blog,
Shopping Cart, Intranet & Project Management, E-learning, Social
Networking, Knowledge base, File sharing, Issue Tracker,
Video-conferencing, LDAP, VPN, Gateway, Network, etc.

The main server apps are: ClearOS (server), Tiki Wiki CMS Groupware, Zarafa
(Mail), Prosody (XMPP) and Syncthing (file sync).

Tiki is the
http://tiki.org/FLOSS+Web+Application+with+the+most+built-in+features and
the content is searchable via Elasticsearch. It's all filtered by
permission, and permits goodies like More Like This, Facets, Percolator
(Stored searches), and more is on the way: http://doc.tiki.org/Elasticsearch

This is all nice and well. However I would like to add emails, XMPP logs,
and the files from the shared/sync folders. So looking at any item
(e-mails, wiki pages, files, tasks, bugs, forum posts, blog posts, XMPP
chat logs, etc.) can show me related entries.

"Introduced with Tiki14, the federated search allows to integrate results
from other sources into the main search results."

So I'd like to include emails (and contacts and events) in these search
results, with proper permissions. So
https://github.com/salyh/elasticsearch-river-imap is of great interest to
me.

I also know of:
http://manifoldcf.apache.org/release/trunk/en_US/end-user-documentation.html#emailrepository

Thoughts? Advice?

Thanks!

On Tuesday, April 22, 2014 11:36:11 AM UTC-4, Hendrik wrote:

Hi,
i'd like to announce a IMAP and POP river for Elasticsearch.

It can be found here:
https://github.com/salyh/elasticsearch-river-imap

This river connects to IMAP4 or POP3 servers polls your mail and index it.
The emails on the server will be never modified or removed from the server.
The river tracks (after the first initial full load) which mails are new
or deleted and then only update the index for this mails.

Features:

  • Incremental indexing of e-mails from a IMAP or POP3 server
    
  • Support indexing of attachments (in conjunction with 
    

https://github.com/elasticsearch/elasticsearch-mapper-attachments/)

  • Support for UTF-7 encoded e-mails (through jutf7)
    
  • SSL, STARTTLS and SASL are supported (through JavaMail API)
    
  • IMAP only: Folders which should be indexed can be specified with 
    

a regex pattern

  • IMAP only: Subfolders can also be indexed (whole traversal of 
    

all folders)

  • No special server capabilities needed
    
  • Bulk indexing
    

The river acts currently as a disconnected client. This means that the
river is polling and for every indexing run a new server connections are
opened and,
after work is done, closed. At a later time is planned to use additionally
the IMAP IDLE feature (if server supports it).

Suggestions, corrections, improvements are very welcome!
Thanks and best regards
Hendrik

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/82b9855d-548d-43e4-a871-91a4377ca172%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.