Indexing academic papers in tribute to Aaron Swartz

Hey,

academics begin to share their papers on twitter for free.

How about making them searchable?

Recipe:

  • modified Twitter River / Attachment Mapper: detecting #pdftribute PDF
    URLs, crawl URLs, extract text and metadata
  • indexing text and metadata into Elasticsearch
  • nice, simple web UI

Regards,

Jörg

--