Galago
Features
It includes TupleFlow, which is a distributed computation
framework like MapReduce or Dryad. TupleFlow manages the difficult
parts of processing text: serializing data, sorting it, and
distributing processing. The IndexReader and IndexWriter classes
manage storing key/value pairs like inverted lists. This makes it
possible to make your own kinds of index structures without
starting from scratch.
The retrieval system supports a variant of the Indri query
language, but redesigned to be more flexible. You can add your own
query operators without recompiling the core libraries; just put
your new operator in the classpath and reference it in a query.
Download
Galago can be obtained from the SourceForge Lemur Project Page.
Release History
The first binary version (3.14159) of galago was released in Dec 2011. Release notes
for the current release can be found on SourceForge.
Related Links



