#include <IndriTextHandler.hpp>
Inheritance diagram for lemur::parse::IndriTextHandler:
Public Member Functions | |
IndriTextHandler (const string &name, int memory, const lemur::api::Parser *p) | |
~IndriTextHandler () | |
char * | handleDoc (char *docno) |
Signal the start of a document. | |
void | handleEndDoc () |
Handle end of document. | |
char * | handleWord (char *word, const char *original, PropertyList *list) |
Handles a word to be indexed. | |
char * | handleBeginTag (char *tag, const char *orig, PropertyList *props) |
Handle a begin tag. | |
char * | handleEndTag (char *tag, const char *orig, PropertyList *props) |
Handle an end tag. | |
Protected Attributes | |
indri::api::IndexEnvironment | env |
The indexer. | |
indri::api::ParsedDocument | document |
Document properties object to be reused over repeated calls. | |
indri::parse::MetadataPair | docid |
Handle the document id. | |
char * | curdocno |
the current document id | |
int | docbegin |
the beginning byte offset into file of this documents | |
const lemur::api::Parser * | parser |
We need a handle to the parser to get file byte locations. |
|
Create an IndriIndex (repository) with the given name Pass in the parser being used at the beginning of the chain |
|
|
|
Handle a begin tag.
Reimplemented from lemur::api::TextHandler. |
|
Signal the start of a document.
Reimplemented from lemur::api::TextHandler. |
|
Handle end of document.
Reimplemented from lemur::api::TextHandler. |
|
Handle an end tag.
Reimplemented from lemur::api::TextHandler. |
|
Handles a word to be indexed.
Reimplemented from lemur::api::TextHandler. |
|
the current document id
|
|
the beginning byte offset into file of this documents
|
|
Handle the document id.
|
|
Document properties object to be reused over repeated calls.
|
|
The indexer.
|
|
We need a handle to the parser to get file byte locations.
|