#include <DocFreqIndexer.hpp>
Inheritance diagram for lemur::distrib::DocFreqIndexer:
Public Member Functions | |
DocFreqIndexer (const string &csName, const string &cwName, const string &ssName, int bufferSize, bool countStopWords=false) | |
~DocFreqIndexer () | |
char * | handleDoc (char *docno) |
Handle a doc. | |
char * | handleWord (char *word) |
Handle a word, possibly transforming it. | |
void | handleEndDoc () |
Handle the end of the doc. | |
void | newDb (const string &name) |
Private Attributes | |
int | cw |
int | dfCount |
bool | first |
lemur::index::PushIndex * | collsel |
lemur::parse::DocumentProps * | csdp |
lemur::index::InvFPTerm * | term |
lemur::utility::WordSet | docWords |
FILE * | collWords |
FILE * | serverSizes |
int | numDocs |
bool | countStopWds |
|
|
|
|
|
Handle a doc.
Reimplemented from lemur::api::TextHandler. |
|
Handle the end of the doc.
Reimplemented from lemur::api::TextHandler. |
|
Handle a word, possibly transforming it.
Reimplemented from lemur::api::TextHandler. |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|