Main Page | Namespace List | Class Hierarchy | Class List | File List | Namespace Members | Class Members | File Members | Related Pages

lemur::distrib::DocFreqIndexer Class Reference

#include <DocFreqIndexer.hpp>

Inheritance diagram for lemur::distrib::DocFreqIndexer:

lemur::api::TextHandler List of all members.

Public Member Functions

 DocFreqIndexer (const string &csName, const string &cwName, const string &ssName, int bufferSize, bool countStopWords=false)
 ~DocFreqIndexer ()
char * handleDoc (char *docno)
 Handle a doc.

char * handleWord (char *word)
 Handle a word, possibly transforming it.

void handleEndDoc ()
 Handle the end of the doc.

void newDb (const string &name)

Private Attributes

int cw
int dfCount
bool first
lemur::index::PushIndexcollsel
lemur::parse::DocumentPropscsdp
lemur::index::InvFPTermterm
lemur::utility::WordSet docWords
FILE * collWords
FILE * serverSizes
int numDocs
bool countStopWds

Constructor & Destructor Documentation

lemur::distrib::DocFreqIndexer::DocFreqIndexer const string &  csName,
const string &  cwName,
const string &  ssName,
int  bufferSize,
bool  countStopWords = false
 

lemur::distrib::DocFreqIndexer::~DocFreqIndexer  ) 
 


Member Function Documentation

char * lemur::distrib::DocFreqIndexer::handleDoc char *  docno  )  [virtual]
 

Handle a doc.

Reimplemented from lemur::api::TextHandler.

void lemur::distrib::DocFreqIndexer::handleEndDoc  )  [virtual]
 

Handle the end of the doc.

Reimplemented from lemur::api::TextHandler.

char * lemur::distrib::DocFreqIndexer::handleWord char *  word  )  [virtual]
 

Handle a word, possibly transforming it.

Reimplemented from lemur::api::TextHandler.

void lemur::distrib::DocFreqIndexer::newDb const string &  name  ) 
 


Member Data Documentation

lemur::index::PushIndex* lemur::distrib::DocFreqIndexer::collsel [private]
 

FILE* lemur::distrib::DocFreqIndexer::collWords [private]
 

bool lemur::distrib::DocFreqIndexer::countStopWds [private]
 

lemur::parse::DocumentProps* lemur::distrib::DocFreqIndexer::csdp [private]
 

int lemur::distrib::DocFreqIndexer::cw [private]
 

int lemur::distrib::DocFreqIndexer::dfCount [private]
 

lemur::utility::WordSet lemur::distrib::DocFreqIndexer::docWords [private]
 

bool lemur::distrib::DocFreqIndexer::first [private]
 

int lemur::distrib::DocFreqIndexer::numDocs [private]
 

FILE* lemur::distrib::DocFreqIndexer::serverSizes [private]
 

lemur::index::InvFPTerm* lemur::distrib::DocFreqIndexer::term [private]
 


The documentation for this class was generated from the following files:
Generated on Tue Jun 15 11:03:05 2010 for Lemur by doxygen 1.3.4