Main Page | Namespace List | Class Hierarchy | Class List | File List | Namespace Members | Class Members | File Members | Related Pages

indri::parse::RawTextParser Class Reference

#include <RawTextParser.hpp>

List of all members.

Public Member Functions

 RawTextParser (int memorySize=1024 *1024)
 ~RawTextParser ()
bool open (const std::string &fileName)
void close ()
bool parseDocument (std::string &docName, indri::utility::greedy_vector< char * > &words)

Private Attributes

std::ifstream _in
char * _buffer
char * _current
int _bufferSize

Constructor & Destructor Documentation

indri::parse::RawTextParser::RawTextParser int  memorySize = 1024*1024  )  [inline]

indri::parse::RawTextParser::~RawTextParser  )  [inline]

Member Function Documentation

void indri::parse::RawTextParser::close  )  [inline]

bool indri::parse::RawTextParser::open const std::string &  fileName  )  [inline]

bool indri::parse::RawTextParser::parseDocument std::string &  docName,
indri::utility::greedy_vector< char * > &  words

parses the next document in the filestream, returning a list of char* to words the words are stored in a character buffer within the parser, which means that if you call parseDocument again, all your old word pointers are invalid.

Member Data Documentation

char* indri::parse::RawTextParser::_buffer [private]

int indri::parse::RawTextParser::_bufferSize [private]

char* indri::parse::RawTextParser::_current [private]

std::ifstream indri::parse::RawTextParser::_in [private]

The documentation for this class was generated from the following file:
Generated on Tue Jun 15 11:03:03 2010 for Lemur by doxygen 1.3.4