Main Page | Namespace List | Class Hierarchy | Class List | File List | Namespace Members | Class Members | File Members | Related Pages

indri::parse::RawTextParser Class Reference

#include <RawTextParser.hpp>

List of all members.

Public Member Functions

 RawTextParser (int memorySize=1024 *1024)
 ~RawTextParser ()
bool open (const std::string &fileName)
void close ()
bool parseDocument (std::string &docName, indri::utility::greedy_vector< char * > &words)

Private Attributes

std::ifstream _in
char * _buffer
char * _current
int _bufferSize


Constructor & Destructor Documentation

indri::parse::RawTextParser::RawTextParser int  memorySize = 1024*1024  )  [inline]
 

indri::parse::RawTextParser::~RawTextParser  )  [inline]
 


Member Function Documentation

void indri::parse::RawTextParser::close  )  [inline]
 

bool indri::parse::RawTextParser::open const std::string &  fileName  )  [inline]
 

bool indri::parse::RawTextParser::parseDocument std::string &  docName,
indri::utility::greedy_vector< char * > &  words
[inline]
 

parses the next document in the filestream, returning a list of char* to words the words are stored in a character buffer within the parser, which means that if you call parseDocument again, all your old word pointers are invalid.


Member Data Documentation

char* indri::parse::RawTextParser::_buffer [private]
 

int indri::parse::RawTextParser::_bufferSize [private]
 

char* indri::parse::RawTextParser::_current [private]
 

std::ifstream indri::parse::RawTextParser::_in [private]
 


The documentation for this class was generated from the following file:
Generated on Tue Jun 15 11:03:03 2010 for Lemur by doxygen 1.3.4