#include <TrecParser.hpp>
Inheritance diagram for lemur::parse::TrecParser:
Public Member Functions | |
TrecParser () | |
void | parseFile (const string &filename) |
Parse a file. | |
void | parseBuffer (char *buf, int len) |
Parse a buffer of len length. | |
long | fileTell () const |
Static Public Attributes | |
const string | identifier = "trec" |
Private Member Functions | |
void | doParse () |
Actual parsing action flow. | |
Private Attributes | |
int | state |
The state of the parser. | |
Property | begelem |
keep a property for being and end of elements | |
Property | endelem |
LinkedPropertyList | proplist |
list |
U.S.A., USA's, and USAs are converted to USA. Does not recognize acronyms with numbers.
The following fields are parsed: TEXT, HL, HEAD, HEADLINE, LP, TTL
|
|
|
Actual parsing action flow.
|
|
Gives current byte position offset into file being parsed. Don't use with parseBuffer Implements lemur::api::Parser. |
|
Parse a buffer of len length.
Implements lemur::api::Parser. |
|
Parse a file.
Implements lemur::api::Parser. |
|
keep a property for being and end of elements
|
|
|
|
Reimplemented from lemur::api::Parser. |
|
list
|
|
The state of the parser.
|