#include <WebParser.hpp>
Inheritance diagram for lemur::parse::WebParser:

Public Member Functions | |
| WebParser () | |
| void | parseFile (const string &filename) |
| Parse a file. | |
| void | parseBuffer (char *buf, int len) |
| Parse a buffer. | |
| long | fileTell () const |
| return the current byte position of the file being parsed | |
Static Public Attributes | |
| const string | identifier = "web" |
Private Member Functions | |
| void | doParse () |
| Actual parsing action flow. | |
Private Attributes | |
| int | state |
| The state of the parser. | |
| Property | begelem |
| keep a property for being and end of elements | |
| Property | endelem |
| LinkedPropertyList | proplist |
| list | |
U.S.A., USA's, and USAs are converted to USA. Does not recognize acronyms with numbers.
The DOCHDR is ignored.
Text in <script> tags is ignored. Text in HTML <!-- --> comments is ignored.
|
|
|
|
|
Actual parsing action flow.
|
|
|
return the current byte position of the file being parsed
Implements lemur::api::Parser. |
|
||||||||||||
|
Parse a buffer.
Implements lemur::api::Parser. |
|
|
Parse a file.
Implements lemur::api::Parser. |
|
|
keep a property for being and end of elements
|
|
|
|
|
|
Reimplemented from lemur::api::Parser. |
|
|
list
|
|
|
The state of the parser.
|
1.3.4