-memory=100M
on the command line. -corpus.path=/path/to/file_or_directory
on the command line. -corpus.class=trecweb
on the command line. The known classes are: path
. Specified as <corpus><annotations>/path/to/file</annotations></corpus> in the parameter file and as -corpus.annotations=/path/to/file
on the command line. path
. Specified as <corpus><metadata>/path/to/file</metadata></corpus> in the parameter file and as -corpus.metadata=/path/to/file
on the command line.
Combining the first two of these elements, the parameter file would contain:
<corpus>
<path>/path/to/file_or_directory</path>
<class>trecweb</class>
</corpus>
field
-- Make the named field available for retrieval as metadata. Specified as <metadata><field>fieldname</field></metadata> in the parameter file and as metadata.field=fieldname
on the command line.
forward
-- Make the named field available for retrieval as metadata and build a lookup table to make retrieving the value more efficient. Specified as <metadata><forward>fieldname</forward></metadata> in the parameter file and as metadata.forward=fieldname
on the command line.
backward
-- Make the named field available for retrieval as metadata and build a lookup table for inverse lookup of documents based on the value of the field. Specified as <metadata><backward>fieldname</backward></metadata> in the parameter file and as metadata.backward=fieldname
on the command line.
-field.name=fieldname
on the command line. true
if the field contains numeric data, otherwise the symbol false
, specified as <field><numeric>true</numeric></field> in the parameter file and as -field.numeric=true
on the command line. This is an optional parameter, defaulting to false. Note that 0
can be used for false and 1
can be used for true. -stemmer.name=stemmername
on the command line. This is an optional parameter with the default of no stemming. true
to perform case normalization when indexing, false to index with mixed case. Default true
-stopper.word=stopword
on the command line. This is an optional parameter with the default of no stopping.