Main Page | Namespace List | Class Hierarchy | Class List | File List | Namespace Members | Class Members | File Members | Related Pages

Distributed Retrieval

This is a sample application that does distributed retrieval, using a resource selection index and individual indexes. resource selection is done using the CORI_CS (the only resource selection implemented thus far). results merging uses either CORIMergeMethod, SingleRegrMergeMethod, or MultiRegrMergeMethod. (If using CORIMerge, should use INQUERY for the retrieval method of each individual database.)

Parameters should be set as follows: index = the collection selection database collCounts = collection counts file for the csindex (needed by CORI) ranksFile = file to write rankings results to (optional) resultFile = file to write final results resultCount = maximum number of results to output for each query (default to 1000) textQuery = file of text queries in docstream format cutoff = maximum number of databases to search (default to 10) "dbids" = "db's param file" - required for each database in the collection selection index. key should be the db's string id as it was named/indexed in the collection selection index, and value should be name of file that has parameters for that database: index = the individual database retModel = the retrieval model to use mergeMethod = the merging method to use "modelvals" - whatever parameters are required for that retModel CSTF_factor The TFfactor parameter in the CORI_CS resource selection method.

CSTF_baseline The TFbaseline parameter in the CORI_CS resource selection method.

mergeMethod resource merging method (0 for CORI results merging method, 1 for single regression results merging method, 2 for multiple regression merge)

Merginge Method-specific parameters:

For CORI merging Method: None

For Single and Multiple regression merging methods:

1. csDbDataBaseIndex the centralized sampling database index

2. DOCTF_factor The TFfactor parameter in the INQUERY retrieval method for the centralized sampling database.

3. DOCTF_baseline The TFbaseline parameter in the INQUERY retrieval method for the centralized sampling database.


Generated on Tue Jun 15 11:02:58 2010 for Lemur by doxygen 1.3.4