#include <SimpleKLDocModel.hpp>
Inheritance diagram for lemur::retrieval::SimpleKLDocModel:
Public Member Functions | |
SimpleKLDocModel (lemur::api::DOCID_T docID, const lemur::langmod::UnigramLM &collectLM, int dl=1, const double *prMass=NULL, SimpleKLParameter::SmoothStrategy strat=SimpleKLParameter::INTERPOLATE) | |
~SimpleKLDocModel () | |
virtual double | termWeight (lemur::api::TERMID_T termID, const lemur::api::DocInfo *info) const |
term weighting function, weight(w) = p_seen(w)/p_unseen(w) | |
virtual double | scoreConstant () const |
doc-specific constant term in the scoring formula | |
virtual double | unseenCoeff () const =0 |
a(d) | |
virtual double | seenProb (double termFreq, lemur::api::TERMID_T termID) const =0 |
p(w|d), w seen | |
Protected Attributes | |
const lemur::langmod::UnigramLM & | refLM |
const double * | docPrMass |
SimpleKLParameter::SmoothStrategy | strategy |
abstract interface of doc representation for smoothed document unigram model
adapt a smoothed document language model interface to a DocumentRep interface
p(w|d) = q(w|d) if w seen = a(d) * Pc(w) if w unseen where, a(d) controls the probability mass allocated to all unseen words and Pc(w) is the collection language model
Constructor & Destructor Documentation
|
|
|
|
|
doc-specific constant term in the scoring formula
Implements lemur::api::DocumentRep. |
|
p(w|d), w seen
Implemented in lemur::retrieval::JelinekMercerDocModel, lemur::retrieval::DirichletPriorDocModel, lemur::retrieval::AbsoluteDiscountDocModel, and lemur::retrieval::TwoStageDocModel. |
|
term weighting function, weight(w) = p_seen(w)/p_unseen(w)
|
|
a(d)
Implemented in lemur::retrieval::JelinekMercerDocModel, lemur::retrieval::DirichletPriorDocModel, lemur::retrieval::AbsoluteDiscountDocModel, and lemur::retrieval::TwoStageDocModel. |
|
|
|
|
|
|