Baselines

The following baselines were run using the test, short topics on the biocaddie_all index using the baselines/ scripts to sweep parameter combinations and the mkeval.sh script to perform LOOCV for each desired metric.

ModelMAPNDCGP@20NDCG@20P@100NDCG@100NotesDate
TFIDF0.33490.55250.64330.55520.50070.4899Sweep b and k16/22/17
Okapi0.3420.5845

0.68

(p-value=0.0587)

0.5597

(p-value=0.0553)

0.50670.5143Sweep b, k1, k36/22/17
QL (JM)0.29880.52420.66670.58290.47470.4974Sweep lambda6/22/17
QL (Dir)0.32840.5440.68330.59230.50130.5099

Sweep mu

6/22/17
QL (TS)0.32610.5450.66670.55490.48530.4906Sweep mu and lambda6/22/17

+ indicates significant improvement over TFIDF baselines (p < 0.05)

Okapi metric results are used from manual run instead of extracted data from combined eval files (due to mismatch data)

Feedback/SDM models

ModelMAPNDCGP@20NDCG@20P@100NDCG@100NotesDate
TFIDF0.33490.55250.64330.55520.50070.4899

 Sweep b and k1

6/22/17
RM30.33790.57170.68330.6031+0.51130.5249+Sweep mu, fbDocs, fbTerms, and lambda6/22/17
RM3 (stopped)0.34170.5827+0.680.58070.5060.5134Sweep mu, fbDocs, fbTerms, and lambda6/22/17
Pubmed0.35340.6009+0.65670.57320.52070.5074Use mu=2500 for pubmed query sweeping fbDocs, fbTerms, and lambda. Sweep mu for final retrieval.6/22/17
Wikipedia0.35530.6029+0.67670.55030.48930.4981Use mu=2500 for wikipedia query sweeping fbDocs, fbTerms, and lambda. Sweep mu for final retrieval.6/22/17
SDM0.33720.57150.60670.55260.5120.5067Sweep mu, w1, w2, w36/22/17
FDM0.34180.58180.67670.57070.50470.5063Sweep mu, w1, w2, w36/22/17
OKAPI Exp0.37250.60230.6633

0.6007

(p-value=0.0525)

0.54130.5569+OKAPI expansion6/22/17


ModelMAPNDCGP@20NDCG@20P@100NDCG@100NotesDate
Dir (Krovetz)0.3663+0.6022+0.7033+

0.6079

(p-value=0.0587)

0.51730.5225+
6/22/17
RM3 (krovetz)0.36040.5809+0.7167+0.57880.49670.5368+
6/22/17
Pubmed (Krovetz)0.38410.6225+0.7067+0.58310.53530.5162
6/22/17

+ indicates significant improvement over TFIDF baselines (p < 0.05)

For verification of the results in above tables.

baselines comparison results.txt

  • No labels