You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 20 Next »

Baselines

The following baselines were run using the combined, short topics on the biocaddie_all index using the baselines/ scripts to sweep parameter combinations and the mkeval.sh script to perform LOOCV for each desired metric.

ModelMAPNDCGP@20NDCG@20P@100NDCG@100Notes
TFIDF0.25240.4590.46430.41440.36380.3973Sweep b and k1
Okapi0.25480.4660.5+0.4414+0.34190.3981Sweep b, k1, k3
QL (JM)0.25730.51590.5524+0.4886+0.37520.4249Sweep lambda
QL (Dir)0.2837+0.5306+0.5667+0.5054+0.39810.4541+

Sweep mu

QL (TS)0.27940.5391+0.5310.47020.3910.4455Sweep mu and lambda

+ indicates significant improvement over TFIDF baselines (p < 0.05)

Feedback/SDM models

ModelMAPNDCGP@20NDCG@20P@100NDCG@100Notes
QL (Dir)0.28370.53060.56670.50540.39810.4541

 

RM30.28460.53810.57380.50540.41570.4710Sweep mu, fbDocs, fbTerms, and lambda
RM3 (stopped)0.30230.54710.54760.50640.38860.463Sweep mu, fbDocs, fbTerms, and lambda
Pubmed0.3076+0.5853+0.53810.48550.42480.4501Use mu=2500 for pubmed query sweeping fbDocs, fbTerms, and lambda. Sweep mu for final retrieval.
Wikipedia0.29470.5956+0.57380.49560.40620.4487Use mu=2500 for wikipedia query sweeping fbDocs, fbTerms, and lambda. Sweep mu for final retrieval.
SDM0.28740.55580.48330.47060.40190.4559Sweep mu, w1, w2, w3
FDM0.2930.5594+0.55000.49600.39330.4560Sweep mu, w1, w2, w3
OKAPI Exp0.30110.51060.50710.45520.38710.4375OKAPI expansion assuming k1=1 and k3=1, for now

+ indicates significant improvement over QL (Dir) baselines (p < 0.05)


ModelMAPNDCGP@20NDCG@20P@100NDCG@100Notes
Dir (Krovetz)0.2944+0.5443+0.5905+0.51500.40330.4601+ means significant improvement of Dir (unstemmed)
RM3 (krovetz)






Pubmed (Krovetz)0.3285+0.6008+0.56670.48210.4333+0.4627+ means improvement of Dir (krovetz)





  • No labels