You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 22 Next »

Baselines

The following baselines were run using the combined, short topics on the biocaddie_all index using the baselines/ scripts to sweep parameter combinations and the mkeval.sh script to perform LOOCV for each desired metric.

ModelMAPNDCGP@20NDCG@20P@100NDCG@100Notes
TFIDF0.25240.4590.46430.41440.36380.3973Sweep b and k1
Okapi0.25480.4660.5+0.4414+0.34190.3981Sweep b, k1, k3
QL (JM)0.25730.51590.5524+0.4886+0.37520.4249Sweep lambda
QL (Dir)0.2837+0.5306+0.5667+0.5054+0.39810.4541+

Sweep mu

QL (TS)0.27940.5391+0.5310.47020.3910.4455Sweep mu and lambda

+ indicates significant improvement over TFIDF baselines (p < 0.05)

Feedback/SDM models

ModelMAPNDCGP@20NDCG@20P@100NDCG@100Notes
QL (Dir)0.28370.53060.56670.50540.39810.4541

 

QL (Dir)0.28370.53060.56670.50540.39670.4541Sweep mu
RM30.28460.53810.57380.50540.41570.4710Sweep mu, fbDocs, fbTerms, and lambda
RM30.29820.53810.54520.51520.40380.4768Sweep mu, fbDocs, fbTerms, and lambda
RM3 (stopped)0.30230.54710.54760.50640.38860.463Sweep mu, fbDocs, fbTerms, and lambda
RM3 (stopped)0.30010.54230.58330.51500.39380.4534Sweep mu, fbDocs, fbTerms, and lambda
Pubmed0.3076+0.5853+0.53810.48550.42480.4501Use mu=2500 for pubmed query sweeping fbDocs, fbTerms, and lambda. Sweep mu for final retrieval.
Wikipedia0.29470.5956+0.57380.49560.40620.4487Use mu=2500 for wikipedia query sweeping fbDocs, fbTerms, and lambda. Sweep mu for final retrieval.
SDM0.28740.55580.48330.47060.40190.4559Sweep mu, w1, w2, w3
FDM0.2930.5594+0.55000.49600.39330.4560Sweep mu, w1, w2, w3
OKAPI Exp0.30110.51060.50710.45520.38710.4375OKAPI expansion assuming k1=1 and k3=1, for now

+ indicates significant improvement over QL (Dir) baselines (p < 0.05)


ModelMAPNDCGP@20NDCG@20P@100NDCG@100Notes
Dir (Krovetz)0.2944+0.5443+0.5905+0.51500.40330.4601+ means significant improvement of Dir (unstemmed)
RM3 (krovetz)0.28580.55920.56430.49990.39520.4605
Pubmed (Krovetz)0.3285+0.6008+0.56670.48210.4333+0.4627+ means improvement of Dir (krovetz)





  • No labels