You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 28 Next »

Baselines

The following baselines were run using the combined, short topics on the biocaddie_all index using the baselines/ scripts to sweep parameter combinations and the mkeval.sh script to perform LOOCV for each desired metric.

ModelMAPNDCGP@20NDCG@20P@100NDCG@100Notes
TFIDF0.25240.4590.46430.41440.36380.3973Sweep b and k1
Okapi0.25480.4660.5+0.4414+0.34190.3981Sweep b, k1, k3
QL (JM)0.25730.51590.5524+0.4886+0.37520.4249Sweep lambda
QL (Dir)0.2837+0.5306+0.5667+0.5054+0.39810.4541+

Sweep mu

QL (TS)0.27940.5391+0.5310.47020.3910.4455Sweep mu and lambda

+ indicates significant improvement over TFIDF baselines (p < 0.05)

Feedback/SDM models

Green rows are runs with Garrick's framework, all others are with Craig's framework.

ModelMAPNDCGP@20NDCG@20P@100NDCG@100Notes
QL (Dir)0.28530.53040.56670.50540.39860.4541

 

QL (Dir)0.28530.53040.56670.50540.39860.4541Sweep mu={50, 250, 500, 1000, 2500, 5000, 10000}
RM30.28460.53810.57380.50540.41570.4710Sweep mu, fbDocs, fbTerms, and lambda
RM30.29820.53810.54520.51520.40380.4768Sweep mu={100, 250, 500, 750, 1000, 2000, 3000}, fbDocs={10, 25, 50, 75, 100}, fbTerms={10, 25, 50, 75, 100}, and lambda=[0.0, 1.0]
RM3 (stopped)0.30230.54710.54760.50640.38860.463Sweep mu, fbDocs, fbTerms, and lambda
RM3 (stopped)0.30010.54230.58330.51500.39380.4534Sweep mu={100, 250, 500, 750, 1000, 2000, 3000}, fbDocs={10, 25, 50, 75, 100}, fbTerms={10, 25, 50, 75, 100}, and lambda=[0.0, 1.0]
Pubmed0.29450.5998+0.51900.47590.41900.4607Use mu=2500 for pubmed query sweeping fbDocs, fbTerms, and lambda. Sweep mu for final retrieval.
Wikipedia0.29470.5956+0.57380.49560.40620.4487Use mu=2500 for wikipedia query sweeping fbDocs, fbTerms, and lambda. Sweep mu for final retrieval.
SDM0.28740.55580.48330.47060.40190.4559Sweep mu, w1, w2, w3
FDM0.2930.5594+0.55000.49600.39330.4560Sweep mu, w1, w2, w3
OKAPI Exp0.30110.51060.50710.45520.38710.4375OKAPI expansion assuming k1=1 and k3=1, for now

+ indicates significant improvement over QL (Dir) baselines (p < 0.05)


ModelMAPNDCGP@20NDCG@20P@100NDCG@100Notes
Dir (Krovetz)0.2944+0.5443+0.5905+0.51500.40330.4601+ means significant improvement of Dir (unstemmed)
RM3 (krovetz)0.28580.55920.56430.49990.39520.4605
Pubmed (Krovetz)0.3285+0.6008+0.56670.48210.4333+0.4627+ means improvement of Dir (krovetz)





  • No labels