You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 39 Next »

Baselines

The following baselines were run using the combined, short topics on the biocaddie_all index using the baselines/ scripts to sweep parameter combinations and the mkeval.sh script to perform LOOCV for each desired metric.

ModelMAPNDCGP@20NDCG@20P@100NDCG@100NotesDate
TFIDF0.27810.53240.54050.45930.40240.4369Sweep b and k15/17/17
Okapi0.28940.5629+0.56430.4927+0.40290.4516Sweep b, k1, k35/17/17
QL (JM)0.26610.53110.53810.48530.38520.437Sweep lambda5/17/17
QL (Dir)0.2976+0.5656+0.56430.50040.41190.457

Sweep mu

5/17/17
QL (TS)0.2931+0.5653+0.57140.48270.40240.4507Sweep mu and lambda5/17/17

+ indicates significant improvement over TFIDF baselines (p < 0.05)

Results when using the combined, orig topics on the biocaddie_all index using the baselines/ scripts to sweep parameter combinations and the mkeval.sh script to perform LOOCV for each desired metric.

ModelMAPNDCGP@20NDCG@20P@100NDCG@100NotesDate
TFIDF0.16610.37490.3950.32340.25050.2962Sweep b and k15/16/17
Okapi0.13480.32790.32250.25580.2120.2547Sweep b, k1, k35/16/17
QL (JM)0.12830.33880.28250.20790.230.2508Sweep lambda5/16/17
QL (Dir)0.14470.36890.2850.23240.24150.2666

Sweep mu

5/16/17
QL (TS)0.16060.37240.360.3040.24750.2991Sweep mu and lambda5/16/17

***No improvement over TFIDF baselines when using original queries.


Feedback/SDM models

Green rows are runs with Garrick's framework, all others are with Craig's framework.

ModelMAPNDCGP@20NDCG@20P@100NDCG@100NotesDate
QL (Dir)0.2976+0.5656+0.56430.50040.41190.457

 

5/15/17
QL (Dir)0.28530.53040.56670.50540.39860.4541Sweep mu5/15/17
RM30.28460.53810.57380.50540.41570.4710Sweep mu, fbDocs, fbTerms, and lambda5/15/17
RM30.28460.53950.56190.50720.41570.4710Sweep mu, fbDocs, fbTerms, and lambda5/16/17
RM3 (stopped)0.29480.58340.55710.50040.38050.4599Sweep mu, fbDocs, fbTerms, and lambda5/17/17
RM3 (stopped)0.30230.54830.57380.51180.39050.4626Sweep mu, fbDocs, fbTerms, and lambda5/16/17
Pubmed0.29450.5998+0.51900.47590.41900.4607Use mu=2500 for pubmed query sweeping fbDocs, fbTerms, and lambda. Sweep mu for final retrieval.5/15/17
Wikipedia0.30510.5894+0.56190.48210.39480.4613Use mu=2500 for wikipedia query sweeping fbDocs, fbTerms, and lambda. Sweep mu for final retrieval.5/15/17
SDM0.28740.55580.48330.47060.40190.4559Sweep mu, w1, w2, w35/16/17
FDM0.2930.5594+0.55000.49600.39330.4560Sweep mu, w1, w2, w35/8/17
OKAPI Exp0.32500.58490.58570.48940.42380.4783OKAPI expansion assuming k1=1 and k3=1, for now5/17/17

+ indicates significant improvement over QL (Dir) baselines (p < 0.05)

Document expansion models

ModelMAPNDCGP@20NDCG@20P@100NDCG@100NotesDate
QL (Dir)0.28530.53040.56670.50540.39860.4541Sweep mu5/15/17
Pubmed RM30.29490.5806+0.56430.47620.38190.4431Sweep mu, fbDocs, fbTerms, alpha, and lambda5/17/17
Wikipedia RM30.28000.5795+0.55240.47120.37240.4254Sweep mu, fbDocs, fbTerms, alpha, and lambda5/17/17
Pubmed+Wikipedia RM30.29230.5725+0.56670.46000.39330.4378Sweep mu, fbDocs, fbTerms, alpha, and lambda5/17/17


ModelMAPNDCGP@20NDCG@20P@100NDCG@100NotesDate
Dir (Krovetz)0.2959+0.5443+0.5905+0.51500.4124+0.4655+ means significant improvement of Dir (unstemmed)5/15/17
RM3 (krovetz)0.30290.57340.56430.47140.40330.4753
5/17/17
Pubmed (Krovetz)0.3285+0.6008+0.56670.48210.4333+0.4627+ means improvement of Dir (krovetz)





  • No labels