...
The following baselines were run using the test, short topics on the biocaddie_all index using the baselines/ scripts to sweep parameter combinations and the mkeval.sh script to perform LOOCV for each desired metric.
Model | MAP | NDCG | P@20 | NDCG@20 | P@100 | NDCG@100 | Notes | Date |
---|---|---|---|---|---|---|---|---|
TFIDF | 0.3349 | 0.5525 | 0.6433 | 0.5552 | 0.5007 | 0.4899 | Sweep b and k1 | 6/22/17 |
Okapi | 0.342 | 0.5845 | 0.68 (p-value=0.0587) | 0.5597 (p-value=0.0553) | 0.5067 | 0.5143 | Sweep b, k1, k3 | 6/22/17 |
QL (JM) | 0.2988 | 0.5242 | 0.6667 | 0.5829 | 0.4747 | 0.4974 | Sweep lambda | 6/22/17 |
QL (Dir) | 0.3284 | 0.544 | 0.6833 | 0.5923 | 0.5013 | 0.5099 | Sweep mu | 6/22/17 |
QL (TS) | 0.3261 | 0.545 | 0.6667 | 0.5549 | 0.4853 | 0.4906 | Sweep mu and lambda | 6/22/17 |
+ indicates significant improvement over TFIDF baselines (p < 0.05)
...