Transform the Biocaddie "short" queries into Boolean queries using the Indri query language. Run the Boolean queries and record the results on the wiki.
Indri allows you to specify Boolean-style queries using the #band (Boolean AND), #or, and #not operators. See the Indri Query Language reference:
https://www.lemurproject.org/lemur/IndriQueryLanguage.php#belief
I'd like to see how the Biocaddie queries perform when at least two query terms are present in the document. For example, for the query "protein sequencing bacterial chemotaxis," you would produce the following Boolean version:
#or(
#band( protein sequencing )
#band( protein bacterial )
#band( protein chemotaxis )
#band( sequencing bacterial )
#band( sequencing chemotaxis )
#band( bacterial chemotaxis )
)
(Note that spacing and new lines don't matter.)
You will probably want to write a script to transform the queries into this format automatically.