Uploaded image for project: 'National Data Service'
  1. National Data Service
  2. NDS-912

Run Boolean retrievals in Indri

XMLWordPrintableJSON

    • Icon: Task Task
    • Resolution: Fixed
    • Icon: Normal Normal
    • None
    • None
    • None

      Transform the Biocaddie "short" queries into Boolean queries using the Indri query language. Run the Boolean queries and record the results on the wiki.

      Indri allows you to specify Boolean-style queries using the #band (Boolean AND), #or, and #not operators. See the Indri Query Language reference:

      https://www.lemurproject.org/lemur/IndriQueryLanguage.php#belief

      I'd like to see how the Biocaddie queries perform when at least two query terms are present in the document. For example, for the query "protein sequencing bacterial chemotaxis," you would produce the following Boolean version:

      #or(
        #band( protein sequencing )
        #band( protein bacterial )
        #band( protein chemotaxis )
        #band( sequencing bacterial )
        #band( sequencing chemotaxis )
        #band( bacterial chemotaxis )
      )

      (Note that spacing and new lines don't matter.)

      You will probably want to write a script to transform the queries into this format automatically.

              gsherma2 Garrick Sherman
              gsherma2 Garrick Sherman
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Created:
                Updated:
                Resolved:

                  Estimated:
                  Original Estimate - 6 hours
                  6h
                  Remaining:
                  Remaining Estimate - 6 hours
                  6h
                  Logged:
                  Time Spent - Not Specified
                  Not Specified