Skip to content
Change the repository type filter

All

    Repositories list

    • TREC KBA Website
      HTML
      0100Updated Aug 19, 2019Aug 19, 2019
    • Streamcorpus website
      HTML
      0000Updated Aug 19, 2019Aug 19, 2019
    • stop word lists in several languages
      Python
      222100Updated Mar 25, 2017Mar 25, 2017
    • framework for making streamcorpus data
      HTML
      MIT License
      41100Updated Mar 11, 2017Mar 11, 2017
    • common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text
      Scala
      193400Updated Sep 30, 2016Sep 30, 2016
    • MOVED to
      0000Updated Dec 31, 2014Dec 31, 2014
    • kba-tools

      Public
      Tools for working with TREC KBA entities, training data, and run submissions
      Python
      2500Updated Nov 16, 2014Nov 16, 2014
    • scoring tools for TREC KBA
      Python
      MIT License
      5200Updated Nov 16, 2014Nov 16, 2014
    • Python
      MIT License
      0160Updated Aug 25, 2014Aug 25, 2014
    • MIT License
      0000Updated Jul 20, 2014Jul 20, 2014
    • integrate factorie language analyzer into streamcorpus-pipeline
      Python
      MIT License
      1000Updated Jun 26, 2014Jun 26, 2014
    • Wrappers for generating one-word-per-line output representing all the goodies from Stanford CoreNLP, so we can include it in the KBA stream corpus.
      Java
      0400Updated Jan 17, 2013Jan 17, 2013
    • Tools for working with TREC KBA Corpora
      Python
      4500Updated Dec 14, 2012Dec 14, 2012
    • This project contains some Hadoop code for working with the TREC Knowledge Base Acceleration dataset. In particular, it provides classes to read/write topic files, read/write run files, and expose the documents in the Thrift files as Hadoop-readable objects.
      Java
      5000Updated Jul 24, 2012Jul 24, 2012