boudinfl

Florian Boudin's repositories

pke

Python Keyphrase Extraction module

Language:PythonGPL-3.01579 30 147

ake-datasets

Large, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.

Language:ShellApache-2.0146 8 3

takahe

takahe is a multi-sentence compression module

Language:PythonMIT53 5 6

sume

Sume is an implementation of the concept-based ILP model for summarization.

Language:PythonGPL-3.037 5 1

ir-using-kg

Keyphrase Generation for Scientific Document Retrieval

Language:Python11 4 1

acm-cr

ACM-CR: A Manually Annotated Test Collection for Citation Recommendation

Language:TeXUnlicense9 50

hulth-2003-pre

Preprocessed Inspec keyphrase extraction benchmark dataset

Language:Shell8 20

semeval-2010-pre

Preprocessed SemEval-2010 benchmark dataset for keyphrase extraction

7 20

duc-2001-pre

Preprocessed DUC 2001 keyphrase extraction benchmark dataset

Apache-2.06 20

krapivin-2009-pre

Preprocessed Krapivin keyphrase extraction benchmark dataset

Language:Python5 30

redefining-absent-keyphrases

Code and dataset for the paper "Redefining Absent Keyphrases and their Effect on Retrieval Effectiveness"

Language:PythonApache-2.05 50

marujo-2012-pre

Preprocessed Marujo keyphrase extraction benchmark dataset

Language:ShellApache-2.04 20

CLIREC

CLinical Information Retrieval Evaluation Collection

Language:Jupyter NotebookNOASSERTION2 40

cross-language_IR

Un cours de deux heures sur la recherche d'information cross-lingue

Language:TeXNOASSERTION2 40

boudinfl.github.io

website

Language:HTML1 20

pke-benchmarking

Language:Jupyter Notebook1 10

silk

silk: Unsupervised Domain Adaptation for Keyphrase Generation using Citation Contexts

Language:PythonCC0-1.01 10

wikinews-2013-pre

Preprocessed Wikinews Keyphrase benchmark dataset

Language:PythonApache-2.01 20

anserini

A Lucene toolkit for replicable information retrieval research

Language:Java020

bib

my bibliography in xml format

Language:TeX020

boudinfl

Config files for my GitHub profile.

010

boudinfl_cv

Language:TeX000

corenlp_parser

Minimal CoreNLP XML Parser in Python

Language:PythonGPL-3.0020

gh-pages-minima-starter

A minimal example for running Github Pages with the minima theme.

Language:HTMLMIT010

golem

Unlicense020

ir-course

010

minima

Minima is a one-size-fits-all Jekyll theme for writers.

Language:SCSSMIT010

s2orc-doc2json

Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)

Language:PythonApache-2.0000

talias

TALIAS project

020

witten-1999-pre

Preprocessed CSTR keyphrase extraction dataset

Language:Shell020