JHU Human Language Technology Center of Excellence's repositories
golden-horse
Named Entity Recognition for Chinese social media (Weibo). From EMNLP 2015 paper.
concrete-python
Python modules and scripts for working with Concrete, a data serialization format for NLP
clir-tutorial
SIGIR 2023 tutorial on cross language information retrieval.
concrete-js
JavaScript library for working with Concrete, a data serialization format for NLP
concrete-stanford
Concrete-Stanford: Wraps Stanford NLP with utilities to fit it into a concrete compliant workflow
annotated-nyt
Java wrappers and utilities for reading the Annotated NYT corpus
cmn-renmin-ocr-ner-dataset
NER annotations of the Chinese Newspaper Renmin
peer_measure
Implementation of the measure Probability of Equal Expected Rank
ner-for-ir-collection
Dataset for exploring the uses of named entity recognition in information retrieval
turkle-client
client for the Turkle annotation platform