Beast code in Giters

xiao's repositories

anago

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.

Language:PythonMIT020

ASER

ASER (activities, states, events, and their relations), a large-scale eventuality knowledge graph extracted from more than 11-billion-token unstructured textual data.

Language:PythonMIT020

AutoPhrase

AutoPhrase: Automated Phrase Mining from Massive Text Corpora

Language:C++Apache-2.0020

BERT-NER

Use Google's BERT for named entity recognition （CoNLL-2003 as the dataset）.

Language:Python020

biobert

BioBERT: a pre-trained biomedical language representation model for biomedical text mining

Language:PythonNOASSERTION030

CoreNLP

Stanford CoreNLP: A Java suite of core NLP tools.

Language:JavaGPL-3.0010

flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Language:PythonNOASSERTION020

GCDT

Code for the paper: GCDT: A Global Context Enhanced Deep Transition Architecture for Sequence Labeling

Language:PLSQLBSD-3-Clause020

GIZA++ is a statistical machine translation toolkit that is used to train IBM Models 1-5 and an HMM word alignment model. This package also contains the source for the mkcls tool which generates the word classes necessary for training some of the alignment models.

Language:C++020

iclr2016

Python code for training all models in the ICLR paper, "Towards Universal Paraphrastic Sentence Embeddings". These models achieve strong performance on semantic similarity tasks without any training or tuning on the training data for those tasks. They also can produce features that are at least as discriminative as skip-thought vectors for semantic similarity tasks at a minimum. Moreover, this code can achieve state-of-the-art results on entailment and sentiment tasks.

Language:Python020

is-xhuang1994

Distinguish Bots from Humans on Twitter

Language:Jupyter NotebookMIT020

LM-LSTM-CRF

Empower Sequence Labeling with Task-Aware Language Model

Language:PythonApache-2.0030

MACROSCORE

MACROSCORE project at ISI - Micro Feature Extraction direction

Language:Python020

mmner

Massively Multilingual Transfer for NER

Language:Python020

mrc-for-flat-nested-ner

The code for "A Unified MRC Framework for Named Entity Recognition"

Language:PythonApache-2.0020

NER-GRN

Code for our AAAI2019 paper "GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition"

Language:Python010

nlproc-cookbook

Language:Python020

nltk_contrib

NLTK Contrib

Language:PythonNOASSERTION020

OntoNotes-5.0-NER-BIO

A BIO formatted Named Entity Recognition data set extracted from the OntoNotes 5.0 release.

Language:Python020

para-nmt-50m

Pre-trained models and code and data to train and use models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions of Machine Translations"

Language:Python030

python-wordsegment

English word segmentation, written in pure-Python, and based on a trillion-word corpus.

Language:PythonNOASSERTION010

scpn

syntactically controlled paraphrase networks

Language:Python020

semi-supervised-baselines

Code for "Strong Baselines for Neural Semi-supervised Learning under Domain Shift" (Ruder & Plank, 2018 ACL)

Language:Python020

Vanilla_NER

Vanilla Sequence Labeling w. Char-LSTM-CRF

Language:PythonApache-2.0020

xhuang28

xiao's repositories

NewBioNer

anago

ASER

AutoPhrase

BERT-NER

biobert

blockchain_bootcamp

commonsenseqa

CoreNLP

flair

GCDT

giza-pp

iclr2016

is-xhuang1994

LM-LSTM-CRF

MACROSCORE

mmner

mrc-for-flat-nested-ner

NER-GRN

nlproc-cookbook

nltk_contrib

OntoNotes-5.0-NER-BIO

para-nmt-50m

python-wordsegment

scpn

semi-supervised-baselines

Vanilla_NER