Nln's repositories
unilm
UniLM AI - Large-scale Self-supervised Pre-training across Tasks, Languages, and Modalities
BoundaryNet
BoundaryNet - A Semi-Automatic Layout Annotation Tool
lit
The Language Interpretability Tool: Interactively analyze NLP models for model understanding in an extensible and framework agnostic interface.
ct_warc_to_doc
Source code to extract content from commoncrawl news corpus and upload to S3
faiss
A library for efficient similarity search and clustering of dense vectors.
parser
A collection of state-of-the-art syntactic parsing models based on Biaffine Parser.
crfpar
Code for ACL'20 paper "Efficient Second-Order TreeCRF for Neural Dependency Parsing" and IJCAI'20 paper "Fast and Accurate Neural CRF Constituency Parsing".
gpt-2-simple
Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts
ProphetNet
ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training https://arxiv.org/pdf/2001.04063.pdf
docker-elk
The Elastic stack (ELK) powered by Docker and Compose.
Clinical-Trial-Parser
Library for converting clinical trial eligibility criteria to a machine-readable format.
SpanBERT
Code for using and evaluating SpanBERT.
doccano
Open source text annotation tool for machine learning practitioner.
PreSumm
code for EMNLP 2019 paper Text Summarization with Pretrained Encoders
clinicalBERT
repository for Publicly Available Clinical BERT Embeddings
wikiextractor
A tool for extracting plain text from Wikipedia dumps
babyai
BabyAI platform. A testbed for training agents to understand and execute language commands.
BioRelEx
BioRelEx: Biological Relation Extraction Benchmark
biobert
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
UnsupervisedMT
Phrase-Based & Neural Unsupervised Machine Translation
few-shot
Repository for few-shot learning machine learning projects
leo
Implementation of Meta-Learning with Latent Embedding Optimization
deeptype
Code for the paper "DeepType: Multilingual Entity Linking by Neural Type System Evolution"
MAML-Pytorch
Elegant PyTorch implementation of paper Model-Agnostic Meta-Learning (MAML)
biobert-pretrained
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
DeepFaceLab
DeepFaceLab is a tool that utilizes deep learning to recognize and swap faces in pictures and videos. Includes prebuilt ready to work standalone Windows 7,8,10 binary (look readme.md).