Hang Dong's starred repositories
tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
graph-based-deep-learning-literature
links to conference publications in graph-based deep learning
Awesome-Learning-with-Label-Noise
A curated list of resources for Learning with Noisy Labels
structural-probes
Codebase for testing whether hidden states of neural networks encode discrete structures.
clinicalBERT
ClinicalBERT: Modeling Clinical Notes and Predicting Hospital Readmission (CHIL 2020 Workshop)
neat-vision
Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Processing (NLP) tasks. (framework-agnostic)
AttentionXML
Implementation for "AttentionXML: Label Tree-based Attention-Aware Deep Model for High-Performance Extreme Multi-Label Text Classification"
OWL2Vec-Star
Embedding OWL ontologies
MedCATtrainer
A simple interface to inspect, improve and add concepts to biomedical NER+L -> MedCAT.
asreview-covid19
Extension that adds Covid-19 related datasets to ASReview
Geonames-embeddings
Embeddings for all geonames populated locations with population greater than 0
collections-as-data
Jupyter Notebooks for reuse in analysis of National Library of Scotland's collections as data
Automated-Health-Responses
A prototype project for automated, physician-like responses to medical questions
gate-cloud-python-example
example of using the GATE Cloud on-line API
domain-specific-bert-scripts
Pretraining/finetuning scripts used for domain specific BERT training.
bio-yodie-resource-prep
Scripts to prepare the informational resources required by GATE Bio-YODIE.
cantemist2020-ner
CANTEMIST(CANcer TExt Mining Shared Task – tumor named entity recognition)-NER track
Awesome-COVID-NLP-tools
This is a curation of a list of COVID-19 related NLP tools that might be interested to both researcher in COVID-19 and also in the clinical NLP domain.