Chao-Chun (Joe) Hsu's repositories
Event-Extraction
Wordnet, Verbnet, and Dependency Parsing
snomed-ontology-parser
Analyzing medical concept distribution of clinical text with Snomed ontology.
piccollage-intern-interview
Pairs, Trapped rainwater
attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
CSCI5622-machine-learning
Course information for CSCI 5622 in Fall 2019
edgar-crawler
Download financial reports from SEC's EDGAR quickly. Extract clean textual data from specific item sections and bootstrap your financial research. Software from the research paper published in ECONLP 2021.
flatten_tokenize_convert_chinese_gigaword
Dump the text of the Gigaword dataset into headline and paragraph files including Chinese word tokenization and simplified-to-tranditional Chinese conversion
metrics
Machine learning metrics for distributed, scalable PyTorch applications.
mimic3-benchmarks
Python suite to construct benchmark machine learning datasets from the MIMIC-III clinical database.
Multimodal-Toolkit
Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
My_first_JavaGame
After a semester of learning Java, I finish a five-in-a-row by myself.
pytorch-lightning
The lightweight PyTorch wrapper for ML researchers. Scale your models. Write less boilerplate
QANet-pytorch
A PyTorch implementation of QANet.
s2-folks
Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.
s2orc-doc2json
Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)
transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
transformers-bloom-inference
Fast Inference Solutions for BLOOM