John Bosco's repositories
komt
Korean Multi-task Instruction Tuning
data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
AdapterEM
AdapterEM: Pre-trained Language Model Adaptation for Generalized Entity Matching using Adapter-tuning
ContextualBlocker-for-EM
A Graph-Based Blocking Approach for Entity Matching Using Contrastively Learned Embeddings
boscoj2008.github.io
Data Science Portfolio
youtube_tutorial
This is an example of how to push to github
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
udapter
UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specific adaptation. This repository includes the code for "UDapter: Language Adaptation for Truly Universal Dependency Parsing"
nlpaug
Data augmentation for NLP
A-Roadmap-for-Transfer-Learning
flowchart
transformer-series
this repo will focus on using Transformers for various NLP downstream tasks
BRIO
ACL 2022: BRIO: Bringing Order to Abstractive Summarization
ember
data and code for the paper: "Bridging the Gap between Reality and Ideality of Entity Matching: A Revisiting and Benchmark Re-Construction"
584-final
Sentence Embeddings using Supervised Contrastive Learning. Danqi Liao.
best-of-ml-python
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
pytorch-loss
label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful
T-DNA
Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation" by Shizhe Diao et al.
google-research
Google Research
DeCLUTR
The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!
diabetes-classification
KNN algorithm, data file and notebook
ditto
Code for the paper "Deep Entity Matching with Pre-trained Language Models"
Machine-Learning-Collection
A resource for learning about ML, DL, PyTorch and TensorFlow. Feedback always appreciated :)
sccl
Pytorch implementation of Supporting Clustering with Contrastive Learning, NAACL 2021
dacon
Data Augmentation for Entity Matching using Consistency Learning
emea
AE
KNN-BERT
Code for paper: KNN-BERT: Fine-Tuning Pre-Trained Models with KNN Classifier
SimCSE
EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings
SupCL-Seq
Supervised Contrastive Learning for Downstream Optimized Sequence Representations