Toan Nguyen's repositories
transformers_without_tears
Transformers without Tears: Improving the Normalization of Self-Attention
awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
An-Introduction-to-Statistical-Learning
This repository contains the exercises and its solution contained in the book "An Introduction to Statistical Learning" in python.
cbz.sh
Simple script to create a cbz archive from a folder.
chinese-hanviet-cognates
A Python notebook that outputs common Han Viet cognates for Chinese words.
computer-science
:mortar_board: Path to a free self-taught education in Computer Science!
fairscale
PyTorch extensions for high performance and large scale training.
FasterTransformer
Transformer related optimization, including BERT, GPT
fastseq
An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/pdf/2106.04718.pdf
fuzzy-match
Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.
genki-study-resources
A collection of exercises for practicing what is taught in Genki: An Integrated Course in Elementary Japanese.
ISLR-python
An Introduction to Statistical Learning (James, Witten, Hastie, Tibshirani, 2013): Python code
mega
Sequence modeling with Mega.
minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
multilingual-nlm
Code for "Unsupervised Multilingual Word Embedding with Limited Resources using Neural Language Models" and "Learning Contextualised Cross-lingual Word Embeddings and Alignments for Extremely Low-Resource Languages Using Parallel Corpora"
pyprobml
Python code for "Probabilistic Machine learning" book by Kevin Murphy
RRHF
RRHF & Wombat
sciblog_support
Support content for my blog
Self-Learning
Books Papers, Courses & more I have to learn soon
terashuf
terashuf shuffles multi-terabyte text files using limited memory
transducer-tutorial
Example code for a neural transducer model.
zhongwen
Fork of the popular "Zhongwen" Chrome extension, adapted for Chinese-Vietnamese.