bcui19's repositories
CompilerGym
A reinforcement learning toolkit for compiler optimizations
composer
Train neural networks up to 7x faster
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
mosaic_examples
Fast and flexible reference benchmarks
NeMo
NeMo: a toolkit for conversational AI
off-belief-learning
Implementation of the Off Belief Learning algorithm.
PCC-pytorch
A pytorch implementation of the paper "Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control"
probability
Probabilistic reasoning and statistical analysis in TensorFlow
RL4LMs
A modular RL library to fine-tune language models to human preferences
rlmeta
RLMeta is a light-weight flexible framework for Distributed Reinforcement Learning Research.
scratch
scratch work
streaming
A Data Streaming Library for Efficient Neural Network Training
tensorflow
Computation using data flow graphs for scalable machine learning
toolbox
Essential guides and programming tools in my toolbox (with focus on ML Training)