Vivek Padman's repositories
vastai_temp
temporary repo
vieveks.github.io
personal website
minijax
codes for different llm architectures in jax and haiku
tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
nanoGPT-understanding-
The simplest, fastest repository for training/finetuning medium-sized GPTs.
alphazero_chess
My opensource modular implementation of alphazero, muzero and other algos on chess and tic tac toe environments
pytorch-alpha-zero
to try out alphazero training and understand the algorithm
pingu
Your personal robotic home assistant
Unlearning
Different algorithms to achieve unlearning
Contilearn
to make LLMs learn at the go
peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
langchain
⚡ Building applications with LLMs through composability ⚡
ChessGPT
ChessGPT - Bridging Policy Learning and Language Modeling
Eureka_vivek
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"
tradez
trading platform
torch_rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
tf_agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
reinforcement-learning_dennybritz
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.