vieveks

0

followers

following

stars

Vivek Padman's repositories

vastai_temp

temporary repo

Language:Jupyter Notebook000

vieveks.github.io

personal website

Language:HTML100

minijax

codes for different llm architectures in jax and haiku

Language:Jupyter NotebookMIT000

reasoning_agent

000

tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Language:PythonMIT000

nanoGPT-understanding-

The simplest, fastest repository for training/finetuning medium-sized GPTs.

MIT000

alphazero_chess

My opensource modular implementation of alphazero, muzero and other algos on chess and tic tac toe environments

Language:Python000

pytorch-alpha-zero

to try out alphazero training and understand the algorithm

Language:Python000

pingu

Your personal robotic home assistant

Language:Python000

Unlearning

Different algorithms to achieve unlearning

Language:Jupyter Notebook000

Contilearn

to make LLMs learn at the go

Language:Python000

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Apache-2.0000

Vishanu

A general purpose cyber virus and anti virus

Language:Python100

Anti_AI

Agent that acts like a second layer of cognition against other ai. Basically a firewall to your brain

Language:Jupyter Notebook100

langchain

⚡ Building applications with LLMs through composability ⚡

MIT000

ChessGPT

ChessGPT - Bridging Policy Learning and Language Modeling

Apache-2.0000

Eureka_vivek

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"

MIT000

Gym

Language:Python000

tradez

trading platform

Language:Python000

rl_agent_trials

Language:Python000

RL_code_implementations

Language:Jupyter Notebook100

torch_rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

MIT000

tf_agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

Apache-2.0000

Self_driving_game

Language:Python000

reinforcement-learning_dennybritz

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

MIT000

HusePricePrediction

Language:Jupyter NotebookApache-2.0000