Vivek Padman (vieveks)

vieveks

Geek Repo

Github PK Tool:Github PK Tool

Vivek Padman's repositories

vastai_temp

temporary repo

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

vieveks.github.io

personal website

Language:HTMLStargazers:1Issues:0Issues:0

minijax

codes for different llm architectures in jax and haiku

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

nanoGPT-understanding-

The simplest, fastest repository for training/finetuning medium-sized GPTs.

License:MITStargazers:0Issues:0Issues:0

alphazero_chess

My opensource modular implementation of alphazero, muzero and other algos on chess and tic tac toe environments

Language:PythonStargazers:0Issues:0Issues:0

pytorch-alpha-zero

to try out alphazero training and understand the algorithm

Language:PythonStargazers:0Issues:0Issues:0

pingu

Your personal robotic home assistant

Language:PythonStargazers:0Issues:0Issues:0

Unlearning

Different algorithms to achieve unlearning

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Contilearn

to make LLMs learn at the go

Language:PythonStargazers:0Issues:0Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Vishanu

A general purpose cyber virus and anti virus

Language:PythonStargazers:1Issues:0Issues:0

Anti_AI

Agent that acts like a second layer of cognition against other ai. Basically a firewall to your brain

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

langchain

⚡ Building applications with LLMs through composability ⚡

License:MITStargazers:0Issues:0Issues:0

ChessGPT

ChessGPT - Bridging Policy Learning and Language Modeling

License:Apache-2.0Stargazers:0Issues:0Issues:0

Eureka_vivek

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"

License:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

tradez

trading platform

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0

torch_rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

License:MITStargazers:0Issues:0Issues:0

tf_agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

reinforcement-learning_dennybritz

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

License:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0