Beast code in Giters

hdchao's starred repositories

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonMIT52181 436 130

grok-1

Grok open release

Language:PythonApache-2.049194 561 202

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookNOASSERTION23290 263 63

style2paints

sketch + style = paints :art: (TOG2018/SIGGRAPH2018ASIA)

Language:JavaScriptApache-2.017941 558 211

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonMIT10987 165 214

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonMIT8792 81 36

llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

Language:HTMLApache-2.08087 78 20

low_cost_robot

Language:PythonMIT2830 46 26

Deep-Reinforcement-Learning-Hands-On

Hands-on Deep Reinforcement Learning, published by Packt

Language:PythonMIT2809 120 81

Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

Language:MATLAB2630 300

stable-diffusion-tutorial

全网最全Stable Diffusion全套教程，从入门到进阶，耗时三个月制作

1208 11 2

Machine-Learning-for-Algorithmic-Trading-Second-Edition_Original

Machine Learning for Algorithmic Trading, Second Edition - published by Packt

Language:Jupyter NotebookMIT1155 66 16

Deep-Reinforcement-Learning-Hands-On-Second-Edition

Deep-Reinforcement-Learning-Hands-On-Second-Edition, published by Packt

Language:Jupyter NotebookMIT1101 25 44

llm-colosseum

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

Language:Jupyter NotebookMIT1037 17 25

LlamaGym

Fine-tune LLM agents with online reinforcement learning

Language:PythonMIT954 8 9

gdrl

Grokking Deep Reinforcement Learning

Language:Jupyter NotebookBSD-3-Clause774 30 31

Transformers-for-NLP-2nd-Edition

Transformer models from BERT to GPT-4, environments from Hugging Face to OpenAI. Fine-tuning, training, and prompt engineering examples. A bonus section with ChatGPT, GPT-3.5-turbo, GPT-4, and DALL-E including jump starting GPT-4, speech-to-text, text-to-speech, text to image generation with DALL-E, Google Cloud AI,HuggingGPT, and more

Language:Jupyter NotebookMIT754 22 3

Python-for-Finance-Cookbook

Python for Finance Cookbook, published by Packt

Language:Jupyter Notebook709 38 14

humanoid-gym

Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real Transfer https://arxiv.org/abs/2404.05695

Language:Python564 12 16

makeMoE

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)

Language:Jupyter NotebookMIT560 7 3

NBCE

Naive Bayes-based Context Extension

Language:Python308 6 7

smalldiffusion

Simple and readable code for training and sampling from diffusion models

Language:PythonMIT181 4 1

Python-for-Finance-Cookbook-2E

The repository of "Python for Finance Cookbook" 2nd edition

Language:Jupyter NotebookMIT124 8 11

LLM-RLHF-Tuning-with-PPO-and-DPO

Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various configurations for the Alpaca, LLaMA, and LLaMA2 models.

Language:Python98 20

hdchao