hdchao's starred repositories

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonLicense:MITStargazers:52181Issues:436Issues:130

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49194Issues:561Issues:202

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:23290Issues:263Issues:63

style2paints

sketch + style = paints :art: (TOG2018/SIGGRAPH2018ASIA)

Language:JavaScriptLicense:Apache-2.0Stargazers:17941Issues:558Issues:211

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:10987Issues:165Issues:214

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8792Issues:81Issues:36

llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

Language:HTMLLicense:Apache-2.0Stargazers:8087Issues:78Issues:20

Deep-Reinforcement-Learning-Hands-On

Hands-on Deep Reinforcement Learning, published by Packt

Language:PythonLicense:MITStargazers:2809Issues:120Issues:81

Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

Language:MATLABStargazers:2630Issues:30Issues:0

stable-diffusion-tutorial

全网最全Stable Diffusion全套教程,从入门到进阶,耗时三个月制作

Machine-Learning-for-Algorithmic-Trading-Second-Edition_Original

Machine Learning for Algorithmic Trading, Second Edition - published by Packt

Language:Jupyter NotebookLicense:MITStargazers:1155Issues:66Issues:16

Deep-Reinforcement-Learning-Hands-On-Second-Edition

Deep-Reinforcement-Learning-Hands-On-Second-Edition, published by Packt

Language:Jupyter NotebookLicense:MITStargazers:1101Issues:25Issues:44

llm-colosseum

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

Language:Jupyter NotebookLicense:MITStargazers:1037Issues:17Issues:25

LlamaGym

Fine-tune LLM agents with online reinforcement learning

Language:PythonLicense:MITStargazers:954Issues:8Issues:9

gdrl

Grokking Deep Reinforcement Learning

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:774Issues:30Issues:31

Transformers-for-NLP-2nd-Edition

Transformer models from BERT to GPT-4, environments from Hugging Face to OpenAI. Fine-tuning, training, and prompt engineering examples. A bonus section with ChatGPT, GPT-3.5-turbo, GPT-4, and DALL-E including jump starting GPT-4, speech-to-text, text-to-speech, text to image generation with DALL-E, Google Cloud AI,HuggingGPT, and more

Language:Jupyter NotebookLicense:MITStargazers:754Issues:22Issues:3

Python-for-Finance-Cookbook

Python for Finance Cookbook, published by Packt

Language:Jupyter NotebookStargazers:709Issues:38Issues:14

humanoid-gym

Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real Transfer https://arxiv.org/abs/2404.05695

makeMoE

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)

Language:Jupyter NotebookLicense:MITStargazers:560Issues:7Issues:3

NBCE

Naive Bayes-based Context Extension

smalldiffusion

Simple and readable code for training and sampling from diffusion models

Language:PythonLicense:MITStargazers:181Issues:4Issues:1

Python-for-Finance-Cookbook-2E

The repository of "Python for Finance Cookbook" 2nd edition

Language:Jupyter NotebookLicense:MITStargazers:124Issues:8Issues:11

LLM-RLHF-Tuning-with-PPO-and-DPO

Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various configurations for the Alpaca, LLaMA, and LLaMA2 models.

Language:PythonStargazers:98Issues:2Issues:0

lite-sora

An initiative to replicate Sora

Language:PythonLicense:Apache-2.0Stargazers:94Issues:3Issues:3

diffusion

From-scratch diffusion model implemented in PyTorch.

Language:Jupyter NotebookLicense:MITStargazers:54Issues:2Issues:1

ParaDance

Offers a toolset for comprehensive, multi-faceted large-scale data analysis and optimizations

Language:PythonLicense:MITStargazers:22Issues:4Issues:0