0x1DA9430

Ke Yan's starred repositories

LLM101n

LLM101n: Let's build a Storyteller

1525100

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Language:PythonMIT225600

decision-mamba

Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces

Language:PythonMIT2000

Awesome-state-space-models

Collection of papers on state-space models

47700

MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Language:PythonApache-2.0187700

reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Language:Jupyter NotebookMIT2024900

awesome-rl

Reinforcement learning resources curated

870700

Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions

Solutions of Reinforcement Learning, An Introduction

Language:Jupyter NotebookMIT193000

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT2893600

easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

Language:Jupyter NotebookNOASSERTION864900

aiXcoder-7B

official repository of aiXcoder-7B Code Large Language Model

Language:PythonApache-2.0215300

Jamba

PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"

Language:PythonMIT9900

lightning-whisper-mlx

An extremely fast implementation of whisper optimized for Apple Silicon using MLX.

Language:Python46900

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

Language:PythonMIT1788600

ComfyUI

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

Language:PythonGPL-3.04169000

Mamba-ND

Ofiicial Implementation for Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data

Language:Python3300

WeatherBench

A benchmark dataset for data-driven weather forecasting

Language:Jupyter NotebookMIT67900

mamba-chat

Mamba-Chat: A chat LLM based on the state-space model architecture 🐍

Language:PythonApache-2.087800

S5

Language:PythonMIT23000

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.02042900

grok-1

Grok open release

Language:PythonApache-2.04915700

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonMIT5157600

mamba-notes

Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)

13100

DRL

Deep Reinforcement Learning

NOASSERTION304600

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonApache-2.01198400