Ke Yan's starred repositories
decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
decision-mamba
Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces
Awesome-state-space-models
Collection of papers on state-space models
reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
awesome-rl
Reinforcement learning resources curated
Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions
Solutions of Reinforcement Learning, An Introduction
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Awesome-Mamba-Papers
Awesome Papers related to Mamba.
aiXcoder-7B
official repository of aiXcoder-7B Code Large Language Model
lightning-whisper-mlx
An extremely fast implementation of whisper optimized for Apple Silicon using MLX.
devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
WeatherBench
A benchmark dataset for data-driven weather forecasting
mamba-chat
Mamba-Chat: A chat LLM based on the state-space model architecture 🐍
annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
mamba-notes
Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)
RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.