yao teng's starred repositories

Language:Jupyter NotebookStargazers:11Issues:0Issues:0
Language:PythonStargazers:10Issues:0Issues:0

KAISR-lite

A lite version of KAIR with SISR codes

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonLicense:MITStargazers:452Issues:0Issues:0

Dimba

Transformer-Mamba Diffusion Models

Language:PythonStargazers:38Issues:0Issues:0

RULER

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Language:PythonLicense:Apache-2.0Stargazers:261Issues:0Issues:0
Language:PythonStargazers:675Issues:0Issues:0

AMD

[CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models

Language:PythonStargazers:7Issues:0Issues:0
Language:PythonStargazers:3Issues:0Issues:0
Language:PythonStargazers:663Issues:0Issues:0

Awesome-Game-Analysis

a comprehensive collection of video game tech analysis resources

Language:PythonLicense:CC0-1.0Stargazers:759Issues:0Issues:0

gemma-2B-10M

Gemma 2B with 10M context length using Infini-attention.

Language:PythonStargazers:861Issues:0Issues:0

long-context-attention

Sequence Parallel Attention for Long Context LLM Model Training and Inference

Language:PythonStargazers:172Issues:0Issues:0

rfpp

The codebase of our paper "Improving the Training of Rectified Flows"

Language:PythonStargazers:36Issues:0Issues:0

d3pm

Minimal Implementation of a D3PM in pytorch

Language:Jupyter NotebookStargazers:140Issues:0Issues:0

RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Language:PythonLicense:NOASSERTIONStargazers:446Issues:0Issues:0

Yuan2.0-M32

Mixture-of-Experts (MoE) Language Model

Language:PythonLicense:Apache-2.0Stargazers:143Issues:0Issues:0

DiG

DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention

Language:PythonLicense:MITStargazers:79Issues:0Issues:0

DMP

Official pytorch implementation of "Diffusion Model Patching via Mixture-of-Prompts"

Language:PythonLicense:MITStargazers:6Issues:0Issues:0

DiM-DiffusionMamba

The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis

Language:PythonStargazers:82Issues:0Issues:0

video-mamba-suite

The suite of modeling video with Mamba

Language:PythonLicense:MITStargazers:163Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:243Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

ambient-tweedie

[ICML 2024]: Official implementation for the paper: "Consistent Diffusion Meets Tweedie"

Language:PythonLicense:GPL-3.0Stargazers:37Issues:0Issues:0
Language:Jupyter NotebookStargazers:4Issues:0Issues:0
Language:PythonLicense:MITStargazers:110Issues:0Issues:0

Pandora

Pandora: Towards General World Model with Natural Language Actions and Video States

Language:PythonStargazers:378Issues:0Issues:0

zigma

A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model"

Language:PythonLicense:Apache-2.0Stargazers:188Issues:0Issues:0

XQL

Extreme Q-Learning: Max Entropy RL without Entropy

Language:PythonStargazers:72Issues:0Issues:0