崔文耀's starred repositories

ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49481Issues:564Issues:209

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21856Issues:185Issues:490

DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Language:PythonLicense:MITStargazers:13491Issues:116Issues:1083

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11327Issues:159Issues:306

fastllm

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

Language:C++License:Apache-2.0Stargazers:3296Issues:41Issues:364

DecryptPrompt

总结Prompt&LLM论文,开源数据&模型,AIGC应用

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:2577Issues:24Issues:27

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1499Issues:16Issues:25

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonLicense:MITStargazers:1261Issues:27Issues:47

Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

mamba.py

A simple and efficient Mamba implementation in pure PyTorch and MLX.

Language:PythonLicense:MITStargazers:953Issues:7Issues:39

mamba-chat

Mamba-Chat: A chat LLM based on the state-space model architecture 🐍

Language:PythonLicense:Apache-2.0Stargazers:904Issues:7Issues:30

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Language:PythonLicense:Apache-2.0Stargazers:624Issues:8Issues:46

LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Language:PythonLicense:MITStargazers:600Issues:10Issues:37

Awesome-state-space-models

Collection of papers on state-space models

streamlit-echarts

A Streamlit component to render ECharts.

Language:PythonLicense:MITStargazers:527Issues:8Issues:32

lost-in-the-middle

Code and data for "Lost in the Middle: How Language Models Use Long Contexts"

Language:PythonLicense:MITStargazers:305Issues:5Issues:14

mega

Sequence modeling with Mega.

Language:PythonLicense:MITStargazers:297Issues:126Issues:16

accelerated-scan

Accelerated First Order Parallel Associative Scan

Language:PythonLicense:MITStargazers:152Issues:8Issues:6

DiJiang

[ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear attention mechanism.

LongICLBench

Code and Data for "Long-context LLMs Struggle with Long In-context Learning"

Language:PythonLicense:MITStargazers:88Issues:3Issues:4

DenseSSM

A repository for DenseSSMs

hippogriff

Griffin MQA + Hawk Linear RNN Hybrid

Language:PythonLicense:MITStargazers:83Issues:4Issues:8

mamba-mini

An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivation. It is probably the code which is the most close to selective_scan_cuda in mamba.

HGRN

[NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Sequence Modeling

rnn-icrag

Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"

Language:PythonStargazers:24Issues:2Issues:0

resonance_rope

[ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.

Language:PythonLicense:Apache-2.0Stargazers:21Issues:2Issues:0