locussam's starred repositories

anything-llm

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.

Language:JavaScriptLicense:MITStargazers:25200Issues:197Issues:1651

firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

Language:TypeScriptLicense:AGPL-3.0Stargazers:17825Issues:93Issues:362

triton

Development repository for the Triton language and compiler

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:12588Issues:134Issues:214

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8963Issues:101Issues:1342

Yi

A series of large language models trained from scratch by developers @01-ai

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7652Issues:106Issues:290

anthropic-cookbook

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Language:Jupyter NotebookLicense:MITStargazers:6565Issues:232Issues:34

courses

Anthropic's educational courses

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:6441Issues:50Issues:16

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:4516Issues:37Issues:1432

Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

Language:MATLABStargazers:3714Issues:36Issues:0

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Liger-Kernel

Efficient Triton Kernels for LLM Training

Language:PythonLicense:BSD-2-ClauseStargazers:3315Issues:39Issues:98

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2487Issues:35Issues:7

darkriscv

opensouce RISC-V cpu core implemented in Verilog from scratch in one night!

Language:VerilogLicense:BSD-3-ClauseStargazers:2109Issues:94Issues:40

rwkv.cpp

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

Language:C++License:MITStargazers:1414Issues:22Issues:81
Language:VerilogLicense:Apache-2.0Stargazers:1220Issues:38Issues:115

Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

Language:PythonLicense:Apache-2.0Stargazers:1055Issues:47Issues:0

Awesome-LLMs-on-device

Awesome LLMs on Device: A Comprehensive Survey

AlphaFold3

Open source implementation of AlphaFold3

Language:PythonLicense:Apache-2.0Stargazers:841Issues:22Issues:6

Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Language:CudaLicense:Apache-2.0Stargazers:615Issues:6Issues:19

rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Language:C++License:Apache-2.0Stargazers:535Issues:12Issues:87

Yi-Coder

🌟 Yi-Coder is a series of open-source code language models that delivers state-of-the-art coding performance with fewer than 10 billion parameters.

hai-platform

一种任务级GPU算力分时调度的高性能深度学习训练平台

Language:PythonLicense:LGPL-3.0Stargazers:305Issues:8Issues:15

fast-voice-assistant

⚡ Insanely fast AI voice assistant with <500ms response times

Language:PythonLicense:MITStargazers:291Issues:0Issues:0

sarathi-serve

A low-latency & high-throughput serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:219Issues:6Issues:17

ml-slowfast-llava

SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models

Language:PythonLicense:NOASSERTIONStargazers:158Issues:10Issues:1

LongRoPE

Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper

Language:PythonLicense:MITStargazers:89Issues:4Issues:9

MagicDec

Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding

Language:JavaScriptLicense:Apache-2.0Stargazers:65Issues:4Issues:3