locussam

followers

following

stars

locussam's starred repositories

anything-llm

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.

Language:JavaScriptMIT25200 197 1651

firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

Language:TypeScriptAGPL-3.017825 93 362

triton

Development repository for the Triton language and compiler

Language:C++MIT13217 193 1464

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonApache-2.012588 134 214

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonApache-2.08963 101 1342

Yi

A series of large language models trained from scratch by developers @01-ai

Language:Jupyter NotebookApache-2.07652 106 290

anthropic-cookbook

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Language:Jupyter NotebookMIT6565 232 34

courses

Anthropic's educational courses

Language:Jupyter NotebookNOASSERTION6441 50 16

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonApache-2.04516 37 1432

Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

Language:MATLAB3714 360

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Liger-Kernel

Efficient Triton Kernels for LLM Training

Language:PythonBSD-2-Clause3315 39 98

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookApache-2.02487 35 7

darkriscv

opensouce RISC-V cpu core implemented in Verilog from scratch in one night!

Language:VerilogBSD-3-Clause2109 94 40

rwkv.cpp

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

Language:C++MIT1414 22 81

vortex

Language:VerilogApache-2.01220 38 115

Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

claude-prompt-generator

Language:PythonApache-2.01055 470

Awesome-LLMs-on-device

Awesome LLMs on Device: A Comprehensive Survey

AlphaFold3

Open source implementation of AlphaFold3

Language:PythonApache-2.0841 22 6

Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Language:CudaApache-2.0615 6 19

rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Language:C++Apache-2.0535 12 87

Yi-Coder

🌟 Yi-Coder is a series of open-source code language models that delivers state-of-the-art coding performance with fewer than 10 billion parameters.

Language:HTML337 11 5

hai-platform

一种任务级GPU算力分时调度的高性能深度学习训练平台

Language:PythonLGPL-3.0305 8 15

fast-voice-assistant

⚡ Insanely fast AI voice assistant with <500ms response times

Language:PythonMIT29100

sarathi-serve

A low-latency & high-throughput serving engine for LLMs

Language:PythonApache-2.0219 6 17

ml-slowfast-llava

SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models

Language:PythonNOASSERTION158 10 1

LongRoPE

Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper

Language:Python124 6 3

TEAL

Language:PythonMIT89 4 9

MagicDec

Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding

Language:JavaScriptApache-2.065 4 3