lxuechen

Xuechen Li's starred repositories

quicktype

Generate types and converters from JSON, Schema, and GraphQL

Language:TypeScriptApache-2.012248 92 1311

pgvector

Open-source vector similarity search for Postgres

Language:CNOASSERTION11888 90 572

Stockfish

A free and strong UCI chess engine

Language:C++GPL-3.011349 247 948

uvloop

Ultra fast asyncio event loop.

Language:CythonApache-2.010296 224 358

trio

Trio – a friendly Python library for async concurrency and I/O

Language:PythonNOASSERTION6108 83 810

gateway

A Blazing Fast AI Gateway with integrated Guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.

Language:TypeScriptMIT5971 35 306

schemaorg

Schema.org - schemas and supporting software

Language:HTMLApache-2.05375 406 2218

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonApache-2.05298 53 535

nvitop

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Language:PythonApache-2.04658 25 83

mergekit

Tools for merging pretrained large language models.

Language:PythonLGPL-3.04552 50 297

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

GPL-3.02549 84 6

swifter

A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner

Language:PythonMIT2518 30 149

graph-of-thoughts

Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"

Language:PythonNOASSERTION2078 21 20

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonApache-2.01959 44 120

sqlacodegen

Automatic model code generator for SQLAlchemy

Language:PythonNOASSERTION1862 29 208

distributed-llama

Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.

Language:C++MIT1380 27 52

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonApache-2.01137 39 75

CodeGPT

The leading open-source AI copilot for JetBrains. Connect to any model in any environment, and customize your coding experience in any way you like.

Language:JavaApache-2.01000 18 414

DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Language:PythonMIT975 15 38

cc_net

Tools to download and cleanup Common Crawl data

Language:PythonMIT962 23 44

LLMUnity

Create characters in Unity with LLMs!

Language:C#MIT567 12 88

The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.

Language:PythonNOASSERTION468 10 42