zzxxxl's starred repositories

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookLicense:MITStargazers:92529Issues:680Issues:7604

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptLicense:NOASSERTIONStargazers:46087Issues:341Issues:3790

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:37324Issues:389Issues:67

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:35546Issues:246Issues:5116

Langchain-Chatchat

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

Language:TypeScriptLicense:Apache-2.0Stargazers:31293Issues:282Issues:3814

LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed inference

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:17650Issues:109Issues:1151

ceph

Ceph is a distributed object, block, and file storage platform

Language:C++License:NOASSERTIONStargazers:13915Issues:657Issues:0

wandb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

Language:PythonLicense:MITStargazers:8910Issues:59Issues:3311

einops

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Language:PythonLicense:MITStargazers:8376Issues:67Issues:176

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4525Issues:50Issues:294

trader

交易模块

Language:PythonLicense:Apache-2.0Stargazers:3513Issues:36Issues:6

MobileAgent

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Language:PythonLicense:MITStargazers:2708Issues:45Issues:51

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2434Issues:34Issues:7

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1524Issues:29Issues:174

evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

Language:PythonLicense:Apache-2.0Stargazers:1195Issues:40Issues:11

CUDA-Learn-Notes

🎉 CUDA Learn Notes with PyTorch: fp32、fp16/bf16、fp8/int8、flash_attn、sgemm、sgemv、warp/block reduce、dot prod、elementwise、softmax、layernorm、rmsnorm、hist etc.

Language:CudaLicense:GPL-3.0Stargazers:1169Issues:12Issues:5

DiskANN

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

Language:C++License:NOASSERTIONStargazers:1066Issues:26Issues:195

gpt-assistant-android

免费的ChatGPT API的安卓语音助手,可用音量键唤起并进行语音交流,支持联网、Vision拍照识图、提问模板等功能 | A free ChatGPT API voice assistant for Android, activated via volume keys for voice interaction, supporting features such as network connectivity, Vision photo recognition, and question templates.

Language:JavaLicense:GPL-3.0Stargazers:640Issues:10Issues:47

flash-attention-minimal

Flash Attention in ~100 lines of CUDA (forward pass only)

Language:CudaLicense:Apache-2.0Stargazers:560Issues:4Issues:5

DB-GPT

An LLM Based Diagnosis System (https://arxiv.org/pdf/2312.01454.pdf)

Language:PythonLicense:Apache-2.0Stargazers:532Issues:10Issues:45

DistServe

Disaggregated serving system for Large Language Models (LLMs).

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:281Issues:4Issues:37

astra-sim

ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale

Language:C++License:MITStargazers:242Issues:14Issues:92

vidur

A large-scale simulation framework for LLM inference

Language:PythonLicense:MITStargazers:237Issues:6Issues:17

refusal_direction

Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".

Language:PythonLicense:Apache-2.0Stargazers:76Issues:0Issues:0

LLM-Extrapolation

Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"

lottery-ticket-adaptation

Lottery Ticket Adaptation

Language:PythonLicense:Apache-2.0Stargazers:33Issues:3Issues:3

Evaluation-Multimodal-LLMs-Survey

A Survey on Benchmarks of Multimodal Large Language Models

RepresentationSurgery

Representation Surgery for Multi-Task Model Merging. ICML, 2024.

Language:PythonLicense:MITStargazers:23Issues:3Issues:2