Beast code in Giters

zzxxxl's starred repositories

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookMIT92529 680 7604

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptNOASSERTION46087 341 3790

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookApache-2.037324 389 67

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonMIT35546 246 5116

Langchain-Chatchat

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

Language:TypeScriptApache-2.031293 282 3814

LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed inference

Language:C++MIT23454 171 835

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonApache-2.017650 109 1151

ceph

Ceph is a distributed object, block, and file storage platform

Language:C++NOASSERTION13915 6570

wandb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

Language:PythonMIT8910 59 3311

einops

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Language:PythonMIT8376 67 176

mergekit

Tools for merging pretrained large language models.

Language:PythonLGPL-3.04525 50 294

trader

交易模块

Language:PythonApache-2.03513 36 6

MobileAgent

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Language:PythonMIT2708 45 51

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookApache-2.02434 34 7

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Language:Jupyter NotebookApache-2.01524 29 174

evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

Language:PythonApache-2.01195 40 11

CUDA-Learn-Notes

🎉 CUDA Learn Notes with PyTorch: fp32、fp16/bf16、fp8/int8、flash_attn、sgemm、sgemv、warp/block reduce、dot prod、elementwise、softmax、layernorm、rmsnorm、hist etc.

Language:CudaGPL-3.01169 12 5

DiskANN

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

Language:C++NOASSERTION1066 26 195

gpt-assistant-android

免费的ChatGPT API的安卓语音助手，可用音量键唤起并进行语音交流，支持联网、Vision拍照识图、提问模板等功能 | A free ChatGPT API voice assistant for Android, activated via volume keys for voice interaction, supporting features such as network connectivity, Vision photo recognition, and question templates.

Language:JavaGPL-3.0640 10 47

zzxxxl

zzxxxl's starred repositories

langchain

dify

llm-course

llama_index

Langchain-Chatchat

LocalAI

ragflow

ceph

wandb

einops

mergekit

trader

MobileAgent

lectures

kernl

evolutionary-model-merge

CUDA-Learn-Notes

DiskANN

gpt-assistant-android

flash-attention-minimal

DB-GPT

DistServe

astra-sim

vidur

refusal_direction

LLM-Extrapolation

lottery-ticket-adaptation

Evaluation-Multimodal-LLMs-Survey

RepresentationSurgery

Proteus