Beast code in Giters

Tomas Lyu's starred repositories

Flowise

Drag & drop UI to build your customized LLM flow

Language:TypeScriptApache-2.02925300

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptNOASSERTION4367500

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonApache-2.01595000

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookNOASSERTION2528700

llms-from-scratch-cn

仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理

Language:Jupyter NotebookNOASSERTION94800

HippoRAG

HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personalized PageRank.

Language:PythonMIT119300

buffer-of-thought-llm

Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Language:Python46300

mem0

The memory layer for Personalized AI

Language:PythonApache-2.02052500

cake

Distributed LLM and StableDiffusion inference for mobile, desktop and server.

Language:RustNOASSERTION241300

awesome-LLM-resourses

🧑‍🚀 全世界最好的中文LLM资料总结

70400

llama-models

Utilities intended for use with Llama models.

Language:PythonNOASSERTION363400

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter Notebook1146100

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++Apache-2.0574700

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonApache-2.0866600

bloop

bloop is a fast code search engine written in Rust.

Language:RustApache-2.0938300

tinysearch

🔍 Tiny, full-text search engine for static websites built with Rust and Wasm

Language:RustApache-2.0270300

cozo

A transactional, relational-graph-vector database that uses Datalog for query. The hippocampus for AI!

Language:RustMPL-2.0332200

fastembed-rs

Library for generating vector embeddings, reranking in Rust

Language:RustApache-2.023700

rag-api-server

A RAG API server written in Rust following OpenAI specs

Language:RustApache-2.02000

trieve

All-in-one infrastructure for search, recommendations, RAG, and analytics offered via API

Language:RustNOASSERTION130200

aichat

All-in-one AI CLI tool featuring Chat-REPL, Shell Assistant, RAG, AI tools & agents, with access to OpenAI, Claude, Gemini, Ollama, Groq, and more.

Language:RustApache-2.0368700

omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Language:PythonGPL-3.0482600

Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️

Language:PythonApache-2.0222200

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonMIT1639200

text-embeddings-inference

A blazing fast inference solution for text embeddings models

Language:RustApache-2.0250300

llm-chain

`llm-chain` is a powerful rust crate for building chains in large language models allowing you to summarise text and complete complex tasks

Language:RustMIT129600

mistral.rs

Blazingly fast LLM inference.

Language:RustMIT328700

llm

[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models

Language:RustApache-2.0605900

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.0804700

MemGPT

Create LLM agents with long-term memory and custom tools 📚🦙

Language:PythonApache-2.01128300

ltlvtao