Tomas Lyu (ltlvtao)

ltlvtao

Geek Repo

Location:Shenzhen, Guangdong, China

Github PK Tool:Github PK Tool

Tomas Lyu's starred repositories

Flowise

Drag & drop UI to build your customized LLM flow

Language:TypeScriptLicense:Apache-2.0Stargazers:29253Issues:0Issues:0

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptLicense:NOASSERTIONStargazers:43675Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:15950Issues:0Issues:0

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:25287Issues:0Issues:0

llms-from-scratch-cn

仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:948Issues:0Issues:0

HippoRAG

HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personalized PageRank.

Language:PythonLicense:MITStargazers:1193Issues:0Issues:0

buffer-of-thought-llm

Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Language:PythonStargazers:463Issues:0Issues:0

mem0

The memory layer for Personalized AI

Language:PythonLicense:Apache-2.0Stargazers:20525Issues:0Issues:0

cake

Distributed LLM and StableDiffusion inference for mobile, desktop and server.

Language:RustLicense:NOASSERTIONStargazers:2413Issues:0Issues:0

awesome-LLM-resourses

🧑‍🚀 全世界最好的中文LLM资料总结

Stargazers:704Issues:0Issues:0

llama-models

Utilities intended for use with Llama models.

Language:PythonLicense:NOASSERTIONStargazers:3634Issues:0Issues:0

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookStargazers:11461Issues:0Issues:0

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++License:Apache-2.0Stargazers:5747Issues:0Issues:0

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8666Issues:0Issues:0

bloop

bloop is a fast code search engine written in Rust.

Language:RustLicense:Apache-2.0Stargazers:9383Issues:0Issues:0

tinysearch

🔍 Tiny, full-text search engine for static websites built with Rust and Wasm

Language:RustLicense:Apache-2.0Stargazers:2703Issues:0Issues:0

cozo

A transactional, relational-graph-vector database that uses Datalog for query. The hippocampus for AI!

Language:RustLicense:MPL-2.0Stargazers:3322Issues:0Issues:0

fastembed-rs

Library for generating vector embeddings, reranking in Rust

Language:RustLicense:Apache-2.0Stargazers:237Issues:0Issues:0

rag-api-server

A RAG API server written in Rust following OpenAI specs

Language:RustLicense:Apache-2.0Stargazers:20Issues:0Issues:0

trieve

All-in-one infrastructure for search, recommendations, RAG, and analytics offered via API

Language:RustLicense:NOASSERTIONStargazers:1302Issues:0Issues:0

aichat

All-in-one AI CLI tool featuring Chat-REPL, Shell Assistant, RAG, AI tools & agents, with access to OpenAI, Claude, Gemini, Ollama, Groq, and more.

Language:RustLicense:Apache-2.0Stargazers:3687Issues:0Issues:0

omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Language:PythonLicense:GPL-3.0Stargazers:4826Issues:0Issues:0

Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️

Language:PythonLicense:Apache-2.0Stargazers:2222Issues:0Issues:0

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonLicense:MITStargazers:16392Issues:0Issues:0

text-embeddings-inference

A blazing fast inference solution for text embeddings models

Language:RustLicense:Apache-2.0Stargazers:2503Issues:0Issues:0

llm-chain

`llm-chain` is a powerful rust crate for building chains in large language models allowing you to summarise text and complete complex tasks

Language:RustLicense:MITStargazers:1296Issues:0Issues:0

mistral.rs

Blazingly fast LLM inference.

Language:RustLicense:MITStargazers:3287Issues:0Issues:0

llm

[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models

Language:RustLicense:Apache-2.0Stargazers:6059Issues:0Issues:0

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:8047Issues:0Issues:0

MemGPT

Create LLM agents with long-term memory and custom tools 📚🦙

Language:PythonLicense:Apache-2.0Stargazers:11283Issues:0Issues:0