Linpeng Tang (chtlp)

chtlp

Geek Repo

Company:Moqi

Location:Beijing

Github PK Tool:Github PK Tool

Linpeng Tang's starred repositories

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1730Issues:0Issues:0

LaVIT

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:440Issues:0Issues:0

NeMo-Guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Language:PythonLicense:NOASSERTIONStargazers:3757Issues:0Issues:0

DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Language:PythonLicense:MITStargazers:12480Issues:0Issues:0

langfuse

🪢 Open source LLM engineering platform: Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

Language:TypeScriptLicense:NOASSERTIONStargazers:4701Issues:0Issues:0

DataX

DataX是阿里云DataWorks数据集成的开源版本。

Language:JavaLicense:NOASSERTIONStargazers:15495Issues:0Issues:0

sglang

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

Language:PythonLicense:Apache-2.0Stargazers:2817Issues:0Issues:0

bisheng

Bisheng is an open LLM devops platform for next generation AI applications.

Language:PythonLicense:Apache-2.0Stargazers:8143Issues:0Issues:0

vanna

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

Language:PythonLicense:MITStargazers:9586Issues:0Issues:0

setfit

Efficient few-shot learning with Sentence Transformers

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2081Issues:0Issues:0

guardrails

Adding guardrails to large language models.

Language:PythonLicense:Apache-2.0Stargazers:3628Issues:0Issues:0

fastText

Library for fast text representation and classification.

Language:HTMLLicense:MITStargazers:25735Issues:0Issues:0

WikiChat

WikiChat stops the hallucination of large language models by retrieving data from Wikipedia.

Language:PythonLicense:Apache-2.0Stargazers:904Issues:0Issues:0

papermage

library supporting NLP and CV research on scientific papers

Language:PythonLicense:Apache-2.0Stargazers:641Issues:0Issues:0

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6364Issues:0Issues:0

splade

SPLADE: sparse neural search (SIGIR21, SIGIR22)

Language:PythonLicense:NOASSERTIONStargazers:703Issues:0Issues:0

distilabel

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

Language:PythonLicense:Apache-2.0Stargazers:1132Issues:0Issues:0

quokka

Making data lake work for time series

Language:PythonLicense:Apache-2.0Stargazers:1102Issues:0Issues:0

ray-llm

RayLLM - LLMs on Ray

Language:PythonLicense:Apache-2.0Stargazers:1197Issues:0Issues:0

rivet

The open-source visual AI programming environment and TypeScript library

Language:TypeScriptLicense:MITStargazers:2570Issues:0Issues:0

openllmetry

Open-source observability for your LLM application, based on OpenTelemetry

Language:PythonLicense:Apache-2.0Stargazers:1525Issues:0Issues:0

pdf2htmlEX

Convert PDF to HTML without losing text or format.

Language:HTMLLicense:NOASSERTIONStargazers:3546Issues:0Issues:0

UniIR

Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)

Language:PythonLicense:MITStargazers:80Issues:0Issues:0

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5365Issues:0Issues:0

instill-core

🔮 Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications

Language:MakefileLicense:NOASSERTIONStargazers:2000Issues:0Issues:0

GPTs

leaked prompts of GPTs

Stargazers:27757Issues:0Issues:0

phoenix

AI Observability & Evaluation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:3024Issues:0Issues:0

dingo

A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ultra-low latency.

Language:JavaLicense:Apache-2.0Stargazers:906Issues:0Issues:0

xyflow

React Flow | Svelte Flow - Powerful open source libraries for building node-based UIs with React (https://reactflow.dev) or Svelte (https://svelteflow.dev). Ready out-of-the-box and infinitely customizable.

Language:TypeScriptLicense:MITStargazers:22555Issues:0Issues:0