chenlinks's starred repositories

nebula

A distributed, fast open-source graph database featuring horizontal scalability and high availability

Language:C++License:Apache-2.0Stargazers:10678Issues:0Issues:0

WanJuan1.0

万卷1.0多模态语料

License:CC-BY-4.0Stargazers:535Issues:0Issues:0

langgraph

Build resilient language agents as graphs.

Language:PythonLicense:MITStargazers:5875Issues:0Issues:0

langgraph-studio

Desktop app for prototyping and debugging LangGraph applications locally.

Stargazers:1595Issues:0Issues:0

LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Language:PythonLicense:MITStargazers:3589Issues:0Issues:0

bonito

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Language:PythonLicense:BSD-3-ClauseStargazers:662Issues:0Issues:0

chat_zhenhuan

使用甄嬛语料微调的chatglm

Language:PythonStargazers:82Issues:0Issues:0

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonLicense:Apache-2.0Stargazers:3788Issues:0Issues:0

LabelLLM

The Open-Source Data Annotation Platform

Language:TypeScriptLicense:Apache-2.0Stargazers:511Issues:0Issues:0

omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Language:PythonLicense:GPL-3.0Stargazers:5094Issues:0Issues:0

cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Language:PythonLicense:AGPL-3.0Stargazers:9436Issues:0Issues:0

labelU

Data annotation toolbox supports image, audio and video data.

Language:PythonStargazers:795Issues:0Issues:0

EmoLLM

心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2、LLama3.1

Language:PythonLicense:MITStargazers:773Issues:0Issues:0

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Language:PythonLicense:AGPL-3.0Stargazers:11689Issues:0Issues:0

PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Language:PythonLicense:AGPL-3.0Stargazers:4931Issues:0Issues:0

zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Language:Jupyter NotebookLicense:MITStargazers:2873Issues:0Issues:0

HuixiangDou

HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance

Language:PythonLicense:BSD-3-ClauseStargazers:1442Issues:0Issues:0

InternEvo

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

Language:PythonLicense:Apache-2.0Stargazers:285Issues:0Issues:0

InternLM

Official release of InternLM2.5 base and chat models. 1M context support

Language:PythonLicense:Apache-2.0Stargazers:6284Issues:0Issues:0

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:3827Issues:0Issues:0

EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Language:PythonLicense:Apache-2.0Stargazers:2529Issues:0Issues:0

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:4304Issues:0Issues:0

maestro

Maestro: Netflix’s Workflow Orchestrator

Language:JavaLicense:Apache-2.0Stargazers:3256Issues:0Issues:0

jieba

结巴中文分词

Language:PythonLicense:MITStargazers:33154Issues:0Issues:0

Tutorial

LLM&VLM Tutorial

Language:PythonStargazers:1341Issues:0Issues:0

deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Language:PythonLicense:Apache-2.0Stargazers:474Issues:0Issues:0

logback-elasticsearch-appender

Logback Elasticsearch Appender

Language:JavaLicense:NOASSERTIONStargazers:233Issues:0Issues:0

OpenSearch

🔎 Open source distributed and RESTful search engine.

Language:JavaLicense:Apache-2.0Stargazers:9594Issues:0Issues:0

Stirling-PDF

#1 Locally hosted web application that allows you to perform various operations on PDF files

Language:JavaLicense:MITStargazers:43022Issues:0Issues:0

mica

Spring Cloud 微服务开发核心工具集。工具类、验证码、http、redis、ip2region、xss 等,开箱即用。 🔝 🔝 记得右上角点个star 关注更新!

Language:JavaLicense:LGPL-3.0Stargazers:2109Issues:0Issues:0