Linpeng Tang (chtlp)

chtlp

Geek Repo

Company:Moqi

Location:Beijing

Github PK Tool:Github PK Tool

Linpeng Tang's starred repositories

ChatGPT-Next-Web

A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。

Language:TypeScriptLicense:MITStargazers:73708Issues:405Issues:2889

autogen

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:29239Issues:357Issues:1530

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:28527Issues:187Issues:4489

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

devika

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

Language:PythonLicense:MITStargazers:18057Issues:208Issues:373

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:14027Issues:107Issues:316

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonLicense:MITStargazers:12222Issues:88Issues:343

QAnything

Question and Answer based on Anything.

Language:PythonLicense:Apache-2.0Stargazers:10952Issues:98Issues:356

kubeasz

使用Ansible脚本安装K8S集群,介绍组件交互原理,方便直接,不受国内网络环境影响

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonLicense:Apache-2.0Stargazers:4476Issues:76Issues:88

Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Language:PythonLicense:Apache-2.0Stargazers:3976Issues:56Issues:291

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3113Issues:26Issues:129

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1970Issues:17Issues:158

s3proxy

Access other storage backends via the S3 API

Language:JavaLicense:Apache-2.0Stargazers:1690Issues:20Issues:368

kor

LLM(😽)

Language:PythonLicense:MITStargazers:1582Issues:14Issues:78

torchtitan

A native PyTorch Library for large model training

Language:PythonLicense:BSD-3-ClauseStargazers:1430Issues:36Issues:117

visualblocks

Visual Blocks for ML is a Google visual programming framework that lets you create ML pipelines in a no-code graph editor. You – and your users – can quickly prototype workflows by connecting drag-and-drop ML components, including models, user inputs, processors, and visualizations.

Language:PythonLicense:Apache-2.0Stargazers:1141Issues:21Issues:8

MyScaleDB

An open-source, high-performance SQL vector database built on ClickHouse.

Language:C++License:Apache-2.0Stargazers:784Issues:12Issues:13

Awesome-LLMs-Datasets

Summarize existing representative LLMs text datasets.

Auto-PPT

Auto generate pptx using gpt-3.5, Free to use online / 通过gpt-3.5生成PPT,免费在线使用

Language:PythonLicense:MITStargazers:483Issues:4Issues:22

open-box

Generalized and Efficient Blackbox Optimization System

Language:PythonLicense:NOASSERTIONStargazers:363Issues:4Issues:64

PyPaperBot

PyPaperBot is a Python tool for downloading scientific papers using Google Scholar, Crossref, and SciHub.

Language:PythonLicense:MITStargazers:330Issues:7Issues:33

alexandria

Full text search engine powering Alexandria.org - the open search engine.

Language:C++License:NOASSERTIONStargazers:186Issues:3Issues:27
Language:GoLicense:Apache-2.0Stargazers:98Issues:1Issues:0

altinity-dashboard

Altinity Dashboard helps you manage ClickHouse installations controlled by clickhouse-operator.

Language:TypeScriptLicense:Apache-2.0Stargazers:61Issues:5Issues:36

myscale-telemetry

Open-source observability for your LLM application.

Language:PythonLicense:MITStargazers:37Issues:5Issues:3
Language:PythonLicense:Apache-2.0Stargazers:25Issues:4Issues:0

tantivy_warc_indexer

builds a tantivy index from common crawl warc.wet files

Language:RustStargazers:9Issues:1Issues:0