Linpeng Tang (chtlp)

chtlp

Geek Repo

Company:Moqi

Location:Beijing

Github PK Tool:Github PK Tool

Linpeng Tang's starred repositories

ChatGPT-Next-Web

A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。

Language:TypeScriptLicense:MITStargazers:72241Issues:0Issues:0

kubeasz

使用Ansible脚本安装K8S集群,介绍组件交互原理,方便直接,不受国内网络环境影响

Language:JinjaStargazers:10205Issues:0Issues:0

visualblocks

Visual Blocks for ML is a Google visual programming framework that lets you create ML pipelines in a no-code graph editor. You – and your users – can quickly prototype workflows by connecting drag-and-drop ML components, including models, user inputs, processors, and visualizations.

Language:PythonLicense:Apache-2.0Stargazers:1109Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:22Issues:0Issues:0

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:25045Issues:0Issues:0

Auto-PPT

Auto generate pptx using gpt-3.5, Free to use online / 通过gpt-3.5生成PPT,免费在线使用

Language:PythonLicense:MITStargazers:446Issues:0Issues:0

altinity-dashboard

Altinity Dashboard helps you manage ClickHouse installations controlled by clickhouse-operator.

Language:TypeScriptLicense:Apache-2.0Stargazers:60Issues:0Issues:0

myscale-telemetry

Open-source observability for your LLM application.

Language:PythonLicense:MITStargazers:36Issues:0Issues:0
Language:GoLicense:Apache-2.0Stargazers:83Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:24Issues:0Issues:0

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:13464Issues:0Issues:0

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

License:NOASSERTIONStargazers:25676Issues:0Issues:0

torchtitan

A native PyTorch Library for large model training

Language:PythonLicense:BSD-3-ClauseStargazers:1256Issues:0Issues:0

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3066Issues:0Issues:0

devika

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

Language:PythonLicense:MITStargazers:17768Issues:0Issues:0

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonLicense:MITStargazers:11798Issues:0Issues:0

Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Language:PythonLicense:Apache-2.0Stargazers:3947Issues:0Issues:0

Awesome-LLMs-Datasets

Summarize existing representative LLMs text datasets.

License:Apache-2.0Stargazers:688Issues:0Issues:0

MyScaleDB

An open-source, high-performance SQL vector database built on ClickHouse.

Language:C++License:Apache-2.0Stargazers:742Issues:0Issues:0

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

License:MITStargazers:3163Issues:0Issues:0

tantivy_warc_indexer

builds a tantivy index from common crawl warc.wet files

Language:RustStargazers:9Issues:0Issues:0

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonLicense:Apache-2.0Stargazers:4427Issues:0Issues:0

alexandria

Full text search engine powering Alexandria.org - the open search engine.

Language:C++License:NOASSERTIONStargazers:186Issues:0Issues:0

s3proxy

Access other storage backends via the S3 API

Language:JavaLicense:Apache-2.0Stargazers:1645Issues:0Issues:0

PyPaperBot

PyPaperBot is a Python tool for downloading scientific papers using Google Scholar, Crossref, and SciHub.

Language:PythonLicense:MITStargazers:323Issues:0Issues:0

QAnything

Question and Answer based on Anything.

Language:PythonLicense:Apache-2.0Stargazers:10401Issues:0Issues:0

kor

LLM(😽)

Language:PythonLicense:MITStargazers:1549Issues:0Issues:0

autogen

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:27794Issues:0Issues:0

open-box

Generalized and Efficient Blackbox Optimization System

Language:PythonLicense:NOASSERTIONStargazers:351Issues:0Issues:0

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1668Issues:0Issues:0