Enming Yuan's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:MITStargazers:165043Issues:1561Issues:2405

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

Language:HTMLLicense:CC0-1.0Stargazers:107462Issues:1392Issues:0

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonLicense:GPL-3.0Stargazers:62509Issues:262Issues:1530

ChatGPT

🔮 ChatGPT Desktop Application (Mac, Windows and Linux)

Language:RustLicense:AGPL-3.0Stargazers:51780Issues:435Issues:1022

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38394Issues:384Issues:1619

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35872Issues:349Issues:1729

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:35084Issues:353Issues:305

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29189Issues:341Issues:267

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25181Issues:222Issues:452

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:23526Issues:217Issues:3585

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:12038Issues:135Issues:197

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:11261Issues:167Issues:224

bob-plugin-openai-translator

基于 ChatGPT API 的文本翻译、文本润色、语法纠错 Bob 插件,让我们一起迎接不需要巴别塔的新时代!Licensed under CC BY-NC-SA 4.0

Language:JavaScriptLicense:NOASSERTIONStargazers:5490Issues:32Issues:94

voila

Voilà turns Jupyter notebooks into standalone web applications

Language:PythonLicense:NOASSERTIONStargazers:5333Issues:75Issues:728

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1898Issues:19Issues:77

prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Language:PythonLicense:MITStargazers:1358Issues:118Issues:15

basaran

Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-based text generation models.

Language:PythonLicense:MITStargazers:1287Issues:22Issues:59

nanoT5

Fast & Simple repository for pre-training and fine-tuning T5-style models

Language:PythonLicense:Apache-2.0Stargazers:944Issues:17Issues:33

MEGABYTE-pytorch

Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

Language:PythonLicense:MITStargazers:604Issues:11Issues:13

Reference-arithmetic-coding

Clear implementation of arithmetic coding for educational purposes in Java, Python, C++.

CoLT5-attention

Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch

Language:PythonLicense:MITStargazers:218Issues:8Issues:7

simple-hierarchical-transformer

Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT

Language:PythonLicense:MITStargazers:199Issues:10Issues:3

degen

Official Repository for "The Curious Case of Neural Text Degeneration"

Language:HTMLLicense:GPL-3.0Stargazers:154Issues:5Issues:2

hpman

A hyperparameter manager for deep learning experiments.

Language:PythonLicense:MITStargazers:94Issues:8Issues:10
Language:ShellLicense:Apache-2.0Stargazers:62Issues:5Issues:1

ssd-lm

Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control

Language:C++License:GPL-3.0Stargazers:14Issues:2Issues:0