Zhiyu Chen's starred repositories

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonLicense:Apache-2.0Stargazers:36800Issues:425Issues:1643

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:33881Issues:354Issues:298

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:32693Issues:234Issues:4222

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

qlib

Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, market dynamics modeling, and RL.

Language:PythonLicense:MITStargazers:14441Issues:288Issues:883

the-algorithm-ml

Source code for Twitter's Recommendation Algorithm

Language:PythonLicense:AGPL-3.0Stargazers:9940Issues:100Issues:35

ChatRWKV

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Language:PythonLicense:Apache-2.0Stargazers:9331Issues:91Issues:115

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonLicense:Apache-2.0Stargazers:8087Issues:73Issues:390

SharpWxDump

ๅพฎไฟกๅฎขๆˆท็ซฏๅ–่ฏ๏ผŒๅฏ่Žทๅ–็”จๆˆทไธชไบบไฟกๆฏ(ๆ˜ต็งฐ/่ดฆๅท/ๆ‰‹ๆœบ/้‚ฎ็ฎฑ/ๆ•ฐๆฎๅบ“ๅฏ†้’ฅ(็”จๆฅ่งฃๅฏ†่Šๅคฉ่ฎฐๅฝ•))๏ผ›ๆ”ฏๆŒ่Žทๅ–ๅคš็”จๆˆทไฟกๆฏ๏ผŒไธๅฎšๆœŸๆ›ดๆ–ฐๆ–ฐ็‰ˆๆœฌๅ็งป๏ผŒ็›ฎๅ‰ๆ”ฏๆŒๆ‰€ๆœ‰ๆ–ฐ็‰ˆๆœฌใ€ๆญฃๅผ็‰ˆๆœฌ

LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Language:PythonLicense:MITStargazers:3284Issues:57Issues:94

scikit-llm

Seamlessly integrate LLMs into scikit-learn.

Language:PythonLicense:MITStargazers:2936Issues:39Issues:51

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2120Issues:26Issues:54

pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2100Issues:33Issues:100

transformers_tasks

โญ๏ธ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Language:Jupyter NotebookStargazers:2024Issues:16Issues:86

Data-Copilot

Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow

Language:PythonLicense:MITStargazers:1311Issues:11Issues:45

TencentPretrain

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

Language:PythonLicense:NOASSERTIONStargazers:992Issues:20Issues:36

LLM-Adapters

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

Language:PythonLicense:Apache-2.0Stargazers:986Issues:12Issues:56

nanoT5

Fast & Simple repository for pre-training and fine-tuning T5-style models

Language:PythonLicense:Apache-2.0Stargazers:932Issues:16Issues:31

alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Language:PythonLicense:Apache-2.0Stargazers:733Issues:8Issues:41

OpenICL

OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.

Language:PythonLicense:Apache-2.0Stargazers:513Issues:7Issues:15

inseq

Interpretability for sequence generation models ๐Ÿ› ๐Ÿ”

Language:PythonLicense:Apache-2.0Stargazers:325Issues:10Issues:79

HugNLP

HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!๐Ÿ˜Š HugNLP will released to @HugAILab

Language:PythonStargazers:250Issues:8Issues:0

causalai

Salesforce CausalAI Library: A Fast and Scalable framework for Causal Analysis of Time Series and Tabular Data

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:236Issues:7Issues:11

pyrelational

pyrelational is a python active learning library for rapidly implementing active learning pipelines from data management, model development (and Bayesian approximation), to creating novel active learning strategies.

Language:PythonLicense:Apache-2.0Stargazers:150Issues:10Issues:15

tasksource

Datasets collection and preprocessings framework for NLP extreme multitask learning

Language:PythonLicense:Apache-2.0Stargazers:132Issues:4Issues:8

RLTF

Accepted by Transactions on Machine Learning Research (TMLR)

Language:PythonLicense:BSD-3-ClauseStargazers:110Issues:2Issues:5

ir_measures

provides a common interface to many IR measure tools

Language:PythonLicense:Apache-2.0Stargazers:68Issues:4Issues:35

hf-spacerini

Plug-and-play Search Interfaces with Pyserini and Hugging Face

Language:PythonLicense:Apache-2.0Stargazers:31Issues:3Issues:7

typo

A python package to simulate typographical errors.

Language:PythonLicense:MITStargazers:30Issues:2Issues:2
Language:Jupyter NotebookStargazers:25Issues:0Issues:0