hdchao's starred repositories

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookLicense:MITStargazers:89677Issues:677Issues:7249

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35875Issues:349Issues:1729

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:33995Issues:341Issues:2656

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Language:PythonLicense:MITStargazers:23401Issues:386Issues:177

style2paints

sketch + style = paints :art: (TOG2018/SIGGRAPH2018ASIA)

Language:JavaScriptLicense:Apache-2.0Stargazers:17941Issues:558Issues:211

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonLicense:Apache-2.0Stargazers:17898Issues:168Issues:1214

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:15537Issues:650Issues:849

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonLicense:MITStargazers:9897Issues:66Issues:104

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonLicense:MITStargazers:7652Issues:143Issues:46

DeepLearning

Python for《Deep Learning》,该书为《深度学习》(花书) 数学推导、原理剖析与源码级别代码实现

Language:PythonLicense:MITStargazers:6240Issues:190Issues:6

AlgoXY

Book of Elementary Functional Algorithms and Data structures

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:5882Issues:76Issues:529

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

BMTools

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Language:PythonLicense:Apache-2.0Stargazers:2858Issues:35Issues:37

RecLearn

Recommender Learning with Tensorflow2.x

Language:PythonLicense:MITStargazers:1841Issues:35Issues:82

lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

Language:PythonLicense:MITStargazers:1180Issues:25Issues:15

llm_agents

Build agents which are controlled by LLMs

Language:PythonLicense:MITStargazers:910Issues:10Issues:3

LaMini-LM

LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions

openrl

Unified Reinforcement Learning Framework

Language:PythonLicense:Apache-2.0Stargazers:604Issues:7Issues:57

self-refine

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.

Language:PythonLicense:Apache-2.0Stargazers:532Issues:13Issues:20

NBCE

Naive Bayes-based Context Extension

Gym-Trading-Env

A simple, easy, customizable Gymnasium environment for trading.

Language:PythonLicense:MITStargazers:278Issues:14Issues:16

Essential-Math-For-AI

This repository contains the supplementary material associated with my book: Essential Math for AI published by O'Reilly Media

Language:Jupyter NotebookStargazers:267Issues:8Issues:0

ContinualLM

An Extensible Continual Learning Framework Focused on Language Models (LMs)

awesome-distributed-ml

A curated list of awesome projects and papers for distributed training or inference

PromptPG

Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".

Language:PythonLicense:MITStargazers:136Issues:4Issues:4
Language:PythonLicense:Apache-2.0Stargazers:93Issues:5Issues:9

CROLoss

Code for paper CROLoss: Towards a Customizable Loss for Retrieval Models in Recommender Systems

Language:PythonLicense:MITStargazers:8Issues:2Issues:0