Ming Zhu's starred repositories

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookLicense:MITStargazers:92456Issues:679Issues:7589

cs-video-courses

List of Computer Science courses with video lectures.

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonLicense:Apache-2.0Stargazers:36949Issues:429Issues:1642

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36481Issues:349Issues:1763

OpenHands

🙌 OpenHands: Code Less, Make More

Language:PythonLicense:MITStargazers:31376Issues:286Issues:1351

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29352Issues:339Issues:267

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18544Issues:153Issues:469

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonLicense:NOASSERTIONStargazers:14697Issues:263Issues:206

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Language:PythonLicense:Apache-2.0Stargazers:10810Issues:137Issues:162

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9522Issues:123Issues:136

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonLicense:MITStargazers:7670Issues:143Issues:46

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonLicense:MITStargazers:6547Issues:68Issues:158

Firefly

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

TaskWeaver

A code-first agent framework for seamlessly planning and executing data analytics tasks.

Language:PythonLicense:MITStargazers:5200Issues:67Issues:200

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4445Issues:49Issues:289

codeinterpreter-api

👾 Open source implementation of the ChatGPT Code Interpreter

Language:PythonLicense:MITStargazers:3759Issues:38Issues:110

Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2571Issues:36Issues:100

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2210Issues:32Issues:87

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonLicense:Apache-2.0Stargazers:966Issues:12Issues:30

gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

pyan

pyan is a Python module that performs static analysis of Python code to determine a call dependency graph between functions and methods. This is different from running the code and seeing which functions are called and how often; there are various tools that will generate a call graph in that way, usually using debugger or profiling trace hooks - for example: https://pycallgraph.readthedocs.org/ This code was originally written by Edmund Horner, and then modified by Juha Jeronen. See README for the original blog posts and links to their repositories.

Language:PythonLicense:GPL-2.0Stargazers:626Issues:16Issues:0
Language:PythonLicense:Apache-2.0Stargazers:240Issues:9Issues:4
Language:PythonLicense:Apache-2.0Stargazers:131Issues:6Issues:11

ToolVerifier

This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.

License:CC0-1.0Stargazers:17Issues:3Issues:0
Language:RStargazers:14Issues:1Issues:0