Ming Zhu's starred repositories

langchain

🦜🔗 Build context-aware reasoning applications

Language:PythonLicense:MITStargazers:85727Issues:656Issues:6792

cs-video-courses

List of Computer Science courses with video lectures.

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonLicense:Apache-2.0Stargazers:36733Issues:422Issues:1643

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:34880Issues:347Issues:1678

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:28958Issues:340Issues:266

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18269Issues:156Issues:467

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonLicense:NOASSERTIONStargazers:14111Issues:261Issues:199

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Language:PythonLicense:Apache-2.0Stargazers:10790Issues:135Issues:162

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8772Issues:116Issues:115

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonLicense:MITStargazers:7611Issues:142Issues:46

machine-learning-cheat-sheet

Classical equations and diagrams in machine learning

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonLicense:MITStargazers:5614Issues:62Issues:140

Firefly

Firefly: 大模型训练工具,支持训练Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

TaskWeaver

A code-first agent framework for seamlessly planning and executing data analytics tasks.

Language:PythonLicense:MITStargazers:4813Issues:65Issues:161

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4349Issues:49Issues:283

codeinterpreter-api

👾 Open source implementation of the ChatGPT Code Interpreter

Language:PythonLicense:MITStargazers:3684Issues:36Issues:106

Play-Leetcode

My Solutions to Leetcode problems. All solutions support C++ language, some support Java and Python. Multiple solutions will be given by most problems. Enjoy:) 我的Leetcode解答。所有的问题都支持C++语言,一部分问题支持Java语言。近乎所有问题都会提供多个算法解决。大家加油!:)

Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2500Issues:37Issues:97

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1920Issues:34Issues:74

gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

pyan

pyan is a Python module that performs static analysis of Python code to determine a call dependency graph between functions and methods. This is different from running the code and seeing which functions are called and how often; there are various tools that will generate a call graph in that way, usually using debugger or profiling trace hooks - for example: https://pycallgraph.readthedocs.org/ This code was originally written by Edmund Horner, and then modified by Juha Jeronen. See README for the original blog posts and links to their repositories.

Language:PythonLicense:GPL-2.0Stargazers:607Issues:16Issues:0
Language:PythonLicense:Apache-2.0Stargazers:119Issues:6Issues:10
Language:PythonLicense:Apache-2.0Stargazers:40Issues:3Issues:1

ToolVerifier

This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.

License:CC0-1.0Stargazers:15Issues:0Issues:0
Language:RStargazers:5Issues:1Issues:0