Ming Zhu's starred repositories

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonLicense:Apache-2.0Stargazers:835Issues:0Issues:0

ToolVerifier

This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.

License:CC0-1.0Stargazers:15Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:40Issues:0Issues:0

TaskWeaver

A code-first agent framework for seamlessly planning and executing data analytics tasks.

Language:PythonLicense:MITStargazers:4814Issues:0Issues:0

gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

Language:JavaScriptStargazers:656Issues:0Issues:0

pyan

pyan is a Python module that performs static analysis of Python code to determine a call dependency graph between functions and methods. This is different from running the code and seeing which functions are called and how often; there are various tools that will generate a call graph in that way, usually using debugger or profiling trace hooks - for example: https://pycallgraph.readthedocs.org/ This code was originally written by Edmund Horner, and then modified by Juha Jeronen. See README for the original blog posts and links to their repositories.

Language:PythonLicense:GPL-2.0Stargazers:607Issues:0Issues:0

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8803Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:119Issues:0Issues:0

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonLicense:MITStargazers:5612Issues:0Issues:0

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1919Issues:0Issues:0

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

License:Apache-2.0Stargazers:15422Issues:0Issues:0

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:PythonStargazers:8971Issues:0Issues:0

codeinterpreter-api

👾 Open source implementation of the ChatGPT Code Interpreter

Language:PythonLicense:MITStargazers:3684Issues:0Issues:0
Language:PythonLicense:MITStargazers:19420Issues:0Issues:0

Firefly

Firefly: 大模型训练工具,支持训练Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Language:PythonStargazers:4874Issues:0Issues:0

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonLicense:NOASSERTIONStargazers:14107Issues:0Issues:0

langchain

🦜🔗 Build context-aware reasoning applications

Language:PythonLicense:MITStargazers:85672Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:34854Issues:0Issues:0

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18264Issues:0Issues:0

Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2497Issues:0Issues:0

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Language:PythonLicense:Apache-2.0Stargazers:10792Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:28954Issues:0Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4348Issues:0Issues:0

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonLicense:Apache-2.0Stargazers:36730Issues:0Issues:0

cs-video-courses

List of Computer Science courses with video lectures.

Stargazers:65231Issues:0Issues:0
Language:PythonLicense:MITStargazers:1412Issues:0Issues:0

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonLicense:MITStargazers:7612Issues:0Issues:0
Language:RStargazers:5Issues:0Issues:0

machine-learning-cheat-sheet

Classical equations and diagrams in machine learning

Language:TeXStargazers:6560Issues:0Issues:0