Beast code in Giters

Ming Zhu's starred repositories

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonApache-2.083500

ToolVerifier

This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.

CC0-1.01500

TaskWeaver

A code-first agent framework for seamlessly planning and executing data analytics tasks.

Language:PythonMIT481400

gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

Language:JavaScript65600

pyan is a Python module that performs static analysis of Python code to determine a call dependency graph between functions and methods. This is different from running the code and seeing which functions are called and how often; there are various tools that will generate a call graph in that way, usually using debugger or profiling trace hooks - for example: https://pycallgraph.readthedocs.org/ This code was originally written by Edmund Horner, and then modified by Juha Jeronen. See README for the original blog posts and links to their repositories.

Language:PythonGPL-2.060700

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookApache-2.0880300

TACO

Language:PythonApache-2.011900

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonMIT561200

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookApache-2.0191900

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

Apache-2.01542200

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:Python897100

codeinterpreter-api

👾 Open source implementation of the ChatGPT Code Interpreter

Language:PythonMIT368400

babyagi

Language:PythonMIT1942000

Firefly

Firefly: 大模型训练工具，支持训练Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Language:Python487400

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonNOASSERTION1410700

langchain

🦜🔗 Build context-aware reasoning applications

Language:PythonMIT8567200

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.03485400

ml_timeline

59500

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookApache-2.01826400

Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！

Language:Jupyter NotebookApache-2.0249700

mingzhu0527

Ming Zhu's starred repositories

SPIN

ToolVerifier

xLAM

TaskWeaver

gpu_poor

pyan

mistral-inference

TACO

DeepSeek-Coder

Medusa

generative_agents

WizardLM

codeinterpreter-api

babyagi

Firefly

evals

langchain

FastChat

ml_timeline

alpaca-lora

Alpaca-CoT

dolly

stanford_alpaca

trlx

Open-Assistant

cs-video-courses

the-pile

PaLM-rlhf-pytorch

DtACI

machine-learning-cheat-sheet