Beast code in Giters

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Language:Python5651 57 277

TaskWeaver

A code-first agent framework for seamlessly planning and executing data analytics tasks.

Language:PythonMIT5200 67 200

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonMIT4445 49 289

codeinterpreter-api

👾 Open source implementation of the ChatGPT Code Interpreter

Language:PythonMIT3759 38 110

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！

Language:Jupyter NotebookApache-2.02571 36 100

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookApache-2.02210 32 87

the-pile

Language:PythonMIT1470 31 100

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonApache-2.0966 12 30

gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

Language:JavaScript777 5 14

pyan

pyan is a Python module that performs static analysis of Python code to determine a call dependency graph between functions and methods. This is different from running the code and seeing which functions are called and how often; there are various tools that will generate a call graph in that way, usually using debugger or profiling trace hooks - for example: https://pycallgraph.readthedocs.org/ This code was originally written by Edmund Horner, and then modified by Juha Jeronen. See README for the original blog posts and links to their repositories.

Language:PythonGPL-2.0626 160

mingzhu0527

Ming Zhu's starred repositories

langchain

cs-video-courses

Open-Assistant

FastChat

OpenHands

stanford_alpaca

babyagi

alpaca-lora

generative_agents

evals

dolly

mistral-inference

WizardLM

PaLM-rlhf-pytorch

DeepSeek-Coder

Firefly

TaskWeaver

trlx

codeinterpreter-api

Alpaca-CoT

Medusa

the-pile

SPIN

gpu_poor

pyan

ml_timeline

xLAM

TACO

ToolVerifier

DtACI