ldwang's repositories
AgentTuning
AgentTuning: Enabling Generalized Agent Abilities for LLMs
alignment-handbook
Robust recipes for to align language models with human and AI preferences
DeepSpeedExamples
Example models using DeepSpeed
AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
ChatDev
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
chemcrow-public
Chemcrow
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
GraphGPT
"GraphGPT: Graph Instruction Tuning for Large Language Models"
instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain
langflow
⛓️ Langflow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.
leetcode
🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解
LLaMA-Factory
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
LLM-Tuning
Tuning LLMs with no tears💦, sharing LLM-tools with love❤️.
llm_finetuning
Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on
LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
MetaGPT
🌟 The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo
MS-AMP
Microsoft Automatic Mixed Precision Library
NeMo
NeMo: a toolkit for conversational AI
OpenAgents
OpenAgents: An Open Platform for Language Agents in the Wild
openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
Qwen-Agent
Agent framework and applications built upon Qwen, featuring Code Interpreter and Chrome browser extension.
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
trl
Train transformer language models with reinforcement learning.
Yi
A series of large language models trained from scratch by developers @01-ai