Xiang LIU's starred repositories

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:29758Issues:194Issues:4671

gpt-researcher

LLM based autonomous agent that does online comprehensive research on any given topic

Language:PythonLicense:Apache-2.0Stargazers:13762Issues:110Issues:323

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:11246Issues:97Issues:468

bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Language:PythonLicense:Apache-2.0Stargazers:6716Issues:71Issues:123

Conference-Acceptance-Rate

Acceptance rates for the major AI conferences

Language:Jupyter NotebookLicense:MITStargazers:4089Issues:128Issues:28

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Language:PythonLicense:Apache-2.0Stargazers:2633Issues:33Issues:33

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonLicense:MITStargazers:1708Issues:17Issues:79

S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Language:PythonLicense:Apache-2.0Stargazers:1681Issues:24Issues:38

lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Language:PythonLicense:MITStargazers:539Issues:32Issues:109

RULER

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Language:PythonLicense:Apache-2.0Stargazers:495Issues:9Issues:46

PyramidKV

The Official Implementation of PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling

Language:Jupyter NotebookLicense:MITStargazers:472Issues:19Issues:15

aideml

AIDE: the Machine Learning CodeGen Agent

Language:PythonLicense:MITStargazers:304Issues:18Issues:7

Lamini-Memory-Tuning

Banishing LLM Hallucinations Requires Rethinking Generalization

LLM-Merging

LLM-Merging: Building LLMs Efficiently through Merging

label-words-are-anchors

Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

Language:PythonLicense:MITStargazers:142Issues:2Issues:28
Language:PythonStargazers:125Issues:0Issues:7

VisualSketchpad

Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:89Issues:7Issues:2

Awesome-TimeSeries-LLM-FM

The collection of resources about LLM for Time series tasks

MMLU-Pro

The scripts for MMLU-Pro

Language:PythonLicense:Apache-2.0Stargazers:71Issues:1Issues:8

batch-prompting

[EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.

Pruner-Zero

Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs

Language:PythonLicense:MITStargazers:61Issues:4Issues:6

BitDistiller

[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.

Language:PythonLicense:MITStargazers:59Issues:3Issues:7

ExCP

Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".

Language:PythonLicense:Apache-2.0Stargazers:35Issues:3Issues:5

OwLore

Official Pytorch Implementation of "OwLore: Outlier-weighed Layerwise Sampled Low-Rank Projection for Memory-Efficient LLM Fine-tuning" by Pengxiang Li, Lu Yin, Xiaowei Gao, Shiwei Liu

GreenTrainer

Code for paper "Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation" (ICLR'24)

Language:PythonLicense:MITStargazers:9Issues:0Issues:0

RLHFlow.github.io

Webpage for RLHFlow

Language:HTMLStargazers:7Issues:1Issues:0
Language:PythonLicense:GPL-3.0Stargazers:3Issues:0Issues:0