Beast code in Giters

Yixin's starred repositories

Awesome-LLM-Compression

Awesome LLM compression research papers and tools.

MIT93700

A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to copy code and launch discussions about the problems you have encoured.

Language:Python4400

GEAR

GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM

Language:PythonMIT12000

Quest

[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

Language:Cuda10400

DejaVu

Language:Python25400

ZO-LLM

[ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".

Language:PythonGPL-3.05400

sparse_gpu_operator

GPU operators for sparse tensor operations

Language:Python2600

FlagGems

FlagGems is an operator library for large language models implemented in Triton Language.

Language:PythonApache-2.018800

LLM-Viewer

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Language:PythonMIT24400

awesome-representation-engineering

A resource repository for representation engineering in large language models

Apache-2.02100

open-box

Generalized and Efficient Blackbox Optimization System

Language:PythonNOASSERTION36200

OpenBA-v2

OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-15B.

Language:PythonApache-2.02100

Awesome-LLM-System-Papers

44700

TriForce

[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

Language:Python14700

Pruning-LLMs

The framework to prune LLMs to any size and any config.

Language:PythonApache-2.09300

GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Language:PythonApache-2.0129700

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonApache-2.02844100

mintaka

Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)

Language:PythonCC-BY-4.010000

KG_RAG

Empower Large Language Models (LLM) using Knowledge Graph based Retrieval-Augmented Generation (KG-RAG) for knowledge intensive tasks

Language:Jupyter NotebookApache-2.055300

Awesome-Knowledge-Graph-Reasoning

AKGR: Awesome Knowledge Graph Reasoning is a collection of knowledge graph reasoning works, including papers, codes and datasets

103700

DetIE

The code for the paper 'DetIE: Multilingual Open Information Extraction Inspired by Object Detection' by Vasilkovsky et al.

Language:Python2000

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonMIT3419100

rpca

Python implementation of robust principal component analysis

Language:PythonMIT1500

godec

Python implementation of the GoDec algorithm from Zhou and Tao (ICML 2011) for low-rank and sparse representation

Language:PythonMIT3200

lion-pytorch

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Language:PythonMIT198100

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonMIT1931100

awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

105100

Conference-Accepted-Paper-List

Some Conferences' accepted paper lists (including AI, ML, Robotic)

MIT90900

LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Language:PythonApache-2.076200

chain-of-thought-hub

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Language:Jupyter NotebookMIT246500

Dereck0602