Yixin (Dereck0602)

Dereck0602

Geek Repo

Github PK Tool:Github PK Tool

Yixin's starred repositories

Awesome-LLM-Compression

Awesome LLM compression research papers and tools.

License:MITStargazers:937Issues:0Issues:0

llama-pipeline-parallel

A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to copy code and launch discussions about the problems you have encoured.

Language:PythonStargazers:44Issues:0Issues:0

GEAR

GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM

Language:PythonLicense:MITStargazers:120Issues:0Issues:0

Quest

[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

Language:CudaStargazers:104Issues:0Issues:0
Language:PythonStargazers:254Issues:0Issues:0

ZO-LLM

[ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".

Language:PythonLicense:GPL-3.0Stargazers:54Issues:0Issues:0

sparse_gpu_operator

GPU operators for sparse tensor operations

Language:PythonStargazers:26Issues:0Issues:0

FlagGems

FlagGems is an operator library for large language models implemented in Triton Language.

Language:PythonLicense:Apache-2.0Stargazers:188Issues:0Issues:0

LLM-Viewer

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Language:PythonLicense:MITStargazers:244Issues:0Issues:0

awesome-representation-engineering

A resource repository for representation engineering in large language models

License:Apache-2.0Stargazers:21Issues:0Issues:0

open-box

Generalized and Efficient Blackbox Optimization System

Language:PythonLicense:NOASSERTIONStargazers:362Issues:0Issues:0

OpenBA-v2

OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-15B.

Language:PythonLicense:Apache-2.0Stargazers:21Issues:0Issues:0

TriForce

[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

Language:PythonStargazers:147Issues:0Issues:0

Pruning-LLMs

The framework to prune LLMs to any size and any config.

Language:PythonLicense:Apache-2.0Stargazers:93Issues:0Issues:0

GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Language:PythonLicense:Apache-2.0Stargazers:1297Issues:0Issues:0

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:28441Issues:0Issues:0

mintaka

Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)

Language:PythonLicense:CC-BY-4.0Stargazers:100Issues:0Issues:0

KG_RAG

Empower Large Language Models (LLM) using Knowledge Graph based Retrieval-Augmented Generation (KG-RAG) for knowledge intensive tasks

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:553Issues:0Issues:0

Awesome-Knowledge-Graph-Reasoning

AKGR: Awesome Knowledge Graph Reasoning is a collection of knowledge graph reasoning works, including papers, codes and datasets

Stargazers:1037Issues:0Issues:0

DetIE

The code for the paper 'DetIE: Multilingual Open Information Extraction Inspired by Object Detection' by Vasilkovsky et al.

Language:PythonStargazers:20Issues:0Issues:0

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:34191Issues:0Issues:0

rpca

Python implementation of robust principal component analysis

Language:PythonLicense:MITStargazers:15Issues:0Issues:0

godec

Python implementation of the GoDec algorithm from Zhou and Tao (ICML 2011) for low-rank and sparse representation

Language:PythonLicense:MITStargazers:32Issues:0Issues:0

lion-pytorch

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Language:PythonLicense:MITStargazers:1981Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19311Issues:0Issues:0

awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

Stargazers:1051Issues:0Issues:0

Conference-Accepted-Paper-List

Some Conferences' accepted paper lists (including AI, ML, Robotic)

License:MITStargazers:909Issues:0Issues:0

LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Language:PythonLicense:Apache-2.0Stargazers:762Issues:0Issues:0

chain-of-thought-hub

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Language:Jupyter NotebookLicense:MITStargazers:2465Issues:0Issues:0