Dominic789654

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonMIT1708 17 79

S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Language:PythonApache-2.01681 24 38

lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Language:PythonMIT539 32 109

RULER

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Language:PythonApache-2.0495 9 46

PyramidKV

The Official Implementation of PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling

Language:Jupyter NotebookMIT472 19 15

aideml

AIDE: the Machine Learning CodeGen Agent

Language:PythonMIT304 18 7

Lamini-Memory-Tuning

Banishing LLM Hallucinations Requires Rethinking Generalization

249 8 2

LLM-Merging

LLM-Merging: Building LLMs Efficiently through Merging

Language:Python149 17 13

label-words-are-anchors

Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

Language:PythonMIT142 2 28

AutoSurvey

Language:Python12507

VisualSketchpad

Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models

Language:Jupyter NotebookApache-2.089 7 2

Awesome-TimeSeries-LLM-FM

The collection of resources about LLM for Time series tasks

81 20

MMLU-Pro

The scripts for MMLU-Pro

Language:PythonApache-2.071 1 8

batch-prompting

[EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.

Language:Python64 7 1

Pruner-Zero

Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs

Language:PythonMIT61 4 6

BitDistiller

[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.

Language:PythonMIT59 3 7

ExCP

Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".

Language:PythonApache-2.035 3 5

chunk-attention

Language:PythonMIT27 7 1

OwLore

Official Pytorch Implementation of "OwLore: Outlier-weighed Layerwise Sampled Low-Rank Projection for Memory-Efficient LLM Fine-tuning" by Pengxiang Li, Lu Yin, Xiaowei Gao, Shiwei Liu

Language:Python15 2 2

GreenTrainer

Code for paper "Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation" (ICLR'24)

Language:PythonMIT900

RLHFlow.github.io

Webpage for RLHFlow

Language:HTML7 10

llama-python-streamingllm

Language:PythonGPL-3.0300