Edward-Sun

Zhiqing Sun's starred repositories

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonApache-2.017361 165 1112

Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger

Language:Jupyter NotebookNOASSERTION7850 68 227

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonMIT5691 63 142

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonBSD-3-Clause5246 60 87

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonMIT3940 34 421

alphageometry

Language:PythonApache-2.03754 51 103

Eureka

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Language:Jupyter NotebookMIT2687 25 33

weak-to-strong

Language:PythonMIT2439 34 18

FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

Language:C++Apache-2.01556 32 603

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)

Language:PythonApache-2.01453 20 146

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookNOASSERTION1163 12 25

hallucination-leaderboard

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Apache-2.01079 37 13

dolma

Data and tools for generating and inspecting OLMo pre-training data.

Language:PythonApache-2.0814 17 61

factool

FacTool: Factuality Detection in Generative AI

Language:PythonApache-2.0772 10 28

PiPPy

Pipeline Parallelism for PyTorch

Language:PythonBSD-3-Clause646 36 247

representation-engineering

Representation Engineering: A Top-Down Approach to AI Transparency

Language:Jupyter NotebookMIT592 29 35

gpt_paper_assistant

GPT4 based personalized ArXiv paper assistant bot

Language:PythonApache-2.0431 6 10

weatherbench2

A benchmark for the next generation of data-driven global weather models.

Language:PythonApache-2.0334 8 34

JudgeLM

An open-sourced LLM judge for evaluating LLM-generated answers.

Language:PythonApache-2.0268 7 15

UltraFeedback

A large-scale, fine-grained, diverse preference dataset (and models).

Language:PythonMIT268 10 13

LLaVA-RLHF

Aligning LMMs with Factually Augmented RLHF

Language:PythonGPL-3.0250 8 30

LRV-Instruction

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

Language:PythonBSD-3-Clause225 11 22

ModuleFormer

ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language Models (MoLM) ranging in scale from 4 billion to 8 billion parameters.

Language:PythonApache-2.0215 11 5