wutaiqiang

followers

following

stars

https://wutaiqiang.github.io/

wutaiqiang's starred repositories

LLM-Barber

Code for the paper "LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for Large Language Models".

Language:PythonMIT400

MAZero

Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.

Language:PythonGPL-3.0800

MoSLoRA

Language:Python2700

Reliable-LLM

Language:JavaScript1500

EMO

[ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)

Language:Python11000

Awesome-LoRAs

1500

Awesome-LoRA

Awesome Low-Rank Adaptation

1500

ShiArthur03

Language:MATLABGPL-3.01036000

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonMIT115700

Taiwan-LLM

Traditional Mandarin LLMs for Taiwan

Language:PythonApache-2.0120400

awesome-implicit-representations

A curated list of resources on implicit neural representations.

MIT242700

Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

math-evaluation-harness

A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨

Language:PythonMIT6600

DSKD

Repo for Paper "Dual-Space Knowledge Distillation for Large Language Models".

Language:Python2400

QDMN

[ECCV2022] Learning Quality-aware Dynamic Memory for Video Object Segmentation

Language:Python14100

ChartMimic

ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation

Language:Python7600

FLoRA

Language:Python2300

butterfly-oft

Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"

7200

sea-llm

Code for the paper "Spectral Editing of Activations for Large Language Model Alignments"

Language:PythonMIT1000

redunet_demo

Language:Jupyter Notebook8000

MuGSI

MuGSI: Distilling GNNs with Multi-Granularity Structural Information for Graph Classification

Language:Python300

qaap

[EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions

Language:Python900

SCMoE

Language:Python600

ToRA

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

Language:PythonMIT92200

iLLaMA

Adapting LLaMA Decoder to Vision Transformer

Language:Python2500

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookApache-2.0690900

MiniGPT4o

Language:Python500

MedDr

A generalist foundation model for healthcare capable of handling diverse medical data modalities.

Language:PythonMIT3800

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

MIT328500

Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

MIT73500