wutaiqiang's starred repositories

LLM-Barber

Code for the paper "LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for Large Language Models".

Language:PythonLicense:MITStargazers:4Issues:0Issues:0

MAZero

Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.

Language:PythonLicense:GPL-3.0Stargazers:8Issues:0Issues:0
Language:PythonStargazers:27Issues:0Issues:0
Language:JavaScriptStargazers:15Issues:0Issues:0

EMO

[ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)

Language:PythonStargazers:110Issues:0Issues:0

Awesome-LoRA

Awesome Low-Rank Adaptation

Stargazers:15Issues:0Issues:0
Language:MATLABLicense:GPL-3.0Stargazers:10360Issues:0Issues:0

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonLicense:MITStargazers:1157Issues:0Issues:0

Taiwan-LLM

Traditional Mandarin LLMs for Taiwan

Language:PythonLicense:Apache-2.0Stargazers:1204Issues:0Issues:0

awesome-implicit-representations

A curated list of resources on implicit neural representations.

License:MITStargazers:2427Issues:0Issues:0

Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

Stargazers:921Issues:0Issues:0

math-evaluation-harness

A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨

Language:PythonLicense:MITStargazers:66Issues:0Issues:0

DSKD

Repo for Paper "Dual-Space Knowledge Distillation for Large Language Models".

Language:PythonStargazers:24Issues:0Issues:0

QDMN

[ECCV2022] Learning Quality-aware Dynamic Memory for Video Object Segmentation

Language:PythonStargazers:141Issues:0Issues:0

ChartMimic

ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation

Language:PythonStargazers:76Issues:0Issues:0
Language:PythonStargazers:23Issues:0Issues:0

butterfly-oft

Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"

Stargazers:72Issues:0Issues:0

sea-llm

Code for the paper "Spectral Editing of Activations for Large Language Model Alignments"

Language:PythonLicense:MITStargazers:10Issues:0Issues:0
Language:Jupyter NotebookStargazers:80Issues:0Issues:0

MuGSI

MuGSI: Distilling GNNs with Multi-Granularity Structural Information for Graph Classification

Language:PythonStargazers:3Issues:0Issues:0

qaap

[EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions

Language:PythonStargazers:9Issues:0Issues:0
Language:PythonStargazers:6Issues:0Issues:0

ToRA

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

Language:PythonLicense:MITStargazers:922Issues:0Issues:0

iLLaMA

Adapting LLaMA Decoder to Vision Transformer

Language:PythonStargazers:25Issues:0Issues:0

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6909Issues:0Issues:0
Language:PythonStargazers:5Issues:0Issues:0

MedDr

A generalist foundation model for healthcare capable of handling diverse medical data modalities.

Language:PythonLicense:MITStargazers:38Issues:0Issues:0

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

License:MITStargazers:3285Issues:0Issues:0

Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

License:MITStargazers:735Issues:0Issues:0