MIT HAN Lab

MIT HAN Lab's repositories

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT6350 60 78

bevfusion

[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

Language:PythonApache-2.02127 42 603

llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonMIT2073 23 157

temporal-shift-module

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Language:PythonMIT2034 42 219

once-for-all

[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

Language:PythonMIT1848 53 75

efficientvit

EfficientViT is a new family of vision models for efficient high-resolution vision.

Language:PythonApache-2.01588 33 114

data-efficient-gans

[NeurIPS 2020] Differentiable Augmentation for Data-Efficient GAN Training

Language:PythonBSD-2-Clause1268 19 97

A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.

Language:Jupyter NotebookMIT1236 25 115

torchsparse

[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.

Language:CudaMIT1149 18 239

gan-compression

[CVPR 2020] GAN Compression: Efficient Architectures for Interactive Conditional GANs

Language:PythonNOASSERTION1098 29 101

smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Language:PythonMIT1098 19 81

anycost-gan

[CVPR 2021] Anycost GANs for Interactive Image Synthesis and Editing

Language:PythonMIT775 23 30

tinyengine

[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 256KB Memory

Language:CMIT758 20 75