There are 3 repositories under efficient-model topic.
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.
[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework
[ICML'21 Oral] I-BERT: Integer-only BERT Quantization
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
Efficient 3D Backbone Network for Temporal Modeling
[ICCV 2019] Harmonious Bottleneck on Two Orthogonal Dimensions, surpassing MobileNetV2
[KDD'22] Learned Token Pruning for Transformers
Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models".
S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)
Any-Precision Deep Neural Networks (AAAI 2021)
[JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion
GraphSnapShot: Caching Local Structure for Fast Graph Learning [Efficient ML System]
[ICCV 2025] Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment
[MICCAI 2021] BiX-NAS: Searching Efficient Bi-directional Architecture for Medical Image Segmentation
Concise, Modular, Human-friendly PyTorch implementation of EfficientNet with Pre-trained Weights.
[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition
Efficient Class Incremental Learning for Object Detection
NeurIPSCD2019, MicroNet Challenge hosted by Google, Deepmind Researcher, "Efficient Model for Image Classification With Regularization Tricks".
Official code base for ‘Lite-Mind : Towards Efficient and Robust Brain Representation Learning’
Concise, Modular, Human-friendly PyTorch implementation of MixNet with Pre-trained Weights.
Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks
Extremely light-weight MixNet with Top-1 75.7% and 2.5M params
On Efficient Variants of Segment Anything Model: A Survey
Implementation of efficient backbones for computer vision task.
Exploring Variational Deep Q Networks. A study undertaken for the University of Cambridge's R244 Computer Science Masters Course. Inspired by https://arxiv.org/abs/1711.11225/.
Explore image transformations with DeepDream Algorithm and Neural Style Transfer in creative image processing.
Code for NAACL paper When Quantization Affects Confidence of Large Language Models?
Tagsy, your friendly Discord bot, designed to enhance server interaction with its intuitive tagging system