MarsJacobs

Minsoo Kim's repositories

kd-qat-large-enc

[EMNLP 2022 main] Code for "Understanding and Improving Knowledge Distillation for Quantization-Aware-Training of Large Transformer Encoders"

Language:Jupyter Notebook7 10

ti-kd-qat

[EACL 2023 main] This Repository provides a Pytorch implementation of Teacher Intervention: Improving Convergence of Quantization Aware Training for Ultra-Low Precision Transformers

Language:Python5 2 2

mbv1_brevitas

This is MobileNetv1 Brevitas based Quantization-aware-Training Framework

Language:Python1 10

2022-Study

000

efficientdet-pytorch

A PyTorch impl of EfficientDet faithful to the original Google impl w/ ported weights

Language:PythonApache-2.0000

finetune-transformer-lm

Code and model for the paper "Improving Language Understanding by Generative Pre-Training"

Language:PythonMIT000

hello-world

Language:HTML010

llm-awq

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonMIT000

lsq-net

Unofficial implementation of LSQ-Net, a neural network quantization framework

Language:PythonMIT000

marsjacobs.github.io

Language:HTMLMIT010

model-quantization

Collections of model quantization algorithms

Language:PythonNOASSERTION000

nsmc-klue-bert

Language:Jupyter Notebook000

Pretrained-Language-Model

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Language:Python000

PROFIT

Language:PythonMIT000

qbert

Language:Python010

TernGEMM

TernGEMM: General Matrix Multiply Library with Ternary Weights for Fast DNN Inference

Language:C++GPL-3.0000

TSLD

[NeurIPS 2023] Token-Scaled Logit Distillation for Ternary Weight Generative Language Models

000

Yet-Another-EfficientDet-Pytorch

The pytorch re-implement of the official efficientdet with SOTA performance in real time and pretrained weights.

Language:Jupyter NotebookLGPL-3.0000