Bo Zheng's repositories
AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
focal_calibration
Code for the paper "Calibrating Deep Neural Networks using Focal Loss"
mlx-examples
Examples in the MLX framework
sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
unilm
UniLM AI - Large-scale Self-supervised Pre-training across Tasks, Languages, and Modalities
xtreme
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.