Divebomb's starred repositories
Knowledge-Distillation-Zoo
Pytorch implementation of various Knowledge Distillation (KD) methods.
Awesome-Quantization-Papers
List of papers related to neural network quantization in recent AI conferences and journals.
awesome-model-quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
line_profiler
Line-by-line profiling for Python
Megatron-LM
Ongoing research training transformer models at scale
Made-With-ML
Learn how to design, develop, deploy and iterate on production-grade ML applications.
SoraReview
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".
BERT-LoRA-TensorRT
This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high performance using NVIDIA's TensorRT.
KG-MM-Survey
Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey
data-centric-AI
A curated, but incomplete, list of data-centric AI resources.
smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
ann-benchmarks
Benchmarks of approximate nearest neighbor libraries in Python
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
PocketFlow
An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models