XiaobingZhang's repositories
Resnext3d-for-video-classification
Using https://github.com/facebookresearch/ClassyVision to implement Resnext3d
apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
HBONet
[ICCV 2019] Harmonious Bottleneck on Two Orthogonal Dimensions, surpassing MobileNetV2
ideep
Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.
incubator-mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
llama.cpp
LLM inference in C/C++
mmclassification
OpenMMLab Image Classification Toolbox and Benchmark
oneDNN
oneAPI Deep Neural Network Library (oneDNN)
opacus
Training PyTorch models with differential privacy
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
tensorrtllm_backend
The Triton TensorRT-LLM Backend
torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
tutorials
PyTorch tutorials.
vision
Datasets, Transforms and Models specific to Computer Vision
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs