wenhuach's repositories
triton-inference-server
The Triton Inference Server provides a cloud inferencing solution optimized for NVIDIA GPUs.
ABINet
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
AVID-CMA
Audio Visual Instance Discrimination with Cross-Modal Agreement
Chinese-BERT-wwm
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
CPM
Introduction to CPM
DynaVSR
DynaVSR: Dynamic Adaptive Blind VideoSuper-Resolution
EssentialMC2
EssentialMC2 Video Understanding.
ffmpeg-libav-tutorial
FFmpeg libav tutorial - learn how media works from basic to transmuxing, transcoding and more
first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
GPT2-Chinese
Chinese version of GPT2 training code, using BERT tokenizer.
HAWQ
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
mediapipe
MediaPipe is the simplest way for researchers and developers to build world-class ML solutions and applications for mobile, edge, cloud and the web.
Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
openvino_training_extensions
Trainable models and NN optimization tools
pensieve
Neural Adaptive Video Streaming with Pensieve (SIGCOMM '17)
pytorch-lightning
The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
smplify-x
Expressive Body Capture: 3D Hands, Face, and Body from a Single Image
smplpix
SMPLpix: Neural Avatars from 3D Human Models
Ultra-Light-Fast-Generic-Face-Detector-1MB
💎1MB lightweight face detection model (1MB轻量级人脸检测模型)
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
youtube-dl
A fork of youtube-dl, for archival purposes.
YOWO
You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization