CV Newbie's repositories
4D-Facial-Avatars
Dynamic Neural Radiance Fields for Monocular 4D Facial Avater Reconstruction
AAAI22-one-shot-talking-face
Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)
ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
awesome-computer-vision
A curated list of awesome computer vision resources
awesome-faceReenactment
papers about Face Reenactment/Talking Face Generation
awesome-gan-inversion
A collection of resources on GAN inversion.
Awesome-Image-Harmonization
A curated list of papers, code and resources pertaining to image harmonization.
awesome-NeRF
A curated list of awesome neural radiance fields papers
awesome-neural-rendering
A collection of resources on neural rendering.
curated-list-of-awesome-3D-Morphable-Model-software-and-data
The idea of this list is to collect shared data and algorithms around 3D Morphable Models. You are invited to contribute to this list by adding a pull request. The original list arised from the Dagstuhl seminar on 3D Morphable Models https://www.dagstuhl.de/19102 in March 2019.
Deep3DFaceRecon_pytorch
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.
espnet
End-to-End Speech Processing Toolkit
FastDiff
PyTorch Implementation of FastDiff (IJCAI'22)
flame-fitting
Example code for the FLAME 3D head model. The code demonstrates how to sample 3D heads from the model, fit the model to 3D keypoints and 3D scans.
insightface
State-of-the-art 2D and 3D Face Analysis Project
instant-ngp
Instant neural graphics primitives: lightning fast NeRF and more
lite.ai.toolkit
🛠 A lite C++ toolkit of awesome AI models with ONNXRuntime, NCNN, MNN and TNN. YOLOX, YOLOP, YOLOv5, YOLOR, NanoDet, YOLOX, SCRFD, YOLOX . MNN, NCNN, TNN, ONNXRuntime, CPU/GPU.
MIMDet
MIMDet: Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
neural-head-avatars
Official PyTorch implementation of "Neural Head Avatars from Monocular RGB Videos"
pytracking
Visual tracking library based on PyTorch.
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
semantic-segmentation
SOTA Semantic Segmentation Models in PyTorch
stylegan2-ada-pytorch
StyleGAN2-ADA - Official PyTorch implementation
torch-ngp
A pytorch CUDA extension implementation of instant-ngp (sdf and nerf), with a GUI.
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
yolov5-face
YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931)
yolov7
🔥🔥🔥🔥 YOLO with Transformers and Instance Segmentation, with TensorRT acceleration! 🔥🔥🔥