azuredsky's repositories
Background-Matting
Background Matting: The World is Your Green Screen
AA-TransUNet
The repository for paper AA-TransUNet: Attention Augmented TransUNet For Nowcasting Tasks.
Car_ReIdentification_application
A car re-identification app based on multi-feature fusion technique
Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
chinese_speech_pretrain
chinese speech pretrained models
contextual_loss_pytorch
Contextual Loss (CX) and Contextual Bilateral Loss (CoBi).
DL-Demos
Demos for deep learning
face_landmark
A simple method for face alignment based on wingloss and mutitask learning :)
FBA-Matting
Official repository for the paper F, B, Alpha Matting
Fengshenbang-LM
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
FGT
[ECCV 2022] Flow-Guided Transformer for Video Inpainting
HR-VITON
Official PyTorch implementation for the paper High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions (ECCV 2022).
HRFAE
Official implementation for paper High Resolution Face Age Editing
indexnet_matting
Indices Matter: Learning to Index for Deep Image Matting
learnopencv
Learn OpenCV : C++ and Python Examples
NanoTrack
Deep learning-based mobile model deployment(Object Tracking). Lightweight Object Tracking, NCNN,
pfld_106_face_landmarks
106点人脸关键点检测的PFLD算法实现
PIPNet
Efficient facial landmark detector
SADMA
SADMA: SAtellite baseD MArine debris detection
talking-face-arxiv-daily
🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.
talking_face_preprocessing
Preprocessing Scipts for Talking Face Generation
temporal-shift-module
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
vit-vqvae
VQ-VAE implementation using Vision Transformers for both the encoder and decoder
Wav2Lip-GFPGAN
High quality Lip sync