xiaohuaibaoguigui

xiaohuaibaoguigui's starred repositories

whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

Language:PythonMIT257800

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT434200

CoMoSVC

CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone

Language:PythonMIT11600

SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonNOASSERTION1137700

Wav2Lip-GFPGAN

High quality Lip sync

Language:Python97400

speaker-verification

Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN

Language:Python8500

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonMIT1930500

PaddleGAN

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.

Language:PythonApache-2.0780000

Talking-Face_PC-AVS

Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)

Language:PythonCC-BY-4.091400

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Language:Python992000

wav2lip-hq

Extension of Wav2Lip repository for processing high-quality videos.

Language:Python53400

librga

Language:CApache-2.024800

AudioDVP

AudioDVP:Photorealistic Audio-driven Video Portraits

Language:Python29500

mmrotate

OpenMMLab Rotated Object Detection Toolbox and Benchmark

Language:PythonApache-2.0181200

imagefusion-rfn-nest

RFN-Nest(Information Fusion, 2021, Highly Cited Paper) - PyTorch =1.5，Python=3.7

Language:Python10800

YOLOv5_NCNN

🍅 Deploy ncnn on mobile phones. Support Android and iOS. 移动端ncnn部署，支持Android与iOS。

Language:C++GPL-3.0143900

ncnn-android-yolov7

Android Live Demo inferenece of Yolov7 using ncnn

Language:C++GPL-3.012200

PaddleSeg

Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.

Language:PythonApache-2.0847300

Remote-Sensing-RVSA

The official repo for [TGRS'22] "Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model"

Language:PythonMIT39900

MobileStyleGAN.pytorch

An official implementation of MobileStyleGAN in PyTorch

Language:PythonApache-2.066300

stylegan2

StyleGAN2 - Official TensorFlow Implementation with practical improvements

Language:PythonNOASSERTION35400

pixel2style2pixel

Official Implementation for "Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation" (CVPR 2021) presenting the pixel2style2pixel (pSp) framework

Language:Jupyter NotebookMIT316400