xiaohuaibaoguigui

xiaohuaibaoguigui

Geek Repo

Github PK Tool:Github PK Tool

xiaohuaibaoguigui's starred repositories

DragonianVoice

多个SVC/TTS的C++推理库

Language:CLicense:AGPL-3.0Stargazers:970Issues:0Issues:0

whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

Language:PythonLicense:MITStargazers:2578Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4342Issues:0Issues:0

CoMoSVC

CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone

Language:PythonLicense:MITStargazers:116Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:11377Issues:0Issues:0

Wav2Lip-GFPGAN

High quality Lip sync

Language:PythonStargazers:974Issues:0Issues:0

speaker-verification

Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN

Language:PythonStargazers:85Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19305Issues:0Issues:0

PaddleGAN

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.

Language:PythonLicense:Apache-2.0Stargazers:7800Issues:0Issues:0

Talking-Face_PC-AVS

Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)

Language:PythonLicense:CC-BY-4.0Stargazers:914Issues:0Issues:0

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Language:PythonStargazers:9920Issues:0Issues:0

wav2lip-hq

Extension of Wav2Lip repository for processing high-quality videos.

Language:PythonStargazers:534Issues:0Issues:0
Language:CLicense:Apache-2.0Stargazers:248Issues:0Issues:0

AudioDVP

AudioDVP:Photorealistic Audio-driven Video Portraits

Language:PythonStargazers:295Issues:0Issues:0

mmrotate

OpenMMLab Rotated Object Detection Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:1812Issues:0Issues:0

imagefusion-rfn-nest

RFN-Nest(Information Fusion, 2021, Highly Cited Paper) - PyTorch =1.5,Python=3.7

Language:PythonStargazers:108Issues:0Issues:0

YOLOv5_NCNN

🍅 Deploy ncnn on mobile phones. Support Android and iOS. 移动端ncnn部署,支持Android与iOS。

Language:C++License:GPL-3.0Stargazers:1439Issues:0Issues:0

ncnn-android-yolov7

Android Live Demo inferenece of Yolov7 using ncnn

Language:C++License:GPL-3.0Stargazers:122Issues:0Issues:0

PaddleSeg

Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.

Language:PythonLicense:Apache-2.0Stargazers:8473Issues:0Issues:0

Remote-Sensing-RVSA

The official repo for [TGRS'22] "Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model"

Language:PythonLicense:MITStargazers:399Issues:0Issues:0

MobileStyleGAN.pytorch

An official implementation of MobileStyleGAN in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:663Issues:0Issues:0

stylegan2

StyleGAN2 - Official TensorFlow Implementation with practical improvements

Language:PythonLicense:NOASSERTIONStargazers:354Issues:0Issues:0

pixel2style2pixel

Official Implementation for "Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation" (CVPR 2021) presenting the pixel2style2pixel (pSp) framework

Language:Jupyter NotebookLicense:MITStargazers:3164Issues:0Issues:0

pixel2style2pixel-mobilenetv3

Re-implementation of pSp that use mobilenet-v3 and stylegan2-256p

Language:PythonLicense:MITStargazers:18Issues:0Issues:0

Alienify

Fine-tune StyleGAN2 on your custom data!

Language:Jupyter NotebookStargazers:7Issues:0Issues:0

Pytorch-Multi-Task-Multi-class-Classification

旨在搭建一个分类问题在Pytorch框架下的通解,批量解决单任务多分类问题、多任务多分类问题。

Language:PythonStargazers:54Issues:0Issues:0

deep-speaker

Deep Speaker: an End-to-End Neural Speaker Embedding System.

Language:PythonLicense:MITStargazers:897Issues:0Issues:0

lite.ai.toolkit

🛠 A lite C++ toolkit of awesome AI models, support ONNXRuntime, MNN, TNN, NCNN and TensorRT.

Language:C++License:GPL-3.0Stargazers:3554Issues:0Issues:0

commonvoice-voiceclassifier

Get gender & age from voice

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

SpeakerProfiling

Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf

Language:PythonLicense:MITStargazers:57Issues:0Issues:0