wiplug

Vatary's starred repositories

scrcpy

Display and control your Android device

Language:CApache-2.0103932 1221 4463

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION34192 305 873

Depix

Recovers passwords from pixelized screenshots

Language:PythonNOASSERTION25271 3990

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.010376 195 2101

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Language:Python9546 162 634

computervision-recipes

Best Practices, code samples, and documentation for Computer Vision.

Language:Jupyter NotebookMIT9315 285 259

vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Language:Jupyter NotebookApache-2.07236 117 1449

real-url

获取斗鱼&虎牙&哔哩哔哩&抖音&快手等 58 个直播平台的真实流媒体地址(直播源)和弹幕，直播源可在 PotPlayer、flv.js 等播放器中播放。

Language:PythonGPL-2.07142 100 416

BackgroundMattingV2

Real-Time High-Resolution Background Matting

Language:PythonMIT6700 150 194

ECCV2022-RIFE

ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Language:PythonMIT4134 76 318

White-box-Cartoonization

Official tensorflow implementation for CVPR2020 paper “Learning to Cartoonize Using White-box Cartoon Representations”

Language:Python3920 76 103

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonApache-2.03791 90 980

MODNet

A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]

Language:PythonApache-2.03635 103 203

AdelaiDepth

This repo contains the projects: 'Virtual Normal', 'DiverseDepth', and '3D Scene Shape'. They aim to solve the monocular depth estimation, 3D scene reconstruction from single image problems.

Language:PythonCC0-1.01036 36 76

ubisoft-laforge-animation-dataset

Ubisoft La Forge - Animation Dataset

Language:PythonNOASSERTION947 30 13

hifi3dface

Code and data for our paper "High-Fidelity 3D Digital Human Creation from RGB-D Selfies".

Language:PythonNOASSERTION738 37 51

camera_calibration

Accurate geometric camera calibration with generic camera models

Language:C++BSD-3-Clause665 28 65

CIPS-3D

3D-aware GANs based on NeRF (arXiv).

Language:PythonMIT604 29 39

img2pose

The official PyTorch implementation of img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation - CVPR 2021

Language:PythonNOASSERTION577 22 78

ov2slam

OV²SLAM is a Fully Online and Versatile Visual SLAM for Real-Time Applications

Language:C++GPL-3.0569 20 65

openchat

OpenChat: Easy to use opensource chatting framework via neural networks

Language:PythonApache-2.0438 16 25

muspy

A toolkit for symbolic music generation

Language:PythonMIT415 6 54

randomCNN-voice-transfer

Audio style transfer with shallow random parameters CNN.

Language:Python397 22 22

This is an official implementation of our CVPR 2021 paper "Deep Dual Consecutive Network for Human Pose Estimation" (https://openaccess.thecvf.com/content/CVPR2021/papers/Liu_Deep_Dual_Consecutive_Network_for_Human_Pose_Estimation_CVPR_2021_paper.pdf)

Language:Python363 10 48