Vatary (wiplug)

wiplug

Geek Repo

Location:BeiJing.China

Github PK Tool:Github PK Tool


Organizations
AvatarWorld

Vatary's starred repositories

scrcpy

Display and control your Android device

Language:CLicense:Apache-2.0Stargazers:103932Issues:1221Issues:4463

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:34192Issues:305Issues:873

Depix

Recovers passwords from pixelized screenshots

Language:PythonLicense:NOASSERTIONStargazers:25271Issues:399Issues:0

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:10376Issues:195Issues:2101

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

computervision-recipes

Best Practices, code samples, and documentation for Computer Vision.

Language:Jupyter NotebookLicense:MITStargazers:9315Issues:285Issues:259

vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7236Issues:117Issues:1449

real-url

获取斗鱼&虎牙&哔哩哔哩&抖音&快手等 58 个直播平台的真实流媒体地址(直播源)和弹幕,直播源可在 PotPlayer、flv.js 等播放器中播放。

Language:PythonLicense:GPL-2.0Stargazers:7142Issues:100Issues:416

BackgroundMattingV2

Real-Time High-Resolution Background Matting

Language:PythonLicense:MITStargazers:6700Issues:150Issues:194

ECCV2022-RIFE

ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Language:PythonLicense:MITStargazers:4134Issues:76Issues:318

White-box-Cartoonization

Official tensorflow implementation for CVPR2020 paper “Learning to Cartoonize Using White-box Cartoon Representations”

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonLicense:Apache-2.0Stargazers:3791Issues:90Issues:980

MODNet

A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]

Language:PythonLicense:Apache-2.0Stargazers:3635Issues:103Issues:203

AdelaiDepth

This repo contains the projects: 'Virtual Normal', 'DiverseDepth', and '3D Scene Shape'. They aim to solve the monocular depth estimation, 3D scene reconstruction from single image problems.

Language:PythonLicense:CC0-1.0Stargazers:1036Issues:36Issues:76

ubisoft-laforge-animation-dataset

Ubisoft La Forge - Animation Dataset

Language:PythonLicense:NOASSERTIONStargazers:947Issues:30Issues:13

hifi3dface

Code and data for our paper "High-Fidelity 3D Digital Human Creation from RGB-D Selfies".

Language:PythonLicense:NOASSERTIONStargazers:738Issues:37Issues:51

camera_calibration

Accurate geometric camera calibration with generic camera models

Language:C++License:BSD-3-ClauseStargazers:665Issues:28Issues:65

CIPS-3D

3D-aware GANs based on NeRF (arXiv).

Language:PythonLicense:MITStargazers:604Issues:29Issues:39

img2pose

The official PyTorch implementation of img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation - CVPR 2021

Language:PythonLicense:NOASSERTIONStargazers:577Issues:22Issues:78

ov2slam

OV²SLAM is a Fully Online and Versatile Visual SLAM for Real-Time Applications

Language:C++License:GPL-3.0Stargazers:569Issues:20Issues:65

openchat

OpenChat: Easy to use opensource chatting framework via neural networks

Language:PythonLicense:Apache-2.0Stargazers:438Issues:16Issues:25

muspy

A toolkit for symbolic music generation

Language:PythonLicense:MITStargazers:415Issues:6Issues:54

randomCNN-voice-transfer

Audio style transfer with shallow random parameters CNN.

DCPose

This is an official implementation of our CVPR 2021 paper "Deep Dual Consecutive Network for Human Pose Estimation" (https://openaccess.thecvf.com/content/CVPR2021/papers/Liu_Deep_Dual_Consecutive_Network_for_Human_Pose_Estimation_CVPR_2021_paper.pdf)

DST

Deformable Style Transfer (ECCV 2020)

Language:Jupyter NotebookStargazers:261Issues:10Issues:7

Neural-Style-Transfer-Audio

This is PyTorch Implementation of Neural Style Transfer Algorithm which is modified for Audios.

Language:PythonLicense:MITStargazers:77Issues:6Issues:0

feel-the-music

Code for our ICCC'20 paper - "Feel The Music: Automatically Generating A Dance For An Input Song"

Language:PythonStargazers:76Issues:4Issues:0

vrtist

Virtual Reality tool for storytelling

Language:C#License:NOASSERTIONStargazers:59Issues:10Issues:1
Language:PythonLicense:NOASSERTIONStargazers:33Issues:4Issues:6