lwzbuaa's repositories
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
faceswap
Deepfakes Software For All
Talking-Face_PC-AVS
Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)
TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
ssterm
A simple console-based serial port terminal, written in Python.
Axiom
An FFmpeg GUI for Windows
pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
rhubarb-lip-sync
Rhubarb Lip Sync is a command-line tool that automatically creates 2D mouth animation from voice recordings. You can use it for characters in computer games, in animated cartoons, or in any other project that requires animating mouths based on existing recordings.
WebTemplateStudio
Microsoft Web Template Studio quickly builds web applications using a wizard-based UI to turn your needs into a foundation of best patterns and practices
libreoffice
LibreOffice - powerful office suite
DialoGPT
Large-scale pretraining for dialogue
DeepSpeech
A PaddlePaddle implementation of ASR.
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Unpaired-Portrait-Drawing
Code for Unpaired Portrait Drawing Generation via Asymmetric Cycle Mapping (CVPR 2020)
deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System.
defish
Algorithmic correction of fisheye lens distortion
ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
libde265
Open h.265 video codec implementation.
3D-BoNet
🔥3D-BoNet in Tensorflow (NeurIPS 2019, Spotlight)
WeChat-MiniProgram-AR-3D
A WeChat MiniProgram 3D that includes a Panorama Viewer and a 3D Viewer using the device orientation control.
Knover
Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle
AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
MobileNet-Yolo
MobileNetV2-YoloV3-Nano: 0.5BFlops 3MB HUAWEI P40: 6ms/img, YoloFace-500k:0.1Bflops 420KB:fire::fire::fire:
detr
End-to-End Object Detection with Transformers
iSeeBetter
iSeeBetter: Spatio-Temporal Video Super Resolution using Recurrent-Generative Back-Projection Networks | Python3 | PyTorch | GANs | CNNs | ResNets | RNNs | Published in Springer Journal of Computational Visual Media, September 2020, Tsinghua University Press
YOLObile
This is the implementation of YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
APDrawingGAN
Code for APDrawingGAN: Generating Artistic Portrait Drawings from Face Photos with Hierarchical GANs (CVPR 2019 Oral)