Fengdalu

Dalu Feng's repositories

FacePose_pytorch

🔥🔥The pytorch implement of the head pose estimation(yaw,roll,pitch) and emotion detection with SOTA performance in real time.Easy to deploy, easy to use, and high accuracy.Solve all problems of face detection at one time.(极简，极快，高效是我们的宗旨)

Language:PythonMIT100

Lipreading_using_Temporal_Convolutional_Networks

ICASSP'20 Lipreading using Temporal Convolutional Networks

Language:PythonNOASSERTION1 10

auto_avsr

Auto-AVSR: Lip-Reading Sentences Project

Language:PythonApache-2.0000

av_hubert

A self-supervised learning framework for audio-visual speech

Language:PythonNOASSERTION000

awesome-audio-visualization

A curated list about Audio Visualization.

Language:Shell010

Awesome-Video-Datasets

Video datasets

010

bark

🔊 Text-Prompted Generative Audio Model

Language:PythonNOASSERTION000

chinese_text_normalization

Chinese text normalization for speech processing

Language:PythonMIT010

DeepFaceLab

DeepFaceLab is the leading software for creating deepfakes.

Language:PythonGPL-3.0020

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Language:PythonBSD-3-Clause010

fairscale

PyTorch extensions for high performance and large scale training.

Language:PythonNOASSERTION010

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.0000

gpu-burn

Multi-GPU CUDA stress test

Language:C++BSD-2-Clause010

lightning-bolts

Toolbox of models, callbacks, and datasets for AI/ML researchers.

Language:PythonApache-2.0000

LRW_ID

The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Padding" (ECCV 2022)

000