Dalu Feng (Fengdalu)

Fengdalu

Geek Repo

Github PK Tool:Github PK Tool


Organizations
VIPL-Audio-Visual-Speech-Understanding

Dalu Feng's repositories

FacePose_pytorch

🔥🔥The pytorch implement of the head pose estimation(yaw,roll,pitch) and emotion detection with SOTA performance in real time.Easy to deploy, easy to use, and high accuracy.Solve all problems of face detection at one time.(极简,极快,高效是我们的宗旨)

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

Lipreading_using_Temporal_Convolutional_Networks

ICASSP'20 Lipreading using Temporal Convolutional Networks

Language:PythonLicense:NOASSERTIONStargazers:1Issues:1Issues:0

auto_avsr

Auto-AVSR: Lip-Reading Sentences Project

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

av_hubert

A self-supervised learning framework for audio-visual speech

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

awesome-audio-visualization

A curated list about Audio Visualization.

Language:ShellStargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0

bark

🔊 Text-Prompted Generative Audio Model

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

chinese_text_normalization

Chinese text normalization for speech processing

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

DeepFaceLab

DeepFaceLab is the leading software for creating deepfakes.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:2Issues:0

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

fairscale

PyTorch extensions for high performance and large scale training.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

gpu-burn

Multi-GPU CUDA stress test

Language:C++License:BSD-2-ClauseStargazers:0Issues:1Issues:0

lightning-bolts

Toolbox of models, callbacks, and datasets for AI/ML researchers.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LRW_ID

The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Padding" (ECCV 2022)

Stargazers:0Issues:0Issues:0

mdistiller

The official implementation of [CVPR2022] Decoupled Knowledge Distillation https://arxiv.org/abs/2203.08679

Language:PythonStargazers:0Issues:0Issues:0
Language:CLicense:MITStargazers:0Issues:1Issues:0

nvjpeg-python

nvjpeg for python

Language:CLicense:MITStargazers:0Issues:1Issues:0

RGB_HSV_HSL

a pure pytorch implementation of color space conversion, including rgb2hsl, rgb2hsv, hsv2rgb, hsl2rgb

Language:PythonStargazers:0Issues:1Issues:0

SCPapers

Must-read Papers on Sememe Computation

License:MITStargazers:0Issues:1Issues:0

Speech-Transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Language:PythonStargazers:0Issues:1Issues:0

stanfordacm

Stanford ACM-ICPC related materials

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0

stargan

StarGAN - Official PyTorch Implementation (CVPR 2018)

License:MITStargazers:0Issues:0Issues:0

torchnvjpeg

Decode JPEG image on GPU using PyTorch

Language:C++License:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Wave-U-Net-for-Speech-Enhancement

Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-4-ClauseStargazers:0Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0