Sining Sun's repositories
a-PyTorch-Tutorial-to-Object-Detection
SSD: Single Shot MultiBox Detector | a PyTorch Tutorial to Object Detection
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
av-se
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
Awesome-pytorch-list
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
Chinese-FastSpeech2
基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏
chinese-xinhua
:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。
facestar
Facestar dataset. High quality audio-visual recordings of human conversational speech.
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
lip-reading-deeplearning
:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
Lipreading_using_Temporal_Convolutional_Networks
ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks
naturalspeech3_facodec
FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3
NKF-AEC
Acoustic Echo Cancellation with Nerual Kalman Filtering
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
pytorch-cpp
C++ Implementation of PyTorch Tutorials for Everyone
Pytorch_Retinaface
Retinaface get 80.99% in widerface hard val using mobilenet0.25.
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
vector-quantize-pytorch
Vector Quantization, in Pytorch
yolov5-face
YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931)
YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/