Sining Sun (snsun)

snsun

Geek Repo

Company:duxiaoman

Location:Beijing

Github PK Tool:Github PK Tool

Sining Sun's repositories

a-PyTorch-Tutorial-to-Object-Detection

SSD: Single Shot MultiBox Detector | a PyTorch Tutorial to Object Detection

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

License:Apache-2.0Stargazers:2Issues:0Issues:0
Language:CStargazers:1Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

Stargazers:1Issues:0Issues:0
Stargazers:1Issues:0Issues:0

megatts2

Unoffical implementation of Megatts2

License:MITStargazers:1Issues:0Issues:0

av-se

Deep-Learning-Based Audio-Visual Speech Enhancement and Separation

Stargazers:0Issues:0Issues:0

Awesome-pytorch-list

A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.

Stargazers:0Issues:1Issues:0

Chinese-FastSpeech2

基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏

Language:PythonStargazers:0Issues:0Issues:0

chinese-xinhua

:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

facestar

Facestar dataset. High quality audio-visual recordings of human conversational speech.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

lingvo

Lingvo

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

lip-reading-deeplearning

:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Lipreading_using_Temporal_Convolutional_Networks

ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

naturalspeech3_facodec

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Stargazers:0Issues:0Issues:0

NKF-AEC

Acoustic Echo Cancellation with Nerual Kalman Filtering

Language:HTMLStargazers:0Issues:0Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

pytorch-cpp

C++ Implementation of PyTorch Tutorials for Everyone

Language:C++License:MITStargazers:0Issues:0Issues:0

Pytorch_Retinaface

Retinaface get 80.99% in widerface hard val using mobilenet0.25.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:1Issues:0

vector-quantize-pytorch

Vector Quantization, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

yolov5-face

YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931)

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

YOLOX

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0