macroustc

macroustc

Geek Repo

Github PK Tool:Github PK Tool

macroustc's repositories

faceswap

Deepfakes Software For All

License:GPL-3.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector

License:MITStargazers:0Issues:0Issues:0

natural-speech-pytorch

Implementation of the neural network proposed in Natural Speech, a text-to-speech generator that is indistinguishable from human recordings for the first time, from Microsoft Research

License:MITStargazers:0Issues:0Issues:0

PaddleNLP

Easy-to-use and powerful NLP library with Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including Neural Search, Question Answering, Information Extraction and Sentiment Analysis end-to-end system.

License:Apache-2.0Stargazers:0Issues:0Issues:0

DeepFaceLab

DeepFaceLab is the leading software for creating deepfakes.

License:GPL-3.0Stargazers:0Issues:0Issues:0

ERNIE

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

Stargazers:0Issues:0Issues:0

MNN

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

License:MITStargazers:0Issues:0Issues:0

ymir

YMIR, a streamlined model development product.

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

annotated_deep_learning_paper_implementations

🧑‍🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

License:MITStargazers:0Issues:0Issues:0

FACIAL

FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.

License:AGPL-3.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

UTMOS22

UT-Sarulab MOS prediction system using SSL models

License:MITStargazers:0Issues:0Issues:0

FlatTN

Chinese Text Normalization and Dataset

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Muskits

An opensource music processing toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

FaceFormer

[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers

License:MITStargazers:0Issues:0Issues:0

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

License:MITStargazers:0Issues:0Issues:0

book-text-to-speech

A book about Text-to-Speech (TTS) in Chinese.

License:NOASSERTIONStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

DeepXi

Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.

License:MPL-2.0Stargazers:0Issues:0Issues:0

NATSpeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

License:MITStargazers:0Issues:0Issues:0

DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

License:MITStargazers:0Issues:0Issues:0

wekws

Production First and Production Ready End-to-End Keyword Spotting Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

recasepunc

Model for recasing and repunctuating ASR transcripts

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

transformer-deploy

Efficient, scalable and enterprise-grade CPU/GPU inference server for Hugging Face transformer models 🚀

License:Apache-2.0Stargazers:0Issues:0Issues:0

speech-synthesis-paper

List of speech synthesis papers.

License:MITStargazers:0Issues:0Issues:0