Đỗ Trí Nhân's repositories
MusicVoiceConversion
Sing any popular song with your voice
SED_SoftLabel
Sound Event Classification With Soft Label
MSPSleepStageClassification
MultiScale MultiPeriod Sleep Stage Classification
SoundEventClassification
Sound Event Classification - Experiment for DCase Challenge
Webdev-Checkinchall
Tourism web
AdaSpeech
An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"
audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
Awesome-AI
A curated list of awesome things related to artificial intelligence tools around the world wide web
LeetcodeAlgorithms
Leetcode solutions
malaya-speech
Speech Toolkit for bahasa Malaysia, https://malaya-speech.readthedocs.io/
PaddleSpeech
Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.
python-design-patterns
A collection of design patterns/idioms in Python
SCCup2022-Synthetic-Speech-Attribution
The IEEE Signal Processing Society’s 2022 Signal Processing Cup (SP Cup) will be a synthetic speech attribution challenge. Teams will be requested to design and develop a system for synthetic speech attribution. This means, given an audio recording representing a synthetically generated speech track, to detect which method among a list of candidate ones has been used to synthesize the speech. The detector must rely on the analysis of the speech signal through signal processing and machine learning techniques.
Torch-Modules-Compilation
A compilation of implementations of various ML papers, especially in computer vision.
tts-generation-webui
TTS Generation Web UI (Bark, MusicGen, Tortoise)
TTS-Objective-Metrics
Objective metrics used in several text-to-speech (TTS) papers.
v-nhandt21.github.io
Do Tri Nhan 's Portfolio
VoiceAnonymous
Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software