Đỗ Trí Nhân's repositories

Viphoneme

Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA

Language:PythonLicense:NOASSERTIONStargazers:71Issues:3Issues:9

Vinorm

Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syllables

Language:MakefileLicense:NOASSERTIONStargazers:51Issues:3Issues:10

ViSV2TTS

Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS

Language:PythonLicense:NOASSERTIONStargazers:44Issues:3Issues:9

MusicVoiceConversion

Sing any popular song with your voice

ViMFA

Montreal Forced Aligner for Vietnamese

Language:PythonLicense:MITStargazers:8Issues:2Issues:2

SED_SoftLabel

Sound Event Classification With Soft Label

Language:Jupyter NotebookStargazers:2Issues:1Issues:0

GPT4VN

Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn

Language:PythonStargazers:1Issues:0Issues:0

MSPSleepStageClassification

MultiScale MultiPeriod Sleep Stage Classification

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

SoundEventClassification

Sound Event Classification - Experiment for DCase Challenge

Language:PythonStargazers:1Issues:1Issues:0
Language:Jupyter NotebookStargazers:1Issues:1Issues:0
Language:JavaScriptStargazers:1Issues:1Issues:0

AdaSpeech

An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"

Language:PythonStargazers:0Issues:0Issues:0

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Awesome-AI

A curated list of awesome things related to artificial intelligence tools around the world wide web

Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

LeetcodeAlgorithms

Leetcode solutions

License:MITStargazers:0Issues:0Issues:0

malaya-speech

Speech Toolkit for bahasa Malaysia, https://malaya-speech.readthedocs.io/

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

python-design-patterns

A collection of design patterns/idioms in Python

Language:PythonStargazers:0Issues:0Issues:0

SCCup2022-Synthetic-Speech-Attribution

The IEEE Signal Processing Society’s 2022 Signal Processing Cup (SP Cup) will be a synthetic speech attribution challenge. Teams will be requested to design and develop a system for synthetic speech attribution. This means, given an audio recording representing a synthetically generated speech track, to detect which method among a list of candidate ones has been used to synthesize the speech. The detector must rely on the analysis of the speech signal through signal processing and machine learning techniques.

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Torch-Modules-Compilation

A compilation of implementations of various ML papers, especially in computer vision.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

tts-generation-webui

TTS Generation Web UI (Bark, MusicGen, Tortoise)

Language:PythonStargazers:0Issues:0Issues:0

TTS-Objective-Metrics

Objective metrics used in several text-to-speech (TTS) papers.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

v-nhandt21.github.io

Do Tri Nhan 's Portfolio

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

VoiceAnonymous

Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software

License:NOASSERTIONStargazers:0Issues:0Issues:0