v-nhandt21

followers

following

stars

Đỗ Trí Nhân's repositories

Viphoneme

Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA

Language:PythonNOASSERTION71 3 9

Vinorm

Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syllables

Language:MakefileNOASSERTION51 3 10

ViSV2TTS

Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS

Language:PythonNOASSERTION44 3 9

MusicVoiceConversion

Sing any popular song with your voice

Language:Python11 3 1

ViMFA

Montreal Forced Aligner for Vietnamese

Language:PythonMIT8 2 2

SED_SoftLabel

Sound Event Classification With Soft Label

Language:Python4 1 1

AudioClassificationLabelTool

Language:Python2 10

FastspeechStyle

Language:Jupyter Notebook2 10

GPT4VN

Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn

Language:Python100

MSPSleepStageClassification

MultiScale MultiPeriod Sleep Stage Classification

Language:PythonMIT1 10

SoundEventClassification

Sound Event Classification - Experiment for DCase Challenge

Language:Python1 10

SyntheticSpeechAttribution

Language:Python1 30

TTS_Flask_demo

Language:Jupyter Notebook1 10

v-nhandt21

1 10

Webdev-Checkinchall

Tourism web

Language:JavaScript1 10

AdaSpeech

An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"

Language:Python000

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonMIT000

Awesome-AI

A curated list of awesome things related to artificial intelligence tools around the world wide web

000

grokking

Language:HTML000

LeetcodeAlgorithms

Leetcode solutions

MIT000

malaya-speech

Speech Toolkit for bahasa Malaysia, https://malaya-speech.readthedocs.io/

Language:Jupyter NotebookMIT000

PaddleSpeech

Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.

Language:PythonApache-2.0000

python-design-patterns

A collection of design patterns/idioms in Python

Language:Python000

SCCup2022-Synthetic-Speech-Attribution

The IEEE Signal Processing Society’s 2022 Signal Processing Cup (SP Cup) will be a synthetic speech attribution challenge. Teams will be requested to design and develop a system for synthetic speech attribution. This means, given an audio recording representing a synthetically generated speech track, to detect which method among a list of candidate ones has been used to synthesize the speech. The detector must rely on the analysis of the speech signal through signal processing and machine learning techniques.

Language:Python010

st_AED_deliverables

Language:Python000

Torch-Modules-Compilation

A compilation of implementations of various ML papers, especially in computer vision.

Language:PythonNOASSERTION000

tts-generation-webui

TTS Generation Web UI (Bark, MusicGen, Tortoise)

Language:Python000

TTS-Objective-Metrics

Objective metrics used in several text-to-speech (TTS) papers.

Language:PythonGPL-3.0000

v-nhandt21.github.io

Do Tri Nhan 's Portfolio

Language:JavaScriptMIT000

VoiceAnonymous

Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software

NOASSERTION000