interspeech

There are 2 repositories under interspeech topic.

DmitryRyumin / INTERSPEECH-2023-Papers
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
interspeech speech-technology machine-translation speech-synthesis asr prosody self-supervised-learning speech-production speech-coding transmission acoustic adaptation robustness signal-processing speech-recognition audio-signals speech-analysis linguistic-analysis language-modeling lexical-analysis
586
gabrielmittag / NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
speech-quality deep-learning interspeech icassp tts pytorch voice-conversion text-to-speech speech-synthesis quality-of-experience
Language:Python 586
soham97 / awesome-sound_event_detection
Reading list for research topics in Sound AI
audio-processing icassp interspeech sound-event-detection acoustic-scene-classification audio-captioning audio-generation audio-retrieval representation-learning zero-shot-learning
136
DmitryRyumin / NewEraAI-Papers
The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementation insights beyond the scope of the article. Stay up to date with the latest advances in AI research!
artificial-intelligence computer-vision deep-learning icassp image-processing interspeech mashine-learning neural-networks signal-processing text-classification video-processing cvpr emnlp iccv ismir natural-language-processing
Language:Python 71
hechmik / voxceleb_enrichment_age_gender
Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021
voxceleb sound machine-learning deep-learning gender-recognition age-prediction interspeech asru2021 voxceleb-enrichment age age-regression
Language:Jupyter Notebook 56
FrenchKrab / IS2023-powerset-diarization
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
interspeech pyannote speaker-diarization
Language:Jupyter Notebook 51
ronggong / interspeech2018_submission01
Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions
beijing-opera singing-voice cnn keras hsmm hmm forced-alignment interspeech
Language:Python 46
coolEphemeroptera / AESRC2020
a deep accent recognition network
ctc resnet keras mtl interspeech accent-recognition asr cosface arcface circle-loss netvlad ghostvlad speaker-recognition crnn
Language:Python 45
doerlbh / MiniVox
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
speaker-diarization paper speaker-recognition online-learning bandit-algorithms contextual-bandits interspeech2020 interspeech acml self-supervised-learning online-speaker-diarization
Language:Cuda 25
Lhx94As / PHO-LID
PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification
interspeech pytorch spoken-language-identification
Language:Python 17
doheejin / SB_loss_PA
This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).
assessment balanced-loss interspeech2023 loss-functions pronunciation pronunciation-scoring scoring-functions score-balanced-loss automatic-pronunciation-assessment language-learning nlp apa interspeech
Language:Python 14
cmu-mlsp / Learning_from_weak_labels
[Interspeech 2022] Tutorial - Learning from Weak Labels
interspeech weak-label
Language:MATLAB 8
whydinkov / interspeech-2019
Interspeech 2019 experiments
keras sklearn interspeech nlp audio-processing
Language:Python 8
jlinear / ReMASC_Exp
Baseline Experiments for ReMASC dataset.
remasc vcs replay-attack interspeech
Language:C 5
allyoushawn / timit_gas
The implementation code for the paper "Gate Activation Signal Analysis for Gated Recurrent Neural Networks and Its Correlation with Phoneme Boundaries"
deep-learning interspeech rnn speech-processing tensorflow interspeech2017
Language:Python 4
ChingtingC / Code-Switching-Sentence-Generation-by-GAN
Code-Switching Sentence Generation by Generative Adversarial Networks and its Application to Data Augmentation. (Interspeech 2019)
code-switching generative-adversarial-network interspeech
Language:Python 1
INTERSPEECH-2024 / MER
Official repo for "Multi-Corpus Emotion Recognition Method based on Cross-Modal Gated Attention Fusion" in INTERSPEECH 2024
interspeech interspeech2024 computational-linguistics human-computer-interaction multimodal-emotion-recognition transformers gated-feature-fusion
Language:Python 1
Nexdata-AI / Interspeech2020-Accented-English-Speech-Recognition-Competition-Data
Interspeech2020 Accented English Speech Recognition Competition Data
asr asr-model audio dataset deep-learning deep-neural-networks interspeech recognition speech speech-recognition speech-to-text
1
KarelianSpeech / AnKaS
AnKaS: Development and Analysis of the Database of Livvi-Karelian Speech Annotations [INTERSPEECH 2024]
interspeech interspeech2024 ankas
Language:JavaScript 0

interspeech

DmitryRyumin / INTERSPEECH-2023-Papers

gabrielmittag / NISQA

soham97 / awesome-sound_event_detection

DmitryRyumin / NewEraAI-Papers

hechmik / voxceleb_enrichment_age_gender

FrenchKrab / IS2023-powerset-diarization

ronggong / interspeech2018_submission01

coolEphemeroptera / AESRC2020

doerlbh / MiniVox

Lhx94As / PHO-LID

doheejin / SB_loss_PA

cmu-mlsp / Learning_from_weak_labels

whydinkov / interspeech-2019

jlinear / ReMASC_Exp

allyoushawn / timit_gas

ChingtingC / Code-Switching-Sentence-Generation-by-GAN

INTERSPEECH-2024 / MER

Nexdata-AI / Interspeech2020-Accented-English-Speech-Recognition-Competition-Data

KarelianSpeech / AnKaS