speech-analysis

There are 9 repositories under speech-analysis topic.

jianchang512 / clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频
clonevoice speech-analysis sts tts voice-assistant
Language:Python 8807
praat / praat.github.io
Praat: Doing Phonetics By Computer
speech phonetics acoustics speech-analysis
Language:C 1779
mmorise / World
A high-quality speech analysis, manipulation and synthesis system
speech-analysis speech-synthesis vocoder
Language:C++ 1276
haoheliu / voicefixer
General Speech Restoration
declipping denoise dereverberation mel speech speech-analysis speech-enhancement speech-processing speech-synthesis super-resolution tts vocoder
Language:Python 1230
DmitryRyumin / INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
acoustic adaptation asr audio-signals interspeech interspeech2023 interspeech2024 language-modeling lexical-analysis linguistic-analysis machine-translation prosody self-supervised-learning signal-processing speech-analysis speech-production speech-recognition speech-synthesis speech-technology transmission
685
gemengtju / Tutorial_Separation
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
speech-separation speech-processing speech-analysis deep-learning deep-neural-networks signal-processing
Language:MATLAB 466
jcvasquezc / DisVoice
feature extraction from speech signals
articulation pathological-speech phonation prosody signal-processing speech-analysis
Language:Jupyter Notebook 386
speechbrain / speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
deep-learning speech-recognition speech-to-text speech speech-processing speaker-recognition speaker-verification speaker-identification speech-separation speech-emotion-recognition speechrecognition speech-recognizer deeplearning neural-network neural-networks beamforming timit librispeech speech-analysis speech-api
Language:HTML 371
Shahabks / my-voice-analysis
My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. It breaks utterances and detects syllable boundaries, fundamental frequency contours, and formants.
speech-analysis acoustic-model voice-analysis python-library praatscript
Language:Python 329
haoheliu / voicefixer_main
General Speech Restoration
speech-processing speech-enhancement speech-analysis speech-synthesis machine-learning tts speech-to-text speech
Language:Python 282
Shahabks / myprosody
A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
speech-analysis voice-recognition acoustic-features prosody phonemes speech-patterns python-library acoustic-model
Language:Python 265
HidekiKawahara / legacy_STRAIGHT
A vocoder framework which had been widely used in research community since 1999.
speech-analysis speech-synthesis vocoder
Language:Matlab 181
at16k / at16k
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
speech-recognition speech-to-text speech-api speech-recognizer speech-analysis voice-recognition voice-commands automatic-speech-recognition asr asr-model pretrained-models
Language:Python 130
philipperemy / tensorflow-ctc-speech-recognition
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
ctc ctc-loss tensorflow-1-0 tensorflow speech-recognition speech-to-text speech-analysis deep-learning machine-learning tutorial
Language:Python 130
itsp
Speech-Interaction-Technology-Aalto-U / itsp
Introduction to Speech Processing
speaker-recognition speech-analysis speech-enhancement speech-modelling speech-processing voice-activity-detection speech-coding speech-quality-evaluation
Language:Jupyter Notebook 104
LimingShi / Bayesian-Pitch-Tracking-Using-Harmonic-model
Pitch detection and pitch tracking, voicing unvoicing detection (VAD)，基音检测
vad-detection voicing-unvoicing-detection pitch-estimation fundamental-frequency-estimation onset-detection speech-analysis pitch-detection
Language:MATLAB 98
JusperLee / Calculate-SNR-SDR
Script to calculate SNR and SDR using python
sdr speech-analysis speech-separation
Language:Python 91
alessandroragano / scoreq
SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)
deep-learning regression-algorithms speech speech-analysis speech-processing
Language:Python 90
google / localized-narratives
Localized Narratives
computer-vision image-captioning speech-analysis
Language:HTML 85
CSTR-Edinburgh / magphase
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
merlin phase-spectra speech-analysis synthesis tts vocoder
Language:Python 80
RichardHladik / outotune
An opensource harmonizer implementation leveraging the DISTRHO Plugin Framework.
realtime-audio speech-analysis harmonizer
Language:C++ 73
hyeonsangjeon / computing-Korean-STT-error-rates
STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하는 Python 함수 패키지
cer wer word-error-rate character-error-rate korean speech-to-text speech-recognition evaluation-metrics text-evaluation normalization text-digitisation computing-error-rates aws amazon transcribe test evaluation-functions evaluate speech-analysis
Language:Python 65
mjpyeon / wavenet-classifier
Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
wavenet-keras speech-analysis speech-api deep-learning deep-neural-networks supervised-learning deepmind speaker-recognition speaker-identification speaker-verification speech-emotion-recognition
Language:Python 64
HidekiKawahara / SparkNG
MATLAB real-time/interactive speech tools. This series is obsolete. SP3ARK is the up-to-date series (will be).
gui-application matlab-realtime speech-analysis speech-production
Language:MATLAB 59
lennes / spect
SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/
praat speech analysis annotation corpus-linguistics corpus-tools speech-analysis conversational-speech transcription transcript spoken-language spect speech-corpus
Language:Praat 57
jcvasquezc / NeuroSpeech
Toolkit to asses speech impairments in patients with neurological disorders
speech-analysis parkinson-disease
Language:C++ 55
MontrealCorpusTools / PolyglotDB
PolyglotDB is a package for phonetic corpus storage and analysis
speech-analysis speech-processing database neo4j influxdb rest-api acoustics
Language:Python 48
msalhab96 / SNR-Estimation-Using-Deep-Learning
An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning
snr signal-processing speech-recognition speec pytorch deep-learning speech-processing speech-analysis
Language:Jupyter Notebook 42
tabahi / WebSpeechAnalyzer
JS speech analyzer for fast speech analysis and labeling
speech signal-processing music music-information-retrieval music-visualizer speech-recognition formant-detection phonemes speech-processing speech-analysis spectrum-analyzer spectrum feature-extraction feature-engineering feature audio-processing audio-analysis
Language:JavaScript 37
HidekiKawahara / YANGstraight_source
Analytic signal-based source information analysis for YANGstraight and real-time interactive tools
gui-application speech-analysis matlab-realtime wavelet
Language:MATLAB 34
praaline / Praaline
Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora
visualisation annotations linguistics corpus corpus-linguistics corpus-tools corpus-builder speech-processing speech-analysis spoken-language-processing
Language:C 30
praweshd / speech_emotion_recognition
In this project, the performance of speech emotion recognition is compared between two methods (SVM vs Bi-LSTM RNN).Conventional classifiers that uses machine learning algorithms has been used for decades in recognizing emotions from speech. However, in recent years, deep learning methods have taken the center stage and have gained popularity for their ability to perform well without any input hand-crafted features. Speech emotion on sets obtained from RAVDESS corpus is classified using a conventionally used Support Vector Machine (SVM) and its performance is compared to that of a bidirectional long short-term memory (LSTM).
machine-learning svm-classifier svm-model speech-recognition speech-analysis speech-emotion-recognition deep-learning deep-neural-networks recurrent-neural-networks lstm-neural-networks lstm-attention
Language:Jupyter Notebook 27
operrotin / GFM-IAIF
Glottal Flow Model-based Iterative Adaptive Inverse Filtering
speech-processing glottal-inverse-filtering glottal-flow glottal-vocoder spectral-models voice-quality speech-analysis linear-prediction-coefficients source-filter glottal-flow-model
Language:MATLAB 26
hcy71o / LPC_Speech_Synthesis
Speech synthesis using LPC
speech-synthesis linear-predictive-coding speech-analysis pitch-detection linear-prediction-coefficients
Language:Jupyter Notebook 23
LinkonBSMRSTU / Speech-To-Text-App-iOS
A simple iOS App that can convert speech/voice into text. Only English voice is supported for now. Used Swift 5, AVKit and Speech.
swift5 avkit speech-to-text speech-recognition xcode11 speech microphone speech-analysis voice-to-text speechrecognition ios xcode ios-app
Language:Swift 23
ringabout / scim
[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
nim audio speech-recognition arraymancer mfcc speech-analysis wav speech-processing digital-signal-processing scientific-computing
Language:Nim 23

speech-analysis

jianchang512 / clone-voice

praat / praat.github.io

mmorise / World

haoheliu / voicefixer

DmitryRyumin / INTERSPEECH-2023-24-Papers

gemengtju / Tutorial_Separation

jcvasquezc / DisVoice

speechbrain / speechbrain.github.io

Shahabks / my-voice-analysis

haoheliu / voicefixer_main

Shahabks / myprosody

HidekiKawahara / legacy_STRAIGHT

at16k / at16k

philipperemy / tensorflow-ctc-speech-recognition

Speech-Interaction-Technology-Aalto-U / itsp

LimingShi / Bayesian-Pitch-Tracking-Using-Harmonic-model

JusperLee / Calculate-SNR-SDR

alessandroragano / scoreq

google / localized-narratives

CSTR-Edinburgh / magphase

RichardHladik / outotune

hyeonsangjeon / computing-Korean-STT-error-rates

mjpyeon / wavenet-classifier

HidekiKawahara / SparkNG

lennes / spect

jcvasquezc / NeuroSpeech

MontrealCorpusTools / PolyglotDB

msalhab96 / SNR-Estimation-Using-Deep-Learning

tabahi / WebSpeechAnalyzer

HidekiKawahara / YANGstraight_source

praaline / Praaline

praweshd / speech_emotion_recognition

operrotin / GFM-IAIF

hcy71o / LPC_Speech_Synthesis

LinkonBSMRSTU / Speech-To-Text-App-iOS

ringabout / scim