bryceirvin

bryceirvin

Geek Repo

Github PK Tool:Github PK Tool

bryceirvin's starred repositories

simple-speaker-embedding

A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:81Issues:0Issues:0

gopt

Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".

Language:PythonLicense:BSD-3-ClauseStargazers:138Issues:0Issues:0

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:33551Issues:0Issues:0

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:51715Issues:0Issues:0

QEA-Sound-Reproduction

Attempting to create a voice identification system using eigen-based linear algebra and also create a voice synthesis system using a Hidden Markov Model (HMM) and Mel Log Spectrum Approximation Filtering.

Language:CSSStargazers:1Issues:0Issues:0

aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Language:PythonLicense:AGPL-3.0Stargazers:2453Issues:0Issues:0

non-parallel-rhythm-flexible-VC

PyTorch implementation of: Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences

Language:PythonStargazers:11Issues:0Issues:0

ppg-vc

PPG-Based Voice Conversion

Language:PythonLicense:Apache-2.0Stargazers:321Issues:0Issues:0

AudioClassification

This software is a demonstration of Audio Signal Processing and Machine Learning using Python and Tensorflow. The software contains a GUI that can stream audio via webcams or external audio devices connected to the computer and process the audio in real time using a Convolutional and/or a Recurrent Neural Network in order to perform audio classification like speech recognition, music classification, etc. (Depending on how the network was trained). The data set can be arranged in directories where the name of a parent directory represents a classification class. In this way a single network can be trained for multiple types of binary independent audio data eventually building a complex neural network.

Language:PythonStargazers:9Issues:0Issues:0

kapre

kapre: Keras Audio Preprocessors

Language:PythonLicense:MITStargazers:918Issues:0Issues:0

accent-classification

Accent Classification in Speech

Language:PythonStargazers:25Issues:0Issues:0

rnnoise

Recurrent neural network for audio noise reduction

Language:CLicense:BSD-3-ClauseStargazers:3904Issues:0Issues:0