bryceirvin's starred repositories
simple-speaker-embedding
A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.
google-research
Google Research
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
QEA-Sound-Reproduction
Attempting to create a voice identification system using eigen-based linear algebra and also create a voice synthesis system using a Hidden Markov Model (HMM) and Mel Log Spectrum Approximation Filtering.
non-parallel-rhythm-flexible-VC
PyTorch implementation of: Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences
AudioClassification
This software is a demonstration of Audio Signal Processing and Machine Learning using Python and Tensorflow. The software contains a GUI that can stream audio via webcams or external audio devices connected to the computer and process the audio in real time using a Convolutional and/or a Recurrent Neural Network in order to perform audio classification like speech recognition, music classification, etc. (Depending on how the network was trained). The data set can be arranged in directories where the name of a parent directory represents a classification class. In this way a single network can be trained for multiple types of binary independent audio data eventually building a complex neural network.
accent-classification
Accent Classification in Speech