Beast code in Giters

bryceirvin's starred repositories

simple-speaker-embedding

A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.

Language:Jupyter NotebookNOASSERTION8100

gopt

Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".

Language:PythonBSD-3-Clause13800

google-research

Google Research

Language:Jupyter NotebookApache-2.03355100

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION5171500

Attempting to create a voice identification system using eigen-based linear algebra and also create a voice synthesis system using a Hidden Markov Model (HMM) and Mel Log Spectrum Approximation Filtering.

Language:CSS100

aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Language:PythonAGPL-3.0245300

non-parallel-rhythm-flexible-VC

PyTorch implementation of: Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences

Language:Python1100

ppg-vc

PPG-Based Voice Conversion

Language:PythonApache-2.032100

AudioClassification

This software is a demonstration of Audio Signal Processing and Machine Learning using Python and Tensorflow. The software contains a GUI that can stream audio via webcams or external audio devices connected to the computer and process the audio in real time using a Convolutional and/or a Recurrent Neural Network in order to perform audio classification like speech recognition, music classification, etc. (Depending on how the network was trained). The data set can be arranged in directories where the name of a parent directory represents a classification class. In this way a single network can be trained for multiple types of binary independent audio data eventually building a complex neural network.

Language:Python900

bryceirvin

bryceirvin's starred repositories

simple-speaker-embedding

gopt

google-research

Real-Time-Voice-Cloning

QEA-Sound-Reproduction

aeneas

non-parallel-rhythm-flexible-VC

ppg-vc

AudioClassification

kapre

accent-classification

rnnoise