Beast code in Giters

Victor Costa Beraldo's starred repositories

build-your-own-x

Master programming by recreating your favorite technologies from scratch.

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION51909 936 1077

crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Language:PythonMIT18496 215 747

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookMIT5764 70 981

machine-learning

:earth_americas: machine learning tutorials (mainly in Python3)

Language:HTMLMIT3166 129 6

Resemblyzer

A python package to analyze and compare voices with deep learning

Language:PythonApache-2.02713 74 82

audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Language:PythonBSD-2-Clause2468 73 928

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonApache-2.02190 45 393

repo2docker

Turn repositories into Jupyter-enabled Docker images

Language:PythonBSD-3-Clause1611 47 533

awesome-python-scientific-audio

Curated list of python software and packages related to scientific research in audio

1541 78 50

jupyterlab-vim

:neckbeard: Vim notebook cell bindings for JupyterLab

Language:TypeScriptMIT970 19 103

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Language:PythonMIT916 11 105

PyTorch_Speaker_Verification

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

Language:PythonBSD-3-Clause575 18 74

VGG-Speaker-Recognition

Utterance-level Aggregation For Speaker Recognition In The Wild

Language:Python362 17 63

pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

Language:PythonMIT303 8 15

You-Only-Speak-Once

Deep Learning - one shot learning for speaker recognition using Filter Banks

Language:Jupyter Notebook150 6 9

NeuralPlda

Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)

Language:Python98 7 8

SpeakerEmbeddingLossComparison

Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP 2020

Language:Jupyter NotebookMIT59 7 5

The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architecture SincNet and the additive margin softmax (AM-Softmax) loss function. It uses the architecture of the SincNet, but with an improved AM-Softmax layer.

Language:Python43 4 1

VictorBeraldo

Victor Costa Beraldo's starred repositories

build-your-own-x

Real-Time-Voice-Cloning

crewAI

pyannote-audio

machine-learning

Resemblyzer

audio

s3prl

repo2docker

awesome-python-scientific-audio

jupyterlab-vim

torch-audiomentations

PyTorch_Speaker_Verification

VGG-Speaker-Recognition

pytorch_xvectors

wav2vec2-sprint

You-Only-Speak-Once

NeuralPlda

SpeakerEmbeddingLossComparison

AM-SincNet

agevoxceleb

feerci

banknoteBrazil

voice-unlocker

training_data_speech2text

speech-emotion-recogntion-ser2022

open-speech-analytics