Tarida George Cristian's repositories
articulate
A platform for building conversational interfaces with intelligent agents (chatbots)
copier-poetry
Copier template for Python projects managed by Poetry.
deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System.
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
diff-match-patch-typescript
🚁 TypeScript port of diff-match-patch.
explainshell
match command-line arguments to their help text
frontend-test
frontend-test
gripmock
gRPC Mock Server
java-vosk-grpc-test
A simple test connecting via grpc from java to vosk-server
MevonAI-Speech-Emotion-Recognition
Identify the emotion of multiple speakers in an Audio Segment
Multimodal-Emotion-Recognition
A real time Multimodal Emotion Recognition web app for text, sound and video inputs
ngx-text-diff
A Text Diff component for Angular
nlp.js
An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more
py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
redact-pii
Remove personally identifiable information from text.
rnnoise
Recurrent neural network for audio noise reduction
self-supervised-speech-recognition
speech to text with self-supervised learning based on wav2vec 2.0 framework
sftp
Securely share your files
spring-data-jpa-issue-2441
Sample code to reproduce spring-data-jpa issue 2441
sqlalchemy-sample-project
sqlalchemy-sample-project
sshfs-win
SSHFS For Windows
tusd
Reference server implementation in Go of tus: the open protocol for resumable file uploads
voxpopuli
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
wav2train
automatically align transcribed audio and generate a wav2letter training corpus
xla
Enabling PyTorch on Google TPU