kukum's repositories
Age-Gender_Estimation_TF-Android
Age + Gender Estimation on Android with TensorFlow Lite
athena
an open-source implementation of sequence-to-sequence based speech processing engine
awesome-android-ui
A curated list of awesome Android UI/UX libraries
awesome-voiceprint
A curated list of awesome Voiceprint Recognition papers
cfr
This is the public repository for the CFR Java decompiler
Discriminative-Speaker-Embedding
Learning Discriminative Speaker Embedding by Improving Aggregation Strategy and Loss Function for Speaker Verification
docs
Rokid 语音开放平台,包含技能开发、语音设备接入及智能家居接入的文档、SDK 及示例代码
idlak
Official home of the Idlak Speech Synthesis Toolkit
kaldi-serve
Server framework for Kaldi ASR Toolkit
kaldi-speaker-diarization
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
kaldi_lre
My language recognition system based on Kaldi, using CommonVoice dataset.
kaldi_x-vector_aishell
Using Kaldi x-vector method to train speaker recognition model on aishell database.
language-ident-from-speech
MInf project focusing on language identification from speech using neural nets, Kaldi and other tools.
mandarin-tts
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
MTL-Speaker-Embeddings
Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" presented at Interspeech 2021
ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
OpenSpeaker
OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognition including multi-platform deployment and model optimization.
PCAPdroid
No-root network monitor, firewall and PCAP dumper for Android
PPSpeakerRecognition
Privacy-Preserving Speaker Recognition System
prosodic-lid-globalphone
MInf project exploring the use of prosodic information in language identification from speech, using the x-vector architecture in Kaldi, on the GlobalPhone dataset.
record_what_i_read
AI Model Security Reading Notes
speaker-verification
Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN
TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and per
UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
voxceleb_enrichment_age_gender
Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021