conformer

There are 4 repositories under conformer topic.

modelscope / FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
conformer pytorch speech-recognition paraformer punctuation speaker-diarization rnnt audio-visual-speech-recognition pretrained-model voice-activity-detection whisper dfsmn vad speechgpt speechllm
Language:Python 12594
PaddlePaddle / PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
transformer conformer speech-translation streaming-asr speech-alignment punctuation-restoration streaming-tts speech-synthesis tts asr kws speech-recognition sound-classification voice-cloning vocoder voice-recognition self-supervised-learning wav2vec2 whisper code-switch
Language:Python 12228
wenet-e2e / wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
asr automatic-speech-recognition conformer e2e-models production-ready pytorch speech-recognition transformer whisper
Language:Python 4800
FireRedTeam / FireRedASR
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.
asr industrial-grade llm multimodal-llm open-source speech-recognition automatic-speech-recognition conformer speechllm transformer
Language:Python 1343
sooftware / conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
conformer transformer cnn transformer-xl asr speech-recognition pytorch conv convolution augmented speech recognition
Language:Python 1071
TensorSpeech / TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
automatic-speech-recognition deepspeech2 speech-recognition speech-to-text tensorflow2 rnn-transducer conformer tflite tflite-model tflite-convertion ctc tensorflow subword-speech-recognition end2end contextnet jasper streaming-transducer
Language:Python 995
yeyupiaoling / PPASR
基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型
asr paddlepaddle deep-learning chinese speech-to-text speech speech-recognition streaming-asr conformer squeezeformer deepspeech2
Language:Python 865
yeyupiaoling / MASR
Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conformer、Squeezeformer、DeepSpeech2模型，支持多种数据增强方法。
deepspeech pytorch asr deep-learning speech-recognition speech-to-text speech conformer squeezeformer
Language:Python 704
eeyhsong / EEG-Conformer
EEG Transformer 2.0. i. Convolutional Transformer for EEG Decoding. ii. Novel visualization - Class Activation Topography.
eeg transformer conformer activation-map eeg-transformer eeg-visualization attention
Language:Python 618
sooftware / kospeech
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
asr attention-is-all-you-need conformer e2e-asr end-to-end jasper korean-speech ksponspeech las las-models pytorch seq2seq speech-recognition transformer
Language:Python 618
liusongxiang / ppg-vc
PPG-Based Voice Conversion
conformer one-shot phonetic-posteriorgram ppg ppg-vc speech-synthesis voice-conversion
Language:Python 335
tuanio / noisy-student-training-asr
Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem
conformer noisy-student nst pretrained pytorch semi-supervised-learning wav2vec2 aped data-augmentation deep-learning machine-learning speech-recognition
Language:Python 97
hyperion-ml / hyperion
Python toolkit for speech processing
speaker-recognition adversarial-attacks x-vectors nist-sre cifar mnist voxceleb pytorch plda calibration vq-vae vae sre19-cts resnet efficientnet transformer conformer sre19-av sre21 sre20-cts
Language:Python 68
MinkaiXu / CGCF-ConfGen
:test_tube: Learning Neural Generative Dynamics for Molecular Conformation Generation (ICLR 2021)
molecule conformation-generation iclr iclr2021 conformer conformation pytorch
Language:Python 46
sooftware / lightning-asr
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
asr conformer hydra pytorch-lightning speech-recognition
Language:Python 45
Rishit-dagli / Conformer
An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras
artificial-intelligence attention-mechanism conformer convolutional-neural-networks deep-learning keras machine-learning speech-recognition tensorflow transformers
Language:Python 44
TeaPoly / Conformer-Athena
Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.
aishell asr asr-tasks conformer speech-recognition tensorflow tensorflow2 transformer
Language:Python 44
Audio-WestlakeU / SAR-SSL
A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer” [TASLP 2024]
acoustic-parameters self-supervised-learning spatial-acoustic-representation array-signal-processing audio-pretraining conformer downstream-tasks fine-tuning pretext-task tdoa real-world-data room-acoustics microphone-array multi-channel
Language:Python 36
VITA-Group / Audio-Lottery
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Zhangyang Wang
pytorch neural-network-compression lottery-ticket-hypothesis speech-recognition ctc conformer
Language:Python 30
jreremy / conformer
Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.
conformer pytorch librispeech librispeech-dataset machine-learning speech-recognition asr
Language:Python 27
RDMC
xiaoruiDong / RDMC
Reaction Data and Molecular Conformers (RDMC) is a package dealing with reactions, molecules, conformers, majorly in 3D.
conformer molecule rdkit reaction
Language:Jupyter Notebook 26
DataXujing / ASR-paper
:fire: ASR教程: https://dataxujing.github.io/ASR-paper/
asr fbank mfcc gmm-hmm tandem dnn-hmm las ctc rnn-t neural-transducer mocha conformer transformer-transducer quartznet jasper citrinet contextnet wfst speech-transformer squeezeformer
24
UnixJunkie / smi2sdf3d
3D diverse conformers generation using rdkit
rdkit python-script conformer ligand chemoinformatics 3d smiles sdf smi
Language:Python 23
ahmed-alllam / Brain-EEG-Emotion-Classifier
Emotion classification from Brain EEG signals using a hybrid CNN-Transformer model and various ML algorithms.
brain-computer-interface conformer deep-learning eeg neural-decoding
Language:Jupyter Notebook 19
msalhab96 / Conformer
An implementation for "Conformer: Convolution-augmented Transformer for Speech Recognition" Paper
asr automatic-speech-recognition conformer speech-recognition speech-to-text transformer
Language:Python 18
manhph2211 / ViSTT
I'm building an end-to-end Vietnamese Speech Recognition System. I'll deploy it into production with the help of Flask, Uwsgi, Nginx, and AWS ...
vietnamese-speech-recognition vietnames-asr rnnt speech-to-text sst tranducer vivos flask nginx pytorch uwsgi pytorch-lightning vietnamese-speech-to-text aws-deploy hydra conformer
Language:Python 17
jaketae / conformer
PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition
conformer speech-recognition transformer convolution
Language:Python 14
tuanio / conformer-rnnt
Conformer RNN-Transducer
conformer python rnnt speech-recognition
Language:Python 14
ADicksonLab / AGDIFF
Implementation of AGDIFF: Attention-Enhanced Diffusion for Molecular Geometry Prediction
attention conformer diffusion-models generative-ai gnn graph-neural-networks machine-learning structure
Language:Python 11
lucadellalib / ts-asr
Target speaker automatic speech recognition (TS-ASR)
conformer pytorch rnn speech-recognition speechbrain transducer asr
Language:Python 11
tuanio / nextformer
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
asr augmented cnn conformer convnet convnext convolution pytorch recognition speech speech-recognition speech-to-text transformer
Language:Python 11
hoangtuanvu / conformer_ocr
Transformer OCR is a Optical Character Recognition tookit built for researchers working on both OCR for both Vietnamese and English. This project only focused on variants of vanilla Transformer (Conformer) and Feature Extraction (CNN-based approach).
ocr optical-character-recognition conformer transformer-encoder vietnamese-ocr
Language:Python 10
LuluW8071 / Conformer
End-to-End Speech Recognition Training with Conformer CTC using PyTorch Lightning⚡
conformer lightning-ai pytorch comet-ml common-voice-dataset sox asr gradio-interface
Language:Jupyter Notebook 10
danieleninni / small-footprint-keyword-spotting
Effective processing pipeline and advanced neural network architectures for small-footprint keyword spotting
attention-mechanism audio-classification cnn conformer data-science deep-learning keyword-spotting machine-learning resnet rnn speech-commands speech-recognition
Language:Python 9
tuanio / asr-toolkit
E2E Speech Recognition Toolkit with Hydra and Pytorch Lightning
conformer ctc lstm pytorch rnn speech-recognition speech-to-text toolkit transformer vietnamese-speech-recognition
Language:Python 7
LENSS / EMSAssist
This is the official artifact for EMSAssist paper on MobiSys'23. EMSAssist: An End-to-End Mobile Voice Assistant at the Edge for Emergency Medical Services
bert conformer edge-computing emergency-medical-services healthcare human-computer-interaction mobile-computing mobilebert speech-recognition text-classificaiton-with-bert voice-assistant
Language:Python 6

conformer

modelscope / FunASR

PaddlePaddle / PaddleSpeech

wenet-e2e / wenet

FireRedTeam / FireRedASR

sooftware / conformer

TensorSpeech / TensorFlowASR

yeyupiaoling / PPASR

yeyupiaoling / MASR

eeyhsong / EEG-Conformer

sooftware / kospeech

liusongxiang / ppg-vc

tuanio / noisy-student-training-asr

hyperion-ml / hyperion

MinkaiXu / CGCF-ConfGen

sooftware / lightning-asr

Rishit-dagli / Conformer

TeaPoly / Conformer-Athena

Audio-WestlakeU / SAR-SSL

VITA-Group / Audio-Lottery

jreremy / conformer

xiaoruiDong / RDMC

DataXujing / ASR-paper

UnixJunkie / smi2sdf3d

ahmed-alllam / Brain-EEG-Emotion-Classifier

msalhab96 / Conformer

manhph2211 / ViSTT

jaketae / conformer

tuanio / conformer-rnnt

ADicksonLab / AGDIFF

lucadellalib / ts-asr

tuanio / nextformer

hoangtuanvu / conformer_ocr

LuluW8071 / Conformer

danieleninni / small-footprint-keyword-spotting

tuanio / asr-toolkit

LENSS / EMSAssist