There are 6 repositories under speech-translation topic.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.
A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.
The dataset of Speech Recognition
Tracking the progress in end-to-end speech translation
Integrated speech toolset designed to be accessible to end-users. Fully open-source.
A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/abs/1905.11235.
Repository containing the open source code of works published at the FBK MT unit.
PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.
List of direct speech-to-speech translation papers.
Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models"
A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)
Revisiting End-to-End Speech-to-Text Translation From Scratch
A corpus that can be used to train English-to-Italian End-to-End Speech-to-Text Machine Translation models
๐ Seamlessly fine-tune and deploy Whisper model.
๐๏ธ This project combines multiple operations in Microsoft Azure Cognitive Services into one GUI, including QnA Maker, LUIS, Computer Vision, Custom Vision, Face, Form Recognizer, Text To Speech, Speech To Text and Speech Translation. It's very user-friendly for users to implement any operation mentioned above.
Systems submitted to IWSLT 2022 by the MT-UPC group.
Code for the paper "Does Joint Training Really Help Cascaded Speech Translation?" (EMNLP 2022)
Speech-To-Text is a C# desktop app that uses Azure Cognitive Services to convert and translate speech. You can copy or show the text on the screen, and choose the language of the speech or the translation.