Md Ataullha's starred repositories
rnnoise_wasm
RNNoise for WASM
whisper-asr-webservice
OpenAI Whisper ASR Webservice API
VoiceStreamAI
Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS
transcriber_app
Real time speech to text transcription app.
whisper_real_time
Real time transcription with OpenAI Whisper.
webrtc-speech-to-text
Speech transcription on the browser using WebRTC and Google Speech
insanely-fast-whisper-cli
The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
WhisperLive
A nearly-live implementation of OpenAI's Whisper.
pyannote-pipeline
Tunable pipelines
pflowtts_pytorch
Unofficial implementation of NVIDIA P-Flow TTS paper
kaggle-bengali-speech-2nd-place
2nd place solution for Kaggle Bengali.AI Speech Recognition
LiveASREngine
LiveASREngine using whisper
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
CCC-wav2vec-2.0
Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech representations
Mediapipe-Virtual-Backgrounds
Adding custom virtual backgrounds to video stream
PINTO_model_zoo
A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8), EdgeTPU, CoreML.
OpenvinoOnMeetmodel
C++ project with openvino to optimize performance in intel x64 machine using google meet segment model (share memory to outapp processing realtime like zoom meeting)
RobustVideoMatting
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
ComfyUI-Video-Matting
A minimalistic implementation of Robust Video Matting (RVM) and BRAIAI-RVMBG v1.4 in ComfyUI