Ataullha

followers

following

stars

Synesis IT PLC

Dhaka,Bangladesh

https://ataullha.github.io/

Md Ataullha's starred repositories

samejs

[WIP] Streaming Audio Models Examples in JS

Language:JavaScriptMIT700

rnnoise_wasm

RNNoise for WASM

Language:JavaScriptMIT5000

skynet

AI core services for Jitsi

Language:PythonApache-2.02100

whisper-asr-webservice

OpenAI Whisper ASR Webservice API

Language:PythonMIT190800

VoiceStreamAI

Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS

Language:PythonMIT58600

transcriber_app

Real time speech to text transcription app.

Language:Python36500

whisper_real_time

Real time transcription with OpenAI Whisper.

Language:Python212400

webrtc-speech-to-text

Speech transcription on the browser using WebRTC and Google Speech

Language:GoMIT11800

insanely-fast-whisper-cli

The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️

Language:PythonMIT29300

insanely-fast-whisper

Language:Jupyter NotebookApache-2.0705700

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

MIT100

vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Language:Jupyter NotebookApache-2.0749200

whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Language:PythonMIT152400

WhisperLive

A nearly-live implementation of OpenAI's Whisper.

Language:PythonMIT161000

pyannote-pipeline

Tunable pipelines

Language:PythonNOASSERTION2500

pflowtts_pytorch

Unofficial implementation of NVIDIA P-Flow TTS paper

Language:PythonMIT20000

bpe.c

Simple Byte pair Encoding mechanism used for tokenization process . written purely in C

Language:CMIT10100

kaggle-bengali-speech-2nd-place

2nd place solution for Kaggle Bengali.AI Speech Recognition

Language:PythonMIT700

LiveASREngine

LiveASREngine using whisper

Language:PythonMIT100

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Language:PythonMIT30800

CCC-wav2vec-2.0

Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech representations

Language:PythonMIT1400

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellNOASSERTION1398900

julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

Language:CBSD-3-Clause181000

yolov10

YOLOv10: Real-Time End-to-End Object Detection

Language:PythonAGPL-3.0859000

Mediapipe-Virtual-Backgrounds

Adding custom virtual backgrounds to video stream

Language:JavaScriptMIT1300

PINTO_model_zoo

A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8), EdgeTPU, CoreML.

Language:PythonMIT343700

OpenvinoOnMeetmodel

C++ project with openvino to optimize performance in intel x64 machine using google meet segment model (share memory to outapp processing realtime like zoom meeting)

Language:C++Apache-2.0400

RobustVideoMatting

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

Apache-2.01300

ComfyUI-Video-Matting

A minimalistic implementation of Robust Video Matting (RVM) and BRAIAI-RVMBG v1.4 in ComfyUI

Language:PythonGPL-3.015800

BackgroundMattingV2

Language:PythonMIT100