MahmoudAshraf97

followers

following

stars

FAO

Mahmoud Ashraf's repositories

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language:Jupyter NotebookBSD-2-Clause2552 46 154

ctc-forced-aligner

Text to speech alignment using CTC forced alignment

Language:Python58 4 8

AutoencoderCompression

Learned Image Compression Using Autoencoder Architecture

Language:Python16 2 2

whisper-serverless-template

Language:PythonMIT1 20

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonBSD-4-Clause1 10

Aligner-SUPERB

Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark

Language:PythonApache-2.0000

audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Language:PythonBSD-2-Clause000

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonMIT000

dl-challenge-2

Deep Learning challenge for Full Time Junior Deep Learning Engineer.

Language:PythonMIT010

Easy-Miner-Setup

010

ivy

The Unified AI Framework

Language:PythonNOASSERTION010

OpenCL-Matrix-Multiplication

A simple program to implement matrix multiplication on GPU using OpenCL

Language:C++020

OpenMP-KMeans-Clustering

A simple implementation for K Means Clustering using OpenMP written in C++

Language:C020

perceptual-quality

Perceptual quality metrics for TensorFlow

Language:PythonNOASSERTION010

Pthreads-Matrix-Multiplication

This is an example program to parallelize matrix multiplication using POSIX threads written in C

Language:C020

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonMIT000

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.0000

nemo-serverless

Language:PythonMIT020

open_asr_leaderboard

Language:PythonApache-2.0000

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.0000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000