Mahmoud Ashraf's repositories
whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
ctc-forced-aligner
Text to speech alignment using CTC forced alignment
AutoencoderCompression
Learned Image Compression Using Autoencoder Architecture
Aligner-SUPERB
Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark
audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
dl-challenge-2
Deep Learning challenge for Full Time Junior Deep Learning Engineer.
OpenCL-Matrix-Multiplication
A simple program to implement matrix multiplication on GPU using OpenCL
OpenMP-KMeans-Clustering
A simple implementation for K Means Clustering using OpenMP written in C++
perceptual-quality
Perceptual quality metrics for TensorFlow
Pthreads-Matrix-Multiplication
This is an example program to parallelize matrix multiplication using POSIX threads written in C
faster-whisper
Faster Whisper transcription with CTranslate2
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.