Mohannad Ehab Barakat (MohannadEhabBarakat)

MohannadEhabBarakat

Geek Repo

Location:Erlangen, Nuremberg

Github PK Tool:Github PK Tool


Organizations
tmontaj

Mohannad Ehab Barakat's starred repositories

txtsplit

A simple text splitter based on Tortoise for use in text-to-speech applications

Language:PythonLicense:Apache-2.0Stargazers:3Issues:0Issues:0

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonLicense:GPL-3.0Stargazers:2Issues:0Issues:0

OpenPhonemizer

An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPL phonemizer.

Language:PythonLicense:BSD-3-Clause-ClearStargazers:72Issues:0Issues:0

RAVE

RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models - Official Repo

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

voice-cloning-training

Voice data <= 10 mins can also be used to train a good VC model!

Language:PythonLicense:MITStargazers:11Issues:0Issues:0

espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

Language:CLicense:GPL-3.0Stargazers:3923Issues:0Issues:0

github-profile-trophy

🏆 Add dynamically generated GitHub Stat Trophies on your readme

Language:TypeScriptLicense:MITStargazers:4966Issues:0Issues:0

RVC-inference

High performance RVC inferencing, intended for multiple instances in memory at once. Also includes the latest pitch estimator RMVPE, Python 3.8-3.11 compatible, pip installable, memory + performance improvements in the pipeline and model usage.

Language:PythonLicense:MITStargazers:15Issues:0Issues:0

cog

Containers for machine learning

Language:PythonLicense:Apache-2.0Stargazers:7511Issues:0Issues:0

voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer

Language:PythonLicense:NOASSERTIONStargazers:15417Issues:0Issues:0

HierSpeechpp

The official implementation of HierSpeech++

Language:PythonLicense:MITStargazers:1139Issues:0Issues:0

piper

A fast, local neural text to speech system

Language:C++License:MITStargazers:5193Issues:0Issues:0

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonLicense:MITStargazers:4461Issues:0Issues:0

tortoise-tts-fastest

Faster Tortoise inference then Tortoise Fast Fork

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:121Issues:0Issues:0

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12497Issues:0Issues:0

ultimatevocalremover_api

API for a Vocal Remover that uses Deep Neural Networks.

Language:PythonLicense:MITStargazers:52Issues:0Issues:0

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonLicense:MITStargazers:686Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:10135Issues:0Issues:0

docker-in-colab

Run Docker inside Google Colab

License:Apache-2.0Stargazers:63Issues:0Issues:0

globox

A package to read and convert object detection datasets (COCO, YOLO, PascalVOC, LabelMe, CVAT, OpenImage, ...) and evaluate them with COCO and PascalVOC metrics.

Language:PythonLicense:MITStargazers:164Issues:0Issues:0

material-tailwind

@material-tailwind is an easy-to-use components library for Tailwind CSS and Material Design.

Language:TypeScriptLicense:MITStargazers:3550Issues:0Issues:0

HairCLIP

[CVPR 2022] HairCLIP: Design Your Hair by Text and Reference Image

Language:PythonLicense:LGPL-2.1Stargazers:514Issues:0Issues:0

Journal-Club

The RISE Journal Club aims to create a friendly environment to discuss the latest state-of-the-art papers in the areas of medical image analysis, AI and computer vision. The moderators will briefly introduce the paper and then moderate a discussion where everyone is welcome to provide their thoughts and ask any questions on the paper.

Stargazers:63Issues:0Issues:0

CTooth

this is the official link to request CTooth

Language:PythonStargazers:91Issues:0Issues:0

Yet-Another-Openpose-Implementation

This project reimplements from scratch the OpenPose paper (Cao et al,2018), Using Tensorflow 2.1 and optional TPU powered training.

Language:Jupyter NotebookLicense:MPL-2.0Stargazers:92Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:55Issues:0Issues:0
Language:JavaScriptStargazers:162Issues:0Issues:0

learn-an-effective-lip-reading-model-without-pains

The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the state-of-art performance in LRW-1000 dataset.

Language:PythonStargazers:150Issues:0Issues:0

AI-Expert-Roadmap

Roadmap to becoming an Artificial Intelligence Expert in 2022

Language:JavaScriptLicense:MITStargazers:28784Issues:0Issues:0

PerceptiLabs

PerceptiLabs main repository

Stargazers:387Issues:0Issues:0