falhafizh's repositories
audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
bark
🔊 Text-Prompted Generative Audio Model
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
falhafizh
Config files for my GitHub profile.
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
indonesian-tts
Indonesian TTS (text-to-speech) using Coqui TTS
InternGPT
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite