Yunus Parvej Faniband's repositories
video-subtitle-remover
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
twitter-video-dl
Download twitter videos as mp4 files
twitter-media-downloader
twmd: CLI/GUI Apiless twitter downlaoder. Download medias from single tweet or a whole profile.
Chitralekha
Chitralekha - A video transcreation platform for Indic languages, supporting transcription, translation and voice-over
Shorts-Maker
Create high-quality vertical quotes videos (1920x1080 - Perfect for all social medias) in about 15seconds per video!
Image-Quote-Generator
Create high-quality images with quotes (Perfect for Instagram and Pinterest) in less than 5 seconds per 100+ images!
lumentis
AI powered one-click comprehensive docs from transcripts and text.
diffused-heads
Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
lecture2
lecture 2 - 2024-01-20
resource-stream
CUDA related news and material links
text2sdg
Detect UN Sustainable Development Goals in Text
word_cloud
A little word cloud generator in Python
UnTube
A simple, comprehensive YouTube playlist manager web app powered by YouTube Data API V3. Built with ❤ using Django, htmx and Bootstrap.
Transformers-for-NLP-2nd-Edition
Transformer models from BERT to GPT-4, environments from Hugging Face to OpenAI. Fine-tuning, training, and prompt engineering examples. A bonus section with ChatGPT, GPT-3.5-turbo, GPT-4, and DALL-E including jump starting GPT-4, speech-to-text, text-to-speech, text to image generation with DALL-E, Google Cloud AI,HuggingGPT, and more
mlc-llm
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
audiotext
A desktop application that transcribes audio from a file or microphone in any supported language using WhisperX or Google Speech-to-Text API.
mlx-examples
Examples in the MLX framework
mlx
MLX: An array framework for Apple silicon
data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
face-alignment
:fire: 2D and 3D Face alignment library build using pytorch
bark
🔊 Text-Prompted Generative Audio Model
dev-chatgpt-prompts
📚 Personal collection of ChatGPT prompts for developers!
Indic-TTS
Text-to-Speech for languages of India
gtc2017-numba
Numba tutorial for GTC 2017 conference
ScientoPy
ScientoPy is a open-source Python based scientometric analysis tool
ttsmms
TTS with The Massively Multilingual Speech (MMS) project