Imtiyaz Momin (imomin)

imomin

Geek Repo

Company:iAstute

Location:Houston TX

Home Page:http://www.iastute.com

Github PK Tool:Github PK Tool

Imtiyaz Momin's repositories

Language:TypeScriptLicense:GPL-3.0Stargazers:0Issues:0Issues:0

bark-TTS

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

calendso

The open-source Calendly alternative.

Language:TypeScriptLicense:NOASSERTIONStargazers:0Issues:1Issues:0

audio2photoreal

Code and dataset for photorealistic Codec Avatars driven from audio

License:NOASSERTIONStargazers:0Issues:0Issues:0

ChatGPTFromKB

Intelligent customer support bot

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

cosmic-media-extension

Search millions of high-quality royalty-free stock photos, images, and videos from popular online media services.

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

DPE

[CVPR 2023] DPE: Disentanglement of Pose and Expression for General Video Portrait Editing

License:MITStargazers:0Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

Stargazers:0Issues:0Issues:0

GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

llm-answer-engine

Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Mixtral, Langchain, OpenAI, Brave & Serper

Stargazers:0Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell

License:NOASSERTIONStargazers:0Issues:0Issues:0

pywinassistant

The first open source Large Action Model generalist Artificial Narrow Intelligence that controls completely human user interfaces by only using natural language. PyWinAssistant utilizes Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models.

License:MITStargazers:0Issues:0Issues:0

Real-Time-Accent-Conversion

Real Time Foreign Accent Conversion

Language:PythonLicense:GPL-2.0Stargazers:0Issues:0Issues:0

roomGPT

Upload a photo of your room to generate your dream room with AI.

Language:TypeScriptStargazers:0Issues:0Issues:0

roop

one-click deepfake (face swap)

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

License:MITStargazers:0Issues:0Issues:0

SadTalker-Video-Lip-Sync

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

Stargazers:0Issues:0Issues:0

Scrapegraph-ai

Python scraper based on AI

License:MITStargazers:0Issues:0Issues:0

ShortGPT

AI framework for automating video and short content creation

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

spleeter

Deezer source separation library including pretrained models.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

StableVideo

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Storyblocks

✨ Experience the enchantment of Story Block: an open-source project merging AI text generation and image synthesis to create captivating video narratives. 📚🎥 Watch as your text prompts come to life with stunning visuals, exploring new frontiers in storytelling!

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

storyteller

Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

stripe-sync-engine

Sync your Stripe account to you Postgres database.

Language:TypeScriptLicense:Apache-2.0Stargazers:0Issues:1Issues:0

text2cinemagraph

Official Pytorch implementation of Text2Cinemagraph: Synthesizing Artistic Cinemagraphs from Text

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:0Issues:0Issues:0

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:CLicense:MITStargazers:0Issues:0Issues:0