Sakib Ahamed (zsxkib)

zsxkib

Geek Repo

Company:Replicate

Location:Edinburgh

Home Page:https://zsxkib.github.io

Twitter:@zsakib_

Github PK Tool:Github PK Tool

Sakib Ahamed's repositories

InstantID

Replicate Repo for InstantID : Instant Faceswap AI Avatars in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:34Issues:0Issues:0

playground-v2-1024px-aesthetic

Playground v2 is a diffusion-based text-to-image generative model. The model was trained from scratch by the research team at Playground.

Language:PythonLicense:NOASSERTIONStargazers:33Issues:3Issues:2

AICoverGen

A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.

Language:PythonLicense:MITStargazers:22Issues:2Issues:0

voice-cloning-create-dataset

Create your own RVC v2 dataset from a youtube video

Language:PythonLicense:MITStargazers:11Issues:1Issues:0
Language:PythonStargazers:10Issues:0Issues:0

voice-cloning-training

Voice data <= 10 mins can also be used to train a good VC model!

Language:PythonLicense:MITStargazers:9Issues:0Issues:0

cog-parakeet-rnnt-1.1b

nvidia/parakeet-rnnt-1.1b running in Replicate Cog container ⚙️

Language:PythonLicense:CC-BY-4.0Stargazers:8Issues:0Issues:0

cog-uform-gen

Cog wrapper for unum-cloud/uform-gen (Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️)

Language:PythonLicense:Apache-2.0Stargazers:4Issues:2Issues:0

TTDS-G35-CW3

TTDS Group Project: Video Games Search Engine. Sakib Ahamed. Dan Buxton, Kenza Amira, Wini Lau, Mansoor Ahmad

Language:PythonStargazers:4Issues:2Issues:0

cog-aya-101

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

frame-interpolation

FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0
License:Apache-2.0Stargazers:2Issues:0Issues:0

PatchFusion

An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

smoothee

Video frame-interpolation Python package utilizing Replicate models

Language:PythonLicense:MITStargazers:2Issues:1Issues:1

conda-envs-in-cog

How to use Conda with Replicate Cog to easily manage packages in your projects. Step-by-step examples included!

Language:PythonStargazers:1Issues:0Issues:0

GeneFacePlusPlus

GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code

Stargazers:1Issues:0Issues:0

TalkNet-ASD

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

trocr-base-handwritten

🖋️➡️📱Converts handwritten text images into digital text

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

voice-cloning

voice-to-voice generation (change your voice)

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

animatediff-cli-prompt-travel

animatediff prompt travel

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Moore-AnimateAnyone

Unofficial Re-Trained AnimateAnyone (Image + DWPose Video → Animated Video of Image)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

License:Apache-2.0Stargazers:0Issues:0Issues:0

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

YOLO-World

Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0