zsxkib

Sakib Ahamed's repositories

InstantID

Replicate Repo for InstantID : Instant Faceswap AI Avatars in Seconds 🔥

Language:PythonApache-2.03900

cog-flux-dev-inpainting

🎨 Fill in masked parts of images with FLUX.1-dev 🖌️

Language:PythonMIT1100

sd3-on-apple-silicon

Run Stable Diffusion on Apple Silicon

Language:PythonMIT9 1 1

IC-Light

More relighting!

Language:PythonApache-2.0800

cog-aura-sr

AuraSR: GAN-based Super-Resolution for real-world

Language:PythonMIT4 3 1

cog-idefics3

Idefics3-8B-Llama3: A powerful multimodal AI model by Hugging Face that integrates image and text inputs to enhance visual reasoning and text generation

Language:PythonMIT400

MimicMotion

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Language:PythonNOASSERTION400

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.0401

cog-flux-schnell-inpainting

🎨 Fill in masked parts of images with FLUX.1-schnell 🖌️

MIT200

FLUX-Controlnet-Inpainting

200

FluxMusic

Text-to-Music Generation with Rectified Flow Transformers

NOASSERTION200

hololive-style-bert-vits2

🎙️Hololive text-to-speech and voice-to-voice (Japanese🇯🇵 + English🇬🇧)

Language:PythonAGPL-3.02 20

TalkNet-ASD

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

Language:Jupyter NotebookMIT200

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

Language:Python200

Arc2Face

Arc2Face: A Foundation Model of Human Faces

Language:PythonMIT100

cog-aura-sr-v2

AuraSR v2: Second-gen GAN-based Super-Resolution for real-world applications

Language:PythonMIT100

cog-blip-3

Language:Python1 10

cog-comfyui

Run ComfyUI with an API

Language:PythonMIT100

cog-molmo-7b-d

Replicate Cog for allenai/Molmo-7B-D-0924

Language:PythonMIT100

conda-envs-in-cog

How to use Conda with Replicate Cog to easily manage packages in your projects. Step-by-step examples included!

Language:Python1 20

animate-diff-scene-assembler

Dkamacho’s Scene Assembler

Language:PythonMIT010

cog-qwen-2

Attempt at cog wrapper for QwenLM/Qwen2

MIT010

cog-stable-diffusion-3-with-instantx-controlnets

A template for running Stable Diffusion 3 InstantX/SD3-Controlnet-Canny with Cog

Language:PythonApache-2.0000

cog-wd-tagger

Language:PythonApache-2.0010

EvTexture

[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution

Language:PythonApache-2.0000

grounded-segmentation

A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integration of powerful object detection and segmentation models, offering an easy-to-use interface for developers seeking efficient image analysis capabilities without complex setups.

GPL-3.0000

Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Language:Jupyter NotebookApache-2.0000

ToonCrafter

a research paper for generative cartoon interpolation

Language:PythonApache-2.0000

zsxkib

Sakib Ahamed's repositories

InstantID

PuLID

cog-flux-dev-inpainting

sd3-on-apple-silicon

IC-Light

FlashFace

cog-aura-sr

cog-idefics3

MimicMotion

segment-anything-2

cog-flux-schnell-inpainting

FLUX-Controlnet-Inpainting

FluxMusic

hololive-style-bert-vits2

TalkNet-ASD

V-Express

Arc2Face

cog-aura-sr-v2

cog-blip-3

cog-comfyui

cog-molmo-7b-d

conda-envs-in-cog

animate-diff-scene-assembler

cog-qwen-2

cog-stable-diffusion-3-with-instantx-controlnets

cog-wd-tagger

EvTexture

grounded-segmentation

Kandinsky-2

ToonCrafter