Beast code in Giters

svjack's repositories

1Prompt1Story

(ICLR 2025) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt

Language:PythonMIT000

CartoonSegmentation

Instance segmentation for cartoon/anime characters and some visual techniques building around it.

Language:Jupyter Notebook000

DiffusionAsShader

[arXiv 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

Language:PythonApache-2.0000

FastVideo

FastVideo is a lightweight framework for accelerating large video diffusion models.

Language:PythonApache-2.0000

HunyuanVideo-I2V

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Language:PythonNOASSERTION000

HunyuanVideoGP

HunyuanVideo GP: Large Video Generation Model - GPU Poor version

Language:PythonNOASSERTION000

joycaption

JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.

Language:PythonApache-2.0000

leapfusion-hunyuan-image2video

A novel approach to hunyuan image-to-video sampling

Language:PythonApache-2.0000

Light-A-Video

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

Language:Python000

LTX-Video

Official repository for LTX-Video

Language:PythonApache-2.0000

magi

Generate a transcript for your favourite Manga: Detect manga characters, text blocks and panels. Order panels. Cluster characters. Match texts to their speakers. Perform OCR.

000

midi-model

Midi event transformer for symbolic music generation

Language:PythonApache-2.0000

MotionClone

[ICLR 2025] Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation

Language:Python000

musiclang_predict

AI Prediction api of the MusicLang package

Language:PythonGPL-3.0000

Show-o

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Language:PythonApache-2.0000

SkyReels-V1

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Language:PythonNOASSERTION000

StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and textual inputs to produce high-quality SVG code with remarkable precision.

Language:PythonApache-2.0000

svjack

svjack's repositories

1Prompt1Story

CartoonSegmentation

cogvideox-2b-img2vid

DiffuEraser

diffusion-pipe-2025-2-16

DiffusionAsShader

FastVideo

HunyuanVideo-I2V

HunyuanVideo-Training

HunyuanVideoGP

joycaption

leapfusion-hunyuan-image2video

Light-A-Video

LLaVA-NeXT

LTX-Video

Lumina-Video

magi

midi-model

MotionClone

musiclang

musiclang_predict

musubi-tuner

personalize-anything

Show-o

SkyReels-V1

star-vector

svjack

VideoModelStudio

Wan2GP

YuE