Yash Kant's starred repositories
sd-forge-layerdiffuse
[WIP] Layer Diffusion for WebUI (via Forge)
DynamiCrafter
[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
momask-codes
Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"
ziplora-pytorch
Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"
RayDiffusion
Code for "Cameras as Rays"
conceptual-12m
Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.
T2I-CompBench
[Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation
FusionVision
Official implementation of the paper " FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything "
perspective-enhanced-diffusion
Enhancing Diffusion Models with 3D Perspective Geometry Constraints (SIGGRAPH Asia 2023)