Yash Kant (yashkant)

yashkant

Geek Repo

Company:University of Toronto

Location:Toronto, Ontario

Home Page:yashkant.github.io

Twitter:@yash2kant

Github PK Tool:Github PK Tool


Organizations
ArIESIITRoorkee
batra-mlp-lab
counselling-cell-iitr

Yash Kant's starred repositories

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:23932Issues:191Issues:3760

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20243Issues:176Issues:353

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:14926Issues:103Issues:948

colmap

COLMAP - Structure-from-Motion and Multi-View Stereo

Language:C++License:NOASSERTIONStargazers:7110Issues:172Issues:1939

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6780Issues:59Issues:137

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4036Issues:47Issues:249

sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Language:PythonLicense:Apache-2.0Stargazers:3642Issues:35Issues:90

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

DynamiCrafter

[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonLicense:Apache-2.0Stargazers:2059Issues:25Issues:100

DPT

Dense Prediction Transformers

Language:PythonLicense:MITStargazers:1901Issues:42Issues:80

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonLicense:Apache-2.0Stargazers:1443Issues:28Issues:81

InstaFlow

:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)

Language:PythonLicense:MITStargazers:1063Issues:44Issues:26

rcg

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Language:PythonLicense:MITStargazers:772Issues:7Issues:33

momask-codes

Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"

Language:PythonLicense:MITStargazers:691Issues:28Issues:54

ziplora-pytorch

Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"

Language:PythonLicense:MITStargazers:476Issues:11Issues:19

RayDiffusion

Code for "Cameras as Rays"

Language:PythonLicense:MITStargazers:454Issues:12Issues:22

Panda-70M

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

conceptual-12m

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

bgpt

Beyond Language Models: Byte Models are Digital World Simulators

Language:PythonLicense:MITStargazers:294Issues:4Issues:1

Dataset

News: the 7k dataset is ready for download.

Language:HTMLLicense:NOASSERTIONStargazers:252Issues:13Issues:22

T2I-CompBench

[Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation

Language:PythonLicense:MITStargazers:159Issues:2Issues:18

NaViT

My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

Language:PythonLicense:MITStargazers:143Issues:7Issues:3

spad

Code for SPAD : Spatially Aware Multiview Diffusers, CVPR 2024

calico

code for: Calibration of Asynchronous Camera Networks: CALICO

Language:C++License:MITStargazers:74Issues:7Issues:4

geneval

GenEval: An object-focused framework for evaluating text-to-image alignment

Language:HTMLLicense:MITStargazers:50Issues:1Issues:5

housekeep

Official code for the paper "Housekeep: Tidying Virtual Households using Commonsense Reasoning" published at ECCV, 2022

Language:PythonLicense:MITStargazers:45Issues:6Issues:6

FusionVision

Official implementation of the paper " FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything "

Language:Jupyter NotebookLicense:UnlicenseStargazers:29Issues:2Issues:2
Language:PythonStargazers:20Issues:1Issues:0
Language:PythonLicense:NOASSERTIONStargazers:17Issues:1Issues:1

perspective-enhanced-diffusion

Enhancing Diffusion Models with 3D Perspective Geometry Constraints (SIGGRAPH Asia 2023)

Language:Jupyter NotebookLicense:MITStargazers:2Issues:0Issues:0