Sumith Kulal (Sumith1896)

Sumith1896

Geek Repo

Company:@Stability-AI

Location:Stanford, CA

Home Page:https://cs.stanford.edu/~sumith/

Github PK Tool:Github PK Tool


Organizations
cs231n
sympy

Sumith Kulal's starred repositories

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:65986Issues:555Issues:696

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:62388Issues:527Issues:0

rclone

"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonLicense:MITStargazers:36821Issues:437Issues:285

ComfyUI

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:36433Issues:294Issues:2279

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:32665Issues:347Issues:294

ControlNet

Let us control diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:28383Issues:214Issues:519

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:22808Issues:243Issues:263

mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Language:PythonLicense:NOASSERTIONStargazers:6866Issues:58Issues:184

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonLicense:Apache-2.0Stargazers:6848Issues:53Issues:1457

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6695Issues:59Issues:137
Language:PythonLicense:NOASSERTIONStargazers:6021Issues:69Issues:114

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5359Issues:46Issues:73
Language:PythonLicense:NOASSERTIONStargazers:3151Issues:159Issues:111

s4

Structured state space sequence models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2181Issues:49Issues:128

k-diffusion

Karras et al. (2022) diffusion models for PyTorch

Language:PythonLicense:MITStargazers:2133Issues:41Issues:62

consistencydecoder

Consistency Distilled Diff VAE

Language:PythonLicense:MITStargazers:2081Issues:23Issues:19

MineDojo

Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Language:JavaLicense:MITStargazers:1682Issues:28Issues:114

invisible-watermark

python library for invisible image watermark (blind image watermark)

Language:PythonLicense:MITStargazers:1495Issues:14Issues:27

edm

Elucidating the Design Space of Diffusion-Based Generative Models (EDM)

Language:PythonLicense:NOASSERTIONStargazers:1121Issues:29Issues:23

4D-Humans

4DHumans: Reconstructing and Tracking Humans with Transformers

Language:PythonLicense:MITStargazers:1075Issues:22Issues:120

clean-fid

PyTorch - FID calculation with proper image resizing and quantization steps [CVPR 2022]

Language:PythonLicense:MITStargazers:881Issues:9Issues:47

muse-maskgit-pytorch

Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch

Language:PythonLicense:MITStargazers:821Issues:34Issues:36

BlenderToolbox

Some simple Blender scripts for rendering paper figures

Language:PythonLicense:Apache-2.0Stargazers:538Issues:7Issues:14

video2dataset

Easily create large video dataset from video urls

Language:PythonLicense:MITStargazers:470Issues:9Issues:153

aesthetic-predictor

A linear estimator on top of clip to predict the aesthetic quality of pictures

Language:Jupyter NotebookLicense:MITStargazers:397Issues:12Issues:6

maskgit

Official Jax Implementation of MaskGIT

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:387Issues:17Issues:12
Language:PythonLicense:Apache-2.0Stargazers:363Issues:26Issues:2

recurrent-interface-network-pytorch

Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in Pytorch

Language:PythonLicense:MITStargazers:188Issues:11Issues:16

fastGPT

Fast GPT-2 inference written in Fortran

Language:FortranLicense:MITStargazers:177Issues:7Issues:19