Samit's repositories
AnimateDiff
Official implementation of AnimateDiff.
awesome-huge-models
A collection of AWESOME things about HUGE AI models.
deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods.
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
generative-models
Generative Models by Stability AI
magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
mindocr-1
A toolbox of OCR models, algorithms, and pipelines based on MindSpore
mindone
one for all, Optimal generator with No Exception
mmclassification
OpenMMLab Image Classification Toolbox and Benchmark
mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
open_clip
An open source implementation of CLIP.
ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
stable-diffusion
A latent text-to-image diffusion model
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
styleguide
Style guides for Google-originated open-source projects
videocomposer
Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability
yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors