SamitHuang

followers

following

stars

Samit's repositories

mindcv

A toolbox of vision models and algorithms based on MindSpore

Language:PythonApache-2.0100

mindocr

A toolbox of OCR models, algorithms, and pipelines based on MindSpore

Language:PythonApache-2.0100

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonApache-2.0000

awesome-huge-models

A collection of AWESOME things about HUGE AI models.

000

deep-text-recognition-benchmark

Text recognition (optical character recognition) with deep learning methods.

Language:Jupyter NotebookApache-2.0000

diff_wk_sd2

Language:Python000

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Apache-2.0000

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

NOASSERTION000

generative-models

Generative Models by Stability AI

MIT000

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Apache-2.0000

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

MIT000

mindcv2301

Language:Python000

mindcv_debug

Language:Jupyter NotebookApache-2.0000

mindocr-1

A toolbox of OCR models, algorithms, and pipelines based on MindSpore

Language:PythonApache-2.0000

mindocr_test

Language:PythonApache-2.0010

mindone

one for all, Optimal generator with No Exception

Language:PythonApache-2.0000

mmclassification

OpenMMLab Image Classification Toolbox and Benchmark

Language:PythonApache-2.0000

mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Apache-2.0000

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.0000

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.

Language:PythonApache-2.0000

open_clip

An open source implementation of CLIP.

NOASSERTION000

ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

NOASSERTION000

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

Language:PythonApache-2.0000

stable-diffusion

A latent text-to-image diffusion model

NOASSERTION000

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

MIT000

styleguide

Style guides for Google-originated open-source projects

Apache-2.0000

video_recaption

Language:PythonApache-2.0000

videocomposer

Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability

MIT000

yolov7

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

GPL-3.0000

yolov7_mindspore

Language:PythonMIT000