KevinPanJun's starred repositories
stable-diffusion
A latent text-to-image diffusion model
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
whisper.cpp
Port of OpenAI's Whisper model in C/C++
imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
taming-transformers
Taming Transformers for High-Resolution Image Synthesis
open_flamingo
An open-source framework for training large multimodal models.
tiny-cuda-nn
Lightning fast C++/CUDA neural network framework
higgsfield
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
DiffusionDet
[ICCV2023 Oral] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
Diffusion-LM
Diffusion-LM
InstructDiffusion
PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.
object-centric-ovd
[NeurIPS 2022] Official repository of paper titled "Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection".
roboflow-100-benchmark
Code for replicating Roboflow 100 benchmark results and programmatically downloading benchmark datasets