Fronk Supakorn's repositories
Anything2Image
Generate image from anything with ImageBind and Stable Diffusion
ControlNet-v1-1-nightly
Nightly release of ControlNet 1.1
DCI-VTON-Virtual-Try-On
[ACM Multimedia 2023] Taming the Power of Diffusion Models for High-Quality Virtual Try-On with Appearance Flow.
DiffFace
DiffFace: Diffusion-based Face Swapping with Facial Guidance
DINet
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
DM-VTON
👗 DM-VTON: Distilled Mobile Real-time Virtual Try-On
IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
IP_LAP
CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors
LatentAvatar
A PyTorch implementation of "LatentAvatar: Learning Latent Expression Code for Expressive Neural Head Avatar"
liver-lesions-detection-2023
1st place solution for the Liver Lesions Detection based on Ultrasound Image Hackathon (12-14 Aug, 2023)
modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
MultiDiffusion
Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)
NeuralPreset
AI-Generated Presets for Faithful 4K Color Style Transfer in Real Time [CVPR 2023]
NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
PatchTST
An offical implementation of PatchTST: "A Time Series is Worth 64 Words: Long-term Forecasting with Transformers." (ICLR 2023) https://arxiv.org/abs/2211.14730
RevIN
RevIN: Reversible Instance Normalization For Accurate Time-series Forecasting Against Distribution Shift
stable-diffusion-webui
Stable Diffusion web UI
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
STAEformer
[CIKM'23] Official code for our paper "Spatio-Temporal Adaptive Embedding Makes Vanilla Transformer SOTA for Traffic Forecasting".
StyleAvatar
Code of SIGGRAPH 2023 Conference paper: StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video
StyleSync
Official code of CVPR '23 paper "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"
StyleSync_PyTorch
PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
Wuerstchen
Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models