Fronk Supakorn's repositories
ai-getting-started
A Javascript AI getting started stack for weekend projects, including image/text models, vector stores, auth, and deployment configs
anylabeling
Effortless AI-assisted data labeling with AI support from Segment Anything and YOLO!
Axial-LOB-High-Frequency-Trading-with-Axial-Attention
Pytorch implementation of Axial-LOB from 'Axial-LOB: High-Frequency Trading with Axial Attention'
colab-connect
Connect to Google Colab VM locally from VSCode
DeepDanbooru
AI based multi-label girl image classification system, implemented by using TensorFlow.
DeepFaceLive
Real-time face swap for PC streaming or video calls
DeOldify
A Deep Learning based project for colorizing and restoring old images (and video!)
DiffTalk
[CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
fashion-clip
FashionCLIP is a CLIP-like model fine-tuned for the fashion domain.
ImageBind
ImageBind One Embedding Space to Bind Them All
large-language-models
Notebooks for Large Language Models (LLMs) Specialization
machine-learning-for-trading
Code for Machine Learning for Algorithmic Trading, 2nd edition.
maxim-pytorch
[CVPR 2022 Oral] PyTorch re-implementation for "MAXIM: Multi-Axis MLP for Image Processing", with *training code*. Official Jax repo: https://github.com/google-research/maxim
MetaPortrait
[CVPR 2023] MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation
mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
plug-and-play
Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)
sd-webui-controlnet
WebUI extension for ControlNet
stylegan-v
[CVPR 2022] StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2
super-gradients
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
TextRecognitionDataGenerator
A synthetic data generator for text recognition