hajungong007's repositories
autodistill
Images to inference with no labeling (use foundation models to train supervised models).
ControlNet
Let us control diffusion models
distill-sd
Segmind Distilled diffusion
DualStyleGAN
[CVPR 2022] Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer
efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
flash-diffusion
Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
flowty-realtime-lcm-canvas
A realtime sketch to image demo using LCM and the gradio library.
GFPGAN-1024
GFPGAN 1024
gpupixel
Cross-Platform AI Beauty Effects Library, Achieving Commercial-Grade Beauty Effects. Written in C++11, Based on OpenGL/ES and VNN.
Hi-SAM
[arXiv preprint] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
img2img-turbo
One-Step Image-to-Image with SD-Turbo
mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
Medical-Image-Segmentation
MedSeg: Medical Image Segmentation GUI Toolbox 可视化医学图像分割工具箱
MedicalGPT-zh
MedicalGPT-zh:一个基于ChatGLM的在高质量指令数据集微调的中文医疗对话语言模型
MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
NeRF-Factory
An awesome PyTorch NeRF library
OmniQuant
OmniQuant is a simple and powerful quantization technique for LLMs.
OpenGlass
Turn any glasses into AI-powered smart glasses
smirk
Official Pytorch Implementation of SMIRK: 3D Facial Expressions through Analysis-by-Neural-Synthesis (CVPR 2024)
SonarSAM
Segment Anything Model, SAM, Sonar images
Stable-Diffusion-Inpaint
Stable diffusion for inpainting
V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
VToonify
[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer
Windrecorder
MacOS App Rewind's alternative on Windows platform, your personal memorize search engine. It can continuously recording your screen locally in small file size, and OCR the content so you can backtrack and query memory any time.
wunjo.wladradchenko.ru
Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.
xrslam
OpenXRLab Visual-inertial SLAM Toolbox and Benchmark