Sanny Liu's repositories
co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
excalidraw
Virtual whiteboard for sketching hand-drawn like diagrams
faiss
A library for efficient similarity search and clustering of dense vectors.
Grounded-Segment-Anything
Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
inpaint-web
A free and open-source inpainting & image-upscaling tool powered by webgpu and wasm on the browser。| 基于 Webgpu 技术和 wasm 技术的免费开源 inpainting & image-upscaling 工具, 纯浏览器端实现。
insightface
State-of-the-art 2D and 3D Face Analysis Project
json
JSON for Modern C++
labelme
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
lora-scripts
LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.
nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
ODISE
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
open_clip
An open source implementation of CLIP.
opencv-mobile
The minimal opencv for Android, iOS, ARM Linux, Windows, Linux, MacOS, WebAssembly
sd-webui-controlnet
WebUI extension for ControlNet
sd-webui-EasyPhoto
📷 EasyPhoto | Your Smart AI Photo Generator.
stable-diffusion-webui
Stable Diffusion web UI
stable-diffusion-webui-docker
Easy Docker setup for Stable Diffusion with user-friendly UI
taming-transformers
Taming Transformers for High-Resolution Image Synthesis
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
U-2-Net
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
Vary
Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
YOLO-FaceV2
YOLO-FaceV2: A Scale and Occlusion Aware Face Detector
yolov8-face
yolov8 face detection with landmark
YOLOv8API
Flask API for object detection and instance segmentation using YOLOv8