GaoCode's repositories
awesome-openai-vision-api-experiments
Must-have resource for anyone who wants to experiment with and build on the OpenAI Vision API 🔥
Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
binary-to-coco-json-converter
Convert segmentation binary mask images to COCO JSON format.
cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
cvpr-latex-template
Extended LaTeX template for CVPR/ICCV papers
Deep-Bushfire-Detection
Smoke Detection with Deep learning.
efficientnet
Implementation of EfficientNet model. Keras and TensorFlow Keras.
EfficientNet-PyTorch
A PyTorch implementation of EfficientNet and EfficientNetV2 (coming soon!)
GPT-4V-Act
AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI
leetcode
All Python solutions for Leetcode
lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
lora-scripts
LoRA training scripts & GUI use kohya-ss's trainer, for diffusion model.
mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
MetaCLIP
Everything about MetaCLIP: curation/training code, metadata, distribution and pre-trained models.
mmdetection
OpenMMLab Detection Toolbox and Benchmark
models
Models and examples built with TensorFlow
Paint-by-Example
Paint by Example: Exemplar-based Image Editing with Diffusion Models
review_object_detection_metrics
Object Detection Metrics. 14 object detection metrics: mean Average Precision (mAP), Average Recall (AR), Spatio-Temporal Tube Average Precision (STT-AP). This project supports different bounding box formats as in COCO, PASCAL, Imagenet, etc.
sahi
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
stable-diffusion-webui
Stable Diffusion web UI
supervision
We write your reusable computer vision tools. 💜
text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
TinyGPT-V
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
tqdm
:zap: A Fast, Extensible Progress Bar for Python and CLI
Transmission-BVM
Dataset and code of our AAAI2022 paper "Transmission-Guided Bayesian Generative Model for Smoke Segmentation"
WEDGE
WEDGE: A multi-weather autonomous driving dataset built from generative vision-language models