Dr. Zhang's repositories
pytorch-grad-cam
Many Class Activation Map methods implemented in Pytorch for CNNs and Vision Transformers. Examples for classification, object detection, segmentation, embedding networks and more. Including Grad-CAM, Grad-CAM++, Score-CAM, Ablation-CAM and XGrad-CAM
annotated_deep_learning_paper_implementations
🧑🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences.
census
China county-level population data (census)
chapyter
Chapyter: ChatGPT Code Interpreter in Jupyter Notebooks
City-Camera-Trajectory-Data
City-scale Vehicle Trajectory Data From Traffic Camera Videos
DeepForest
Python Package for Tree Crown Detection in Airborne RGB imagery
dowhy
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
GPBoost
Combining tree-boosting with Gaussian process and mixed effects models
GPS-GLASS
GPS-GLASS: Learning Nighttime Semantic Segmentation Using Daytime Video and GPS data
IA-Seg
The code for "Improving Nighttime Driving-Scene Segmentation via Dual Image-adaptive Learnable Filters".
LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.
MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
optuna
A hyperparameter optimization framework
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
satellite-image-deep-learning
Deep learning with satellite & aerial imagery
Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
SHAP_spatial_data_paper
Code and data repository for the paper
SIHE
Estimation of building heights with single street view images
torch-cam
Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)
torchgeo
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
VLM_survey
Vision-Language Models for Vision Tasks: A Survey
Wavelet-transform-fusion-for-spatial-data
This is the code for wavelet transform fusion.