Zilun Zhang's repositories
AidLearning-FrameWork
🔥🔥🔥AidLearning is a powerful AIOT development platform, AidLearning builds a linux env supporting GUI, deep learning and visual IDE on Android...Now Aid supports CPU+GPU+NPU for inference with high performance acceleration...Linux on Android or HarmonyOS
AMFMN
The source code of AMFMN and the dataset RSITMD
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
ColossalAI
Making big AI models cheaper, easier, and scalable
Counting-from-Sky-A-Large-scale-Dataset-for-Remote-Sensing-Object-Counting-and-A-Benchmark-Method
Counting from Sky: A Large-scale Dataset for Remote Sensing Object Counting and A Benchmark Method
diffusiondb
A large-scale text-to-image prompt gallery dataset based on Stable Diffusion
earthengine-api
Python and JavaScript bindings for calling the Earth Engine API.
fastdup
FastDup is a tool for gaining insights from a large image collection. It can find anomalies, duplicate and near duplicate images, clusters of similaritity, learn the normal behavior and temporal interactions between images. It can be used for smart subsampling of a higher quality dataset, outlier removal, novelty detection of new information to be
GaLR
Source code of paper "Remote Sensing Cross-Modal Image-Text Retrieval Based on Global and Local Information"
geo-bench
GEO-Bench: Toward Foundation Models for Earth Monitoring
laion-prepro
Get hundred of million of image+url from the crawling at home dataset and preprocess them
markdown_readme
Markdown - you can mark up titles, lists, tables, etc., in a much cleaner, readable and accurate way if you do it with HTML.
OVDEval
A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection
PromptPapers
Must-read papers on prompt-based tuning for pre-trained language models.
RSVLM_papers
Remote Sensing Vision Language Model Paper
satellite-image-deep-learning
Deep learning with satellite & aerial imagery
segment-geospatial
A Python package for segmenting geospatial data with the Segment Anything Model (SAM)
SemanticLocalizationMetrics
The first research for semantic localization
SODA-mmrotate
OpenMMLab Rotated Object Detection Toolbox and Benchmark
Transformer-in-Vision
Recent Transformer-based CV and related works.
Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.