bycnfz / weather2k

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Logo

Weather2K: A Multivariate Spatio-Temporal Benchmark Dataset for Meteorological Forecasting Based on Real-Time Observation Data from Ground Weather Stations

【Accepted】by the 26th International Conference on Artificial Intelligence and Statistics (AISTATS) 2023

https://aistats.org/aistats2023/accepted.html

This repository provides the dataset used in the paper. We hope that the release of Weather2K can provide a foundation for accelerated research in this area and foster collaboration between atmospheric and data scientists.

Download the dataset

Google Drive : https://drive.google.com/drive/folders/1rTabWIuAarDfSkUROK2rzEYJEqP338UM?usp=share_link

If you have any quesetions about the data download, please contact wuming@bupt.edu.cn

The data is hosted in the upper links with the following directory structure :

.
|-- Weather2K-R
|   `-- data.npy
|-- Weather2K-S
|   |-- 79.csv
|   |-- 114.csv
|   |-- 209.csv
|   |-- 259.csv
|   |-- 532.csv
|   |-- 546.csv
|   |-- 850.csv
|   |-- 852.csv
|   |-- 947.csv
|   |-- 1201.csv
|   |-- 1272.csv
|   |-- 1328.csv
|   |-- 1527.csv
|   |-- 1691.csv
|   `-- 1796.csv
`-- idx2stat.txt

The shape of the numpy file of Weather2K-R is (1866, 23, 40896), which means 1,866 groud weather stations, 3 constants for position information and 20 meteorological factors, and 40,896 time steps.

Weather2K-S contains 15 representative and geographically diverse ground weather stations distributed throughout China. They are csv files named after the index used in the numpy file.

idx2stat.txt shows the correspondence bewteen the numpy index and the original station number.

Definitions of the variables in the dataset

Numpy Index Long Name Short Name Unit
0 Latitude lat (°)
1 Longitude lon (°)
2 Altitude alt (m)
3 Air pressure ap hpa
4 Air Temperature t (°C)
5/6 Maximum / Minimum temperature mxt / mnt (°C)
7 Dewpoint temperature dt (°C)
8 Relative humidity rh (%)
9 Water vapor pressure wvp hpa
10/11/12/13/14 Precipitation in 1h / 3h / 6h / 12h / 24h p1 / p2 / p3 / p4 / p5 (mm)
15 Wind direction wd (°)
16 Wind speed ws (ms-1)
17 Maximum wind direction mwd (°)
18 Maximum wind speed mws (ms-1)
19 Land surface temperature st (°C)
20/21 Horizontal visibility in 1 min / 10 min hv1 / hv2 (m)
22 Vertical visibility vv (m)

Cite

If you are using this dataset please cite

Zhu X, Xiong Y, Wu M, et al. Weather2K: A Multivariate Spatio-Temporal Benchmark Dataset for Meteorological Forecasting Based on Real-Time Observation Data from Ground Weather Stations[C]//International Conference on Artificial Intelligence and Statistics. PMLR, 2023: 2704-2722.

About