jinbochen0823 / cross_domain_gesture_dataset

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

mmWave_cross_domain_gesture_dataset (MCD-Gesture Dataset)

MCD-Gesture is an open-source mmWave gesture dataset collected from various domains (i.e. environments, users and locations), and it can be used to develop domain-independent gesture recognition systems based on mmWave radar. The total size of the processed dataset is 5.3GB. Following we introduce the composition and implementation details of this dataset.

Dataset Introduction

Data collecting

  • 750 domains: 6 environments x 25 volunteers x 5 locations
  • 6 environments:meeting room, living room, bedroom, laboratory and 2 office rooms
  • 25 volunteers:25 users with different sex, ages, heights and weights.
  • 5 locations:5 anchor locations with different distances and angles away from the radar, ranging from 0.6m to 1m and -30° ,to 30°.
  • 13 gestures: 6 predefined gestures (push, pull, slide left, slide right, clockwise turning, counterclockwise turning) and 7 other actions as negative samples (lifting right arm, lifting left arm, sitting down, standing up, waving hand, turn around, walking).
  • 24050 samples: 10650 gesture samples + 13400 negative samples, with 695193 radar frames in total.

Dataset Implementation

Hardware Configuration

  • This dataset is collected by TI AWR1843 mmWave radar (left) and DCA1000 real-time data acquisition board (right).

dca

  • The parameters of the radar are set as follows:
Parameter Value Parameter Value
Start frequency 77GHz Sample points 128
Frequency slope 99.987MHz/µs Sample rate 4MHz
Idle time 340µs Chirps in one frame 128
Ramp end time 40µs Frame periodicity 50ms

Under these settings the radar achieves a frame rate of 20fps, a range resolution of 0.047m, a velocity resolution of 0.039m/s. The number of activated transmitting antennas and receiving antennas are 2 and 4, respectively.

Data preprocessing

The raw signals are processed into Dynamic Range Agnle Image (DRAI) sequences through 3D-FFT and noise elimination. DRAI depicts doppler power distribution over spatial positions when people perform gestures. For example, the following figure shows a series of DRAI when user perform gesture "push". In DRAI, the pixel intensity corresponds to doppler power, the horizontal axis is angle of arrival and the vertical axis is range. It can be observed that when users perform push, the brightest spot moves vertically which denotes distance changes of hands.

push

Dataset Structure

  • The DRAI sequence of each gesture sample is saved as numpy array with 3 dimensions T x 32 x 32, where the first dimension represents the frame length of the DRAI sequence, and the last two dimensions represent the size of one frame DRAI. The format of each .npy filename is y/n_GestureName_EnvironmentLabel_UserLabel_PositionLabel_SampleLabel.npy and the first character represents whether it is a predefined gesture (y) or negative sample (n). For example, the filename "y_SlideRight_e6_u21_p5_s4" denotes that it is the 4th sample of predefined gesture "SlideRight" performed by user21 at location5 in environment6.

  • The example video of how to perform the predefined gestures can be viewed here.

  • The number of samples collected from each volunteer is as follows:

User Sample
User A-User G (7) 7 Users x 5 Rooms x 5 Locations x (6 Gestures x 5 Instances + 60 Negative samples) = 12250 Samples
User H-User I (2) 2 Users x 4 Rooms x 5 Locations x (6 Gestures x 5 Instances + 60 Negative samples) = 2800 Samples
User J-User L (3) 3 Users x 3 Rooms x 5 Locations x (6 Gestures x 5 Instances + 60 Negative samples) = 3150 Samples
User M-User N (2) 2 Users x 2 Rooms x 5 Locations x (6 Gestures x 5 Instances + 60 Negative samples) = 1400 Samples
User O-User R (4) 4 Users x 1 Room x 5 Locations x (6 Gestures x 10 Instances + 60 Negative samples) = 2000 Samples
User S-User Y (7) 7 Users x 1 Room x 5 Locations x (6 Gestures x 5 Instances + 60 Negative samples) = 2450 Samples

How to access the dataset

To obtain the dataset, please sign the agreement, scan and send it to yadongli@mail.ustc.edu.cn. You will receive a notification email which includes the download links of the dataset in three days.

Citation

-If you use this dataset, please cite the following paper :

Y. Li et al., "Towards Domain-Independent and Real-Time Gesture Recognition Using Mmwave Signal," in IEEE Transactions on Mobile Computing, 2022, doi: 10.1109/TMC.2022.3207570.

-You may also be interested in the Human Indoor Behavior Exclusive RF dataset HIBER.

Z. Wu et al., "RFMask: A Simple Baseline for Human Silhouette Segmentation With Radio Signals," in IEEE Transactions on Multimedia, 2022, doi: 10.1109/TMM.2022.3181455.

C. Yu, Z. Wu, D. Zhang, Z. Lu, Y. Hu and Y. Chen, "RFGAN: RF-Based Human Synthesis," in IEEE Transactions on Multimedia, doi: 10.1109/TMM.2022.3153136.

About