radaimi / leveraging-AL-and-CMI-in-HAR

Code for "Leveraging Active Learning and Conditional Mutual Information to Minimize Data Annotation in Human Activity Recognition"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Leveraging Active Learning and Conditional Mutual Information to Minimize Data Annotation in Human Activity Recognition

This repo contains code accompanying the paper, "Leveraging Active Learning and Conditional Mutual Information to Minimize Data Annotation in Human Activity Recognition".

Datasets

Scripts

Sampling Scripts

The informative and diverse pool-based sampling is based on https://github.com/google/active-learning with modifications that made it faster with large datasets (ExtraSensory) and made it work with matlab scripts (Opportunity)

PAMAP2-related Scripts

ExtraSensory-related Scripts

Reference

Rebecca Adaimi and Edison Thomaz. 2019. Leveraging Active Learning and Conditional Mutual Information to Minimize Data Annotation in Human Activity Recognition. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 3, 3, Article 70 (September 2019), 23 pages.

Download paper here

Bibtex Reference:

@article{10.1145/3351228,
author = {Adaimi, Rebecca and Thomaz, Edison},
title = {Leveraging Active Learning and Conditional Mutual Information to Minimize Data Annotation in Human Activity Recognition},
year = {2019},
issue_date = {September 2019},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
volume = {3},
number = {3},
url = {https://doi.org/10.1145/3351228},
doi = {10.1145/3351228},
abstract = {A difficulty in human activity recognition (HAR) with wearable sensors is the acquisition of large amounts of annotated data for training models using supervised learning approaches. While collecting raw sensor data has been made easier with advances in mobile sensing and computing, the process of data annotation remains a time-consuming and onerous process. This paper explores active learning as a way to minimize the labor-intensive task of labeling data. We train models with active learning in both offline and online settings with data from 4 publicly available activity recognition datasets and show that it performs comparably to or better than supervised methods while using around 10% of the training data. Moreover, we introduce a method based on conditional mutual information for determining when to stop the active learning process while maximizing recognition performance. This is an important issue that arises in practice when applying active learning to unlabeled datasets.},
journal = {Proc. ACM Interact. Mob. Wearable Ubiquitous Technol.},
month = sep,
articleno = {70},
numpages = {23},
keywords = {Stopping Criterion, Conditional Mutual Information, Active Learning, Human Activity Recognition, Data Annotation}
}

About

Code for "Leveraging Active Learning and Conditional Mutual Information to Minimize Data Annotation in Human Activity Recognition"


Languages

Language:Python 100.0%