MarshaGomez / Multilabel-Classification

The dataset is made up of WikiHow articles. We propose a unified multi-class active learning approach for automatically labeling articles. The experimental results show that the proposed approach works effectively even with a significantly reduced amount of labeled data.

Home Page:https://github.com/MarshaGomez/Multilabel-Classification

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data Mining 2020 - Classification of WikiHow pages

The aim is to build a multi-class classifier of Wikihow pages. A Wikihow page pertains to one and only one of 19 macro-categories. Right now resources are labeled by hand by the creator of the resource, but our aim is to build an automatic tool for the task, based on the article's text and summary.

The report can be found here.

About

The dataset is made up of WikiHow articles. We propose a unified multi-class active learning approach for automatically labeling articles. The experimental results show that the proposed approach works effectively even with a significantly reduced amount of labeled data.

https://github.com/MarshaGomez/Multilabel-Classification


Languages

Language:Java 90.2%Language:Python 9.8%