qianliu0708 / 5AbstractsGroup

A dataset for text classification

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

#5AbstractsGroup

Dataset used in "Leveraging Pattern Associations for Word Embedding Models", COLING 2018.

The dataset contains academic papers from five different domains collected from the Web of Science, namely business, artifical intelligence, sociology, transport and law. One line is a document which contains the title and abstract fields of one paper.

About

A dataset for text classification