evijit / PublicSphere2

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Public Sphere 2.0: Paragraph-Comment Dataset

This dataset comprises articles of two news groups, the Guardian and the New York Times. The dataset has the following three columns separated by tabs.

Paragraph Text, Comment Text, Relevance score of paragraph and comment

Paragraph text refers to the content of a single paragraph in an article A.

Comment text refers to the content of a single comment of the same article A.

The relevance score means the relevance of the comment text with respect to the paragraph text. The score ranges from 1 to 5 with 5 being the highest relevance and 1 being the lowest.

About