nlp-benchmark
Datasets:
Dataset |
Classes |
Train samples |
Test samples |
source |
Imdb |
2 |
25 000 |
25 000 |
link |
AG’s News |
4 |
120 000 |
7 600 |
link |
Sogou News |
5 |
450 000 |
60 000 |
link |
DBPedia |
14 |
560 000 |
70 000 |
link |
Yelp Review Polarity |
2 |
560 000 |
38 000 |
link |
Yelp Review Full |
5 |
650 000 |
50 000 |
link |
Yahoo! Answers |
10 |
1 400 000 |
60 000 |
link |
Amazon Review Full |
5 |
3 000 000 |
650 000 |
link |
Amazon Review Polarity |
2 |
3 600 000 |
400 000 |
link |
Models:
- [1]: CNN: Character-level convolutional networks for text classification (paper, code)
- [2]: VDCNN: Very deep convolutional networks for text classification (paper, code)
Experiments:
Results are reported as follows: (i) / (ii)
- (i): Test set accuracy reported by the paper
- (ii): Test set accuracy reproduced here
|
imdb |
ag_news |
sogu_news |
db_pedia |
yelp_polarity |
yelp_review |
yahoo_answer |
amazon_review |
amazon_polarity |
CNN small |
|
84.35 / 87.10 |
91.35 / 93.53 |
98.02 / 98.15 |
|
|
|
|
|
VDCNN (9 layers, k-max-pooling) |
|
90.17 / 89.22 |
96.30 / 93.50 |
98.75 / 98.35 |
94.73 / 93.97 |
61.96 / 61.18 |
|
|
|
VDCNN (17 layers, k-max-pooling) |
|
90.61 / 90.00 |
-/ |
- / |
94.95 / 94.73 |
62.59 / |
|
|
|
VDCNN (29 layers, k-max-pooling) |
|
91.33 / 91.22 |
-/ |
- / |
95.37 / 94.82 |
63.00 / |
|
|
|
HAN |
|
|
|
|
|
|
|
|
|