KyoungaRyu / rnn-pytorch

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Naver Sentiment Movie Corpus

RNN(Recurrent Neural Network)/ PyTorch

This is a sentiment analysis of Naver Movie Reviews using PyTorch.

To analyze movie corpus, Konlpy needs to be installed with Korean tokenizer(twitter- Open Korean Text).

1. Dataset

Data from Naver Movie Review Dataset.

Tabular dataset includes id, document, and label, which is 'tsv' format. ('tsv' format delimits columns with tabs.)

2. Model

Recurrent Neural Network(RNN)

The model uses the nn.LSTM module to get better results.

3. Architecture

Long-Short-Term-Memory (LSTM) Architecture

Reference

LSTM architecture usually consists of three regularators, called gates, of the flow of information inside the LSTM unit: an input gate, an output gate and a forget gate.

4. Conclusion

This RNN model of validation dataset shows 85% of accuracy as the results.

About


Languages

Language:Jupyter Notebook 100.0%