vintersnow / rnn-language-model

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Pytorch language model

Language model implimented in pytorch

Requirements

  • pytorch

How to run

Data sets

  • train data
  • vocabulary file

See this repo

Train data

format: Tokenized by nltk.tokenized.sent_tokenize

Example:

Tokenized Text .
Seconed Text .

Vocabulary file

Fromat: {word} {frequency}

Execute

mkdir ckpt runs
python train.py --num_iters 100 --store_summary --data_path 'path/to/data*' --vocab_file 'path/to/vocab'

About


Languages

Language:Python 99.2%Language:Shell 0.8%