Zminghua / SentEncoding

Sentence encoder and training code for Mean-Max AAE

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

BookCorpus dataset not available

jingjojo opened this issue · comments

Hello,

I went to the link http://yknzhu.wixsite.com/mbweb, however the BookCorpus dataset is no longer available. Could you let me know what other dataset we can use for training as it's showing me this error when I run run_train.py?

FileNotFoundError: [Errno 2] No such file or directory: '/content/SentEncoding/data/bookcorpus/books.vocab'

Thanks.