shunkakinoki / now_eminem

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

deep-elon-tweet-generator

Elon Musk-like tweets generated with a recurrent neural network (RNN).

This project was built with the fastai library (built on Pytorch) and the language model used to generate the tweets is an AWD-QRNN (ASGD Weight-Dropped Quasi-Recurrent Neural Network).

The task involves predicting the next token (i.e. a word, a character, a punctuation mark, etc.) in a sequence given the n preceding tokens.

It was trained with the WikiText-103 dataset (https://www.sysml.cc/doc/50.pdf) and then fine-tuned with Elon's tweets using transfer learning.

  • The WikiText language modeling dataset is a collection of over 100 million tokesn extracted from Wikipedia.
  • Elon's tweets are from 2010 to 2018 and contain over 6,000 tweets

All the processing steps are in the jupyter notebook. The other fields are used to build a web app that generates Musk-like tweets on demand. It is hosted using Zeit.

The app can be seen live here: https://deepelon.com/

Inference (generated tweets) can take a couple of minutes and the generated tweets are still prone to errors depending on which root word(s) you use to generate the tweet.

About

License:Apache License 2.0


Languages

Language:Jupyter Notebook 81.7%Language:Python 5.3%Language:CSS 4.9%Language:HTML 4.6%Language:JavaScript 3.0%Language:Dockerfile 0.5%