levindoneto / lanGen

N-Gram language model that learns n-gram probabilities from a given corpus and generates new sentences from it based on the conditional probabilities from the generated words and phrases.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

lanGen | Natural language generation using an n-gram language model

N-Gram language model that learns n-gram probabilities from a given corpus and generates new sentences from it based on the conditional probabilities from the generated words and phrases.

Author: Levindo Gabriel Taschetto Neto.

How to deploy

$ pip install docopt
$ pip install nltk

Requirements

  • Operating system: Windows.
  • Python: 3.0 or superior.

How to use

$ python lmgen.py InputFile

InputFile is optional. If no input is given, this default corpus is used.

License

MIT License. Click here for more information about this license.

About

N-Gram language model that learns n-gram probabilities from a given corpus and generates new sentences from it based on the conditional probabilities from the generated words and phrases.

License:MIT License


Languages

Language:Python 100.0%