Tokenize the Gutenberg corpus, and care for the exceptions. Create a language model based on N-gram prediction. Use the language model to score a sentence by its grammatical features, including punctuation. Finally, score the spelling of a word without using a dictionary lookup.
Language The language used for this is Python3 (v 3.7.3)
The project is executed as a Jupyter Notebook
- Clone this repository
$ git clone https://github.com/AlokDebnath/LanguageModel.git
- Run the following command
$ jupyter notebook Project\ 1\ Task\ 1
- Click shift+enter at every code block