mcanalesmayo / LanguageModel-NGram

Measuring the performance of bigrams and trigrams in language modeling problems

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

LanguageModel-NGram

Description

N-gram generalized solutions have O(K^N) storage complexity, where K = number of words, and N = complexity of the Markov chain. Therefore:

  • The ideal solution has O(K^K), which makes it computationally impossible to work
  • Bigrams have O(K^2) complexity
  • Trigrams have O(K^3) complexity
  • ... and so on

The aim of this program is to try the performance of bigrams and trigrams in language modeling problems.

References

If you use this code, please reference: Javier Civera - jcivera@unizar.es

About

Measuring the performance of bigrams and trigrams in language modeling problems

License:MIT License


Languages

Language:MATLAB 100.0%