There are 0 repository under word-level-language-model topic.
Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.
In this project, I worked with a small corpus consisting of simple sentences. I tokenized the words using n-grams from the NLTK library and performed word-level and character-level one-hot encoding. Additionally, I utilized the Keras Tokenizer to tokenize the sentences and implemented word embedding using the Embedding layer. For sentiment analysis