Preprocessing of the dataset of 347 subtitles for the TV series (thanks to Taiga Corpus) to build a word2vec model, JamSpell model, neural network training, chat bot training or in any other NLP task.
Home Page:https://tatianashavrina.github.io/taiga_site/downloads
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool