dbklim / Russian_subtitles_dataset

Preprocessing of the dataset of 347 subtitles for the TV series (thanks to Taiga Corpus) to build a word2vec model, JamSpell model, neural network training, chat bot training or in any other NLP task.

Home Page:https://tatianashavrina.github.io/taiga_site/downloads

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

dbklim/Russian_subtitles_dataset Issues

No issues in this repository yet.