Tsolak Ghukasyan's repositories
arpa-paraphrase-corpus
Sentential paraphrase datasets and BERT-based paraphrase detection models for the Armenian language.
word-embeddings-eval-hy
Pre-trained fastText, word2vec, GloVe embeddings for the Armenian language and datasets for their intrinsic and extrinsic evaluation
babylondigger
Toolkit for text segmentation, part-of-speech tagging, lemmatization and dependency parsing
Language:PythonGPL-3.0000
Language:Python000
Language:AutoIt000
Language:Jupyter Notebook000
mlevn.github.io
ML EVN - Yerevan machine learning community
Language:HTML000
nltk_data
NLTK Data
000
Language:Jupyter Notebook000
spaCy
đź’« Industrial-strength Natural Language Processing (NLP) with Python and Cython
Language:PythonMIT000
style-change-analysis-1
Datasets and resources for stylometry-based intrinsic plagiarism detection research for the Armenian language.
000
Language:Python000
000
Language:Jupyter Notebook000