BlackKakapo's repositories

Romanian-Word-Embeddings

Romanian Word Embeddings. Here you can find pre-trained corpora of word embeddings. Current methods: CBOW, Skip-Gram, Fast-Text (from Gensim library). The .vec and .model files are available for download (all in one archive).

Icelandic-Word-Embedding

Icelandic Word Embeddings. Here you can find pre-trained corpora of word embeddings. Current methods: CBOW, Skip-Gram, Fast-Text (from Gensim library). The .model file are available for download.

License:Apache-2.0Stargazers:4Issues:2Issues:0

dexonline-API

This is a simple API that queries the dexonline.ro site, and returns the definitions of the words. The API is written for the python language.

Language:PythonStargazers:3Issues:2Issues:0

BioSentVec

BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1Issues:1Issues:0
Language:PythonStargazers:0Issues:2Issues:0

bert

TensorFlow code and pre-trained models for BERT

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Datasets

Machine learning datasets used in tutorials on MachineLearningMastery.com

Stargazers:0Issues:1Issues:0

Icelandic

An old (2017) short description of how to apply word2vec to a small Icelandic corpus and look at embedding similarities and common features in word2vec library. More of a conceptual demonstration.

Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:2Issues:0
Language:JavaStargazers:0Issues:1Issues:0

RoWordNet

Romanian WordNet (Data + API for Python)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0