Fanchouille / ovnlp

OV Toolbox for NLP

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ovnlp

A toolkit to download, train, use fastText word vectors on text data. Also lets you deduplicate data based on TF IDF representation (see txtMatcher) Developed under MIT license by Openvalue : http://openvalue.co

Fasttext

Installation

OVNLP runs on Python 3.6 ONLY.

Just run

> pip install ovnlp

Usage

See demo_notebook.ipynb for usage examples

FT Weights source

Pretrained weights from FB :

Feel free to change weightsource.json to add data sources if needed.

About

OV Toolbox for NLP

License:MIT License


Languages

Language:Python 59.2%Language:Jupyter Notebook 40.8%