Word Embedding with (John Snow Lab) NLU

NLU has created a powerful API for embeddings (and even some NLP downstream-task like sarcasm detection or sentiment classification) in 1-liner of code. However, at the beginning of each NLP projects, you are facing the issue of selecting the model that fits best to your data structure. This app is designed for selecting and comparing pre-trained NLP models from NLU (John Snow Lab) with own data. Whether your project has word, sentence or document embeddings: upload the data, select some pre-trained models and download the embeddings.

This app was build with Streamlit and has the sections:

Word embbeding

Sentance or Document embedding

Fake News Classifier

Get start on your local machine

Getting start with the description below on your go to John White Lab Installation

1. Java 8

You only need to configure Java 8 on your machine and are good to go! Unless you are on Windows, which requires 1 additional step.

Check you java version

$ java -version
# should be Java 8 (Oracle or OpenJDK)

2. Windows Specific Prerequisites

Download winutils.exe
Create folder C:\winutils\bin
Copy winutils.exe inside C:\winutils\bin
Set environment variable HADOOP_HOME to C:\winutils

3 Install NLU

Install PySpark based NLU form pip.

Note: For nlu<=1.0.2 please use a Python version with version number SMALLER than 3.8

$ pip install nlu

Credentials and links

About

Languages

Language:Python 100.0%