lkafle / nlp_model_selection_app

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Word Embedding with (John Snow Lab) NLU

NLU has created a powerful API for embeddings (and even some NLP downstream-task like sarcasm detection or sentiment classification) in 1-liner of code. However, at the beginning of each NLP projects, you are facing the issue of selecting the model that fits best to your data structure. This app is designed for selecting and comparing pre-trained NLP models from NLU (John Snow Lab) with own data. Whether your project has word, sentence or document embeddings: upload the data, select some pre-trained models and download the embeddings.

This app was build with Streamlit and has the sections:

Word embbeding

Sentance or Document embedding

Fake News Classifier

Get start on your local machine

Getting start with the description below on your go to John White Lab Installation

1. Java 8

You only need to configure Java 8 on your machine and are good to go! Unless you are on Windows, which requires 1 additional step.

Check you java version

$ java -version
# should be Java 8 (Oracle or OpenJDK)

2. Windows Specific Prerequisites

  • Download winutils.exe
  • Create folder C:\winutils\bin
  • Copy winutils.exe inside C:\winutils\bin
  • Set environment variable HADOOP_HOME to C:\winutils

3 Install NLU

Install PySpark based NLU form pip.

Note: For nlu<=1.0.2 please use a Python version with version number SMALLER than 3.8

$ pip install nlu

Credentials and links

About


Languages

Language:Python 100.0%