VirtualRoyalty / spark-nlp-project

Micro project on big data technologies via spark

Repository from Github https://github.comVirtualRoyalty/spark-nlp-projectRepository from Github https://github.comVirtualRoyalty/spark-nlp-project

Russian language processing via Spark(NLP) 🔥

Go to colab

Micro project on big data technologies via spark

Content:

  1. Colab-Spark setup

  2. Data loading

  3. EDA & Preprocessing

  4. Pipelines & Experiments

  5. Text preprocessing

  6. Text classification

    • BoW models + LogReg
    • Transfer Learning (at least an attempt 😀)
  7. Entity Recgnition & Entity Linking

Tech stack:

...and much more 🤘

About

Micro project on big data technologies via spark


Languages

Language:Jupyter Notebook 100.0%