There are 2 repositories under spark-nlp topic.
Models and Pipelines for the Spark NLP library
Tutorial for Topic Modelling using PySpark and Spark NLP
Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"
Build and publish Spark NLP to Anaconda Cloud
NLP functions with John Snow's Spark NLP in the Java language
Miscellaneous codes and writings for MLOps
This is the repository for all of my Spark projects, which include Spark NLP & Computer Vision projects.
Python scripts to process, and analyze log files using PySpark.
Compilation of NLP notebooks from various sources that address several technical challenges.
contains notebooks on topic modeling, spark and pandas implementation
Final project of "Big Data Analytics and Business Intelligence" course.
Testing and benchmarking some of the existing NLP libraries in Apache Spark
Benchmark inference of custom and pre-trained NLP models with Spark NLP.
SparkNLP and Healthcare SparkNLP based analysis of scientific literature on equine colic.
Project that captures information about all Dark Souls 3 (DS3) weapons and performs textual analysis on.
Big data project to analyze (Subreddit : NoStupidQuestions) comments
NHL event detection project using 500GB+ of Twitter and Reddit data.
Final Project for Harvard's Scala for Big Data Systems course
This project is a Spark ML pipeline using Pyspark for NLP, using annotators: DocumentAssembler, Tokenizer, WordEmbeddingsModel, PerceptronModel & NerCrfModel. It prints a transformed DataFrame showing POS & NER columns, and analyzes any relationship between found entities & their POS attributes. Hands-on experience with Spark, Pyspark & Spark-NLP.
Text summarization algorithms using PySpark
O projeto mostra a utilização de alguns métodos da biblioteca Spark NLP que são utilizados para processamento de linguagem natural.
Spark training examples of Machine Learning and Natural Language Processing, using Spark ML and Spark NLP libraries.
An implementation of NLP Sandbox PHI Annotator API based on Spark NLP
Applying Spark NLP models to Enron datasets with Postgres