There are 5 repositories under spark-structured-streaming topic.
Apache Spark™ and Scala Workshops
Kinesis Connector for Structured Streaming
Custom state store providers for Apache Spark
A real-time reddit data streaming pipeline for sentiment analysis of various subreddits
A curated list of Pulsar tools, integrations and resources.
Sentiment Analysis of a Twitter Topic with Spark Structured Streaming
Azure Databricks - Advent of 2020 Blogposts
Extensible streaming ingestion pipeline on top of Apache Spark
Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python
Spark structured streaming examples with using of version 3.5.1
spark structured streaming via HTTP communication
This repository includes supervised and unsupervised machine learning methods which are used to detect anomalies on network datasets. Decision Tree, Random Forest, Gradient Boost Tree, Naive Bayes, and Logistic Regression were used for supervised learning. K-Means was used for unsupervised learning.
Spark1.6和spark2.2的示例,包含kafka,flume,structuredstreaming,jedis,elasticsearch,mysql,dataframe
Get Twitter trends with twitter4j, stream it to a Kafka topic, save it to MongoDB and visualize in Google Maps
An naive anomaly detection and data visualization tool for F1 on board telemetry data.
Spark Structured Streaming data pipeline that processes movie ratings data in real-time.
Real-time streaming data pipeline for Twitter Tweets
Samples for using Kafka within Spark Streaming and Akka Actors, Akka Streams
A library having Java and Scala examples for Spark 2.x
Analytics for IoT devices using Apache Spark Structured Streaming 2.4.0
cpu anomaly detection with spark
Spark Structured Streaming with State Store
Elastic scaling is a library that allows to control the number of resources (executors or workers) instantiated by a Spark Structured Streaming Job in order to optimize the effective microbatch duration.
Design and proof-of-concept for a Broker for astronomy using Apache Spark
Spark Examples
Repositorio para la clase de UAM, Máster en Business Intelligence, PARALELIZACIÓN DE DATOS, Modulo de Streaming
This Spark Java project serves as a demonstration of Gradle Spark configuration, specifically focusing on utilizing the MemoryStream class as the streaming source.
Structured Streaming Log Analysis
Kafka streaming job from iomete. This streaming job copies data from Kafka to Iceberg.
NYC Taxi & Limousine Commission's open data with Spark Streaming 3.0.0
🎓 Repositório com a solução de IoT Analytics desenvolvida como parte do Trabalho de Conclusão de Curso (TCC) do curso de Ciência da Computação da Universidade Federal de Campina Grande (UFCG)
Research on legacy and structured streaming with Spark