There are 2 repositories under spark-streaming-kafka topic.
《Kafka技术内幕》代码
One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)
Spark with Scala example projects
Samples for using Kafka within Spark Streaming and Akka Actors, Akka Streams
These are a select few projects related to Big Data Analytics and Management. The projects listed are a combination of both small and big projects but interesting ones.
End-to-end Kafka Streaming Examples on Databricks with Evolving Avro Schemas.
An ETL application which is written in Quarkus, Spark SQL Streaming, Neo4j and various types of Databases and stores. It also covers the devops frameworks like Jenkins CI/CD, docker and Kubernetes.
Design data streaming architecture and API for a real-life application called the Step Trending Electronic Data Interface (STEDI). It is a working application used to assess fall risk for seniors. When a senior takes a test, they are scored using an index which reflects the likelihood of falling, and potentially sustaining an injury in the course of walking. STEDI uses a Redis datastore for risk score and other data. The Data Science team has completed a working graph for population risk at a STEDI clinic. The problem is the data is not populated yet. You will work with Kafka Connect Redis Source events and Business Events to create a Kafka topic containing anonymized risk scores of seniors in the clinic.
Spark Examples
Structured Streaming Log Analysis
Research on legacy and structured streaming with Spark
A working example of Twitter -> Kafka -> Spark Streaming integration by a beginner
The core objective of this project is to build an end-to-end data streaming pipeline that processes this dataset in real-time. By leveraging modern data engineering tools and techniques, we aim to connect, buffer, process, store, and visualize streaming data. This allows for better understanding of data flows, handling of large-scale real-time data
Project to compare Apache Spark Streaming vs Apache Flink.
SparkStreaming新手友好向模板,简化SparkStreaming开发
Near real-time streaming using Apache Spark and Apache Kafka
How to get closer to the audience using Twitter: an use case following the France football team run during the 2022 World Cup
Playstore apps rating analysis - Machine Learning on Bigdata- Loading streaming Data using Kafka and Flume
Repository for Spark structured streaming use case implementations.
In-Stream final project
Intro to streaming data with Kafka, Spark and AWS Glue
spark-kafka-integration
Data processing ang ingestion backend for ViyaDB based on Spark streaming
Mata Elang | Data Preprocessing using Scala and Spark
Using various data processing tool for real time data pipeline with Kafka
Projects related to Big Data technologies
substance effects on reflexes
Streaming component of the project, which is written with Spark Streaming.
Collection of spark-components functions for big-data processing
batch processing and realtime tains(railway) data analysis to help Station Masters refreshing each 20 seconds
DCL-700: Big Data Essentials
Design a data streaming pipeline around Apache Spark, Kafka, and Redis for a real-time application