There are 3 repositories under sparkstreaming topic.
:boom: :rocket: 封装sparkstreaming动态调节batch time(有数据就执行计算);:rocket: 支持运行过程中增删topic;:rocket: 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
Real-time Machine Learning with Apache Spark on Twitter Public Stream
全套大数据基础学习教程,包含最基础的centos、maven。大数据主要包含hdfs、mr、yarn、hbase、kafka、scala、sparkcore、sparkstreaming、sparksql。教程包含所有的源代码演示以及在线文档说明。
SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失
:boom: :alien: :hotsprings::rocket:Encapsulated spark 与其他组件的结合api,方便使用,例如 es,hbase,kudu,kafka,mq等
Spark 2.x 案例操作:Scala版本与 Java1.8lambda版代码示例。涵盖Spark核心技术操作SparkCore、SparkSql、SparkStreaming。同时提供了Spark高级性能优化、序列化、广播变量、数据倾斜、算子优化、JVM优化、troubleshooting、数据倾斜解决方案。是多年来根据工作积累整理出来!
电影推荐系统,包括基于ALS、LFM的离线推荐、实时推荐,基于Spark
Spark in Action, 2e - chapter 10 - Ingestion through structured streaming
A Simple Real-Time Detector of DDOS Attacks with Apache Kafka And Spark Streaming
The project aims to design and implement a real-time movie recommendation system using the EK Stack (Elasticsearch and Kibana), Kafka, and a personalized recommendation API to enhance the user experience on Jay-Zz Entertainment's streaming platform.
A Realtime Stock Portfolio Manager built using Apache Distributed Technologies!
Apache Spark machine learning project using pyspark
Big Data Project - SSML - Spark Streaming for Machine Learning
SparkStreaming新手友好向模板,简化SparkStreaming开发
Full poc on spark 2, Spark RDD, Spark DStream, Spark SQL, Spark Datasets & DataFrames & Spark Structured Streaming [SCALA][SPARK]
Track trending hashtags on Twitter in real time.
This is a repository i have created to put up some of the knowledge i have gained around Big Data Technologies especially Spark, GraphX etc.
This repository contains files, codes and markdown documents for "big data from scratch" writings on my blog (z-ing.net)
Big data computing tasks conducted with PySpark. The problems involve MapReduce and Streaming algorithms.
This project gets data from Spotify API , ingests into kafka for streaming and processes it through spark streaming. All this is done on Azure.
This project contains snippets of Scala code for illustrating various Apache Spark concepts. I write all these code when i'm learning spark.
a streaming app and a dashboard for visualizing cryptocurrency data fetched from the CoinGecko API. The streaming app retrieves real-time cryptocurrency information using Spark Streaming and stores it in a PostgreSQL database.
Developed a real-time streaming analytics pipeline using Apache Spark to calculate and store KPIs for e-commerce sales data, including total volume of sales, orders per minute, rate of return, and average transaction size. Used Spark Streaming to read data from Kafka, Spark SQL to calculate KPIs, and Spark DataFrame to write KPIs to JSON files.
sparkML智能客户系统项目实战-全套笔记,详细记录学习过程
A real-time sales data analysis Application using Spark Structured Streaming, Kafka as a messaging system, PostgreSQL as a storage for processed data, and Superset for creating a dashboard.
This Code helps take a close look of spark Data Streaming Structure
BigDataBusinessRuleEngine in Spark, scala, Drools.
Big Data Spark Hadoop Kafka Flink Spark Streaming