There are 2 repositories under azkaban topic.
大数据入门指南 :star:
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Schedulis is a high performance workflow task scheduling system that supports high availability and multi-tenant financial level features, Linkis computing middleware, and has been integrated into data application development portal DataSphere Studio
最好的大数据项目。《Titan数据运营系统》,本项目是一个全栈闭环系统,我们有用作数据可视化的web系统,然后用flume-kafaka-flume进行日志的读取,在hive设计数仓,编写spark代码进行数仓表之间的转化以及ads层表到mysql的迁移,使用azkaban进行定时任务的调度,使用技术:Java/Scala语言,Hadoop、Spark、Hive、Kafka、Flume、Azkaban、SpringBoot,Bootstrap, Echart等;
基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。
基于DataX的通用数据同步微服务,一个Restful接口搞定所有通用数据同步
基于大数据的图书推荐系统
Apache DolphinScheduler Kubernetes Operator.
Ambari service for Azkaban
:file_folder: Extract, Transform, Load (ETL) :construction_worker: refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
Define and schedule workflow, support Flink Jar/SQL, ClickHouse/Hive/Mysql SQL, Shell, etc.
基于Spark的电影推荐系统
Frontend pages of project `flink-platform-backend`
springboot-azkaban job地址https://github.com/poemp/azkaban-data-push-job
Generates Azkaban jobs in zip format by taking flows in xml file
Hogwarts is digitizing their sorting process. Create a python script to assign a new student to one of the following houses: Gryffindors, Hufflepuff, Ravenclaw and Slythering. Use the following steps: Design a questionnaire to assess the qualities of a student Compare her qualities with the chart.
使用python监控azkaban,使用官方提供的接口,解析日志,当执行次数或执行时间不达标,通过企业微信发送发送到微信群中;