ai-smalleryu's repositories
apache-shiro-tutorial-webapp
A step-by-step tutorial showing how to secure a web app with Apache Shiro
arctic-fork
Arctic is a streaming lake warehouse service open sourced by NetEase
calcite
Apache Calcite
data-algorithms-book-stu
MapReduce, Spark, Java, and Scala for Data Algorithms Book
data-datart
Datart is a next generation Data Visualization Open Platform
flink-cdc-connectors
CDC Connectors for Apache Flink®
flink-kubernetes-operator
Apache Flink Kubernetes Operator
flink-recommandSystem-demo
:helicopter::rocket:基于Flink实现的商品实时推荐系统。flink统计商品热度,放入redis缓存,分析日志信息,将画像标签和实时记录放入Hbase。在用户发起推荐请求后,根据用户画像重排序热度榜,并结合协同过滤和标签两个推荐模块为新生成的榜单的每一个产品添加关联产品,最后返回新的用户列表。
flink-sql-lineage
FlinkSQL字段血缘解决方案及源码。FlinkSQL field lineage solution and source code, The core idea is to parse SQL through Calcite to generate a RelNode tree of relational expressions. Then get the optimized logical paln through optimization stage, and finally call Calcite RelMetadataQuery to get the lineage relationship at the field level.
flink-sql-security-fork
FlinkSQL数据脱敏和行级权限解决方案及源码,支持面向用户级别的数据脱敏和行级数据访问控制,即特定用户只能访问到脱敏后的数据或授权过的行。此方案是实时领域Flink的解决方案,类似于离线数仓Hive Ranger中的Row-level Filter和Column Masking方案。
open-source-manual
A Ebook of Open Source Manual
studyandtest
学习使用
gpt4free
decentralising the Ai Industry, just some language model api's...
hudi
Upserts, Deletes And Incremental Processing on Big Data.
incubator-livy-spark-rest
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
incubator-paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
incubator-seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
incubator-streampark-quickstar-forkt
Apache StreamPark quickstart
incubator-streampark-sql-gateway
StreamPark, Make stream processing easier! easy-to-use streaming application development framework and operation platform
linkis-fork
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
spark-notebook
Interactive and Reactive Data Science using Scala and Spark.
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)