hackallan's repositories
airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
datasophon
The next generation of cloud-native big data management expert , Aims to help users rapidly build stable, efficient, and scalable cloud-native platforms for big data.
dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available `out of the box`.
flink-streaming-platform-web
基于flink-sql的实时流计算web平台
incubator-doris
Apache Doris (Incubating)
incubator-streampark
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
incubator-streampark-website
Apache streampark Website
jenkinsTest
test jenkins
openai-quickstart
A comprehensive guide to understanding and implementing large language models with hands-on examples using LangChain for GenAI applications.
spark
Apache Spark - A unified analytics engine for large-scale data processing
sqoop
Mirror of Apache Sqoop