XiuhongTang's repositories
arctic
Arctic is a streaming lake warehouse service open sourced by NetEase
autocut
用文本编辑器剪视频
CloudEon
A lightweight solution to manager bigdata cluster(hadoop、hive、Doris and etc..) on kubernetes. 一款基于kubernetes的云原生大数据平台,致力于简化k8s上大数据集群的运维管理
datahub
The Metadata Platform for the Modern Data Stack
datahub-helm
Repository of helm charts for deploying DataHub on a Kubernetes cluster
datasophon
It is committed to rapidly implementing the deployment, management, monitoring and automatic operation and maintenance of the big data cloud native platform, helping you quickly build a stable, efficient, elastic and scalable big data cloud native platform.
datavines
Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.
docker-android
🤖 A minimal and customizable Docker image running the Android emulator as a service.
dolphinscheduler
Apache DolphinScheduler is the modern data workflow orchestration platform with powerful user interface, dedicated to solving complex task dependencies in the data pipeline and providing various types of jobs available `out of the box`
doris-flink-connector
Flink Connector for Apache Doris
Douyin_TikTok_Download_API
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。
flink
Apache Flink
flink-connector-elasticsearch
Apache Flink connector for ElasticSearch
flink-connector-jdbc
Apache flink
flink-sql-lineage
The Lineage Analysis system for FlinkSQL supports advanced syntax such as Watermark, UDTF, CEP, Windowing TVFs, and CTAS.
incubator-paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
incubator-streampark
StreamPark, Make stream processing easier! easy-to-use streaming application development framework and operation platform
keda
KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes
kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
luna-ai
Luna AI - 全自动的 AI 直播系统
scaleph
Open data platform based on Flink and Kubernetes, supports web-ui click-and-drop data integration with SeaTunnel on Flink, manage flink jar job both YARN and Kubernetes. Now Scaleph is working on Flink SQL online editor
seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
tiktok_youtube_douyin_handling
爬虫可视化; tiktok视频; youtube视频; 抖音视频 tiktok/youtube的视频到抖音; 抖音的视频到tiktok / youtube平台 使用selenium发布视频
TiktokAutomation
2023年4、5月份心血来潮,想做TK,为了实现矩阵运营,开启此项目,但是最后由于各种原因,无法继续。现在将项目公开,希望能对后面做自媒体的有所帮助。本项目包括本地代理IP的配置,outlook邮箱申请(图片验证需要手动处理一下),邮箱验证码自动读取,tk账号注册和登录(这里也存在问题,单次可行,第二次会被识别次数太多,细节看readme),tk的模拟浏览视频,tk视频下载,视频搬运前的剪辑处理等等
TiktokDouyinCrawler
国外Tiktok+国内抖音爬虫,a-bogus和x-bogus算法破解
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
wechat-need-web
让微信网页版可用 / Allow the use of WeChat via webpage access
windows-in-docker-container
Deploy and manage a Windows OS (x64) seamlessly using Vagrant VM, libvirt, and docker-compose. This innovative approach integrates smoothly into existing workflows, providing an efficient way of containerizing Windows OS for better resource allocation and convenience.