XiuhongTang (tgluon)

tgluon

Geek Repo

Company:杭州数梦工场科技有限公司

Location:杭州

Github PK Tool:Github PK Tool

XiuhongTang's repositories

License:Apache-2.0Stargazers:0Issues:0Issues:0

datahub-helm

Repository of helm charts for deploying DataHub on a Kubernetes cluster

Language:MustacheLicense:Apache-2.0Stargazers:0Issues:0Issues:0

incubator-seatunnel-web

SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).

License:Apache-2.0Stargazers:0Issues:0Issues:0

tiktok-scraper

TikTok Scraper. Download video posts, collect user/trend/hashtag/music feed metadata, sign URL and etc.

Stargazers:0Issues:0Issues:0

LakeSoul

LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

cube-studio

云原生一站式机器学习平台,多租户,数据资产,notebook在线开发,拖拉拽任务流编排,多机多卡分布式训练,超参搜索,推理服务,多集群调度,多项目组资源组,边缘计算,大模型实时训练, ai应用商店

License:NOASSERTIONStargazers:0Issues:0Issues:0

kubesphere

The container platform tailored for Kubernetes multi-cloud, datacenter, and edge management ⎈ 🖥 ☁️

Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0

xiaohongshu

小红书自动化,自动登录、可选择Cookie登录、支持上传图文、视频并自动发布

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

flink-table-store-101

Playground for Flink Table Store with use cases and performance features

License:Apache-2.0Stargazers:0Issues:0Issues:0

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

License:MITStargazers:0Issues:0Issues:0

hudi

Upserts, Deletes And Incremental Processing on Big Data.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

arroyo

Arroyo is a distributed stream processing engine written in Rust

License:Apache-2.0Stargazers:0Issues:0Issues:0

emqx

The most scalable open-source MQTT broker for IoT, IIoT, and connected vehicles

License:NOASSERTIONStargazers:0Issues:0Issues:0

the-algorithm

Source code for Twitter's Recommendation Algorithm

License:AGPL-3.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

God-Of-BigData

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

Stargazers:0Issues:0Issues:0

chitu-sdp

赤兔实时计算平台是基于 Apache Flink 构建的企业级、一站式、高性能、低门槛实时大数据实时计算平台,广泛适用于流式数据应用开发场景。

License:GPL-3.0Stargazers:0Issues:0Issues:0

spark

Apache Spark - A unified analytics engine for large-scale data processing

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

docker-hadoop

Apache Hadoop docker image

Stargazers:0Issues:0Issues:0

hadoop

Apache Hadoop

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ddia

《Designing Data-Intensive Application》DDIA中文翻译

Language:PythonLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

alldata

💥🔥 为了解决企业建设大数据平台的痛难点, 本项目旨在对Apache众多大数据平台组件进行二次开发维护,并输出一款通用的大数据平台底座,重点解决数据采集, 数据存储, 数据计算, 数据开发和数据运营场景遇到的问题与挑战, 初衷是建设开源业界领先的一站式大数据平台, 赋能成千上万个中小企业的业务快速发展, 以及给热爱大数据的开发者提供一系列解决方案。

License:Apache-2.0Stargazers:0Issues:0Issues:0

alluxio

Alluxio, data orchestration for analytics and machine learning in the cloud

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

docker-krb5-server

A Krb5Server Docker Image very easy and simple to use.

Stargazers:0Issues:0Issues:0
Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ozone

Scalable, redundant, and distributed object store for Apache Hadoop

License:Apache-2.0Stargazers:0Issues:0Issues:0

juicefs

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

License:Apache-2.0Stargazers:0Issues:0Issues:0

flink-sql-security

FlinkSQL的行级权限解决方案及源码,支持面向用户级别的行级数据访问控制,即特定用户只能访问授权过的行,隐藏未授权的行数据。此方案是实时领域Flink的解决方案,类似离线数仓Hive中Ranger Row-level Filter方案。

License:Apache-2.0Stargazers:0Issues:0Issues:0

mootdx

通达信数据读取的一个简便使用封装

License:MITStargazers:0Issues:0Issues:0