Beast code in Giters

charry's repositories

hera

hera 分布式任务调度系统（数据部门专用）

Language:JavaGPL-2.0200

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Language:PythonApache-2.0100

DataX

DataX是阿里云DataWorks数据集成的开源版本。

Language:JavaNOASSERTION100

针对datax进行2次开发，实现data 以rpc的方式传递json配置调用推数服务，同时修复datax多处bug。项目中也引入nacos作为服务的配置中心和注册中心；同时项目内扩展了kafkawriter，rabbitmqwriter，esreader，hivereader。增强了hdfs插件，支持分区表推送，支持动态参数传递（例如时间实现自增式抽取）。具体使用方式可以参照example模块。目前该服务已经稳定服务某上市公司半年，累计总任务数100+ ，日推送数据过10亿。具体如何使用，如何做插件开发以及datax底层原理，请关注https://blog.csdn.net/xiaoyao1999hn

Language:JavaNOASSERTION100

dubbo-rest-example

dubbo rest filter

Language:JavaScriptMIT100

flink

Apache Flink

Language:JavaApache-2.0100

hugegraph

HugeGraph Database core component, including graph engine, API, and built-in backends

Language:JavaApache-2.0100

incubator-dolphinscheduler

Dolphin Scheduler is a distributed and easy-to-expand visual DAG workflow scheduling system, dedicated to solving the complex dependencies in data processing, making the scheduling system out of the box for data processing.(分布式易扩展的可视化工作流任务调度)

Language:JavaApache-2.0100

mybatis-3

MyBatis SQL mapper framework for Java

Language:JavaApache-2.01 10

netty

Netty project - an event-driven asynchronous network application framework

Language:JavaApache-2.0100

SpringBoot-Simple-Demo

开发模板：开发环境 IntelliJ IDEA JDK8 Maven 3.5.x lombok 1.16.18 使用框架 Spring Boot Swagger2 Druid Log4j2 MyBatis MyBatis Plus MySQL H2 Thymeleaf

Language:Java1 10

tunnel

PG数据同步工具（Java实现），支持hive

Language:JavaApache-2.01 10

xxl-rpc

源码解析(重点解析netty实战)，A high performance, distributed RPC framework.（分布式服务框架XXL-RPC）

Language:JavaGPL-3.0100

zeus

taobao zeus 支持 Hadoop mr, hive, shel，前端界面用java（google富客户端gwt）写的，现在二次开发之后hera(https://github.com/scxwhite/hera)

Language:Java100

datax-distribute

datax 分布式服务：主要将job 和 taskGroup分拆在两个进程，采用rpc实现通信，就能达到分布式能力，避免单进程资源局限。

010

streamx

Make stream processing easier! Flink & Spark development scaffold, The original intention of StreamX is to make the development of Flink easier. StreamX focuses on the management of development phases and tasks. Our ultimate goal is to build a one-stop big data solution integrating stream processing, batch processing, data warehouse and data laker.

Language:ScalaApache-2.0000