CrazyPig's starred repositories
unitycatalog
Open, Multi-modal Catalog for Data & AI
aws-glue-data-catalog-client-for-apache-hive-metastore
The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions
tugraph-analytics
TuGraph Analytics is a distributed graph compute engine.
facebook-hive-udfs
Facebook's Hive UDFs
hive-extension-examples
Examples for extending hive
fucking-algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
LeetCodeAnimation
Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)
spring-hadoop
Spring for Apache Hadoop is a framework for application developers to take advantage of the features of both Hadoop and Spring.
excel-streaming-reader
An easy-to-use implementation of a streaming Excel reader using Apache POI
dolphinscheduler
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
riskcontrol
轻量级JAVA实时业务风控系统框架
mysql-binlog-connector-java
MySQL Binary Log connector
open-monitor
Distributed monitoring system based on Prometheus
DataDefender
Sensitive Data Management: Data Discovery and Anonymization toolkit
chlorine-finder
A Java Library to detect and mask sensitive data
nv-websocket-client
High-quality WebSocket client implementation in Java.
sofa-jarslink
Jarslink is a sofa ark plugin used to manage multi-application deployment
technology-talk
【大厂面试专栏】一份Java程序员需要的技术指南,这里有面试题、系统架构、职场锦囊、主流中间件等,让你成为更牛的自己!
grpc-by-example-java
A collection of useful/essential gRPC Java Examples
binlog2sql
Parse MySQL binlog to SQL you want