brucemen711

brucemen711

Geek Repo

0

followers

0

following

0

stars

Github PK Tool:Github PK Tool

brucemen711's repositories

BigData-Notes

大数据入门指南 :star:

Language:JavaStargazers:1Issues:0Issues:0

airbyte

Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.

Language:JavaLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:SmartyStargazers:0Issues:1Issues:0

awesome-datascience

:memo: An awesome Data Science repository to learn and apply for real world problems.

License:MITStargazers:0Issues:1Issues:0

awesome-sysadmin

A curated list of amazingly awesome open source sysadmin resources inspired by Awesome PHP.

License:NOASSERTIONStargazers:0Issues:1Issues:0

book-notes

Notes from books and other interesting things that I've read. Table of contents at the end 👇

Stargazers:0Issues:1Issues:0

citus

Scalable PostgreSQL for multi-tenant and real-time analytics workloads

Language:CLicense:AGPL-3.0Stargazers:0Issues:1Issues:0

dag-factory

Dynamically generate Apache Airflow DAGs from YAML configuration files

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dbt-core

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

License:Apache-2.0Stargazers:0Issues:0Issues:0

dbt-spark

dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks

License:Apache-2.0Stargazers:0Issues:0Issues:0

debezium-examples

Examples for running Debezium (Configuration, Docker Compose files etc.)

License:Apache-2.0Stargazers:0Issues:0Issues:0

hive-metastore-docker

Example for article Running Spark 3 with standalone Hive Metastore 3.0

Stargazers:0Issues:0Issues:0

iceberg

Apache Iceberg

License:Apache-2.0Stargazers:0Issues:0Issues:0

incubator-dolphinscheduler

Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available `out of the box`.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

integrations-extras

Community developed integrations and plugins for the Datadog Agent.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:2Issues:0

kamu-cli

New generation decentralized data warehouse and streaming data pipeline

License:NOASSERTIONStargazers:0Issues:0Issues:0

ML-From-Scratch

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

mml-book.github.io

Companion webpage to the book "Mathematics For Machine Learning"

Stargazers:0Issues:0Issues:0

nlp_course

YSDA course in Natural Language Processing

License:MITStargazers:0Issues:0Issues:0

presto

Official home of the community managed version of Presto, the distributed SQL query engine for big data, under the auspices of the Presto Software Foundation.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

puppet-clickhouse-1

Install and manage ClickHouse DBMS Requires for xml-simple ruby gem to be installed

License:MITStargazers:0Issues:0Issues:0

ranger

Mirror of Apache Ranger

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

react-flow

Highly customizable library for building interactive node-based UIs, editors, flow charts and diagrams

License:MITStargazers:0Issues:0Issues:0

spark

Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

the_silver_searcher

A code-searching tool similar to ack, but faster.

License:Apache-2.0Stargazers:0Issues:0Issues:0

verdict

Interactive-Speed Analytics: 200x Faster, 200x Fewer Cluster Resources, Approximate Query Processing

License:Apache-2.0Stargazers:0Issues:0Issues:0

vitess

Vitess is a database clustering system for horizontal scaling of MySQL.

Language:GoLicense:Apache-2.0Stargazers:0Issues:1Issues:0

wormhole

Wormhole is a SPaaS (Stream Processing as a Service) Platform

License:Apache-2.0Stargazers:0Issues:0Issues:0