Bach Chu's starred repositories

unitycatalog

Open, Multi-modal Catalog for Data & AI

Language:JavaLicense:Apache-2.0Stargazers:1960Issues:0Issues:0

polaris

Polaris Catalog is an open source catalog for Apache Iceberg

License:Apache-2.0Stargazers:331Issues:0Issues:0

spark-operator

Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.

Language:GoLicense:Apache-2.0Stargazers:2702Issues:0Issues:0

ethereum-etl

Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ

Language:PythonLicense:MITStargazers:2879Issues:0Issues:0

ccxt

A JavaScript / TypeScript / Python / C# / PHP cryptocurrency trading API with support for more than 100 bitcoin/altcoin exchanges

Language:PythonLicense:MITStargazers:32043Issues:0Issues:0

xg2xg

by ex-googlers, for ex-googlers - a lookup table of similar tech & services

Stargazers:14404Issues:0Issues:0

nessie

Nessie: Transactional Catalog for Data Lakes with Git-like semantics

Language:JavaLicense:Apache-2.0Stargazers:920Issues:0Issues:0

localstack

đź’» A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline

Language:PythonLicense:NOASSERTIONStargazers:53077Issues:0Issues:0

data-contract-template

Template for a data contract used in a data mesh.

License:Apache-2.0Stargazers:455Issues:0Issues:0

imessage-exporter

Export iMessage data + run iMessage Diagnostics

Language:RustLicense:GPL-3.0Stargazers:2670Issues:0Issues:0

notes

Notes from tech books I'm reading.

License:UnlicenseStargazers:55Issues:0Issues:0

ruff

An extremely fast Python linter and code formatter, written in Rust.

Language:RustLicense:MITStargazers:29129Issues:0Issues:0
Language:ScalaStargazers:71Issues:0Issues:0

winutils

winutils.exe hadoop.dll and hdfs.dll binaries for hadoop windows

Language:ShellLicense:Apache-2.0Stargazers:252Issues:0Issues:0

git-changelog

Automatic Changelog generator using Jinja2 templates.

Language:PythonLicense:ISCStargazers:131Issues:0Issues:0

datahub

The Metadata Platform for your Data Stack

Language:JavaLicense:Apache-2.0Stargazers:9474Issues:0Issues:0

marquez

Collect, aggregate, and visualize a data ecosystem's metadata

Language:JavaLicense:Apache-2.0Stargazers:1693Issues:0Issues:0

OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

Language:TypeScriptLicense:Apache-2.0Stargazers:4849Issues:0Issues:0

dvc

🦉 ML Experiments and Data Management with Git

Language:PythonLicense:Apache-2.0Stargazers:13410Issues:0Issues:0

piicatcher

Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub

Language:PythonLicense:Apache-2.0Stargazers:264Issues:0Issues:0

data-lineage

Generate and Visualize Data Lineage from query history

Language:PythonLicense:MITStargazers:304Issues:0Issues:0

sqllineage

SQL Lineage Analysis Tool powered by Python

Language:PythonLicense:MITStargazers:1223Issues:0Issues:0

avro-schema-viewer

Visualizer for Avro Schemas (.avsc) - Try it yourself at:

Language:TypeScriptLicense:Apache-2.0Stargazers:27Issues:0Issues:0

autometrics-rs

Easily add metrics to your code that actually help you spot and debug issues in production. Built on Prometheus and OpenTelemetry.

Language:RustLicense:Apache-2.0Stargazers:785Issues:0Issues:0

spark-xml

XML data source for Spark SQL and DataFrames

Language:ScalaLicense:Apache-2.0Stargazers:496Issues:0Issues:0

linearmouse

The mouse and trackpad utility for Mac.

Language:SwiftLicense:MITStargazers:3563Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:34538Issues:0Issues:0

meltano

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

Language:PythonLicense:MITStargazers:1702Issues:0Issues:0

autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Language:PythonLicense:MITStargazers:6078Issues:0Issues:0

roapi

Create full-fledged APIs for slowly moving datasets without writing a single line of code.

Language:RustLicense:Apache-2.0Stargazers:3154Issues:0Issues:0