Kevin Liu's repositories
iceberg-rest-catalog
Pythonic Iceberg REST Catalog
arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
arrow-datafusion
Apache Arrow DataFusion SQL Query Engine
coinbase-order-book-pipeline
This project demonstrates a data pipeline using the Coinbase websocket feed to display and analyze their order book in real time.
evidence
Business intelligence as code: build polished data products with SQL and markdown
FastUI
Build better UIs faster.
flink-iceberg-minio-trino
This project demonstrates Real-Time streaming of CDC data from MySql to Apache Iceberg using Flink SQL Client for faster data analytics and machine learning workloads.
glaredb
GlareDB: An analytics DBMS for distributed data
iceberg
Apache Iceberg
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
gluten
Gluten: Plugin to Double SparkSQL's Performance
gravitino
World's most powerful data catalog service with providing a high-performance, geo-distributed and federated metadata lake.
gravitino-playground
A playground to experience Gravitino
harlequin
The SQL IDE for Your Terminal.
hive-metastore
Apache Hive Metastore as a Standalone server in Docker
iceberg-openapi-web-doc
Visualize Iceberg REST Catalog OpenAPI Spec
iceberg-python
Apache PyIceberg
iceberg-rest-image
Simple project to expose a catalog over REST using a Java catalog backend
jupyterlite
Wasm powered Jupyter running in the browser 💡
onetable
OneTable is an omni-directional converter for table formats that facilitates interoperability across data processing systems and query engines.
paradedb
Postgres for Search and Analytics
polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
pymetastore
A Python Client for Hive Metastore
ScratchDB
Scratch is an open-source alternative to BigQuery, Redshift, and Snowflake. Runs on Clickhouse.
teable
✨ A Super fast, Real-time, Professional, Developer friendly, No code database
trino-go-client
Go client for Trino
trino.io
Trino website
zed
A novel data lake based on super-structured data
zui
Zui is a powerful desktop application for exploring and working with data. The official front-end to the Zed lake.