lingo-xp's repositories
antlr4
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
ByConity
ByConity is an open source cloud-native data warehouse
calcite
Apache Calcite
chdb
chDB is an embedded OLAP SQL Engine powered by ClickHouse
ClickHouse
ClickHouse® is a free analytics DBMS for big data
cockroach
CockroachDB - the open source, cloud-native distributed SQL database.
CRoaring
Roaring bitmaps in C (and C++), with SIMD (AVX2, AVX-512 and NEON) optimizations
databend
An elastic and reliable Serverless Data Warehouse, offers Blazing Fast Query and combines Elasticity, Simplicity, Low cost of the Cloud, built to make the Data Cloud easy
docker-hadoop
Apache Hadoop docker image
docker-hive-metastore-postgresql
Postgresql configured to work as metastore for Hive.
dolphinscheduler
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
duckdb
DuckDB is an in-process SQL OLAP Database Management System
hive
Apache Hive
hudi
Upserts, Deletes And Incremental Processing on Big Data.
JavaGuide
「Java学习+面试指南」一份涵盖大部分 Java 程序员所需要掌握的核心知识。准备 Java 面试,首选 JavaGuide!
polardbx-sql
PolarDB-X is a cloud native distributed SQL Database designed for high concurrency, massive storage, complex querying scenarios.
presto
The official home of the Presto distributed SQL query engine for big data
protobuf
Protocol Buffers - Google's data interchange format
sqlglot
Python SQL Parser and Transpiler
styleguide
Style guides for Google-originated open-source projects
substrait
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
velox
A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.