Yijie Shen's starred repositories

spark-fast-tests

Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)

Language:ScalaLicense:MITStargazers:426Issues:0Issues:0

prollytree

A prolly tree (probabilistic tree) is a data structure designed to provide efficient storage, retrieval, and modification of ordered data with integrity guarantees.

Language:RustLicense:Apache-2.0Stargazers:3Issues:0Issues:0

unitycatalog

Open, Multi-modal Catalog for Data & AI

Language:JavaLicense:Apache-2.0Stargazers:2152Issues:0Issues:0

optd

CMU-DB's Cascades optimizer framework

Language:RustLicense:MITStargazers:340Issues:0Issues:0

CacheLib

Pluggable in-process caching engine to build and scale high performance services

Language:C++License:Apache-2.0Stargazers:1168Issues:0Issues:0

foyer

Hybrid in-memory and disk cache in Rust

Language:RustLicense:Apache-2.0Stargazers:125Issues:0Issues:0

libpq.rs

Rust safe binding for libpq

Language:RustLicense:MITStargazers:15Issues:0Issues:0

arrow-udf

A User-Defined Function Framework for Apache Arrow.

Language:RustLicense:Apache-2.0Stargazers:57Issues:0Issues:0

cudarc

Safe rust wrapper around CUDA toolkit

Language:RustLicense:Apache-2.0Stargazers:561Issues:0Issues:0
Language:RustLicense:Apache-2.0Stargazers:26Issues:0Issues:0

write-you-a-vector-db

A Vector Database Tutorial (over CMU-DB's BusTub system)

Language:C++Stargazers:610Issues:0Issues:0

fuzz-testing-for-spark

[WIP] Run SQL-aware fuzz tests for the Catalyst optimizer in Apache Spark

Language:C++License:Apache-2.0Stargazers:6Issues:0Issues:0

influxdb

Scalable datastore for metrics, events, and real-time analytics

Language:RustLicense:Apache-2.0Stargazers:28519Issues:0Issues:0

vector

A high-performance observability data pipeline.

Language:RustLicense:MPL-2.0Stargazers:17369Issues:0Issues:0

join-order-benchmark

Join Order Benchmark (JOB)

Stargazers:281Issues:0Issues:0

datafusion-duckdb-benchmark

Comparing DataFusion with DuckDB based on ClickBench, H2O, and TPC-H

Language:PythonStargazers:4Issues:0Issues:0

iceberg-rust

Apache Iceberg

Language:RustLicense:Apache-2.0Stargazers:571Issues:0Issues:0

AgentGPT

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

Language:TypeScriptLicense:GPL-3.0Stargazers:31208Issues:0Issues:0

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:MITStargazers:166119Issues:0Issues:0

gpt-4-search

A command line GPT-4 REPL with Google search in 200 lines of code

Language:PythonLicense:MITStargazers:362Issues:0Issues:0

spaceman

A gRPC client from another world

Language:RustLicense:MITStargazers:366Issues:0Issues:0
Language:C++License:MITStargazers:461Issues:0Issues:0

openraft

rust raft with improvements

Language:RustLicense:Apache-2.0Stargazers:1350Issues:0Issues:0

dilu

A colorful CLI client with icons for accessing data via OpenDAL

Language:RustLicense:Apache-2.0Stargazers:34Issues:0Issues:0

human-panic

Panic messages for humans.

Language:RustLicense:Apache-2.0Stargazers:1621Issues:0Issues:0

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Language:PythonLicense:Apache-2.0Stargazers:10808Issues:0Issues:0

chatgpt-retrieval-plugin

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Language:PythonLicense:MITStargazers:20998Issues:0Issues:0

opendal

Apache OpenDAL: access data freely.

Language:RustLicense:Apache-2.0Stargazers:3197Issues:0Issues:0

bob-plugin-openai-polisher

使用 OpenAI API 给文本进行润色和语法纠错的 Bob 插件!完美代替 Grammarly!Licensed under CC BY-NC-SA 4.0

Language:TypeScriptLicense:NOASSERTIONStargazers:649Issues:0Issues:0

duckdb

DuckDB is an analytical in-process SQL database management system

Language:C++License:MITStargazers:22155Issues:0Issues:0