cchenax

cchenax

Geek Repo

Github PK Tool:Github PK Tool

cchenax's repositories

milvus

A cloud-native vector database, storage for next generation AI applications

License:Apache-2.0Stargazers:0Issues:0Issues:0

deeplake

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

License:MPL-2.0Stargazers:0Issues:0Issues:0

redpanda

Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!

Stargazers:0Issues:0Issues:0

litdata

Transform datasets at scale. Optimize datasets for fast AI model training.

License:Apache-2.0Stargazers:0Issues:0Issues:0

trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

License:Apache-2.0Stargazers:0Issues:0Issues:0

hops

Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.

License:Apache-2.0Stargazers:0Issues:0Issues:0

cockroach

CockroachDB - the open source, cloud-native distributed SQL database.

Language:GoLicense:NOASSERTIONStargazers:0Issues:0Issues:0

minio

The Object Store for AI Data Infrastructure

Language:GoLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

ClickHouse

ClickHouse® is a free analytics DBMS for big data

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

ozone

Scalable, redundant, and distributed object store for Apache Hadoop

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

system-design

Learn how to design systems at scale and prepare for system design interviews

License:NOASSERTIONStargazers:0Issues:0Issues:0

system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

License:NOASSERTIONStargazers:0Issues:0Issues:0

pinot

Apache Pinot - A realtime distributed OLAP datastore

License:Apache-2.0Stargazers:0Issues:0Issues:0

ambry

Distributed object store

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

License:Apache-2.0Stargazers:0Issues:0Issues:0

seaweedfs

SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.

License:Apache-2.0Stargazers:0Issues:0Issues:0

flink

Apache Flink

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

cassandra

Mirror of Apache Cassandra

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

repairboost-code

This is the implementation of RepairBoost described in our paper "Boosting Full-Node Repair in Erasure-Coded Storage" appeared in USENIX ATC'21.

Stargazers:0Issues:0Issues:0

ecwide

USENIX FAST 2021, "Exploiting Combined Locality for Wide-Stripe Erasure Coding in Distributed Storage"

Stargazers:0Issues:0Issues:0

incubator-ratis

Open source Java implementation for Raft consensus protocol.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

educative.io_courses

this is downloadings of all educative.io free student subscription courses as pdf from GitHub student pack

Stargazers:0Issues:0Issues:0

Grokking-the-System-Design

Grokking the system design interview course materials

Stargazers:0Issues:0Issues:0

Copysets

CS 244 Reproduction of Copysets

Stargazers:0Issues:0Issues:0

raft-zh_cn

Raft一致性算法论文的中文翻译

Stargazers:0Issues:0Issues:0

task-scheduler

A fault tolerant distributed task scheduler simulation

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

hadoop-20

Facebook's Realtime Distributed FS based on Apache Hadoop 0.20-append

License:Apache-2.0Stargazers:0Issues:0Issues:0