tekjar / tensorbase

TensorBase is a new big data warehousing with modern efforts.

Home Page:https://tensorbase.io/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

What is TensorBase

TensorBase is a new big data warehousing with modern efforts.

TensorBase is building on top of Rust, Apache Arrow/DataFusion.

TensorBase hopes to change the status quo of bigdata system as follows:

  • low efficient (in the name of 'scalable')
  • hard to use (for end users) and understand (for developers)
  • not evolve with modern infrastructures (OS, hardware, engineering...)

Features

  • Out-of-the-box to play ( get started just now )
  • Lighting fast architectural performance In Rust ( real-world benchmarks )
  • Modern redesigned columnar storage
  • Top performance network transport server
  • ClickHouse compatible syntax
  • Green installation with DBA-Free ops
  • Reliability and high availability (WIP)
  • Cluster (TBD)
  • Cloud neutral and cloud native (TBD)
  • Arrow dataLake (...)

Architecture (in 10,000 meters altitude)

arch_base

Quick Start

play_out_of_the_box

Benchmarks

For query, TensorBase is faster in simple aggregation, but soon slower in more complex cases. Great start!

Query ClickHouse (v21.2.5.5) TensorBase (main branch) Speedup Ratio of TB
select sum(trip_id) from trips_lite 0.248 sec 0.079 sec 3.1 (TB is faster)
select date_part('year',pickup_datetime), count(1) from trips_lite group by date_part('year',pickup_datetime)* 0.514 sec 3.375 sec 0.15 (TB is slower)

More detail about this benchmark seen in benchmarks.

Roadmap

Milestones

Dev Meeting

We setup an online dev meeting on Zoom at Wednesday 7:00pm (UTC+8) or at Tencent Meeting at Friday 7:00pm (UTC+8). The meeting url will be shared before the start time in Discord server and Slack Channel for Zoom, or Wechat group for Tencent Meeting.

(The current time is flexible now, if you want to have a talk but in another timezone just leave a message in any way.)

Working Groups

Working Group - Engineering

This is a wg for engineering related topics, like codes or features.

Working Group - Database

This is a higher kind wg for database related topics, like ideas from papers.

Join these working groups on the Discussions or on Discord server.

Communications

Wechat group or other more are on community

Contributing

Read the Contributing.

Documents (WIP)

More documents will be prepared soon.

Read the Documents.

License

TensorBase is distributed under the terms of the Apache License (Version 2.0), which is a commercial-friendly open source license.

It is greatly appreciated that,

  • you could give this project a star, if you think these got from TensorBase are helpful.
  • you could indicate yourself in Who is Using TensorBase, if you are using TensorBase in any project, product or service.
  • you could contribute your changes back to TensorBase, if you want your changes could be helpful for more people.

Your encouragements and helps can make more people realize the value of the project, and motivate the developers and contributors of TensorBase to move forward.

See LICENSE for details.

About

TensorBase is a new big data warehousing with modern efforts.

https://tensorbase.io/

License:Apache License 2.0


Languages

Language:Rust 99.9%Language:Shell 0.1%Language:Dockerfile 0.0%