[Tracking] Breaking changes in V2

Question

[Tracking] Breaking changes in V2

marvin-j97 opened this issue 3 months ago · comments

Marvin commented 3 months ago

API

Remove FlushMode alias
Enable bloom feature by default

Data format

Start journal markers at 1, to prevent the zeroed pre-allocated bytes from matching the start marker tag, causing unnecessary logging of an unfinished batch at journal tail #53
#58 PartitionCreateOptions need to be stored to be recovered
Fix journal length of values #68
Set max value length to u32
Key-Value separation #34
Correctly track lowest closed instant/snapshot seqno #61

i18n.site · Answer 1 · Thu May 23 2024 10:31:51 GMT+0800 (China Standard Time)

I hope that fixed-length key values can be considered when designing the format. Many times, keys and values can be fixed-length (such as u64 id - file hash). I believe that fixed-length fields can be optimized a lot.

I think you can refer to duckdb and consider writing data to the log regularly and compressing it into parquet format.
https://duckdb.org/docs/data/parquet/overview.html
https://parquet.apache.org

I believe this format does a lot of optimizations for the data

You can use this library to read and write https://docs.rs/parquet/latest/parquet/

Marvin · Answer 2 · Fri May 24 2024 00:20:15 GMT+0800 (China Standard Time)

I hope that fixed-length key values can be considered when designing the format. Many times, keys and values can be fixed-length (such as u64 id - file hash). I believe that fixed-length fields can be optimized a lot.

I'm not sure if fixed lengths can really be optimized in block based tables. You would at most save 3 byte per K-V pair for a lot of added complexity. It could save you some decent space for huge data sets, but not in block-based tables, and right now I don't plan on adding other types of tables.

compressing it into parquet format.

Parquet is a column-based format with row groups. There is no notion of columns or rows here, ~~so I'm not sure there is an advantage over packed K-V blocks.~~ I have some interest in implementing an alternative block format that is row group based. The current blocks are KVKVKVKV, but an alternative Parquet-esque format could be KKKKVVVV, which would allow for better compression, depending on the values.