YAK - yet another kafka

Yet another Kafka is a personal undertaking to replace a push-based architecture in a software I use with a pull-based architecture. The motivation for the project is as follows:

Push-based architecture causes queueing on the overall system, which ultimately halts the software
Disk can be sacrificed in cases where memory queues can become expensive.
Dedicated disk I/O with sequential files will be faster than random disk seeks when queues are flushed as pages to disk (random)

The architecture works as follows:

YAK server is a single broker that listens to messages, and responds to message requests.
Each message has the following - a key (defaults to timestamp when not supplied)
The key is used to maintain an in-memory B-Tree
Consumers can request for a range of keys to consume. The messages are buffered in memory for some time, after which they are flushed. The message sending happens with sendfile api.
After clients "ack" a message key range, the messages are flushed from memory.
Any un-flushed messages are stored in the memory and can be replayed using a different script.
Each message is associated with a Topic. If the topic does not exist, then it is created.
Each topic is associated with a LSM Tree in memory. s

0x0decaf / yak

YAK - yet another kafka

About