hippalus / awesome-kafka-streams

A collection of examples of Kafka Streams being used in production. Contributions welcome!

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Kafka Streams in the wild

Here is a collection of known uses of Kafka Streams in production. This is based only on public information. If you have a use case you'd like to add, please contribute!

Biotech

Digital Natives

  • Airbnb uses Kafka Streams to power their actor-based merge queue with serializablle consistenty to manage merges in their large monolithic repositories.
  • Expedia built their real time conversations platform, which answers questions like "how long is my layover", etc., around Kafka Streams. Some more details of the use case can be found in this paper.
  • Indeed powers their microservices architecture with Kafka Streams.
  • ironSource uses Kafka Streams API to handle multiple real-time use cases, such as budget management, monitoring and alerting that run through their game growth platform.
  • Pinterest used Kafka Streams for their real-time ads platform. Also more details can be found in this talk.
  • Salesforce uses Kafka Streams to generate real time insights which are used to score and rank emails for boosting sales efficency.
  • Trivago uses Kafka Streams for stream processing of application logs, such as data cleaning and enrichment, sessionization, etc.
  • Twitter uses Kafka Streams extensively, including in their real time ML training pipeline which updates the model that ranks tweets for the 'For You' timeline.

Ecommerce and Retail

  • BigCommerce uses Kafka streams to power their real time data platform which serves insights to the merchants using their platform for ecommerce.
  • Lifull Connect uses Kafka Streams to power their platform for real estate listings.
  • Nuuly relies on Kafka Streams and Kafka Connect, coupled with data science and machine learning to provide in-the-moment business intelligence and to tailor a personalized rental experience for their customers.
  • Walmart:
  • Stubhub uses Kafka Streams for real time fraud detection: unfortunately there seems no presentation materials left, but it was primarily a remote call to other fact stores to check certain fraud indication rules (barely remember the details).
  • Zalando uses Kafka Streams to rank fashion websites in real-time.

Finance and Fintech

Manufacturing and Logistics

  • Altair Panopticon incorporates Kafka Streams framework into its stream processing engine provided to customers.
  • Deutsche Bahn uses Kafka Streams to make announcements of train arrivals at stations.
  • Intel uses Kafka Streams as part of their real time cyber intelligence platform.
  • Michelin uses Kafka Streams to power their in-house ERP system.

Miscellaneous

  • k2data, a data platform solution provider focused on industrial markets, uses Kafka Streams for their client's real-time scenarios, such as anomaly detection and data cleaning.
  • New York Times manages their entire article publishing pipeline using Kafka Streams.

Telecoms and Communications

  • AVSystem uses Kafka Streams for real-time analytics on IoT data at scale.
  • LineCorp uses Kafka Streams to implement a task dispatching system and also for performing streaming operations on their Kafka topics.
  • Twilio's Enterprise Insights team uses Kafka Streams to power their alerting engine based on real-time metrics.

Security

  • Imperva uses Kafka Streams to build a highly-available and fault-tolerant distributed data store for their microservices.

About

A collection of examples of Kafka Streams being used in production. Contributions welcome!

License:Apache License 2.0