SandishKumarHN

SandishKumarHN's repositories

alpakka

Reactive Enterprise Integration — Alpakka

Language:ScalaNOASSERTION020

awesome-consensus

Awesome list for Paxos and friends

000

datacollector

StreamSets Data Collector - Continuous big data and cloud platform ingest infrastructure

Language:JavaApache-2.0000

deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Language:ScalaApache-2.0000

flink

Mirror of Apache Flink

Language:JavaApache-2.0000

fugue

A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

Language:PythonApache-2.0000

incubator-druid

Apache Druid (Incubating) - Column oriented distributed data store ideal for powering interactive applications

Language:JavaApache-2.0000

kudu

Mirror of Apache Kudu

Language:C++Apache-2.0000

logging-log4j2

Mirror of Apache Logging Log4J2

Language:JavaApache-2.0010

nifi

Mirror of Apache NiFi

Language:JavaApache-2.0000

github-api

Language:Scala000

HTTP-Octopus

Language:XSLT020

incubator-gearpump

Mirror of Apache Gearpump (Incubating)

Language:ScalaApache-2.0020

incubator-pinot

Apache Pinot (Incubating) - A realtime distributed OLAP datastore

Language:JavaApache-2.0000

jvm-readings

JVM readings

000

log4j2-charts

Language:HTML000

loganalytics

Language:Shell020

LoveIt

❤️A clean, elegant but advanced blog theme for Hugo 一个简洁、优雅且高效的 Hugo 主题

MIT000

oozie

Mirror of Apache Oozie

Language:JavaApache-2.0000

papers-we-love

Papers from the computer science community to read and discuss.

000

pipewrench-docker-demo

Language:Python020

polynote

A better notebook for Scala (and more)

Apache-2.0000

presto

The official home of the Presto distributed SQL query engine for big data

Apache-2.0000

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

NOASSERTION000

resources

Language:Java000

rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

MIT000

spark

Apache Spark - A unified analytics engine for large-scale data processing

Language:ScalaApache-2.0000

spring-hadoop

Spring for Apache Hadoop is a framework for application developers to take advantage of the features of both Hadoop and Spring.

Language:Java010

sqoop

Mirror of Apache Sqoop

Language:JavaApache-2.0000

system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

NOASSERTION000