Anurag870's starred repositories

n8n

Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services.

Language:TypeScriptLicense:NOASSERTIONStargazers:41382Issues:335Issues:1598

ClickHouse

ClickHouse® is a real-time analytics DBMS

Language:C++License:Apache-2.0Stargazers:34748Issues:686Issues:19767

OpenAPI-Specification

The OpenAPI Specification Repository

Language:MarkdownLicense:Apache-2.0Stargazers:28361Issues:848Issues:2192

posthog

🦔 PostHog provides open-source product analytics, session recording, feature flagging and A/B testing that you can self-host.

Language:PythonLicense:NOASSERTIONStargazers:17707Issues:98Issues:5472

rrweb

record and replay the web

Language:TypeScriptLicense:MITStargazers:15701Issues:194Issues:842

prefect

Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines

Language:PythonLicense:Apache-2.0Stargazers:14876Issues:160Issues:4970

airbyte

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Language:PythonLicense:NOASSERTIONStargazers:14356Issues:177Issues:13690

pulsar

Apache Pulsar - distributed pub-sub messaging system

Language:JavaLicense:Apache-2.0Stargazers:13820Issues:402Issues:6788

zuul

Zuul is a gateway service that provides dynamic routing, monitoring, resiliency, security, and more.

Language:JavaLicense:Apache-2.0Stargazers:13235Issues:902Issues:551

shepherd

Guide your users through a tour of your app

Language:TypeScriptLicense:MITStargazers:12432Issues:99Issues:559

dolphinscheduler

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

Language:JavaLicense:Apache-2.0Stargazers:12229Issues:330Issues:7282

engineer-manager

A list of engineering manager resource links.

iceberg

Apache Iceberg

Language:JavaLicense:Apache-2.0Stargazers:5644Issues:155Issues:3134

scribejava

Simple OAuth library for Java

Language:JavaLicense:MITStargazers:5421Issues:305Issues:626

noria

Fast web applications through dynamic, partially-stateful dataflow

Language:RustLicense:Apache-2.0Stargazers:4944Issues:114Issues:79

atlas

In-memory dimensional time series database.

Language:ScalaLicense:Apache-2.0Stargazers:3395Issues:523Issues:192

dgs-framework

GraphQL for Java with Spring Boot made easy.

Language:KotlinLicense:Apache-2.0Stargazers:3005Issues:229Issues:436

gobblin

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.

Language:JavaLicense:Apache-2.0Stargazers:2196Issues:166Issues:0

pravega

Pravega - Streaming as a new software defined storage primitive

Language:JavaLicense:Apache-2.0Stargazers:1970Issues:107Issues:4018

bookkeeper

Apache BookKeeper - a scalable, fault tolerant and low latency storage service optimized for append-only workloads

Language:JavaLicense:Apache-2.0Stargazers:1861Issues:107Issues:1265

secor

Secor is a service implementing Kafka log persistence

Language:JavaLicense:Apache-2.0Stargazers:1837Issues:70Issues:275

querybook

Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.

Language:TypeScriptLicense:Apache-2.0Stargazers:1775Issues:34Issues:213

talkyard

A community discussion platform: Brings together the main features from StackOverflow, Slack, Discourse, Reddit, and Disqus blog comments.

Language:TypeScriptLicense:AGPL-3.0Stargazers:1677Issues:28Issues:0

marquez

Collect, aggregate, and visualize a data ecosystem's metadata

Language:JavaLicense:Apache-2.0Stargazers:1640Issues:47Issues:758

kylo

Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.

Language:JavaLicense:Apache-2.0Stargazers:1093Issues:115Issues:0

titanoboa

Titanoboa makes complex workflows easy. It is a low-code workflow orchestration platform for JVM - distributed, highly scalable and fault tolerant.

Language:ClojureLicense:AGPL-3.0Stargazers:908Issues:21Issues:28

pipelinewise

Data Pipeline Framework using the singer.io spec

Language:PythonLicense:Apache-2.0Stargazers:606Issues:90Issues:0

api-covid19-in

COVID Rest API for India data, using Cloudflare Workers

Language:JavaScriptLicense:Apache-2.0Stargazers:323Issues:19Issues:41

dbnd

DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.

Language:PythonLicense:Apache-2.0Stargazers:249Issues:16Issues:5

JedAIToolkit

An open source, high scalability toolkit in Java for Entity Resolution.

Language:JavaLicense:Apache-2.0Stargazers:201Issues:26Issues:41