Shubham Mishra's repositories
tariff-mgmt-system
Flask based Tariff management system, RESTful
dbt-cicd-sflk
dbt-core cicd using Azure DevOps & Azure Pipelines
flink-event-processor
Kafka producer & Flink event processor to calculate web event metrics
hiring-mgmt-sys
A MERN based hiring system, implemented in REST architecture pattern
saas-account-analytics
cohort modelling, retention and 7d_rolling_active_usrs on Bigquery & dbt
100DaysOfCode
#100DaysOfCode - Learn by developing 100 unique apps to explore exciting tech stacks
apache-spark-internals
The Internals of Apache Spark
awesome-opensource-data-engineering
An Awesome List of Open-Source Data Engineering Projects
AWS-Guide
Amazon Web Services (AWS) Guide. Learn all about Amazon Web Services Tools, Services, and Certifications.
bq-lineage-tool
BigQuery Column Lineage parser
datenlord
DatenLord, Computing Defined Storage, an application-orientated, cloud-native distributed storage system
db-readings
Readings in Databases
delta-lake-internals
The Internals of Delta Lake
docker-cri-k8
Cheat Sheet
flink-learning
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
fluvio
Lean and mean distributed stream processing system written in rust and web assembly.
google-summer-of-code
Rust project ideas for Google Summer of Code
incubator-streampark
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
Kubernetes-Guide
Kubernetes Guide. Learn all about Kubernetes monitoring, networking, and containers. Whether you're running Kubernetes Locally or in the Cloud ( Azure, AWS, and GCP).
learning-llms-and-genai-for-dev-sec-ops
A set of lessons aimed at anyone learning LLM and generative AI concepts, with sections on operations and security, as well as development.
Linux-Guide
Linux Guide. Learn about Linux Hardware vendors, Linux in the Cloud, Desktop Environments, Window Mangers, Linux Distributions, Linux Security, Graphics (AMD/NVIDIA/Intel ARC), and Software Apps.
pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Quantum-Computing-Guide
Quantum Computing Guide
Self-Hosting-Guide
Self-Hosting Guide. Learn all about locally hosting (on premises & private web servers) and managing software applications by yourself or your organization. Including Cloud, LLMs, WireGuard, Automation, Home Assistant, and Networking.
SparkInternals
Notes talking about the design and implementation of Apache Spark
stratosphere
Stratosphere is now Apache Flink.
upstream-prod
A dbt package for easily using production data in a development environment.
Xline
A geo-distributed KV store for metadata management