Maheedhar Reddy Chappidi's repositories
spark-kinesis-streaming-df2s3
Spark streaming example where dataframe is created and pushed to s3
algorithm-complexity-and-big-o
[big o notation, algorithm, interview]
aws-glue-samples
AWS Glue code samples
aws-sdk-pandas
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
coding-interview-university
A complete computer science study plan to become a software engineer.
dbt-glue
This repository contains de dbt-glue adapter
deltacat
A Pythonic Data Catalog powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your big data workloads.
free-programming-books
:books: Freely available programming books
enhancements
Tracking Ray Enhancement Proposals
iceberg
Apache Iceberg
incubator-livy
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
incubator-wayang
Apache Wayang(incubating) is the first cross-platform data processing system.
modin
Modin: Scale your Pandas workflows by changing a single line of code
pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
spark
Apache Spark - A unified analytics engine for large-scale data processing