Krishnan Paranji Ravi's repositories
BookSummaries
A summary of my favorite books. Follow for updates.
000
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Language:ScalaApache-2.0000
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Language:PythonApache-2.0000
flan-quickstart
flan-quickstart
Apache-2.0000
datacompy
Pandas and Spark DataFrame comparison for humans and more!
Language:PythonApache-2.0000
emr-dynamodb-connector
Implementations of open source Apache Hadoop/Hive interfaces which allow for ingesting data from Amazon DynamoDB
Apache-2.0000
stanford-cs-229-machine-learning
VIP cheatsheets for Stanford's CS 229 Machine Learning
MIT000