Mi ;'s repositories
awesome-python
A curated list of awesome Python frameworks, libraries, software and resources
aws-glue-samples
AWS Glue code samples
data-engineering-zoomcamp
Free Data Engineering course!
first-contributions
🚀✨ Help beginners to contribute to open source projects
github-slideshow
A robot powered training repository :robot:
hacker-laws
💻📖 Laws, Theories, Principles and Patterns that developers will find useful. #hackerlaws
interviews
Everything you need to know to get the job.
kafka-tutorials
Kafka Tutorials microsite
log4jscanner
A log4j vulnerability filesystem scanner and Go package for analyzing JAR files.
Projects-Solutions
:pager: Links to others' solutions to Projects (https://github.com/karan/Projects/)
pyspark-cheatsheet
PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
sagemaker-python-sdk
A library for training and deploying machine learning models on Amazon SageMaker
spark-daria
Essential Spark extensions and helper methods ✨😲
Sudoku-GUI-Solver
This is a sudoku solver using the backtracking algorithm. It includes a graphical GUI as well as a text based version.
system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
atom
:atom: The hackable text editor
awesome-systematic-trading
A curated list of awesome libraries, packages, strategies, books, blogs, tutorials for systematic trading.
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
hops
Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.
joblib-spark
Joblib Apache Spark Backend
spark-essentials
The official repository for the Rock the JVM Spark Essentials with Scala course
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
winutils
winutils.exe hadoop.dll and hdfs.dll binaries for hadoop windows