Mi ; (mithunonline)

mithunonline

Geek Repo

Location:India

Github PK Tool:Github PK Tool

Mi ;'s repositories

awesome-python

A curated list of awesome Python frameworks, libraries, software and resources

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

aws-glue-samples

AWS Glue code samples

Language:PythonLicense:MIT-0Stargazers:1Issues:0Issues:0

data-engineering-zoomcamp

Free Data Engineering course!

Stargazers:1Issues:0Issues:0

github-slideshow

A robot powered training repository :robot:

Language:RubyLicense:MITStargazers:1Issues:0Issues:0

interviews

Everything you need to know to get the job.

Language:JavaLicense:MITStargazers:1Issues:0Issues:0

Projects-Solutions

:pager: Links to others' solutions to Projects (https://github.com/karan/Projects/)

Stargazers:1Issues:0Issues:0

pyspark-cheatsheet

PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster

License:CC0-1.0Stargazers:1Issues:0Issues:0

sagemaker-python-sdk

A library for training and deploying machine learning models on Amazon SageMaker

License:Apache-2.0Stargazers:1Issues:0Issues:0

spark

Apache Spark - A unified analytics engine for large-scale data processing

License:Apache-2.0Stargazers:1Issues:0Issues:0

spark-daria

Essential Spark extensions and helper methods ✨😲

License:MITStargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:1Issues:0

atom

:atom: The hackable text editor

License:MITStargazers:0Issues:0Issues:0

awesome-systematic-trading

A curated list of awesome libraries, packages, strategies, books, blogs, tutorials for systematic trading.

Stargazers:0Issues:0Issues:0

build-your-own-x

Master programming by recreating your favorite technologies from scratch.

Stargazers:0Issues:0Issues:0

Databricks-Certified-Data-Engineer-Professional

The resources of the preparation course for Databricks Data Engineer Professional certification exam

Stargazers:0Issues:0Issues:0

DataGristle

Tough and flexible tools for data analysis, transformation, validation and movement.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

dbrx

Code examples and resources for DBRX, a large language model developed by Databricks

License:NOASSERTIONStargazers:0Issues:0Issues:0

delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

License:Apache-2.0Stargazers:0Issues:0Issues:0

doris

Apache Doris is an easy-to-use, high performance and unified analytics database.

License:Apache-2.0Stargazers:0Issues:0Issues:0

hadoop

Public hadoop release repository

License:Apache-2.0Stargazers:0Issues:0Issues:0

Hash-Buster

Crack hashes in seconds.

License:MITStargazers:0Issues:0Issues:0

hops

Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

joblib-spark

Joblib Apache Spark Backend

License:Apache-2.0Stargazers:0Issues:0Issues:0

LearningSparkV2

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

License:Apache-2.0Stargazers:0Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

pyspark-tutoriallll

PySpark Tutorial for Beginners - Practical Examples in Jupyter Notebook with Spark version 3.4.1. The tutorial covers various topics like Spark Introduction, Spark Installation, Spark RDD Transformations and Actions, Spark DataFrame, Spark SQL, and more. It is completely free on YouTube and is beginner-friendly without any prerequisites.

License:MITStargazers:0Issues:0Issues:0

spark-essentials

The official repository for the Rock the JVM Spark Essentials with Scala course

Stargazers:0Issues:0Issues:0

sparkMeasure

This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination of Spark metrics, making it a practical choice for both developers and data engineers.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

winutils

winutils.exe hadoop.dll and hdfs.dll binaries for hadoop windows

Stargazers:0Issues:0Issues:0