Sundar Shankar's repositories
coding-interview-university
A complete computer science study plan to become a software engineer.
ScalaPB
Protocol buffer compiler for Scala.
marmaray
Generic Data Ingestion & Dispersal Library for Hadoop
Scala
All Algorithms implemented in Scala
spark-testing-base
Base classes to use when writing tests with Spark
spark2.0-examples
Examples of Spark 2.0
sparkMeasure
This is the development repository of SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark workload metrics data.
Databricks-Apache-Spark-2X-Certified-Developer
Databricks - Apache Spark™ - 2X Certified Developer
uberscriptquery
UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy
2018-cycle-2
Contents covered in sessions of AI Saturdays (cycle 2) as well as relevant material for further study.
envelope
Build configuration-driven ETL pipelines on Apache Spark
snowflake
Snowflake is a network service for generating unique ID numbers at high scale with some simple guarantees.
Bidirectional-LSTM-CRF-for-Clinical-Concept-Extraction
Bidirectional LSTM-CRF for Clinical Concept Extraction using i2b2-2010 data
structured_data_processing_spark_sql
Code and setup information for Structured data processing with Spark sQL session