Rajesh Santha's repositories
DataStructuresAndAlgorithmsInScala
This project contains snippets of Scala code for various problems available on LeetCode,Hackerrank and also Data structures and algorithms implementation that required to solve those these problems.
Spark_Incremental_Load_Automated_POC
This repository contains project of 'Automated Spark incremental data ingestion' from FileSystem to HDFS. The inbound folder will contains the input csv files. When you trigger the spark job , following steps will takes place. Spark will pick the latest arrived file in the inbound folder automatically and validate,process and ingest to HDFS. During the validation, if you found that file is already loaded to HDFS, then you can request new load from spark-submit optional parameters.This optional parameters are developed by scala's scopt library.When you request a new load flag, scala script will fetch a new file from external location(as this is a poc, It is simulated as some other directory than inbound within same file system) to Inbound and load that file to HDFS table. Once the data is read and validated , it will insert into given parameterized avro table or overwrite if table already exists.
databricks-crt020-notes
docs, codes and resources to prepare for the CRT020: Databricks Certified Associate Developer for Apache Spark 2.4 with Python 3 certification
leetcode-patterns
A pattern-based approach for learning technical interview questions
MonitoredStructuredStreaming
Repository for Spark structured streaming use case implementations.
Analyze-sentiment-FLUME-HIVE
Twitter sentiment analysis by FLUME
awesome-github-wiki
:neckbeard: Awesome list GitHub Wikis
awesome-guidelines
A curated list of high quality coding style conventions and standards.
coding-interview-university
A complete computer science study plan to become a software engineer.
computer-science
:mortar_board: Path to a free self-taught education in Computer Science!
covid19
Resources for the Udemy Course - Azure Data Factory For Data Engineers - Project on Covid19 by Ramesh Retnasamy
Data-Science--Cheat-Sheet
Cheat Sheets
DatabricksIntegration
This repo is integrated with Databricks course learning
fpinscala
Code, exercises, answers, and hints to go along with the book "Functional Programming in Scala"
free-programming-books
:books: Freely available programming books
githubDemo
Its a demo repository to practice advanced git commands.
gitPracticeDir
delete later
InputData
This repo consists of sample datasets and Data that required for POCs and assignments
LearningScala
My journey to learn Scala.
professional-programming
A collection of full-stack resources for programmers.
Spark-practice-
Itversity problem 1 to 20
Spark-The-Definitive-Guide
Spark: The Definitive Guide's Code Repository
terraform-azure
terraform - azure - build