shridharreddy1's repositories
Data-Science--Cheat-Sheet
Cheat Sheets
AbstractActual
Incremental Import to Hive by using Sqoop
awesome-deep-learning
A curated list of awesome Deep Learning tutorials, projects and communities.
BigData-Ecosystem-Architecture
Life-cycle: Internal working of HDFS, SQOOP, HIVE, SPARK, HBASE, KAFKA with code.
data-pipeline-samples
This repository hosts sample pipelines
Databricks-Apache-Spark-2X-Certified-Developer
Databricks - Apache Spark™ - 2X Certified Developer
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
exercises-scalatutorial
Exercises for the "Functional Programming Principles in Scala", part of the FP in Scala specialized program by EPFL.
HBASE
NOSQL DATABASE
Hcatalog_Incremetal_Import
Importing from DB2 to HCatalog table incrementally without Duplicates
Java
All Algorithms implemented in Java
java-best-practices
Best practices in Coding, Designing and Architecting Java Applications
java-cheat-sheet
Java Tutorial For Beginners - Companion Reference
JavaInterviewQuestionsAndAnswers
Java Interview Questions and Answers
json4s
A single AST to be used by other scala json libraries
JsonPath
Java JsonPath implementation
mastering-spark-sql-book
The Internals of Spark SQL
spark-csv
CSV Data Source for Apache Spark 1.x
spark-redshift
Redshift data source for Apache Spark
Spark-The-Definitive-Guide
Spark: The Definitive Guide's Code Repository
spark-workshop
Apache Spark™ and Scala Workshops
sqoop_inc_import
Apache Sqoop Incremental Import Feature