jasjyotsinghjaswal's repositories
DataQualityOnNB
This framework on JUPYTER allows to validate different columns with the help of resusable rulesets to mark records as PASS , FAIL or REJECT configurable using a DQ Bible which can be used to activate or inactivate a rule without making code changes
BIgDataJunNWAssign
Project Contains assignments for interview
BigDataOrclProject
This Respository Contains PySpark example from Beginner to Advanced
BigDataPOC
BigDataPOC in Java
cdh_exam_prep
Repository for exam preparation
HadoopPratice
Project for alll Hadoop & other technical code for POC
HelloWorldMaven
Contains sample Maven Project
MachineLearningBeginners
GitHub repository for self tutorial for machine learning
MyCollabNotebooks
This will have all my collab notebooks specially for Data science along with word docs
pyinstrument
š“Ā Call stack profiler for Python. Shows you why your code is slow!
pyspark-datacol-diff
PySpark utility created to quickly provide details regarding which attributes differ between 2 dataframes with same schema and primary key
ScalaTutorials
This Repository will contain Tutorials for Scala in JupyterNotebooks