banarasi04's repositories

awesome-etl

A curated list of awesome ETL frameworks, libraries and software.

Stargazers:0Issues:0Issues:0

data-engineering-zoomcamp

Free Data Engineering course!

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Data-Science-with-Spark

Machine Learning and Data Analysis Case Studies using Spark.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

DataStructureAndAlgorithmsMadeEasyInJava

Data Structure And Algorithms Made Easy In Java

Language:JavaStargazers:0Issues:0Issues:0

datawarehouse

Solution of Datawarehouse course

Stargazers:0Issues:0Issues:0

drunken-data-quality

Spark package for checking data quality

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Fake-Apache-Log-Generator

Generate a boatload of Fake Apache Log files very quickly

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

HDFSChecksumForLocalfile

This program / jar creates checksum, with same algorithm that hadoop uses to create on hdfs files. So integrity of file can be verified on local and hadoop system. Can also, be used to check if file exist based on checksum, before uploading and cluttering hdfs with duplicate files.

Language:JavaStargazers:0Issues:0Issues:0

JustEnoughScalaForSpark

A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Machine-Learning-with-Python

Machine Learning Implementations in Python

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

mlops-zoomcamp

Free MLOps course from DataTalks.Club

Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

Store

A sample online store web application built in eclipse.

Language:JavaStargazers:0Issues:0Issues:0

tpcc

Java implementation of TPC-C benchmark

Language:JavaStargazers:0Issues:0Issues:0

tpcds

Port of TPC-DS data generator to Java

Language:SmartyLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tpcds-gen

Wrap up TPC-DS dsgen into a map-reduce task

Language:JavaStargazers:0Issues:0Issues:0

tsdb

The Prometheus time series database layer.

License:Apache-2.0Stargazers:0Issues:0Issues:0