Shweta Tanwar (shwetatanwar13)

shwetatanwar13

Geek Repo

Company:Infosys

Location:Fremont, California

Github PK Tool:Github PK Tool

Shweta Tanwar's repositories

Kafka-Pyspark-Streaming

Kafka streaming using Pyspark

Language:PythonStargazers:3Issues:1Issues:0

CCA175-Hadoop-and-Spark-Developer

CCA 175 Preparation scripts

Language:PythonStargazers:2Issues:0Issues:0

Spark-and-Kafka_IoT-Data-Processing-and-Analytics

IoT Project for UCSC Internet of Things : Created a Data Pipeline using Spark Streaming and Kafka. JSON messages are simulated using Python program.Data Analysis is done using Spark SQL and Visualization is done using Tableau with Data Source as Hive.

Language:HTMLStargazers:2Issues:0Issues:0

Pyspark

Spark Scripts using Python for UCSC Extension Course Iot : Internet of Things . This also includes the Data Analysis scripts I did for practice.

Language:PythonStargazers:1Issues:0Issues:0

Assignment-6.2-HIVE-INTRODUCTION

Fetch date and temperature from temperature_data where zip code is greater than 300000 and less than 399999.  Calculate maximum temperature corresponding to every year from temperature_data table.  Calculate maximum temperature from temperature_data table corresponding to those years which have at least 2 entries in the table.  Create a view on the top of last query, name it temperature_data_vw.  Export contents from temperature_data_vw to a file in local file system, such that each file is '|' delimited.

Stargazers:0Issues:0Issues:0

bigdata-projects

Student projects in Big Data field.

Language:JavaStargazers:0Issues:0Issues:0

Complete-Python-3-Bootcamp

Course Files for Complete Python 3 Bootcamp Course on Udemy

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Data-Analysis

Data Analysis of NYSE stock exchange data using Hive and Tableau

Stargazers:0Issues:0Issues:0

Hive

Hive Snippets for UCSC Extension course Hadoop: Distributed Processing.

Stargazers:0Issues:0Issues:0

python

Python programs using various python libraries like Numpy, Matplotlib etc.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

data_analytics

Assignments and case studies as part of pgdda-iiitb

Language:RStargazers:0Issues:0Issues:0

Java-Programs

Java Practice Programs

Language:JavaStargazers:0Issues:0Issues:0

Kafka

Apache Kafka

Stargazers:0Issues:0Issues:0

learning-spark

Example code from Learning Spark book

Language:JavaLicense:MITStargazers:0Issues:0Issues:0

Million-Song-Dataset-HDF5-to-CSV

Million Song Dataset HDF5 to CSV Converter

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

MongoDB

UCSC MongoDB Course

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:PLSQLStargazers:0Issues:0Issues:0

PGDDA-Projects

A comprehensive 1 Year program taught by Industry experts and IIITB faculty; 7 case studies & projects; 400+ hours of academic learning & 30+ hours of industry mentoring

Language:RLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

python-for-everybody

Python For Everybody

Language:PythonStargazers:0Issues:0Issues:0
Language:RStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Spark-The-Definitive-Guide

Spark: The Definitive Guide's Code Repository

Language:ScalaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Tableau

UCSC Extension Tableau Practice

Stargazers:0Issues:0Issues:0