himanitawade's repositories

ivy

The Unified AI Framework

License:NOASSERTIONStargazers:0Issues:0Issues:0

Certifications

Data Engineering

Stargazers:0Issues:0Issues:0

RealtimeStreamingEngineering

This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenAI LLM, Kafka and Elasticsearch. It covers each stage from data acquisition, processing, sentiment analysis with ChatGPT, production to kafka topic and connection to elasticsearch.

Stargazers:0Issues:0Issues:0

dbt-bigquery-crash-course

A deep dive into the powerful combination of DBT and BigQuery, the game-changers in modern data engineering.

Stargazers:0Issues:0Issues:0

changecapture-e2e

This project shows how to capture changes from postgres database and stream them into kafka

Stargazers:0Issues:0Issues:0

ApacheFlink-SalesAnalytics

This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project demonstrates how to ingest, process, and analyze sales data, showcasing the capabilities of Apache Flink for big data processing.

Stargazers:0Issues:0Issues:0

EMR-for-data-engineers

This project demonstrates the use of Amazon Elastic Map Reduce (EMR) for processing large datasets using Apache Spark. It includes a Spark script for ETL (Extract, Transform, Load) operations, AWS command line instructions for setting up and managing the EMR cluster, and a dataset for testing and demonstration purposes.

Stargazers:0Issues:0Issues:0

RedditDataEngineering

This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data warehouse. The pipeline leverages a combination of tools and services including Apache Airflow, Celery, PostgreSQL, Amazon S3, AWS Glue, Amazon Athena, and Amazon Redshift.

Stargazers:0Issues:0Issues:0

Japan-visa-data-engineering

This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark clusters are set up within a Docker container on Azure.

Stargazers:0Issues:0Issues:0
Language:HTMLLicense:MITStargazers:0Issues:0Issues:0
Language:C++License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

cpsc449-project1

Back-End API for Game

Stargazers:0Issues:0Issues:0
Language:C++License:MITStargazers:3Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0
Language:JavaStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

Algorithms

A collection of algorithms and data structures

License:MITStargazers:0Issues:0Issues:0
Language:C++License:MITStargazers:1Issues:0Issues:0
Language:C++License:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0