Erjan's starred repositories
mage-zoomcamp
This repository will contain all of the resources for the Mage component of the Data Engineering Zoomcamp: https://github.com/DataTalksClub/data-engineering-zoomcamp/tree/main
Image-Caption-Generator
A neural network to generate captions for an image using CNN and RNN with BEAM Search.
HashtagCashtag
My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on lambda architecture, that aggregates Twitter and US stock market data for user sentiment analysis using open source tools - Apache Kafka for data ingestions, Apache Spark & Spark Streaming for batch & real-time processing, Apache Cassandra f or storage, Flask, Bootstrap and HighCharts f or frontend.
Solving-100-exercises
Solving Python problems and creating programs from scratch
Python-programming-exercises
100+ Python challenging programming exercises
pandas_exercises
Practice your pandas skills!
rt-analytics
An example project that demontrates real time big data stream processing using GigaSpaces
tengrinews-open-project
Data Engineering pet-project covering GCP, Docker, workflow orchestration with Mage, data transforming with dbt, batch processing via Spark
audiophile-e2e-pipeline
Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard.
Smart-City-Sample
The smart city reference pipeline shows how to integrate various media building blocks, with analytics powered by the OpenVINO™ Toolkit, for traffic or stadium sensing, analytics and management tasks.
changecapture-e2e
This project shows how to capture changes from postgres database and stream them into kafka
mlops-course
Learn how to design, develop, deploy and iterate on production-grade ML applications.
TakeHomeDataChallenges
My solution to the book <A collection of Data Science Take-home Challenges>
data_engineering_best_practices
Sample project to demonstrate data engineering best practices
League-of-Legends-Analytics
DataTalks.Club's Data Engineering Zoomcamp Project
employees-attrition-mlops
Final Project of the MLOps Zoomcamp hosted by DataTalksClub.
dtc-data-engineering-zoomcamp-project
DataTalks.Club's Data Engineering Zoomcamp Project
dataEngineering
A repo to track data engineering projects
Batch-data-engineering-project
A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour metric table.
Data-Engineering
A project portfolio to accompany my resume
Air_Pollution_Pipeline
Data Engineering Project in GCP
spotify-data-engineering-project
In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data from the Spotify API, transform into desired format and load it into an AWS data store.
steam-data-engineering
A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!
Data_Engineering_Project_Portfolio
Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3