James Mwangi (James-Wachuka)

James-Wachuka

Geek Repo

Location:Kenya

Home Page:james-wachuka.github.io

Twitter:@wachuka_james

Github PK Tool:Github PK Tool

James Mwangi's repositories

weather_data_pipeline

This is a PySpark-based data pipeline that fetches weather data for a few cities, performs some basic processing and transformation on the data, and then writes the processed data to a Google Cloud Storage bucket and a BigQuery table.The data is then viewed in a looker dashboard

Language:PythonStargazers:5Issues:2Issues:0

event-driven-microservices

This project demonstrates an event-driven microservices architecture using Apache Kafka for event streaming and webhook integration with external services

Language:PythonLicense:MITStargazers:4Issues:2Issues:0

python_etl

Using python-sql to create ETL between mysql and postgresql and windows scheduler to automate the job.

Language:PythonStargazers:4Issues:1Issues:0

python-kafka_distributed_task_queue

a simple implementation of a distributed task queue

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

python_tel_chatbot

python telegram chatbot using telegram API

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

awesome-opensource-data-engineering

An Awesome List of Open-Source Data Engineering Projects

License:NOASSERTIONStargazers:0Issues:0Issues:0

coingecko-streamapp

a streaming app and a dashboard for visualizing cryptocurrency data fetched from the CoinGecko API. The streaming app retrieves real-time cryptocurrency information using Spark Streaming and stores it in a PostgreSQL database.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

data-diff

Compare tables within or across databases

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dataquest_DE_learningpath

code from my data engineering learning path by dataquest

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dta_warehouse_example

using mysql and talend open studio to perform ETL

Stargazers:0Issues:1Issues:1

dta_warehouse_hive

A data warehouse implementation in hive.

Language:HiveQLStargazers:0Issues:0Issues:0

face_det

face detection in python using Open CV

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

flaskapp

serving a ml model in flask

Language:PythonStargazers:0Issues:1Issues:0

mentalhealth_analysis-data-pipeline

An end to end data pipeline for for mental health analysis

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

mysql_gcp

using airflow to extract data from mysql transform and load into bigquery

Language:PythonStargazers:0Issues:0Issues:0

podcasts_pipeline

Building a four-step data pipeline using Airflow to download podcast episodes.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Prefect-PostgreSQL-Sensors

The prefect_postgres_sensors package provides Prefect sensors for monitoring changes or conditions within a PostgreSQL database.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pyspark_optimization

using cache/persit methods to optimize pyspark and Pyspark/SQL to query mysql database

Language:PythonStargazers:0Issues:0Issues:0

python_tweepy

Using python and tweepy to followback friends on twitter. This task uses the windows scheduler to follow back every 5 minutes

Language:PythonStargazers:0Issues:0Issues:0

tweepy_airflow

airflow dag that shows twitter trending hashtags every 20 mins

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

MapReduce

mapreduce techniques in hadoop-joins, job counters, inputs/outputs

Language:JavaStargazers:0Issues:0Issues:0

ploomber

The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

python_flask

python-flask basics

Language:PythonStargazers:0Issues:0Issues:0

R_examples

R- data science exercises and examples

License:MITStargazers:0Issues:0Issues:0

scrape_selenium

twitter automation with selenium

Language:PythonStargazers:0Issues:1Issues:0

shell_

first attempt at windows task scheduling

Language:BatchfileStargazers:0Issues:0Issues:0

soda-core

:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io

License:Apache-2.0Stargazers:0Issues:0Issues:0

speeddating_R

supervised learning in R

Language:HTMLStargazers:0Issues:0Issues:0

weatherbot

weatherbot -using weather map API and telegram API

Language:PythonLicense:MITStargazers:0Issues:0Issues:0