Victor Oketch Sabare's repositories
stock-price-prediction-spark-cassandra
This is a data pipeline for predicting stock prices using Apache Spark, Apache Cassandra, and machine learning techniques. It collects and preprocesses stock data from Alpha Vantage API, engineers features, trains models, and performs data analysis and predictions.
Stock_Price_Data_Analysis
This repository contains the code and analysis for my data analysis project on stock price analysis and forecasting for my Internal attachment at Jomo Kenyatta University of Agriculture and Technology. The project analyzes historical stock price data, visualizes trends, and develops a forecasting model using Python and data science techniques.
ds-comm-ke
Data Science Communities in Kenya
The-Forex-Data-Pipeline
The Forex Data Pipeline is a comprehensive solution designed to collect, process, and prepare currency exchange rate data for downstream machine-learning pipelines. This repository showcases the creation of a data pipeline that fetches currency rates from an external API and performs data transformation using PySpark.
The-GitHub-History-of-the-Scala-language
Find out who has had the most influence on its development and who are the experts. Explore the evolution of the Scala language through its vibrant GitHub history. This is a comprehensive collection of historical data, commits, issues, and pull requests related to the development of Scala, a modern, multi-paradigm programming language.
Airports-Average-Distances
In this project I import an open dataset from socrata that contains airport codes, latitude coordinates and longitude coordinates for 13,429 US airports.
Analyzing-Students-Mental-Health
In this project I explore and analyze the students data to see how the study reached its conclusions and gain a better understanding of it. Specifically, I explore and analyze how the length of stay (stay) impacts the mental health of the international students present in the study.
Apache_Kafka_Project
In this project, I explored the Apache Kafka concepts such us asynchronous messaging, real-time stream processing, logging and monitoring, event sourcing, and real-time analytics.
Databriks-Golang-SDK
This repository holds code for installing and configuring the databriks SDK with Go on Visual Studio Code for ELT tasks with Spark SQL and Python
dp-203-azure-data-engineer
Exercise files for Microsoft Data Engineer curriculum
Dr.-Semmelweis-and-the-Discovery-of-Handwashing
Reanalyzed the data behind one of the most important discoveries of modern medicine: handwashing
Drive-Safe
🚗 DriveSafe: Your Guardian Angel on the Road 🛡️ Stay alert and safe behind the wheel with DriveSafe! Powered by facial recognition and machine learning, it detects signs of fatigue in real-time, ensuring every journey is a safe one. Let DriveSafe be your co-pilot on the road to safer driving! 🌟
ETL-DAG-with-Airflow
Welcome to my repository for building a Directed Acyclic Graph (DAG) using Apache Airflow for analyzing top-level domains (TLDs). This project aims to provide a robust framework for systematically collecting data on TLD usage and performing insightful analyses using Airflow's powerful workflow automation capabilities.s.
Getting-Started-as-User-Assistance-Developer
A repository to share content and helpful resources about user assistance, information architecture and technical writing.
introduction-to-analytics-engineering-4380318
This is a code repository for the LinkedIn Learning course Intro to Analytics Engineering.
Job-descriptions
Loking for a job in data science? In this project, I have "scraped"(taken from the web) 1000 job descriptions for companies
Reminder-Teams-App
A powerful app with customizable reminders for messages in channels and chats, along with a boomerang notification feature to remind you of unreplied messages. Efficiently manage your reminders and never miss an important message again.
Technical_Writing_Training
This is a repository containing the course work that I have done in the course "Technical Writing: How to Write Software Documentation" on Udemy offered by Jordan Stanchev
textbook
The textbook Computational and Inferential Thinking: The Foundations of Data Science
workshop-library
A library of workshops written by and for Microsoft Learn Student Ambassadors and Cloud Advocates and their local communities
zenorocha.com
My personal website ❤️