Labriji Saad's repositories
Youtube-video-transcriptor
In this notebook, I implemented a script to transcribe YouTube videos (and audio files in general) using Google's speech-to-text API.
Twitter-Sentiment-Analysis-with-Python
I aim in this project to analyze the sentiment of tweets provided from the Sentiment140 dataset by developing a machine learning sentiment analysis model involving the use of classifiers. The performance of these classifiers is then evaluated using accuracy and F1 scores.
labrijisaad
LABRIJI Saad's Profile Page README.md
Prediction-du-cours-de-Bourse
Forecast Apple stock prices using Python, machine learning, and time series analysis. Compare performance of four models for comprehensive analysis and prediction.
Kedro-Energy-Forecasting-Machine-Learning-Pipeline
This repo showcases a project that transforms ML model training into a simplified, production-ready Kedro Dockerized Pipeline. It emphasizes best MLOps practices, enabling easy training, evaluation, and deployment of models, including XGBoost, LightGBM and Random Forest, with built-in visualization and logging features for effective monitoring.
Data-warehousing-in-Azure-postgreSQL
In this repository, I address missing values in the Prosper dataset using advanced data cleaning techniques. The refined data is then seamlessly uploaded to a pre-configured Azure Postgres Database via a Jupyter Notebook, showcasing efficient data management and cloud database integration.
EXAMEN-DATA-ENGINEERING
In-class exams for Docker, Git, and ML.
Language-Identifier-SVM
Language identification script that can detect the language of a given text. Currently supports Swahili, Wolof, French, English, Arabic, and Dyula. Customizable language support.
Dimension-Reduction-Clustering-Data
explore the impact of dimensionality reduction on the quality of clustering.
Git-Clustering
Enhanced and Repackaged GIT Clustering: This repository offers an open-source, enhanced version of the GIT (Graph of Intensity Topology) clustering algorithm.
Monthly-Daily-Energy-Forecasting-Docker-API
This repository houses an Energy Forecasting API that uses Machine Learning to predict daily and monthly energy consumption from historical data. It's designed as a practical demonstration of a Machine Learning Engineering workflow, from initial analysis to a deployable API packaged with Docker.
Sentiment-analysis-on-the-Quran-Karim-dataset
In this notebook, I attempted to create a script that utilizes pre-trained CamemBERT and VaderSentiment models to label the sentiment of a Quran Karim dataset in English and French. My goal was to accurately classify the sentiment of each text sample in the dataset.
DevEnvConfigurations
π Centralized repository for my customized IDE settings, system configurations, and tech stack preferences. π οΈ
DimReduce-HealthAnalytics
A project showcasing the application of various dimensionality reduction techniques for visualizing and analyzing simulated health diagnostics data in 2D and 3D.
Technical-Test-NLP-Category-Correction
This repo has a Jupyter Notebook for an e-commerce NLP and data manipulation technical test.
working-with-cassandra
In this repository, I covered the basics of Cassandra π§Ώ
working-with-mongodb
In this repository, I covered the basics of MongoDB π₯
Apache-beam-k-means
Implementing K-means clustering in sequential, streaming, and distributed formats using Apache Beam.
AXA-Direct-ML-Apprenticeship
Repository showcasing my Machine Learning Engineering Apprenticeship at AXA-Direct Assurance, contributing to the development and implementation of Machine Learning solutions.
Chefclub-Data-Internship
Repository showcasing my Data Engineer / Scientist internship at Chefclub, contributing to data infrastructure enhancement and fostering data-driven insights.
Optimal-K-in-K-Means-Clustering
Using the Elbow Method and Silhouette Analysis to find the optimal K in K-Means Clustering.
Technical-Test-AXA-Direct-Assurance
Data Science Technical Test Solution.
unsupervised-learning-project
In the class project related to unsupervised learning.
ClusterLLM
LLM guided text clustering
DeepMoji
State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.
first-Android-app
Here is the source code for our first inclass Android app.
Mnist-Deep-Learning-Project
his repository is dedicated to tackling three distinct problems using deep learning techniques on the Mnist dataset.