There are 8 repositories under datawrangling topic.
A fluent code explorer for R. 🔍
This project analyzes and visualizes the Used Car Prices from the Automobile dataset in order to predict the most probable car price
Understand the relationships between various features in relation with the sale price of a house using exploratory data analysis and statistical analysis. Applied ML algorithms such as Multiple Linear Regression, Ridge Regression and Lasso Regression in combination with cross validation. Performed parameter tuning, compared the test scores and suggested a best model to predict the final sale price of a house. Seaborn is used to plot graphs and scikit learn package is used for statistical analysis.
course website for data science tools 1
Data Science and Artificial Intelligence advanced certification course led by the IIT Madras & Intellipaat
Python Data Analytics, Machine Learning & Natural Language Processing
Fundamentals of Quantitative Analysis
Personal finance database creation, SQL analysis, and Power BI dashboard
I aquired a full scholarship from Google Launchpad. Advanced data wrangling skills to work with messy, complex real-world datasets. Highly customized visualizations using the Matplotlib Python library
This repository is a collection of all the solutions of tasks that were assigned to me during my Data Analytics Virtual Internship Experience Program at Quantium. 💻📚📊
All dataset analyzed during ALX data analyst nanodegree program including project files
Data visualization with Python
A small utility for analyzing data structures (e.g. JSON files)
Scrapers, parsers, data wrangling and utilities for TikTok and YouTube
It includes full exploratory data analysis (EDA) and preprocessing steps before applying machine learning algorithms. The project uses various datasets and covers data wrangling, exploratory data analysis, data visualization, feature engineering
To import data from multiple sources, clean and wrangle data, perform exploratory data analysis (EDA), and create meaningful data visualizations. I will then predict future trends from data by developing linear, multiple, polynomial regression models & pipelines and learn how to evaluate them.
A collection of projects as part of the Python for Data Science program at GreyAtom EduTech Pvt Ltd
A smart recipe recommendation system that suggests recipes based on ingredient similarities. This project is done in PySpark
I have successfully completed a 16-week and 8 end-to-end, applied data science projects of the Applied Data Science Lab module at WorldQuant University. The mini-projects included scientific computing, data wrangling, machine learning and natural language processing with Python.
This project is part of the IBM Data Science Professional Certificate course. It aims to identify the best classifier to predict whether the first stage of a Falcon9 rocket will land successfully, helping SpaceX minimize costs.
This project aims to predict employee salaries based on their experience. It involves data wrangling, exploratory data analysis, data visualization, and applies linear regression for prediction.
This project aims to predict student grades using various independent features. It involves data wrangling, exploratory data analysis, data visualization, and linear regression. The project uses Python and Jupyter notebooks for implementation.
Udacity Data Analyst Nanodegree
Designed a dashboard to track employee data for the HR team including attendence, working hours and leaves. This dashboard can streamline the HR processes and also can save the HR team 3-4 hours of work daily.
The goal of this project is to use Netflix data (7787, 12) to classify and group movies and shows into specific clusters. We will utilize techniques such as K-means clustering, Agglomerative clustering, and content-based recommendation systems to analyze the data and provide personalized suggestions to consumers based on their preferences
This project dives deep into the historical and contemporary data of the Olympics to uncover trends, patterns, and insights.
I have developed a comprehensive Amazon Prime Video Dashboard, which offers in-depth insights into the platform's content library. This dashboard is designed to help stakeholders understand various aspects of the available shows
This repository showcases my work in a Data Analytics and Commercial Insights simulation. Tasks include data preparation, customer analytics, and uplift testing using transaction data to generate strategic, data-driven recommendations. Outputs include code, benchmark analysis, and reports aimed at supporting informed business decisions.
IBM Data Science Capstone Project from Coursera