Gonza Ferreiro Volpi's repositories
Networks_Analysis_plus_Recommendations_system
This project explores the classic MovieLens dataset, first from a networks perspective, analyzing the relationship between users and movies. Later, in the main part of the project, we built and evaluate several Recommendations Systems.
SQL_practice_and_application
Some simple notebooks to show basic SQL skills through Pandas
NLP_with_20newsgroups
In this brief project we're gonna explore a few NLP tools using a Sklearn dataset and the following modelling techniques: bag of words, Hashing and TF-IDF vectorizer.
Simple_web_scraper
Simple web scraper using a little bit of regex to obtain a list of books and authors
Dealing_with_class_imbalance
In this repository you'll find a theoretical introduction to the problem of class imbalance, as well as a notebook with examples about how to use some of the algorithms mentioned in the theoretical guide.
Market_value_football_players
Final project of my immersive course in Data Science at General Assembly. It consisted of a lot of Web Scraping + several regression techniques to predict current value of football players. Conclusions inside.
Spark_theoretical_practical_application
In this repository I'll be exploring in deep three labs from my Immersive Course in Data Science about Spark including some basic map reduce and SQL-Spark operations, as well as a bit of modelling through Spark
Basic_skills_with_python
First project ever done with Python about data structures, functions and some statistics/probability to describe and refine a Pokemon gameplay.
From_job_posts_to_salaries_classification
For this project I explored different machine learning classification models to predict four salary categories for Data Science job posts using publications from Indeed.co.uk. The goal was to obtain an accuracy of 0.8 or higher on both the train and test group, which implies predicting correctly at least 80% of the total population.
Random_forest_theoretical_practical_application
This brief project explores first the theoretical background behind Random Forest, followed by its application with the Boston Housing dataset
gonzaferreiro.github.io
Repository for website creation
Predicting_house_prices_with_ames_dataset
This projected aimed to estimate the sale price of properties based on their "fixed" characteristics, such as neighborhood, lot size, number of stories, etc. In second place, I tried to estimate the value of possible changes and renovations to properties from the variation in sale price not explained by the fixed characteristics. The goal was to estimate the potential return on investment when making specific improvements to properties. This project uses the Ames housing data recently made available on Kaggle.
StreamlitDemos
Streamlit demo for Medium post.
SES_Uruguay
This repository and notebook try to analyze the Socioeconomic Status in Uruguay bases on data from a national survey
Several_things
This is my repository to save ideas, bunches of code, things I don't wanna lose, etc.
training-data-analyst
Labs and demos for courses for GCP Training (http://cloud.google.com/training).
deploying-machine-learning-models
Example Repo for the Udemy Course "Deployment of Machine Learning Models"
python_enigma_machine
Enigma Machine built in Python using OOP
learn-go-with-tests
Learn Go with test-driven development
PythonDataScienceHandbook
Python Data Science Handbook: full text in Jupyter Notebooks