aolifodaisy / SDSC_22_workshop

This repository contains the materials for one of the Spatial Data Science Conference (2022) workshops, ran by Gladys Kenyon and Juan Ramon Selva

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

SDSC_22_workshop (18/05/22)

Exploring the housing market in Spanish cities

Gladys and Juan will take you through an exploratory spatial analysis and market segmentation of property listings data from idealista. The city of focus is Madrid, the capital of Spain. You will create a range of visualuations and maps, and implement an unsupervised machine learning algorithm (k-means), to create clusters of properties. The analysis will be done in Python, every required step of the method is coded for you. If you are comfortable writing Python code, there are optional exersises to extend the analysis to other cities (Barcelona and Valencia).

Running the notebook

The analysis is done in a jupyter notebook. You will need to bring a laptop to run the notebook. You have the option to run it locally or to run it through the internet.

Locally

Follow these instructions to install and run Python from your own computer. You will need sufficient storage space for the software and data. Doing it this way will mean you are set up for future analysis and you can save your work and come back to it later!

If you want to run the notebook locally, it is recommended to create a new conda environment based on the environment.yml file.

conda env create -f environment.yml

Alternatively, Make sure all the required packages are up to date and available in your environment:

  • geopandas
  • contextily
  • seaborn
  • pygeos
  • pysal
  • scikit-learn

Run online

Open in an interactive in-browser environment. On the day there will be a binder link here, you do not need to do anything beforehand.

Binder

Data Source

Properties listing datasets and spatial segmentation have been brought thanks to idealista.

A detailed explanation of the methodology can be accessed via this Data in Brief working paper, authored by David Rey Blanco, Pelayo González Arbués, Fernando López Hernández & Antonio Páez.

Terms of use

Use of this dataset is subject to the license CC 4.0, as indicated at https://github.com/paezha/idealista18/blob/master/LICENSE.md

About

This repository contains the materials for one of the Spatial Data Science Conference (2022) workshops, ran by Gladys Kenyon and Juan Ramon Selva

License:MIT License


Languages

Language:Jupyter Notebook 100.0%