akuskis / funda_analysis

Tool to collect data from funda.nl and do basic overview of this data

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Funda Analysis

Description

Analysis of house pricing in the Netherlands by data received from Funda.

Setup

Get conda into your system. The commands below will import the environment and activate it.

conda env create -f environment.yml
conda activate funda

Pulling data

Call the following command from repository to pull data from Funda:

cd scrapy
scrapy crawl funda -O dump.json

Where funda is a name of the spider and dump.json is a place where to store collected data.

Historical data

Analysis

Run JupyterLab and navigate to the analysis folder (JupyterLab is installed as part of environment):

jupyter-lab

Notebooks are located in the analysis folder.

Examples

Source price types

Price

General distribution of the Living Area

Living area

Search price distribution

Filter data by buildings after 1990, 80-100 (m2) and not less than 20 properties per town.

By Filter

Areas are not taken into account (as there are a lot of bad neighborhood in every city), but top 25% should give us approximate price of the property. Anyway it's a sandbox just to get starting bids, because real price is different.

About

Tool to collect data from funda.nl and do basic overview of this data

License:GNU General Public License v3.0


Languages

Language:Jupyter Notebook 58.2%Language:Python 41.8%