Pvillamanario / playing_with_BiciMad

Just playing with the Open Data Portal of Madrid about the public bikes service (BiciMad)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ih_datamadpt1121_project_m1

Ironhack Madrid - Data Analytics Part Time - November 2021 - Project Module 1

Data:

There are 2 main datasources:

  • Azure SQL Database. The database contains information from the BiciMAD stations including their location (i.e.: latitude / longitude). In order to access the database you may need the following credentials:
Server name:   sqlironhack
Database:      BiciMAD

IMPORTANT => Username and password will be provided in class.

IMPORTANT => Specific datasets will be assigned to each student in order to perform the challenges.


Main Challenge:

You must create a Python App (Data Pipeline) that allow their potential users to find the nearest BiciMAD station to a set of places of interest. The output table should look similar to:

Place of interest Type of place (*) Place address BiciMAD station Station location
Auditorio Carmen Laforet (Ciudad Lineal) Centros Culturales Calle Jazmin, 46 Legazpi Calle Bolívar, 3
Centro Comunitario Casino de la Reina Centros municipales de enseñanzas artísticas Calle Casino, 3 Chamartin Calle Rodríguez Jaén, 40
... ... ... ... ...

(*) There is a list of datasets each one with different places. A specific dataset will be assigned to each student.

Your project must meet the following requirements:

  • It must be contained in a GitHub repository which includes a README file that explains the aim and content of your code. You may follow the structure suggested here.

  • It must create, at least, a .csv file including the requested table (i.e. Main Challenge). Alternatively, you may create an image, pdf, plot or any other output format that you may find convenient. You may also send your output by e-mail, upload it to a cloud repository, etc.

  • It must provide, at least, two options for the final user to select when executing using argparse: (1) To get the table for every 'Place of interest' included in the dataset (or a set of them), (2) To get the table for a specific 'Place of interest' imputed by the user.

Additionally:

  • You must prepare a 4 minutes presentation (ppt, canva, etc.) to explain your project (Instructors will provide further details about the content of the presentation).

  • The last slide of your presentation must include your candidate for the 'Ironhack Data Code Beauty Pageant'.


Bonus 1:

You may include in your table the availability of bikes in each station.


Bonus 2:

You may improve the usability of your app by using FuzzyWuzzy.


Bonus 3:

Feel free to enrich your output data with any data you may find relevant (e.g.: wiki info for every place of interest).


Project Main Stack

About

Just playing with the Open Data Portal of Madrid about the public bikes service (BiciMad)


Languages

Language:Jupyter Notebook 91.1%Language:Python 8.9%