python jupyter-notebook web-scraping data-mining data-science big-data machine-learning complex-networks community-detection corruption-networks crime-prediction crimonology quantitative-criminology social-networks igraph scikit-learn networkx

Crime and political corruption analysis using data mining, machine learning and complex networks

There has been a remarkable increasing in the amount of stored data by private and public companies. On one hand, these huge amounts of data enable a detailed historical review of the processes under investigation; on the other hand, this excess of data makes harder to extract summarized information and also to make good decisions supported by well-established empirical facts. This modern phenomenon has been called a big data and understanding these systems and extracting patterns from these data requires a multidisciplinary approach. In this sense, during the course at the School of Applied Mathematics in the Institute of Mathematics and Computer Science at University of São Paulo we will address topics that involve computer science, statistics, and physics to understand these systems. Among the topics, we will focus on the following ones:

Introduction to Python;
Web scraping;
Data mining;
Machine learning;
Complex networks.

Using these tools, we will focus on two issues that are of great relevance in Brazil: predicting homicides in cities and describing the mechanism behind political corruption networks. In the first topic, we will use machine learning techniques to predict the number of crimes in Brazilian cities. In the second topic, we will use complex networks to describe the interaction between politicians investigated in corruption scandals in Brazil from 1987 to 2014.

Any comments, questions, or concerns can be directed to:

Luiz G. A. Alves lgaalves@northwestern.edu

Course Syllabus

This course is broken up into several modules with each module having a set of Jupyter notebooks to help teach concepts.

Basics, Collections and Files (Day 1)

Imports, Plots, Functions, Dictionaries, and Web Scraping (Day 2)

Data Mining, Statistics, and Data Analysis (Day 3)

Machine Learning Part I (Day 4)

Machine Learning Part II (Day 5)

Complex Network and Analysis of Corruption Networks (Day 6)

Social Network Analysis Using `igraph` and `leidenalg` (Extra)

Software Installation

This bootcamp uses the Anaconda Python 3.7 distribution

You must have Anaconda Python 3.7 installed before the first day of class

Downloading Course Materials

The course materials can be downloaded from the repository's github page. Just download the zip file, unzip it onto your Desktop, and rename the directory school-of-applied-math.

Usage of Course Materials

This text and the majority of the course will conducted with Jupyter Notebook http://jupyter.org. Jupyter Notebook is a 'web-based interactive computational environment', meaning that it allows to write and execute python code in a web page from your own computers. Jupyter Notebook is a relatively new tool and we believe that is an excellent way to teach the basics of python programming and computational data analysis.

Jupyter Notebook is installed by default with the Anaconda Python distribution and can be laucnhed from the Anaconda Navigator program.

Location and period of the course:

Period: July 1 to July 6, 2019.

Hours: 08:00 to 12:00

Location: (Institute of Mathematics and Computer Science at University of São Paulo) / University of São Paulo (rooms of block 3).

Approval Criteria: 85% of attendance and performance of proposed activities.

Target Audience: Senior year students and postgraduate students in applied mathematics, statistics, computer science and physics interested in data science.

Number of vacancies: 20

Enrollment Period: 04/15/2019 to 05/30/2019.

References

About

Lectures on "crime and political corruption analysis using data mining, machine learning and complex networks" at the School of Applied Mathematics in the Institute of Mathematics and Computer Science at University of São Paulo

https://github.com/lgaalves/school_crime_and_corruption_analysis