voskresenskiy / citizen_science

Data analysis for citizen science projects

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Citizen science open data analysis

Here we do data analysis for citizen science projects with the open data https://www.inaturalist.org/projects/city-nature-challenge-2019 The project is in progress and discussions between @Liubov Tupikina, Muki Haklay, CRI.

The project is in process. See the correlaid team github for the names of team members.

Preliminary notebook for analysis is "citizen_science_analysis".

Reseach Questions (RQs)

RQ1: Can we identify macroscopic laws of citizen science projects? The main goal of the project to identify short-term and long-term dynamics in citizen science projects. (general RQ)

RQ2: How citizen science communities are growing in time, what drives their growth if not preferential attachment or exogenous factors?

RQ3: If we are looking at City Nature Challenge data - we have now data for San Francisco, LA, and London, which now have 4 or 3 years time series. How the power law like distributions of human activities are emerging? It's interesting that it is similar for SF and LA and the pattern that they have in year 4, London has in year 3.

RQ4: (a) Are there differences in data collection patterns in each year and between years? Are there also emerging pattern over time in a location? (b) Are there also emerging pattern over time in a location?

RQ5: To what degree the heavy contributors influence the total pattern and distribution of data? Is there any observed impact from major contributors (e.g. someone who is heavy contributors dropping off because they can't keep up with the competition). There is also the question about repeat recruitment - are the pattern of one timers vs people who come back stay the same from year to year? Are there any patterns or places that are behaving differently in terms of patterns of contribution?

RQ6: Since there are specific patterns of participation for different people from various countries we would like to understand the reason of power-law of participation (some participate a lot, many participate only little). We would like to understand how this pattern emerge dynamically (M.Pocco).

RQ7: What is the dfference between citizen science projects evolution in different cities around the globe? London vs. San Francisco? What are parameters which may lead to differences?

Some of the problems for citizen science community are: heterogeneity of collected data, missing data, non-controlled growth of community of observers.

Participate

The project is now discussed together with CorrelAid X Paris x CRI https://github.com/correlaid-paris/

Contact person: Liubov Tupikina (CRI, Bell labs), Muki Haklay (UCL), Frie (Correlaid).

About

Data analysis for citizen science projects


Languages

Language:Jupyter Notebook 100.0%Language:Python 0.0%