luoluogogogo / COVIDExposureIndices

Exposure indices derived from PlaceIQ movement data by Couture, Dingel, Green, Handbury, and Williams

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Exposure indices derived from PlaceIQ movement data

Summary

This repository contains indices describing exposure on the basis of smartphone movements. These indices are produced by Victor Couture, Jonathan Dingel, Allison Green, Jessie Handbury, and Kevin Williams, with assistance from Hayden Parsley, Serena Xu, and Shih Hsuan Hsu. They are derived from anonymized, aggregated smartphone movement data provided by PlaceIQ.

We are making these indices publicly available to all researchers in the context of the spread of COVID-19. The indices, which describe (potential) exposure varying across locations and time within the United States, could be useful in a variety of applications. The indices published here cover January 2020 to present, with a seven-day lag. We will update them every weekday.

When using these indices, please cite "Measuring movement and social contact with smartphone data: a real-time application to COVID-19" by Couture, Dingel, Green, Handbury, and Williams. The paper describes properties of smartphone data, compares the residential distribution and movement patterns of devices to those in traditional data sources, and discusses potential measurement issues that arise in the context of the ongoing pandemic.

We thank Drew Breunig, Nicholas Sheilas, Stephanie Smiley, Elizabeth Cutrone, and the team at PlaceIQ for their ongoing help. This data release was approved by the University of California, Berkeley Office for Protection of Human Subjects under CPHS Protocol No 2018-05-11122.

Data description

The following indices are available as (compressed) CSV files in this GitHub repository.

Location exposure indices

Currently version 0.5.

  • State-level location exposure index (LEX): Among smartphones that pinged in a given state today, what share of those devices pinged in each state at least once during the previous 14 days? The daily state-level LEX is a 51-by-51 matrix in which each cell reports, among devices that pinged today in the column state, the share of devices that pinged in the row state at least once during the previous 14 days.

  • County-level location exposure index (LEX): Among smartphones that pinged in a given county today, what share of those devices pinged in each county at least once during the previous 14 days? The daily county-level LEX is an approximately 2000-by-2000 matrix in which each cell reports, among devices that pinged today in the column county, the share of devices that pinged in the row county at least once during the previous 14 days.

Daily LEX values are available for January 20 through August 19. To limit computational costs, we are only publishing LEX values for Wednesdays and Saturdays after August 20, 2020.

For tips on downloading and processing these data, please read the notes for users. For a full description of how we compute the LEX measures, please read the LEX documentation PDF.

Device exposure indices

Currently version 0.6.5.

  • State-level device exposure index (DEX): For a smartphone residing in a given state, how many distinct devices also visited any of the commercial venues that this device visited today? The state-level DEX reports the state-level average of this number across all devices residing in the state that day. The DEX values are necessarily only a fraction of the number of distinct individuals that also visited any of the commercial venues visited by a device, since only a fraction of individuals, venues, and visits are in the device sample.

    • DEX-income reports the state-level average of this number for the four groups of devices residing in each of four neighborhood-income quartiles.
    • DEX-education reports the state-level average of this number for the four groups of devices residing in each of four neighborhood-college-share quartiles.
    • DEX-race reports the state-level average of this number for four ethnic/racial demographic groups, weighting each device by the share of its neighborhood residents who belong to the ethnic/racial demographic group.
  • CBSA-level device exposure index (DEX): For a smartphone residing in a given core-based statistical area (CBSA), how many distinct devices also visited any of the commercial venues that this device visited today? CBSA-level counterparts of the state-level measures listed above are available.

  • County-level device exposure index (DEX): For a smartphone residing in a given county, how many distinct devices also visited any of the commercial venues that this device visited today? The county-level DEX reports the county-level average of this number across all devices residing in the county that day. The DEX values are necessarily only a fraction of the number of distinct individuals that also visited any of the commercial venues visited by a device, since only a fraction of individuals, venues, and visits are in the device sample.

  • Adjusted device exposure index (DEX-adjusted): In the context of the ongoing pandemic, the DEX measure may be biased if devices sheltering-in-place are not in the sample due to lack of movement. We report adjusted DEX values to help address this selection bias. DEX-adjusted is computed assuming that the number of devices has not declined since the early-2020 peak and that unobserved devices did not visit any commercial venues.

These indices are reported in a state-level file, a CBSA-level file, and a county-level file. For a codebook and tips on downloading and processing these data, please read the notes for users. For a full description of how we compute the DEX measures, please read the DEX documentation PDF.

Further information and feedback

  • This is ongoing work released rapidly in the context of the ongoing pandemic. The release version less than "1.0" signals that these indices have passed basic sensibility checks but are not a peer-reviewed research product.
  • Please read the documentation to understand relevant caveats.
  • If you have questions, suggestions, or spot a bug, please file an issue at the GitHub repository. We welcome opportunities to improve the documentation and to better understand how you might want to use these and related indices that we are developing.
  • The SQL code to produce the indices will soon be available on the data warehouse Snowflake. Researchers will be able to modify these queries to produce related indices derived from PlaceIQ's smartphone movement data.

Coming soon

Work in progress: Venue-level exposure index, Device network-overlap index

Acknowledgments

These indices are based upon work supported by the National Science Foundation under Grant No. 2030056. Victor Couture thanks the Fisher Center for Real Estate and Urban Economics at Berkeley for generous financial support. Jonathan Dingel thanks the James S. Kemper Foundation Faculty Research Fund and Initiative for Global Markets at the University of Chicago Booth School of Business. Jessie Handbury thanks the Research Sponsors Program of the Wharton Zell-Lurie Real Estate Center and the Wharton Dean's Research Fund. Kevin Williams thanks the Yale School of Management.

For helpful feedback and reactions to our data and documentation, we thank Gabriel Kreindler.

About

Exposure indices derived from PlaceIQ movement data by Couture, Dingel, Green, Handbury, and Williams