Chinese Inter-County Competition & Pro-Business Policy

Materials for my MA thesis at Fudan University (复旦大学)

The idea is, counties want to attract investment, but firms don't care if they locate in one county or another nearby.
So counties have to compete with one another for firm investment: the more counties, the fiercer competition is.
Some experts (e.g. Cheung, 2014) think inter-county competition is the main reason for China's economic success.

We can measure pro-business policy by the effective tax rate — how much of the federal tax rate is actually enforced.
A problem is, there may be some factor affecting both county density and the tax rate, causing spurious correlation.
For example, an area richer to begin with will have more counties and also more development (which will affect taxes).
To overcome endogeneity, we need a variable correlated with county density, but not economic development.

This study uses geography as an instrumental variable for county density, controlling for development.
I use the software ArcGIS to analyze geographic data for land elevation & agricultural productivity.
Then I use regression analysis with Stata to determine how these influence pro-business policy.

Code – Brief Outline

density.R

Uses county adjacency & centroid data extracted from ArcGIS as Excel files (available on request)
To reduce search space, uses neighbours within a given 'depth' (e.g. neighbours of neighbours is depth 2)
Density for a given county is its number of neighbours whose centroid is within 100km of that county's centroid

assembleData.R

Works with geographic data extracted using ArcGIS, contained in Excel files (available on request)
Cleans counties with missing values, salvages values by using data for neighbouring counties
Uses run length encoder to repeat geographic data for firms in each county
After running, the data are assembled into a regression that can be run in R or Stata

robustness.R

Reassembles data using only adjacent neighbours (sharing a border), rather than within 100km
Experiments with different specifications for tax enforcements from the firm-level data
Reassembles data for a subset of counties for which GDP data is available

The Stata do-file shows how to run the main regressions, but isn't meant to be run as-is.

Possible Extensions

In theory, I can omit ArcGIS and Stata and do my entire analysis using R.
My school made me use a custom Office template, but I'd prefer to replicate it in LaTeX.
I'd love to make this ‘reproducible’, i.e. push a button to replicate the whole thesis from scratch

gjoncas / China-Counties

Chinese Inter-County Competition & Pro-Business Policy

Code – Brief Outline

Possible Extensions

About

Languages