nychealth / coronavirus-data

This repository contains data on Coronavirus Disease 2019 (COVID-19) in New York City (NYC), from the NYC Department of Health and Mental Hygiene.

Home Page:https://www1.nyc.gov/site/doh/covid/covid-19-data.page

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Vaccine Status Definition and Age-Adjustment Clarifications

warrtalon opened this issue · comments

Hello, @acharney2 , in this Readme, the definitions for "Vaccinated" and "Unvaccinated" leave room for interpretation.

https://github.com/nychealth/coronavirus-data/blob/master/latest/Readme.md

May we know the following?

Breakthrough Summary.csv:

  1. In what category do you place those who are <14 days after the most recent dose completing their primary series?
  • Are those counted as "Unvaccinated," or is there a 3rd unseen category (e.g. "Partially Vaccinated")?
  1. What is the formula used to create the Hosp/Death adjusted rates? I know it's based on the 2000 age distribution standard, but what is the formula? It appears to nearly triple the crude rate even though the largest Unvaxxed cohort is 75+ (besides 5-12)

Now-Weekly-Breakthrough.csv:

  1. Does this table include all COVID outcomes for each week, or is there another set of outcomes not seen because they don't fit the "Unvaccinated" or "Vaccinated" definitions?
  2. Hosp/Death rates:
  • Can you add columns for crude rate (like you show for Cases)?
  • If not, can you add population columns for Vax and Unvax so we can see the denominators? This would help us in the public to validate the math - it's great for transparency and confidence 👍

My chart here shows what I mean. The yellow line is my calculation for crude rate while the blue line is yours. It makes a huge difference (blue vs. yellow):
image

This 2nd version includes my interpolation for weeks 1/8-1/29 since those seem to have incomplete Vax Record Matching:
image

These details are important, because you can see how that artificial blue hump dramatically impacts the chart on the website:
image

Since the charts are being shared widely on social media by Commissioner Chokshi and very large media accounts before Vax Record Matching is complete, it can lead to public misconceptions and unintended policy impacts. You can see why I care so much about it, and I know you do, too. 🙏

Thank you!
-Clayton

I was just about to ask about the definition of "Vaccinated" and "Unvaccinated" in the breakthrough data myself so hope you get an answer soon!

I was just about to ask about the definition of "Vaccinated" and "Unvaccinated" in the breakthrough data myself so hope you get an answer soon!

Definitions are in that link I gave at the top. Here's a snippet:
image

Hi, thanks for the interest.

re: breakthrough-summary.csv: we do not have a partially vaccinated category in these data, and partially vaccinated people (those who are <14 days after completing the primary series) are excluded from these data, including the denominators.

re: age-adjustment: You can read more about age-adjustment here. This is a standard analytical process in epidemiology. When two populations have very different age distributions, crude rates primarily reveal age differences, whereas age-adjusted rates separate the effect of the age differences from the effect of the thing you want to study. In this case, the standard and appropriate way to compare these two populations is via age-adjusted rates.

Hi, thanks for the interest.

re: breakthrough-summary.csv: we do not have a partially vaccinated category in these data, and partially vaccinated people (those who are <14 days after completing the primary series) are excluded from these data, including the denominators.

re: age-adjustment: You can read more about age-adjustment here. This is a standard analytical process in epidemiology. When two populations have very different age distributions, crude rates primarily reveal age differences, whereas age-adjusted rates separate the effect of the age differences from the effect of the thing you want to study. In this case, the standard and appropriate way to compare these two populations is via age-adjusted rates.

Thank you for the partial vax answer. That makes sense and is good to confirm.

FOr the age-adjustment, I acknowledge awareness of the standard, and I understand why it's usually done. What I'm asking is what exact formula you're using and how it accounts for 75+ having more Unvaxxed than any other age group. This would seemingly cause the age-adjustment to go the other direction. Why would more young people being vaccinated cause age-adjustment to triple the crude rate?

These 2 charts show what I mean:
% by Age Group
image
Raw Count by Age Group
image
Raw Count with Death Rates (based on Week of 1/8/22)
image

May we see the calculations and how this works? If not, it's like a black box that no one in the public can understand.

Thank you!