June 18, 2021

Data disarray damages COVID-19 response, expert says

by Beth Blauer, Johns Hopkins University

Throughout our work at the Coronavirus Resource Center, we have highlighted that state reporting methods for COVID-19 testing, hospitalization, death, and vaccination data are limited by lack of standardization and accessibility. These challenges only worsen when the data is broken into a dizzying number of demographic subcategories. Further complicating the matter, states often only provide demographic data in an aggregated form, preventing local governments and the public from accessing detailed demographic information for their areas.

Why is demographic data important?

With access to detailed demographic data, local governments would be equipped to design targeted outreach programs and deploy mobile vaccination units to locate and assist their most vulnerable residents. This information would also inform policy decisions of city and county leaders by more accurately identifying hotspots, opportunities to invest in public health assets, and track outbreaks in near real-time. Additionally, because COVID-19 is still a relatively new virus, making detailed demographic data publicly available would help the scientific community better understand how the virus spreads and which people are most impacted by infection. This could help with the design of new therapies and better prepare public health professionals for future viruses and variants.

What are the problems?

There are two major issues with COVID-19 demographic data:

Detailed demographic data from states is not accessible to the public and local governments
Demographic categories and definitions differ between states

Exacerbating these challenges, state demographic data is often released in the form of charts or PDF reports, which contain aggregated data and lack individual detail necessary for accurate analysis. This bare-bones, aggregated data often is stored in hard to reach places, recorded manually in assorted reports, press releases, and even images. This manual data collection process requires significant work, unlike other data streams whose collection has been automated due to the format and accessibility of data. Releasing raw, individualized COVID-19 demographic data from which the states calculate these aggregates would remove the need for manual data collection. Publicly releasing the data would also allow county and city-level officials and public health workers to access and utilize this granular demographic information.

More concerning is the fact that the data is so disparate it's essentially impossible to compare between states. There are no standards for categorizing demographic data, so individual decisions to label categories with similar but different names, such as "Hawaiian" vs. "Hawaiian/Pacific Islander," complicate the data. There are currently 1,098 different demographic categories reported by the U.S. states and territories, which is an unmanageable quantity. This description of data disarray does not even include testing, hospitalization, or cross-categorization metrics, such as "white women aged 30-45," which would add thousands more categories.

There are even discrepancies within the same state. In Georgia, the demographic age categories on the state COVID-19 dashboard do not match the age categories on the state vaccination dashboard. A 60-year-old Georgia resident would be in the 60-69 age demographic group for a COVID-19 case, but in the 55-64 age group for their COVID-19 vaccination. This one person has already contributed data to two separate demographic pools, before accounting for sex, ethnicity, and race.

How are these issues manifested in the data?

CRC data scientists have visualized the plethora of demographic categories as tree maps. Each box in the map represents a distinct demographic category and its size is proportional to the number of times that label is used across states.

Racial data is the most complex and convoluted of all the demographic data even though the U.S. Office of Management and Budget only mandates five race categories. Instead, there are over 40 unique demographic categories of race reported within state vaccination data alone. While America is a melting pot where many people have rich, diverse ancestries, there are mathematical and scientific benefits to grouping the data into a reasonable number of consistent categories. A person who is "Middle Eastern" should not become "Asian" when they cross one state border and "white" when they cross another. This lack of data uniformity makes race-based analysis of COVID-19 trends virtually impossible.

What can we do to fix it?

The good news is the data exist. States could release the raw data publicly to allow for better demographic analysis. The data can and should be disaggregated, randomized, individualized, and anonymized. While privacy is not a major concern in a data set this large, it is important to proactively protect individual anonymity while pursuing data analysis to protect the country. These data should be easily accessible, machine-readable, regularly updated, and centralized so that policymakers and the public can make informed decisions and identify risks in near real-time.

As the country moves forward in this pandemic and prepares for the next public health crisis, demographic categories need to be standardized. The United States does not have federal standards for public health data categorization, but state health departments could agree to follow one example, such as that of the Census Bureau or the Office of Management and Budget, which overlap and have been thoroughly vetted in a bipartisan manner. Standardizing and releasing these data streams should be a national priority.

More information: Daily Status Report, Georgia Department of Public Health, 2021.

GA DPH Vaccine Distribution Dashboard, Georgia Department of Public Health.

About Race, 16 October 2020. www.census.gov/topics/population/race/about.html. (Accessed 07 June 2021).

Revisions to the Standards for the Classification of Federal Data on Race and Ethnicity, in: Federal Register, Office of Management and Budget, 30 October 1997, pp. 58782-58790.

Standards for Maintaining, Collecting, and Presenting Federal Data on Race and Ethnicity, in: Federal Register, Office of Management and Budget, 30 September 2016, pp. 67398-67401.

Provided by Johns Hopkins University

Citation: Data disarray damages COVID-19 response, expert says (2021, June 18) retrieved 30 June 2024 from https://medicalxpress.com/news/2021-06-disarray-covid-response-expert.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Data dashboard highlights COVID-19 demographics and time trends

Feedback to editors

Insurance coverage disruptions, challenges accessing care common amid Medicaid unwinding

16 hours ago

Scientists developing a monoclonal antibody to neutralize Nipah virus one of the deadliest zoonotic pathogens

20 hours ago

Researchers develop scalable synthesis of cancer-fighting compounds

Jun 28, 2024

New device inspired by python teeth may reduce the risk of rotator cuff re-tearing

Jun 28, 2024

Serotonin 2C receptor regulates memory in mice and humans: Implications for Alzheimer's disease

Jun 28, 2024

Fears of attack and no phone signal deter women trail runners, finds study

Jun 28, 2024

Creating supranormal hearing in mice

Jun 28, 2024

Visualizing core pathologies of Parkinson's disease and related disorders in live patients

Jun 28, 2024

Novel mechanism for targeting bone marrow adipocytes to prevent bone loss

Jun 28, 2024

Breakthrough research makes cancer-fighting viral agent more effective

Jun 28, 2024

Load comments (0)

Data disarray damages COVID-19 response, expert says

Why is demographic data important?

What are the problems?

How are these issues manifested in the data?

What can we do to fix it?

Insurance coverage disruptions, challenges accessing care common amid Medicaid unwinding

Scientists developing a monoclonal antibody to neutralize Nipah virus one of the deadliest zoonotic pathogens

Researchers develop scalable synthesis of cancer-fighting compounds

New device inspired by python teeth may reduce the risk of rotator cuff re-tearing

Serotonin 2C receptor regulates memory in mice and humans: Implications for Alzheimer's disease

Fears of attack and no phone signal deter women trail runners, finds study

Creating supranormal hearing in mice

Visualizing core pathologies of Parkinson's disease and related disorders in live patients

Novel mechanism for targeting bone marrow adipocytes to prevent bone loss

Breakthrough research makes cancer-fighting viral agent more effective

Data dashboard highlights COVID-19 demographics and time trends

Mobility data used to respond to COVID-19 can leave out older and non-white people

Attitudes toward vaccination more favorable after J&J vaccine pause

How to use statistics to prepare for the next pandemic

Greater than 60 percent receiving COVID-19 vaccine in U.S. are non-Hispanic whites

Black healthcare workers at highest risk of contracting COVID-19

Scientists developing a monoclonal antibody to neutralize Nipah virus one of the deadliest zoonotic pathogens

Gene therapy halts progression of rare genetic condition in young boy

Loss of salt and body fluid stimulates kidney regeneration in mice

Chemical conjugation mitigates immunotoxicity of chemotherapy of lipid nanoparticles

New predictors of metastasis in patients with early-stage pancreatic cancer

New study sheds light on potassium channels to help researchers design better drugs

Phys.org

Tech Xplore

Science X

Data disarray damages COVID-19 response, expert says

Why is demographic data important?

What are the problems?

How are these issues manifested in the data?

What can we do to fix it?

Insurance coverage disruptions, challenges accessing care common amid Medicaid unwinding

Scientists developing a monoclonal antibody to neutralize Nipah virus one of the deadliest zoonotic pathogens

Researchers develop scalable synthesis of cancer-fighting compounds

New device inspired by python teeth may reduce the risk of rotator cuff re-tearing

Serotonin 2C receptor regulates memory in mice and humans: Implications for Alzheimer's disease

Fears of attack and no phone signal deter women trail runners, finds study

Creating supranormal hearing in mice

Visualizing core pathologies of Parkinson's disease and related disorders in live patients

Novel mechanism for targeting bone marrow adipocytes to prevent bone loss

Breakthrough research makes cancer-fighting viral agent more effective

Related Stories

Data dashboard highlights COVID-19 demographics and time trends

Mobility data used to respond to COVID-19 can leave out older and non-white people

Attitudes toward vaccination more favorable after J&J vaccine pause

How to use statistics to prepare for the next pandemic

Greater than 60 percent receiving COVID-19 vaccine in U.S. are non-Hispanic whites

Black healthcare workers at highest risk of contracting COVID-19

Recommended for you

Scientists developing a monoclonal antibody to neutralize Nipah virus one of the deadliest zoonotic pathogens

Gene therapy halts progression of rare genetic condition in young boy

Loss of salt and body fluid stimulates kidney regeneration in mice

Chemical conjugation mitigates immunotoxicity of chemotherapy of lipid nanoparticles

New predictors of metastasis in patients with early-stage pancreatic cancer

New study sheds light on potassium channels to help researchers design better drugs

Newsletter sign up

Donate and enjoy an ad-free experience