Most comprehensive analysis of COVID-19 data reveals previously unattributed deaths

global pandemic
Credit: Pixabay/CC0 Public Domain

A study published in The Lancet Digital Health has used health data from 57 million people in England to build the most complete picture of the pandemic in a single country to date.

In this first-of-its-kind study, researchers from University College London (UCL) combined multiple NHS on national laboratory testing data, primary care consultations, hospitalizations and deaths to reveal the exact trajectory of individuals through the healthcare system during the pandemic, and what impact this had on their health outcomes.

The analysis uncovered 15,486 deaths that occurred within 28 days of a COVID-19 diagnosis but didn't list COVID-19 as being a cause of death. A further 10,884 COVID-19 diagnoses were identified from death records alone with no other related information recorded earlier in .

Researchers also found almost one-third of patients received ventilatory support outside of ICU departments, and that this was associated with the highest rates of death in waves one and two of the pandemic. The authors say this demonstrates the need for planning on how to scale ICU services in the event of future pandemics and healthcare emergencies.

Dr. Chris Tomlinson of UCL, the co-lead researcher of the study, said, "Understanding the impact of COVID-19 requires consideration of how the infection varies in severity and time course—from asymptomatic to cases that are unfortunately fatal.

"These different clinical presentations are captured in a patient's digital records, but across multiple, and often unconnected organizations—including public health bodies, GP surgeries, hospitals, intensive care units and death registries. Analyzing all this data on the scale of an entire population presents a real challenge.

"In this study, we bring together eight complementary and national-level datasets from across the NHS to create the most comprehensive analysis of COVID-19 events to date, with the aim of supporting policy decision-making for COVID-19 and future health crises."

For their analysis, researchers used anonymized patient data from multiple national NHS sources to identify patterns in how patients progressed through the healthcare system. Linking these to demographic factors like age, sex, and ethnicity allowed another layer of analysis. For example, those from non-white ethnicities had a shorter time between infection and , suggesting these groups may have been accessing testing facilities and healthcare later in their disease.

The research was conducted securely in a Trusted Research Environment by members of the CVD-COVID-UK consortium, a National Institute for Health Research (NIHR) and British Heart Foundation (BHF) flagship project led by the BHF Data Science Centre, part of Health Data Research UK.

Professor Cathie Sudlow, Director of the BHF Data Science Centre, said, "Rapid and reliable access to health data has been essential throughout the pandemic. Until now, this data has been locked away in siloed organizations where it is almost impossible to analyze in harmony.

"The BHF Data Science Centre's CVD-COVID-UK consortium is now working to provide trusted researchers rapid access to multiple, linked datasets from across the NHS. By collaborating with research teams like this one who are developing new approaches to analyze these data sets, we're paving the way towards a new future of using health data to improve people's lives."

Professor Spiros Denaxas of UCL, an author on the study, said, "By linking electronic health records on a national scale we were able to identify patterns and patient trajectories in the pandemic which would have otherwise remain hidden in smaller datasets. On-going, secure access to the excellent data the NHS holds is essential for performing high quality health data research and improve patients' health and healthcare."

The researchers note that although they present patterns throughout the , their focus was to analyze COVID-19 related characteristics, rather than causal relationships. The findings are important for identifying potential NHS pinch points and informing future policies.

Dr. Johan Thygesen of UCL, who co-led the study, said, "This work has already enabled other research with highly relevant public health implications, like assessing the blood clotting risks of COVID-19 vaccines. By fully sharing our methods and code, we believe this research has the potential to unlock the power of linked health data for not just future COVID-19 outbreaks, but all kinds of complex health conditions."

More information: COVID-19 trajectories among 57 million adults in England: a cohort study using electronic health records, The Lancet Digital Health (2022). … (22)00091-7/fulltext

Provided by Health Data Research UK
Citation: Most comprehensive analysis of COVID-19 data reveals previously unattributed deaths (2022, June 8) retrieved 13 April 2024 from
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Scientists identify characteristics to better define long COVID


Feedback to editors