Anonymization remains a powerful approach to protecting the privacy of health information

December 8, 2011

De-identification of health data has been crucial for all types of health research, but recent articles in medical and scientific literature have suggested that de-identification methods do not sufficiently protect the identities of individuals and can be easily reversed. A recent review conducted by researchers at CHEO entitled "A Systematic Review of Re-identification Attacks on Health Data" and published in PLoS ONE, did not uncover evidence to support this. "If re-identification rates were as high as some of these articles suggest, it would be worrisome," says lead author, Dr. Khaled El-Emam. "But our review did not support these claims – there is no broad empirical support for a failure of anonymization."

Such a failure would have significant policy implications. For example, it may become necessary to obtain patient consent before data is released (a time-consuming undertaking), incentive to de-identify would decline, and the likelihood of breaches would increase. For this reason, Dr. El-Emam and his team conducted a review that set out to characterize known re-identification attacks on and compare them to attacks on other types of data, calculate the number of records correctly identified in these attacks, and assess whether the results indicate a weakness in current de-identification methods.

After identifying 14 relevant studies and analyzing them in detail, the group was unable to find convincing evidence that existing de-identification methods are not effective. Few of these attacks involved health data which is naturally protected more strenuously. Secondly, many of the attacks were on small databases with large confidence intervals around their success rates. Most importantly, the majority of re-identified data was not de-identified according to existing standards. "Of the 24 studies we examined, only six were attacks on data and only one of these was de-identified according to standards," Dr. El-Emam points out. "In that particular study, the proportion of correctly re-identified records was very low: about 0.013%."

In certain well-publicized re-identification attacks, adversaries were able to make use of such information as an individual's date of birth, gender, and residential zip code. Since these 3 features were not modified in any way, the database would not meet basic standards for de-identification. If anything, such a breach serves to underscore the importance of implementing existing de-identification standards.

Dr. El-Emam concludes by saying that in order to have a more accurate picture of the extent to which de-identification protects against real attacks, future research on re-identification attacks should focus on large databases that have been de-identified according to existing standards, and that success rates should be correlated with how well de-identification was performed. In the meantime, it is suggested that data custodians continue to de-identify using current best practices.

Explore further: Most Canadians can be uniquely identified from their date of birth and postal code

More information: Link to report: www.plosone.org/article/info%3 … journal.pone.0028071

Related Stories

Most Canadians can be uniquely identified from their date of birth and postal code

August 8, 2011
There are increasing pressures for health care providers to make individual-level data readily available for research and policy making. But Canadians are more likely to allow the sharing of their personal data if they believe ...

New report on creating clinical public use microdata files

September 15, 2011
The demand for transparency through publicly available healthcare data is on the rise. This is the case for administrative and clinical data for research, and for clinical trials data used to support new drug approvals. Broad ...

Recommended for you

Exploring the potential of human echolocation

June 25, 2017
People who are visually impaired will often use a cane to feel out their surroundings. With training and practice, people can learn to use the pitch, loudness and timbre of echoes from the cane or other sounds to navigate ...

Team eradicates hepatitis C in 10 patients following lifesaving transplants from infected donors

April 30, 2017
Ten patients at Penn Medicine have been cured of the Hepatitis C virus (HCV) following lifesaving kidney transplants from deceased donors who were infected with the disease. The findings point to new strategies for increasing ...

'bench to bedside to bench': Scientists call for closer basic-clinical collaborations

March 24, 2017
In the era of genome sequencing, it's time to update the old "bench-to-bedside" shorthand for how basic research discoveries inform clinical practice, researchers from The Jackson Laboratory (JAX), National Human Genome Research ...

The ethics of tracking athletes' biometric data

January 18, 2017
(Medical Xpress)—Whether it is a FitBit or a heart rate monitor, biometric technologies have become household devices. Professional sports leagues use some of the most technologically advanced biodata tracking systems to ...

Financial ties between researchers and drug industry linked to positive trial results

January 18, 2017
Financial ties between researchers and companies that make the drugs they are studying are independently associated with positive trial results, suggesting bias in the evidence base, concludes a study published by The BMJ ...

Best of Last Year – The top Medical Xpress articles of 2016

December 23, 2016
(Medical Xpress)—It was a big year for research involving overall health issues, starting with a team led by researchers at the UNC School of Medicine and the National Institutes of Health who unearthed more evidence that ...

0 comments

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.