Scientists explore safeguards for genomic data privacy

March 1, 2014 by Nancy Owano, Medical Xpress weblog

By now the general public has become aware that mobile phone applications, bank security systems and credit card databases are not immune to vulnerabilities; information thefts happen. Some computer scientists now say it's time to recognize that vulnerabilities in genetic databases need recognition too. Are anonymous genetic profiles truly anonymous? Is data de-identifying technically feasible for genetic data?

People having their genomes mapped for research or for private information would prefer to assume the data will be anonymous. After all, as an article in Inside Science points out, a person's genome carries information about inherited diseases and physical traits, stored in strands of DNA. "The consequences of being able to search, cross-reference, and analyze this information are profound." The solution is not to stop the research but to find ways where research can continue without privacy abuse.

At a February 16 symposium at the American Association for the Advancement of Science. (AAAS) Annual Meeting, a panel of experts addressed tensions between the need for genetic privacy and the medical community's interest in genetic data. Yaniv Erlich, a fellow at the Whitehead Institute for Biomedical Research, has been a key voice in discussing the genetic privacy issue. At the February meeting, he talked about routes by which genetic data can be breached..

Also, in a review submitted October last year, "Routes for Breaching and Protecting Genetic Privacy," Erlich and Arvind Narayanan, an assistant professor in the Department of Computer Science at Princeton, called attention to the topic of of genetic information.

"We are entering the era of ubiquitous genetic information for research, clinical care, and personal curiosity," they wrote. "Sharing these datasets is vital for rapid progress in understanding the genetic basis of human diseases. However, one growing concern is the ability to protect the genetic privacy of the data originators. Here, we technically map threats to genetic privacy and discuss potential mitigation strategies for privacy-preserving dissemination of ."

(Narayanan, elsewhere, in talking about his doctoral research on problems with data anonymization, said his thesis, "in a sentence, is that the level of anonymity that consumers expect—and companies claim to provide—in published or outsourced databases is fundamentally unrealizable.")

The two authors suggested that privacy by design algorithms include access control as well as differential privacy and cryptographic techniques. "So far," they said, "data custodians of mainly adopted access control as a mitigation strategy." They added that new developments in cryptographic techniques "may usher in an additional arsenal of security by design techniques."

In January 2013, Erlich took part in a study, "Identifying Personal Genomes by Surname Inference," published in Science, that clearly demonstrated a potential for breaches of privacy in genomics studies. The authors of that study are Melissa Gymrek, Amy L. McGuire, David Golan, Eran Halperin and Erlich. They showed how identities of volunteers who donate personal genome sequence data for research may be revealed merely on the basis of publicly available information. The researchers recovered the identities of nearly 50 anonymous participants in the 1000 Genomes Project through free, publicly accessible Internet resources.

"Here, we report that surnames can be recovered from personal genomes by profiling short tandem repeats on the Y chromosome (Y-STRs) and querying recreational genetic genealogy databases. We show that a combination of a surname with other types of metadata, such as age and state, can be used to triangulate the identity of the target. A key feature of this technique is that it entirely relies on free, publicly accessible Internet resources."

The findings were shared with officials at the National Human Genome Research Institute (NHGRI) and National Institute of General Medical Sciences (NIGMS). In response, NIGMS and NHGRI moved certain demographic information from the publicly-accessible portion of the NIGMS cell repository to help reduce the risk of future breaches.

Explore further: Researchers expose new vulnerabilities in the security of personal genetic information

More information: Science 18 January 2013: Vol. 339 no. 6117 pp. 321-324 DOI: 10.1126/science.1229566

Related Stories

Researchers expose new vulnerabilities in the security of personal genetic information

January 17, 2013
Using only a computer, an Internet connection, and publicly accessible online resources, a team of Whitehead Institute researchers has been able to identify nearly 50 individuals who had submitted personal genetic material ...

A privacy risk in your DNA: New policies are needed to safeguard participants' identity in genetic studies

February 7, 2013
The growing ease of DNA sequencing has led to enormous advancements in the scientific field. Through extensive networked databases, researchers can access genetic information to gain valuable knowledge about causative and ...

Crowd-sourcing genetic data could help unravel the causes of disease

July 3, 2013
Earlier this month, researchers and advocates from 40 countries formed a global alliance to enable the secure sharing of genomic and clinical data, aiming to end the era in which only the people who collected your genetic ...

Personal Genome Project Canada launches

December 10, 2012
The Personal Genome Project Canada (PGP-C) launches this week giving Canadians an unprecedented opportunity to participate in a groundbreaking research study about human genetics and health.

Recommended for you

RNA thought to spread cancer shows ability to suppress breast cancer metastasis

October 22, 2018
Researchers at The University of Texas MD Anderson Cancer Center have discovered that a form of RNA called metastasis-associated lung adenocarcinoma transcript 1 (MALAT1) appears to suppress breast cancer metastasis in mice, ...

New tool gives deeper understanding of glioblastoma

October 22, 2018
Researchers in the lab of Charles Danko at the Baker Institute for Animal Health have developed a new tool to study genetic "switches" active in glioblastoma tumors that drive growth of the cancer. In a new paper in Nature ...

Researchers find common genetic link in lung ailments

October 22, 2018
An international research team led by members of the University of Colorado School of Medicine faculty has identified a genetic connection between rheumatoid arthritis-associated interstitial lung disease and idiopathic pulmonary ...

A single missing gene leads to miscarriage

October 19, 2018
A single gene from the mother plays such a crucial role in the development of the placenta that its dysfunction leads to miscarriages. Researchers from the Medical Faculty of Ruhr-Universität Bochum (RUB) have observed this ...

Making gene therapy delivery safer and more efficient

October 18, 2018
Viral vectors used to deliver gene therapies undergo spontaneous changes during manufacturing which affects their structure and function, found researchers from the Perelman School of Medicine at the University of Pennsylvania ...

Student develops microfluidics device to help scientists identify early genetic markers of cancer

October 16, 2018
As anyone who has played "Where's Waldo" knows, searching for a single item in a landscape filled with a mélange of characters and objects can be a challenge. Chrissy O'Keefe, a Ph.D. student in the Department of Biomedical ...


Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.