Researchers have developed a computational framework for standardizing neuroscience data worldwide

December 19, 2014 by Linda Vu
Credit: Wikimedia Commons

Thanks to standardized image file formats—like JPEG, PNG or TIFF—which store information every time you take a digital photo, you can easily share selfies and other pictures with anybody connected to a computer, mobile phone or the Internet. Nobody needs to download any special software to see your picture.

But in many science fields—like neuroscience—sharing data isn't that simple because no standard data format exists. So in November 2014, the Neurodata without Borders initiative—which is supported by the Kavli Foundation, GE, Janelia Farm, Allen Institute for Brain Science and the International Neuroinformatics Coordinating Facility (INCF)—hosted a hackathon to consolidate ideas for designing and implementing a standard neuroscience file format. And BrainFormat, a neuroscience data standardization framework developed at the Lawrence Berkeley National Laboratory (Berkeley Lab), is among the candidates selected for further investigation. It is now a strong contender to contribute to and develop a community-wide data format and storage standard for the neuroscience research community. BrainFormat is free to use, and can be downloaded here.

"This issue of standardizing data formats and sharing files isn't unique to neuroscience. Many science areas, including the global climate community, have grappled with this," says Oliver Ruebel, Berkeley Lab Computational Scientist who developed BrainFormat. "Sharing data allows researchers to do larger, more comprehensive studies. This in-turn increases confidence in scientific results and ultimately leads to breakthroughs."

In conjunction with this work, Berkeley Lab's National Energy Research Scientific Computing Center (NERSC) is also working with Jeff Teeters and Fritz Sommer of the Redwood Center for Theoretical Neuroscience at UC Berkeley on the Collaborative Research Computational Neuroscience (CRCNS) data-sharing portal, which will allow neuroscience researchers worldwide to easily share files without having to download any special software.

Both BrainFormat and CRCNS are being developed as part of a tri-institutional partnership between Berkeley Lab, UC Berkeley and UC San Francisco (UCSF). The computational tools could also help facilitate the White House's Brain Research through Advancing Innovative Neurotechnologies (BRAIN) Initiative.

Dealing With the Deluge of Brain Data

In 2013, President Barack Obama challenged the neuroscience community to gain fundamental insights into how the mind develops and functions, and discover new ways to address brain diseases and trauma. He called this the BRAIN Initiative.

This work is expected to generate a deluge of data for the neuroscience community. After all, measuring activity from a fraction of neurons in the brain of a single mouse could generate almost as much data as the Hadron Collider, which is 17-miles in circumference. So before researchers can even begin taking measurements, they must first develop a standard format for labeling and organizing data, sharing files, and scaling up analytical and visualization methods and software to handle massive amounts of information.

"Neuroscience is currently a field of individual principle investigators, doing individual experiments, and analyzing that data on customized software. This means that data is stored in many different formats and described in different ways, which hinders community access to data," says Kristofer Bouchard, a neuroscientist at Berkeley Lab. "As data volumes grow, we are going to need more people to look at the same data in different ways."

Berkeley Lab is actively seeking ways to expand its contribution to the BRAIN Initiative, and as a scientist in the Computational Research Division (CRD) Ruebel is familiar with helping scientists from a variety of disciplines organize, store, access, analyze, share and massive complex datasets.

To come up with a convention for labeling, organizing, storing and accessing neuroscience data, Ruebel worked closely with Bouchard for applications from UCSF neurosurgeon Edward Chang and Berkeley Lab physicist Peter Denes to design BrainFormat using open source Hierarchical Data Format (HDF) technologies. Over the last 15 years, HDF has helped a variety of scientific disciplines organize and share their data. One prominent user of HDF is NASA's Earth Observing System, the primary data repository for understanding global climate change.

In addition to data format standardization, HDF is also optimized to run on supercomputers. So by building BrainFormat on this technology, neuroscientists will be able to use supercomputers to process and analyze their massive datasets.

"This work really highlights the unique strength of a Berkeley Lab, UC Berkeley and UCSF partnership," says Denes. "UCSF is renowned for its clinical and experimental neuroscience experience with in vivo cortical electrophysiology; UC Berkeley contributes world-class expertise in theoretical neuroscience, statistical learning and data analysis; and Berkeley Lab brings supercomputing and applied mathematics expertise together with electronics and micro- and nano-fabrication."

Denes heads Berkeley Lab's contingent of the tri-institutional partnership to develop instrumentation and computational methods for recording neuroscience data. In addition to developing tools to deal with the data deluge, the BRAIN Initiative is also going to require new hardware to collect more data at higher-resolution, and process it in real-time. Researchers will also need novel algorithms for analyzing . The tri-institutional partnership is also leveraging tools and expertise from different areas of science to tackle these challenges as well.

"Berkeley Lab's strength has always been in science of scale," says Prabhat, Berkeley Lab computational scientist. "Over the years, many science areas have struggled with issues of file format standardization, as well as managing and sharing massive datasets, and our staff built similar infrastructures for them. This isn't a new problem, with BrainFormat and the CRCNS portal we've just extended these solutions to the field of ."

Explore further: Research institutions announce collaboration for sharing, standardizing neuroscience data

Related Stories

Research institutions announce collaboration for sharing, standardizing neuroscience data

August 4, 2014
The Allen Institute for Brain Science, California Institute of Technology, New York University School of Medicine, the Howard Hughes Medical Institute (HHMI) and the University of California, Berkeley (UC Berkeley) are collaborating ...

A new mathematics for experimental science

April 1, 2014
Mathematics is the ultimate scientific tool. For centuries it has been used to describe the forces of nature, from planetary motion to fluid dynamics. It helped unlock the secrets of DNA and unleashed the Digital Revolution. ...

Brain visualization prototype holds promise for precision medicine

June 3, 2013
The ability to combine all of a patient's neurological test results into one detailed, interactive "brain map" could help doctors diagnose and tailor treatment for a range of neurological disorders, from autism to epilepsy. ...

Where will big neuroscience take us?

November 17, 2014
We're entering the era of big neuroscience. In a little over a year, the United States, Europe, Japan and Israel have launched brain research projects with big budgets and bold ambitions. Several other countries are expected ...

Recommended for you

Discovery deepens understanding of brain's sensory circuitry

December 12, 2017
Because they provide an exemplary physiological model of how the mammalian brain receives sensory information, neural structures called "mouse whisker barrels" have been the subject of study by neuroscientists around the ...

Stuttering: Stop signals in the brain disturb speech flow

December 12, 2017
One per cent of adults and five per cent of children are unable to achieve what most of us take for granted—speaking fluently. Instead, they struggle with words, often repeating the beginning of a word, for example "G-g-g-g-g-ood ...

Intermittent fasting found to increase cognitive functions in mice

December 12, 2017
(Medical Xpress)—The Daily Mail spoke with the leader of a team of researchers with the National Institute on Aging in the U.S. and reports that they have found that putting mice on a diet consisting of eating nothing every ...

Neuroscientists show deep brain waves occur more often during navigation and memory formation

December 12, 2017
UCLA neuroscientists are the first to show that rhythmic waves in the brain called theta oscillations happen more often when someone is navigating an unfamiliar environment, and that the more quickly a person moves, the more ...

How Zika virus induces congenital microcephaly

December 12, 2017
Epidemiological studies show that in utero fetal infection with the Zika virus (ZIKV) may lead to microcephaly, an irreversible congenital malformation of the brain characterized by an incomplete development of the cerebral ...

Presurgical imaging may predict whether epilepsy surgery will work

December 11, 2017
Surgery to remove a part of the brain to give relief to patients with epilepsy doesn't always result in complete seizure relief, but statisticians at Rice University have developed a method for integrating neuroimaging scans ...

1 comment

Adjust slider to filter visible comments by rank

Display comments: newest first

RobertKarlStonjek
not rated yet Dec 19, 2014
The software does not run on a standard PC. First you must install 'Python'.

Download Python for all operating systems (21 different operating systems supported) for free from here: https://www.pytho...wnloads/

This is an extremely complex programmer's environment that is way of the head of the average neuroscientist (and I know quite a few of them).

If you are not familiar with programming in python then you have no hope of viewing any images.

It is quite unbelievable that anyone would think they have standardised a format and then not bother to make it easily accessible. They add programming lines to their explanation of the 'easy' installation' but don't mention where you type in those lines or under which option.

I haven't seen DOS command lines in quite some time ~

This is a total fail...what the hell were they thinking???

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.