Data mining opens the door to predictive neuroscience
The discovery, using state-of-the-art informatics tools, increases the likelihood that it will be possible to predict much of the fundamental structure and function of the brain without having to measure every aspect of it. That in turn makes the Holy Grail of modelling the brain in silico -- the goal of the proposed Human Brain Project -- a more realistic, less Herculean, prospect.
"It is the door that opens to a world of predictive biology," says Henry Markram, the senior author on the study, which is published this week in PLoS ONE.
Within a cortical column, the basic processing unit of the mammalian brain, there are roughly 300 different neuronal types. These types are defined both by their anatomical structure and by their electrical properties, and their electrical properties are in turn defined by the combination of ion channels they presentthe tiny pores in their cell membranes through which electrical current passes, which make communication between neurons possible.
Scientists would like to be able to predict, based on a minimal set of experimental data, which combination of ion channels a neuron presents. They know that genes are often expressed together, perhaps because two genes share a common promoterthe stretch of DNA that allows a gene to be transcribed and, ultimately, translated into a functioning proteinor because one gene modifies the activity of another. The expression of certain gene combinations is therefore informative about a neuron's characteristics, and Georges Khazen and co-workers hypothesised that they could extract rules from gene expression patterns to predict those characteristics.
They took a dataset that Prof Markram and others had collected a few years ago, in which they recorded the expression of 26 genes encoding ion channels in different neuronal types from the rat brain. They also had data classifying those types according to a neuron's morphology, its electrophysiological properties and its position within the six, anatomically distinct layers of the cortex. They found that, based on the classification data alone, they could predict those previously measured ion channel patterns with 78 per cent accuracy. And when they added in a subset of data about the ion channels to the classification data, as input to their data-mining programme, they were able to boost that accuracy to 87 per cent for the more commonly occurring neuronal types.
"This shows that it is possible to mine rules from a subset of data and use them to complete the dataset informatically," says one of the study's authors, Felix Schürmann. "Using the methods we have developed, it may not be necessary to measure every single aspect of the behaviour you're interested in." Once the rules have been validated in similar but independently collected datasets, for example, they could be used to predict the entire complement of ion channels presented by a given neuron, based simply on data about that neuron's morphology, its electrical behaviour and a few key genes that it expresses.
Researchers could also use such rules to explore the roles of different genes in regulating transcription processes. And importantly, if rules exist for ion channels, they are also likely to exist for other aspects of brain organisation. For example, the researchers believe it will be possible to predict where synapses are likely to form in neuronal networks, based on information about the ratio of neuronal types in that network. Knowledge of such rules could therefore usher in a new era of predictive biology, and accelerate progress towards understanding and modelling the brain.