BBSRC Portfolio Analyser
Award details
Predicting properties of biological networks from noisy and incomplete data
Reference
BB/E01612X/1
Principal Investigator / Supervisor
Professor Michael Stumpf
Co-Investigators /
Co-Supervisors
Institution
Imperial College London
Department
Life Sciences
Funding type
Research
Value (£)
301,860
Status
Completed
Type
Research Grant
Start date
01/09/2007
End date
31/08/2010
Duration
36 months
Abstract
Biological networks have received great attention in the context of systems biology but the poor quality of the data and the fact that networks are far from complete poses severe limitations on their usefulness in their present state. This project will build on recent theoretical developments in random graph theory and statistics to overcome these limitations. Simulation models for the different experimental procedures used to map out protein-protein interactions will allow us to understand the causes of noise and its effect on analyses of network data. With this better understanding we will then be able to generate predictive models, which allow us to infer properties of the true network from partial and incomplete network data. In particular we will use multi-model inference in likelihood and Bayesian settings in order to infer properties of the true network from partial data. We have already been able to demonstrate the usefulness of such approaches in pilot studies. These tools will then be applied to real-world biological networks in order to predict structural and functional properties of the global protein interaction networks in a range of species for which suitable data is available. Finally we will explore the scope of more realistic network models which allow for changes in the network structure (with time or in response to some stimulus) as well as different interaction strengths.
Summary
Networks aim to put interactions and dependencies among different objects (or agents) into a single coherent context. Their analysis has attracted great attention in different scientific disciplines because they offer a pictorial representation of complex phenomena, and they frequently also allow a detailed mathematical analysis of these phenomena. Unfortunately, observed networks are often very different from the true network because we cannot measure all interactions reliably. Moreover frequently only some small part of the network is considered. Both factors affect our ability to interpret network data reliably. This is especially true for many biological network datasets. The applicants group has developed a range of mathematical tools that allow us to study the effects these sources or error have on our analysis, and to overcome the limitations imposed by them to some extent. In the proposed research we will adapt these mathematical methods so that they can be applied to biological networks, in particular protein-interaction network data. This will involve the formulation of detailed models of the different experimental methods used to obtain protein interaction data. By simulating the experiment we can study the effects (and causes) of error in detail and use this to gain insights into the reliability of different datasets. With this better understanding of the effects of noise and incompleteness on experimental datasets we can then try to predict properties of the true (but partially unobserved) network. We will use this to predict the size of interaction network in different species: it is now known that the number of genes does not correlate well with our understanding of the relative complexity of different organisms (for example the number of human genes is less than twice the number of genes in the fruitfly). The statistical prediction procedures to be developed in the course of the proposed research will allow us to infer the sizes of the interaction networks in different species and will therefore enable us to see if the complexity of the network could help to explain the differences in biological complexity between different species. Finally, we will study new and more realistic models for protein interaction networks.
Committee
Closed Committee - Engineering & Biological Systems (EBS)
Research Topics
Systems Biology, Technology and Methods Development
Research Priority
X – Research Priority information not available
Research Initiative
X - not in an Initiative
Funding Scheme
X – not Funded via a specific Funding Scheme
I accept the
terms and conditions of use
(opens in new window)
export PDF file
back to list
new search