Award details

Predicting properties of biological networks from noisy and incomplete data

ReferenceBB/E01612X/1
Principal Investigator / Supervisor Professor Michael Stumpf
Co-Investigators /
Co-Supervisors
Institution Imperial College London
DepartmentLife Sciences
Funding typeResearch
Value (£) 301,860
StatusCompleted
TypeResearch Grant
Start date 01/09/2007
End date 31/08/2010
Duration36 months

Abstract

Biological networks have received great attention in the context of systems biology but the poor quality of the data and the fact that networks are far from complete poses severe limitations on their usefulness in their present state. This project will build on recent theoretical developments in random graph theory and statistics to overcome these limitations. Simulation models for the different experimental procedures used to map out protein-protein interactions will allow us to understand the causes of noise and its effect on analyses of network data. With this better understanding we will then be able to generate predictive models, which allow us to infer properties of the true network from partial and incomplete network data. In particular we will use multi-model inference in likelihood and Bayesian settings in order to infer properties of the true network from partial data. We have already been able to demonstrate the usefulness of such approaches in pilot studies. These tools will then be applied to real-world biological networks in order to predict structural and functional properties of the global protein interaction networks in a range of species for which suitable data is available. Finally we will explore the scope of more realistic network models which allow for changes in the network structure (with time or in response to some stimulus) as well as different interaction strengths.

Summary

Networks aim to put interactions and dependencies among different objects (or agents) into a single coherent context. Their analysis has attracted great attention in different scientific disciplines because they offer a pictorial representation of complex phenomena, and they frequently also allow a detailed mathematical analysis of these phenomena. Unfortunately, observed networks are often very different from the true network because we cannot measure all interactions reliably. Moreover frequently only some small part of the network is considered. Both factors affect our ability to interpret network data reliably. This is especially true for many biological network datasets. The applicants group has developed a range of mathematical tools that allow us to study the effects these sources or error have on our analysis, and to overcome the limitations imposed by them to some extent. In the proposed research we will adapt these mathematical methods so that they can be applied to biological networks, in particular protein-interaction network data. This will involve the formulation of detailed models of the different experimental methods used to obtain protein interaction data. By simulating the experiment we can study the effects (and causes) of error in detail and use this to gain insights into the reliability of different datasets. With this better understanding of the effects of noise and incompleteness on experimental datasets we can then try to predict properties of the true (but partially unobserved) network. We will use this to predict the size of interaction network in different species: it is now known that the number of genes does not correlate well with our understanding of the relative complexity of different organisms (for example the number of human genes is less than twice the number of genes in the fruitfly). The statistical prediction procedures to be developed in the course of the proposed research will allow us to infer the sizes of the interaction networks in different species and will therefore enable us to see if the complexity of the network could help to explain the differences in biological complexity between different species. Finally, we will study new and more realistic models for protein interaction networks.
Committee Closed Committee - Engineering & Biological Systems (EBS)
Research TopicsSystems Biology, Technology and Methods Development
Research PriorityX – Research Priority information not available
Research Initiative X - not in an Initiative
Funding SchemeX – not Funded via a specific Funding Scheme
terms and conditions of use (opens in new window)
export PDF file