Award details

Computational identification of protein-protein interactions

ReferenceBB/H006818/1
Principal Investigator / Supervisor Professor Simon Lovell
Co-Investigators /
Co-Supervisors
Dr Simon Whelan
Institution The University of Manchester
DepartmentLife Sciences
Funding typeResearch
Value (£) 317,990
StatusCompleted
TypeResearch Grant
Start date 01/10/2010
End date 14/02/2014
Duration41 months

Abstract

We propose to study the fundamental biological processes of specificity of binding in protein-protein interactions; these processes underpin systems biology and cellular interaction networks. We will deliver a new computational tool for testing for the presence of an interaction between a pairs of protein sequences by examining their sequence alignments. We will develop a sophisticated computational approach, grounded in the statistical models used in phylogenetics, to identify residues involved in protein-protein interactions. Recent research has shown that coevolution across protein interfaces is a powerful predictor of protein-protein interactions when applied at the residue level, but not over whole sequences. However, relatively little research has been performed on site-specific approaches for inferring interactions, and there are plenty of opportunities for improving their power. In common with nearly all previous phylogenetic research, a more realistic substitution model, coupled with careful development of heuristics, will substantially improve inference. We will develop an intermolecular coevolution model based on empirical observations about what occurs at protein binding interfaces. By relaxing the assumption of independence of evolution at differing sites this will allow us to account accurately for coevolution and create a predictive model specifically applicable at the residue level. We will combine our new methodology with appropriate heuristics to investigate the set of interactions occurring in yeast, and compare it to what has been inferred by experimental high-throughput approaches. Our results will be used to formulate hypotheses that will be tested in the laboratory by our collaborators.

Summary

Proteins are extremely important biological molecules. In addition to numerous vital structural roles, they are responsible for the majority of active biochemical functions and molecular processes within living cells. Nearly all proteins work as components of a biological system by binding other molecules, and most function in concert with others, as 'molecular machines' or in elegant 'production lines', such as signalling pathways, to carry out complex biological functions. These protein interactions are also important in combating foreign proteins, such as from a viral infection. Approximately 60% of proteins take part in some kind of protein assembly or 'complex'. These protein complexes play a role in the majority of cellular processes, and modern biology is now able to build the connecting parts list of cellular protein interactions via genomic and post-genomic science. However, in the majority of cases, we don't understand how the various protein specifically recognise their specific partners. What we do know is that in order to form complexes, individual proteins must make contact with ('bind') a limited number of specific partners. It is the rules that control this 'specificity' for binding that we propose to investigate. Binding in complexes is the result of specific contacts in the context of proteins' three-dimensional structures. We propose to determine the key regions for binding (termed 'interfaces'), distinguish them from non-binding regions. The strength of inferred interactions within the interface regions may help determine which amino acids are most important for binding. To achieve our goal of computationally identifying protein binding interfaces, we propose to develop sophisticated computational methods that describe how evolution at interfaces differs from that occurring at non-interacting site on proteins. These models will look for correlations in evolution at specific sites. We will examine sequence data taken from a range of interacting and non-interacting proteins to develop our a sophisticated and rigorous model to explain this evolutionary process. By iteratively improving and simplifying this substitution model we will progressively improve our ability to discriminate between interacting and non-interacting positions, enabling us to better identify both interacting proteins and the specific interfaces by which they interact. The resultant model will provide a powerful new computational tool for studying biological systems, which until now has been lacking in the field. By using phylogenetic methods that are founded on established statistical methodology, we will bring a new degree of rigour to this type of analysis and make the best possible use of information held within our sequences. We will apply the tool to investigate interaction networks in yeast, identifying new potential interactions and to identify errors in experimental methods. We will work with experimental collaborators to confirm these computational inferences and further improve our models.

Impact Summary

The proposed research is basic science and its outcomes would deliver a powerful new computational tool for inferring protein binding sites and protein-protein interactions. In common with all basic science, the end beneficiaries outside the academic sector cannot be easily predicted. However, the relevance of the proposal to current BBSRC strategies suggests that our proposal lies in an area that is likely to see substantial growth in the near future. Our proposed research could result in a crucial tool in systems biology, structural biology, and comparative genomics. Research from these fields is already being picked up by the private sector for use in the biotechnology and pharmaceutical sectors, and has potential for adding value to commercial items conceived or created in the UK. The main benefit for the public and third sector is the intellectual insight and that the application of our tools may help guide future developments and policy. Our Impact Plan details how we will communicate the ideas of our research to a wide and appropriate base of people. Between them, SW and SL have extensive experience of disseminating their work through standard academic channels, such as presentation at conferences and publication of research paper, and by working with the private sector (Pfizer), the popular media, and the internet.
Committee Research Committee C (Genes, development and STEM approaches to biology)
Research TopicsSystems Biology, Technology and Methods Development
Research PrioritySystems Approach to Biological research, Technology Development for the Biosciences
Research Initiative X - not in an Initiative
Funding SchemeX – not Funded via a specific Funding Scheme
terms and conditions of use (opens in new window)
export PDF file