Award details

A novel and rapid approach to predict protein structure

ReferenceBB/G003912/1
Principal Investigator / Supervisor Professor Michael Sternberg
Co-Investigators /
Co-Supervisors
Institution Imperial College London
DepartmentLife Sciences
Funding typeResearch
Value (£) 322,264
StatusCompleted
TypeResearch Grant
Start date 01/09/2008
End date 30/11/2011
Duration39 months

Abstract

We have developed a novel and rapid approach (Poing) to template-free protein structure prediction. The approach is a coarse-grained simplified classical dynamics simulation that is orders of magnitude faster than existing approaches (fragment-based folding and all-atom molecular dynamics). This enables the technique to be applied to larger proteins. Poing represents a protein structure in the Levitt & Warshel backbone-plus-sidechain model, and explicitly simulates effects known to be important in protein folding as a network of spring-like forces. Our pilot study results show that state-of-the-art template-free predictions can be obtained in cpu hours. In this grant we will develop and disseminate Poing. We will take part in the international blind trial of prediction (CASP). Key steps are: * Months 1 / 5: CASP8 participation. * Months 1 / 24: Analysis, development and improvement of predicted structures. By minimising the number of incorrect structures produced by our model whilst maintaining the stability of the native state, we will increase the effectiveness of clustering in picking correct models from the time-series samples produced, and therefore improve the predictive accuracy of the model. Current shortcomings to be addressed include: non-native internal voids within structures; incorrect tessellation of sidechains; and non-native topological features. * Months 18 / 24: Public deployment via web server and open source dissemination. * Months 22 / 29: CASP 9 participation. * Months 25 / 36: Development of model selection and integration with complementary, fragment-based approaches.

Summary

IMPORTANCE OF KNOWLEDGE ABOUT PROTEIN STRUCTURE Proteins are molecular machines which carry out most of the basic functions of an organism. They are made of chains of smaller molecules called amino acids. There are twenty types of amino acid, and the precise sequence of amino acids determines the shape and function of the protein. A protein is a large molecule, and in water it folds into a globular structure. The amino acids interact with each other in specific ways. It is important for us to know the shape of a protein as this provides insight into its function and can help in the design of experiments. Knowledge of the structure of a protein can be the starting point for the systematic design of novel regulators of activity such as drugs and agricultural agents. PROTEIN STRUCTURE PREDICTION It is slow, expensive and difficult to find out the structure of a protein directly. However, we now have the DNA sequences for many important organisms, including humans, and we generally can get protein sequences from DNA sequences. We know that the structure of a protein depends entirely on the sequence of its amino acids. Thus we can try to predict the structure of a protein from its sequence. Many successful prediction methods use similarities between the sequence for an unknown structure and the sequence for a known structure - . known as template-based modelling, But what if no such similarity can be found? There are two main methods that are yielding useful predictions today. One, fragment folding, tries to make a structure out of little fragments of other structures. This has been the most successful of the template-free methods in the last few years and has about 50% success rate. It requires high performance computing (up to years of cpu time per prediction). Another method, molecular dynamics, simulates the interactions between the atoms in the protein. Although this approach has provided useful predictions for the very smallest of proteins, it requires a computation time of many years on a single processor. OUR APPROACH We have developed with a new method, called poing, which aims to solve some of the problems with these other methods. We base our approach on a highly simplified model, introduced in the mid 70s, representing the protein as a ball-and-spring model. Each amino acid is represented by just two balls, less than a tenth the number that is used in molecular dynamics. This makes poing very fast. The springs between the balls are modelled using heuristics to represent specific effects which are known to be important in how a protein folds. Our preliminary results show that our approach can yield useful predictions with a run time of 20 hours on a single cpu. THIS PROPOSAL We propose to develop the new model to make it more accurate at predicting structures. We will also take part in a regular protein structure prediction experiment, where different prediction methods are tested on new proteins, and then compared with each other. We will also make our software available to the community via a public web server and by allowing others freely to obtain copies of it to change and run on their own computers. All this work will take three years.
Committee Closed Committee - Biomolecular Sciences (BMS)
Research TopicsStructural Biology, Technology and Methods Development
Research PriorityX – Research Priority information not available
Research Initiative X - not in an Initiative
Funding SchemeX – not Funded via a specific Funding Scheme
terms and conditions of use (opens in new window)
export PDF file