BBSRC Portfolio Analyser
Award details
Enhancements to ProFit
Reference
BB/F015836/1
Principal Investigator / Supervisor
Professor Andrew Martin
Co-Investigators /
Co-Supervisors
Institution
University College London
Department
Structural Molecular Biology
Funding type
Research
Value (£)
98,021
Status
Completed
Type
Research Grant
Start date
01/03/2008
End date
31/03/2009
Duration
13 months
Abstract
Least squares fitting of two or more protein or RNA structures is a fundamental technique to examine structural similarity and variability. ProFit implements McLachlan least squares fitting which overlaps the centres of geometry and performs a modified Conjugate Gradients minimisation of the RMSd through rotational optimization. The code consists of ~16,000 lines of C (of which ~400 lines are the actual fitting code) written by the applicant and the program has >4000 users. The program began life in the late 1980s as FIT and was redesigned in the early 1990s. Over the last 15 years, the program has evolved significantly adding new facilities. These include two methods to create atom equivalences for the user. First, it provides a Needleman and Wunsch sequence alignment with fitting zones being derived from equivalent residues in the sequence alignment. Second it provides an iterative updating procedure; dynamic programming is used iteratively to update an optimal set of equivalent C-alpha atoms. Another recently added facility is fitting of multiple structures. This is an interative procedure in which an averaged structure is generated in each iteration. The primary aim of the work proposed here is to implement a large number of enhancements requested by users. These include removing restrictions relating to multi-chain proteins (currently, sequence alignment and iterative updating of fitting zones can only be used with single-chain proteins). Currently the software is distributed as source code and as a Linux binary. We will provide an official Windows binary and will implement a web-based interface to the program using AJAX. These enhancements will enable the use of the program by Biologists without the ability or desire to install ProFit under Linux, or compile it themselves for Windows. In addition we will provide a Subversion repository and bug tracking using Trac to make the program fully open source and encourage communinity involvement.
Summary
This project will make a number of enhancements to the computer software known as ProFit previously written by the applicant for comparing sets of protein or RNA coordinates. These enhancements are largely requested by current users and will enable additional applications and research. The optimal 'fitting' of two or more sets of protein or RNA coordinates is a fundamental requirement for examining structural similarity and variability. This fitting is done when predicting the three-dimensional structure of proteins based on other known structures and in evaluating the performance of modelling methods. It is also used in studying the variability and flexibility of protein and RNA structures. This is useful when a single molecule is flexible and when one wishes to examine the effects of mutations or different bound ligands on protein structure. Fitting is also used to examine the variability of related protein structures to study evolution or protein structures. ProFit is a very widely used program for fitting two or more protein or RNA structures. A set of equivalent atoms in the structures must be specified, but ProFit also allows the equivalent atoms to be optimized automatically. Once the structures have been fitted, a simple statistic known as the 'root mean square deviation' (RMSd) is calculated to give an indication of the similarity. The work proposed in this project will provide a number of enhancements to the program. Most importantly, a number of additions requested by users will be implemented which include removing restrictions when dealing with structures having more than one poly-peptide chain. This is becoming particularly important as crystallographers are now able to solve enormously complex structures. We will also perform some internal cleanup of the code to make it more maintainable and will provide a Windows version and a web-based graphical interface to improve the usability of the software for Biologists.
Committee
Closed Committee - Engineering & Biological Systems (EBS)
Research Topics
Structural Biology, Technology and Methods Development
Research Priority
X – Research Priority information not available
Research Initiative
Tools and Resources Development Fund (TRDF) [2006-2015]
Funding Scheme
X – not Funded via a specific Funding Scheme
I accept the
terms and conditions of use
(opens in new window)
export PDF file
back to list
new search