Award details

Using Mathematical Modelling to Deconstruct Transcription

ReferenceBB/S009035/1
Principal Investigator / Supervisor Professor Jane Mellor
Co-Investigators /
Co-Supervisors
Dr Andrew Angel, Professor Andre Furger
Institution University of Oxford
DepartmentBiochemistry
Funding typeResearch
Value (£) 676,318
StatusCompleted
TypeResearch Grant
Start date 01/03/2019
End date 30/09/2022
Duration43 months

Abstract

Mathematical modelling of complex processes such as transcription enable us to refine our understanding of genes and their regulatory mechanisms by describing the underlying processes that give rise to the shape of their transcriptional profiles. The data from techniques (NET-seq, GRO-seq, PRO-seq and TT-seq) for assessing nascent transcription produces a profile which is the sum of the various events during transcription: initiation, pausing, backtracking, early termination, elongation and termination. Each technique assesses different aspects of transcription giving distinct profiles for the same genes under the same conditions. An ideal situation would be to amalgamate all data types to give a more holistic view. A mathematical model is ideal for this approach as it can be constructed to account for each technique's strengths and weaknesses and find the underlying mechanisms that best fit all of the data so giving each of the events of transcription a relative importance at different genes and under different conditions. Using our own data, we have shown that the shape of these profiles changes when transcriptional regulatory factors are ablated or environmental conditions change, supporting our hypothesis that the dynamics of transcription giving rise to different profiles are an important component of gene regulation. In this work we will extend and refine our basic mathematical model of transcription, so that using the shape of a transcriptional profile we can interpret how individual genes are regulated, predict how this changes over time and with environmental change, and define more precisely the steps in transcription that are influenced by transcription factors, transcription elongation factors, termination factors and chromatin. This work aims to provide a step change in the understanding of how the regulation of gene expression goes far beyond the simple activation or repression of genes and can be facilitated during the process of transcription itself.

Summary

Transcription is the first step in the complex process of gene expression that brings the characteristics of a cell into being, by transcribing the DNA code of genes into RNA copies, transcripts, which will subsequently be translated into proteins, the workhorses of a cell. As such, transcription is a highly regulated process that when disrupted can cause disease. Thus, there is a real need to understand all aspects of transcriptional regulation and exactly where and how it can go wrong. The best-known regulators of transcription are the DNA binding transcription factors which were thought to act as on/off switches for transcription by RNA polymerase. However, recent data suggests that many transcription factors do not act as the decision point for simply switching a gene on or off but at various stages following the start of transcription, during transcription elongation. Indeed, we have recently shown that the amount of the accessory factors associated with RNA polymerase, known as transcription elongation factors, is determined by the DNA-bound transcription factors, and that this differs on individual genes and with environmental conditions. In order to explore the various stages of transcription, and the impact of transcription factors on these events, we have developed a mathematical model that is trained on experimental data and can determine which of the many stages of transcription changes with mutation or environmental variation. We have begun the process of extending the model by including further details of the transcriptional process and training it on experimental data from humans and yeast, so that it can be applied generally. The purpose of this work to develop the model and exploit its predictive power so that we can describe the key steps at which the transcription of a gene is regulated and how this is likely to change when conditions change. The results of our modelling will enable us to move to a better understanding of how transcription is disrupted when environmental conditions change, when organisms are stressed and in disease where it may well identify potential therapeutic targets. We intend that the model becomes widely dispersed in the academic community.

Impact Summary

Who will benefit and how will they benefit? Other Scientists We aim to release the model as analysis software for other groups to use and so be better able to analyse their transcription data. We will advertise the resource at International Transcription Meetings over the three year period of funding including the CSH Mechanisms of Eukaryotic Transcription Meeting in 2019 and 2021 the EMBL Transcription and Chromatin Meetings in 2020 and 2022. Both PIs will attend one or both of these meetings. In addition, our TT-seq and NET-seq data sets will be available to collaborators pre-publication and to the community at publication. A major aim is to increase the number of trained interdisciplinary scientists comfortable with modelling complex data sets, who will then be able to encourage further adoption of the approach and themselves supervise interdisciplinary science. This is something that will become ever more important as experimental techniques become more sophisticated and large volumes of genome-wide data are generated. To facilitate this, both PIs are actively involved in recruiting and training graduate students and post-docs to use bioinformatics and mathematical modelling in their day to day research and this will continue into the life of this grant. Societal Impact In addition to the fundamental direct scientific benefits of the proposed research, there are a number of indirect ways in which it could impact society. The most prominent of these is the potential for increased understanding of diseases such as Alzheimer's, Parkinson's, ALS, asthma and cardiovascular disease whose causes are thought to include impaired regulation of the transcriptional process. The proposed research could further benefit the diagnosis and treatment of such diseases. In terms of diagnosis, the improved ability to interpret transcriptional profiles will pave the way for personalised medicine in the form of the analysis of the transcriptional profiles from individual patients which could form a broad-spectrum diagnosis or identify specific subtypes of a disease. This will require technical advances with transcription mapping but recent reports suggest such approaches are feasible for laboratory cultured patient-derived cancer cells and will be developed for other cell types in the future. This sort of approach could be used to (i) detect transcriptional changes related to disease, (ii) see how transcription changes in response to disease or chemical agents in viable cell culture lines with possible applications as a safety test for drugs that are based on transcription factors for example the TFs themselves and small molecules that target them, (iii) extended to safety test drugs that target transcription elongation factors (TEFs) in addition to transcription factors (TFs) as both affect different aspects of transcription dynamics. These sorts of approach could give new insights into the mechanisms of action of new classes of compounds designed to target TEFs and TFs and also facilitate identification of the most suitable TFs or TEFs to target with target with small-molecule inhibitors. Mathematical modelling approaches are highly suitable for public engagement. Complex systems must be reduced to a minimal set of components and interactions for an effective model to be constructed, and this leads to the ability to present highly intricate molecular systems on a conceptual level. Additionally, simple stochastic models, of the kind to be employed in the proposed research, translate rather well into animations which can be highly engaging for both non-specialist and specialist audiences. Some animations will be produced in-house and we will utilise the local Oxford Sparks service to produce a professional animation from our research for use in public lectures and displays at a cost of £9000.
Committee Research Committee C (Genes, development and STEM approaches to biology)
Research TopicsSystems Biology
Research PriorityX – Research Priority information not available
Research Initiative X - not in an Initiative
Funding SchemeX – not Funded via a specific Funding Scheme
terms and conditions of use (opens in new window)
export PDF file