Award details

Hydroxy-Sensitive Cut Counting (HSCC); simultaneous, genome-wide mapping of 5-methylcytosine and 5-hydroxymethylcytosine in mammals

ReferenceBB/J021032/1
Principal Investigator / Supervisor Professor Richard Meehan
Co-Investigators /
Co-Supervisors
Dr Colm Nestor
Institution University of Edinburgh
DepartmentCentre for Molecular Medicine
Funding typeResearch
Value (£) 120,177
StatusCompleted
TypeResearch Grant
Start date 01/10/2012
End date 30/09/2013
Duration12 months

Abstract

DNA methylation, the covalent attachment of a methyl group directly to the fifth position of cytosine (C) in DNA is the most widely studied epigenetic mark in mammals. DNA methylation is involved in maintaining genome integrity by silencing transposable elements, the process of X-inactivation in females, regulation of allele-specific expression at imprinted loci, and may also contribute to defining tissue-specific gene-expression patterns. The recent discovery of substantial amounts of 5-hydroxymethylcytosine (5hmC), a derivative of 5-methylcytosine (5mC) in various mouse tissues and human ES cells has necessitated a re-evaluation of our knowledge of 5mC/5hmC patterns and functions in mammalian cells. Indeed, we recently showed that many of the techniques used to assay DNA methylation patterns, including bisulphite sequencing, failed to differentiate between these two modifications. Our current understanding of 5hmC is now seriously limited by the inability to assess genome-wide patterns of 5hmC in an unbiased and quantitative manner. Here we describe a radical new assay, Hydroxy Cut Counting (HSCC) which will allow determination of 5-mC and 5-hmC levels at over 1.5 million MspI sites throughout the mouse genome. HSCC will exploit the newly described differential sensitivity of the MspI restriction enzyme to its target sequence, CCGG, when the internal cytosine residue is either 5hmC (C5hmCGG; cut) or beta-glucosyl-5-hydroxymethylcytosine, (Cg5hmCGG; uncut). Subsequently EcoP15I digested DNA will be subjected to linker-mediated PCR (LM-PCR) followed by massively parallel sequencing and in silico analyses to determine genome-wide locus-specific 5mC, 5hmC and unmodified C levels in tissues. Whereas, affinity-based techniques, can only reliably sample regions with moderate to high levels of 5hmC and cannot report on regions lacking the mark, HSCC will sample all MspI sites irrespective of 5-hmC levels.

Summary

The human genome contains all the instructions (genes) necessary to create a human being, beginning from a single fertilised egg cell. Although the same genome sequence is present in all cells, there are hundreds of markedly different cell types, such as brain (neurons), kidney, and liver cells, in the human body. Each cell type must 'turn on' a different subset of genes for the correct functioning of each tissue type. A primary mechanism by which cells mark genes to be turned on or off is called, 'epigenetics'. Epigenetics refers to heritable changes in gene expression that are not caused by mutations in the underlying DNA sequence. The most comprehensively studied epigenetic mark in mammals is DNA methylation which involves the attachment of a tag-molecule called a 'methyl group' to cytosines to give 5-methylcytosine (5mC). It typically occurs in a CpG dinucleotide context, although non-CpG methylation occurs in embryonic stem cells, between 60-90% of all CpGs are methylated in mammals. DNA methylation is associated with gene silencing and is essential for normal development. Key processes including genomic imprinting, X-chromosome inactivation, suppression of repetitive elements, stability of gene expression and carcinogenesis either depend or involve dynamic changes in DNA methylation patterns. From this perspective it is important to know where modified DNA resides in the genome. Much of what we know about DNA methylation in mammals is based on a set of techniques which can distinguish between 5mC and an unmethylated cytosine (C). These include the use of methyl-sensitive restriction enzymes (which can cut unmethylated, but not methylated DNA) and bisulfite sequencing, a method that allows accurate quantification of methylation levels at several neighbouring cytosines simultaneously. This analysis has been confounded by the identification of a new type of modified DNA, 5-hydroxymethylcytosine (5hmC) that is present at high levels in mammalian tissues. Indeed, 5hmC is 40% as common as 5mC in mouse brain samples. Despite intense study of DNA methylation for the last 30 years, the presence of 5hmC in mammalian tissues had been missed especially as most the techniques used to identify methylation, do not differentiate between 5mC and 5hmC. We propose a radical, new technique termed Hydroxy Sensitive Cut Counting (HSCC) to simultaneously analyse for 5mC, 5hmC and C at the 1.5 -2.3 million CCGG sequences present throughout the mouse and human genomes respectively. HSCC is based on the observation that the restriction enzyme MspI, can cut its target site, CCGG, if the internal C is 5hydroxymethylcytosine, but not if it is beta-glucosyl-5-hydroxymethylcytosine (ghmC). Using a well characterised and commercially available enzyme T4 Phage beta-glucosyltransferase, we will convert all the 5hmC in the genome to ghmC, and digest the sample before and after treatment with MspI. The sequences surrounding each MspI site will then be sequenced using next generation sequencing. The number of sequences in the treated versus untreated samples for each MspI site is a measure of the amount of 5hydroxymethylcytosine present, allowing for semi-quantification of genome-wide 5hmC levels. The resulting 5hmC profiles will represent tissue 'identifier'; a barcode that will be a read-out of a normal tissue state. HSCC represents a dramatic improvement on the existing 'genome-wide' techniques that use antibodies to harvest DNA containing 5hmC. Such 'affinity' techniques can only assay regions of the genome which contain 5hmC and often exhibit strong sequence-context biases. In contrast, HSCC will assay MspI sites regardless of 5hmC level. This is vital, as knowing where 5hmC is depleted is as important as knowing where it is enriched if we are to understand the role of this exciting new mark in mammalian biology.

Impact Summary

The role of the newly identified base, 5-hydroxymethylcytosine (5-hmC) is a high impact research topic in the field of epigenetics, including potential epidemiology studies. This application is predicated on developing a new technique, Hydroxy-Sensitive Cut Counting (HSCC), which enables simultaneous, genome-wide mapping of 5-methylcytosine (5-mC) and 5-hmC in mammals. HSCC is a new research tool for epigenetics research, which has the potential to provide significant impact 'to facilitate new biological understanding' in many areas including, transcriptional regulation, stem cell biology, developmental biology, nutritional research and the epidemiology of cognitive ageing. The development of HSCC represents potential value for money as a tool that can be used to study the DNA modification patterns in population cohorts, which may reflect the transcription profile of the target tissue. These resulting profiles can act as a read-out of past environmental exposures or disease states that result in altered patterns of transcription. The knowledge gained has the potential to improve prosperity and quality of life. HSCC can address a central question; how dynamic are the epigenetic patterns of DNA modification in the nucleus? The impact of the technique itself would be to contribute to the knowledge base of the areas outlined above. HSCC has the potential to be used by many epigenetic scientists in their future research. This tool is needed, as an inexpensive high resolution genomic mapping data that incorporates C, 5mC and 5hmC profiles is not available at present. The availability of HSCC as a new research tool has the capacity to be part of a turning point in DNA modification research that will lead to new observations, and have significant impact on scientific advances in the area of epigenetics. Beyond the scope of this application are concerns about the epigenetic effects of nutrition, drugs or stress on foetal programming affecting adult health. HSCC representsa 'cutting edge technology' that has the potential to facilitate rapid accumulation of epigenetic profiling in these areas of research. In the case of our application, knowledge transfer would take place by publishing and presenting our work at local and international meetings. HSCC represents 'timeliness and promise' as there is a growing effort to characterise DNA modification profiles in areas of cell, developmental and population biologys.
Committee Research Committee C (Genes, development and STEM approaches to biology)
Research TopicsTechnology and Methods Development
Research PriorityX – Research Priority information not available
Research Initiative Tools and Resources Development Fund (TRDF) [2006-2015]
Funding SchemeX – not Funded via a specific Funding Scheme
terms and conditions of use (opens in new window)
export PDF file