Award details

Underpinning UK Bioscience Research with high-throughput single molecule sequencing

ReferenceBB/T017864/1
Principal Investigator / Supervisor Professor Peter Keightley
Co-Investigators /
Co-Supervisors
Dr Tim Connelley, Professor Daniel Macqueen, Dr James Prendergast, Dr Javier Santoyo-Lopez, Dr Jacqueline Smith, Dr Alexander Twyford, Professor Michael Watson
Institution University of Edinburgh
DepartmentSch of Biological Sciences
Funding typeResearch
Value (£) 442,788
StatusCompleted
TypeResearch Grant
Start date 10/11/2020
End date 09/04/2021
Duration5 months

Abstract

The PacBio Sequel II is an evolution of the original Sequel single molecule sequencer. The improved technology and flow cell chemistry, in the Sequel II, will allow EdGe to generate highly accurate sequence reads of long DNA and RNA molecules. Single molecule, long read technology has benefits over short read technologies (e.g. Illumina) where assembly or mapping often result in ambiguities. The longer reads generated by single molecule sequencing do not need local assembly; they are more confidently assigned to the correct location. The complexity of genomes (DNA) and transcriptomes (RNA) can best be addressed with longer reads. For example, full length RNA molecules, with usually many variants per gene, can be identified and compared between samples (letter of support from Richard Kuo). When assembling chromosome level sequences, long reads help to resolve issues like repetitive regions and structural variation. The advantages of the Sequel II (over the Sequel) are its greatly increased throughput, due to its larger, 8 million Zero Mode Wave guide flow cell and longer movie times for data collection. The increased number of reads results in a lower cost per base (less than half the cost of Sequel). This means that research is more cost effective, allowing more sequence depth, sample numbers (for additional replicates, or biological questions) or both. For example, 40-50 bacterial samples can be sequenced for de novo assembly on a single SMRT cell. However, the biggest improvement in Sequel II is its accuracy. 'HiFi' reads are generated by circular consensus sequencing mode where the same strand is sequenced several times. This increases accuracy from 86% (single pass accuracy) to >99.9%, allowing detection of single nucleotide variants and helping to generate highly accurate phased genome and transcriptome assemblies. The improved accuracy of the Sequel II will allow hard to sequence areas, like the MHC (co-I Dr Connelley), to be accurately sequenced at scale.

Summary

The genomics revolution is entering its third phase with the advent of technologies that can generate single molecule DNA sequencing data extending 100s of kilobases. Long DNA sequence reads can be used to generate high quality reference genomes, to better understand gene expression, and to sequence and sort DNA in complex mixtures such as environmental samples of soil or water. The Pacific Biosciences Sequel II System is a new long-read DNA sequencing platform that generates extremely large amounts of data at an affordable cost. A key feature of Sequel II is that it can generate highly accurate long reads, by reading a DNA molecule multiple times to correct errors. This grant will acquire the Sequel II platform, the first in Scotland, embed this within Edinburgh Genomics, and allow us to offer low-cost, high quality long read sequencing as a service to the biological and biotechnology research community. We will use the investment to support researchers who want to generate genomic data for diverse topics such as agriculture, biotechnology, genetics, immunology and synthetic biology.

Impact Summary

The initial impact of this investment will be in high quality biological research published in peer reviewed journals, conference proceedings and invited lectures. Often, high impact work is also the subject of print and broadcast media articles, which broaden the reach of the work to inform the general public and policy makers. The Sequel II will allow more studies, at lower costs, and with better results than previous technologies. Because Edinburgh Genomics (EdGe) is an open access Genome Facility, the impact of the investment will be seen across the University of Edinburgh but also in other institutes in the UK, and in collaboration with scientists abroad. The expected lifespan of the sequencer is around five years, and it is expected that more than 100 projects will benefit from access to the sequencer. These studies will have broad societal impact as well as basic scientific value. The research excellence afforded by this new technology will promote invaluable real world benefits to areas such as food security (through animal health, crop disease resistance), health and medical science (in disease areas such as cardiovascular, inflammation, cancer and reproductive biology and exciting technologies like stem cell therapy), and our ecosystems (with environmental research in oceans and forests). Students, postdocs and PIs from the University of Edinburgh and from across the UK will benefit from our new wet and dry lab long read sequencing course. This will train 48 scientists per year in the process of using long read sequence data in biological research. Additional bioinformatics-only workshops will be offered to a larger number (<100) of attendees per year. These 'upskilled' scientists will feed back into more high quality long read biological research across agricultural, medical and environmental biology. Through public engagement, the science that this investment facilitates will be brought to a large audience and will feed back into further valuable research as its value becomes clearer and public opinion demands more from our scientists. Influence on governmental and funding policies will increase the speed and scope of science's impact on our lives.
Committee Not funded via Committee
Research TopicsTechnology and Methods Development
Research PriorityX – Research Priority information not available
Research Initiative Advanced Life Sciences Research Technology Initiative (ALERT) [2013-2014]
Funding SchemeX – not Funded via a specific Funding Scheme
terms and conditions of use (opens in new window)
export PDF file