Award details

Ensembl and enabling genetics and genomics research in farmed animal species

ReferenceBB/I025506/1
Principal Investigator / Supervisor Dr Paul Flicek
Co-Investigators /
Co-Supervisors
Institution EMBL - European Bioinformatics Institute
DepartmentVertebrate Genomics
Funding typeResearch
Value (£) 259,620
StatusCompleted
TypeResearch Grant
Start date 01/04/2012
End date 31/03/2015
Duration36 months

Abstract

A high quality annotated reference genome sequence is critical to contempary research in the biological sciences. The Ensembl browser and associated annotation tools and database have been shown to be robust and effective means for making genomic information useful to a wide range of users. Draft reference genome sequences have been established for several farmed animal species (chicken, cattle, pig, horse, turkey) and sequencing is well advanced for several others (including sheep, duck, salmon). Annotated assemblies have already been made available through the Ensembl (chicken, cattle, pig, horse) and Pre-Ensembl (duck, turkey, sheep) genome browsers. However, the utility of a bioinformatics resource are critically dependent upon the currency of the resource. Genome sequence assemblies, including the 'finished' human and mouse sequences are subject to continual revision as new data are acquired and errors corrected. This proposal is concerned with maintaining the currency of Ensembl in respect of farmed and companion animal species, including poultry and farmed fish. Whilst first draft genome sequences have been established for several of the species of interest, improved genome assemblies and increased volumes of ancillary data, including RNAseq and ChIPseq data are also being generated for these species. Thus, we will use these growing and improving data to develop up-to-date and enhanced annotation for these species. Not only are the genomes of more farmed animal species but also the genomes of multiple individuals within a species are being sequenced. The recently developed Ensembl variation resources allow these additional data to be captured and visualised for the benefit of scientists engaged in genetics and genomics, and other lines of, research on the target species. We will work with the animal sciences research community to acquire re-sequence data, SNP and CNV genotypes with which to populate the Ensembl-animal variation databases.

Summary

The sequence of almost all genes (a draft genome sequence) has been determined for several farmed and companions animals including cattle, pigs, chickens, turkeys, dogs and horses. Draft genome sequences for several other species such as sheep, ducks and salmon will be completed soon. The strings of billions of bases (symbolised as four letters A, C, G, T) that constitute these genome sequences are not immediately useful to biological research scientists. Annotating these draft genome sequences with features such as the coding and regulatory parts of genes, and bases which differ between individuals within a species (genetic variants) greatly enhances the value and utility of the genome sequence. Visualising the genome sequences complete with annotations in an freely accessible manner further improves the value of the information. The web-mounted Ensembl genome browser, databases and associated annotation tools have been shown to be powerful and effective means of annotating the complex genomes of animal species including humans, mice and more recently farmed and companion animals. This project is concerned with improving the quality of genome annotation for farmed and companion animal genomes. International consortia of scientists are using the so-called next generation sequencing technology, not only to sequence the genomes of more economically important species, but also the genomes of multiple individuals for each species of interest and to improve or finish the reference genome sequences for key species. These new sequencing technologies are also being used increasingly in assays, for example, of the extent of gene expression in different cells or under different conditions (transcriptomics) or of the state of the genome (epigenomics). Mapping the sequence read-outs from these assays back to the relevant genome sequence not only provides a genome-wide framework for analysis but also provides further information with which to annotate the genome sequence itself. Thus, there is a recurring need to refresh the genome sequence annotation for important animal species. We will use the Ensembl system to annotate the genome sequences of key farmed and companion animal species. The resulting annotated genome sequences will be made freely available as resources mounted on the World Wide Web. Recently developed features within the Ensembl system enable the analysis and visualisation of genetic variation (i.e. sequence differences) between individuals of the same species. This genetic variation explains the differences in traits such as growth, milk yield and susceptibility to disease. We will populate the Ensembl-animal Variation databases with sequence and genotype data acquired from the animal genetics research community. Visualising these variation data and making them accessible to the scientific community and the animal breeding industry will facilitate research to understand the genetic control of complex traits in animals and genetic improvement of farmed animals. A high quality annotated reference genome sequence is a critical bioinformatics resource for the effective prosecution of contempary research in the biological sciences. The value and utility of such bioinformatics resources are critically dependent upon the currency of the resource. Thus, this project is concerned with delivering high quality up-to-date annotated reference genomes for key farmed and companion animal species to enable research on these economically or socially important animal species.

Impact Summary

Who will benefit? The primary beneficiaries from this proposed development and maintenance of Ensembl resources for farmed and companion animals will be researchers in academia and industry in the UK and beyond. The access statistics and citations of Ensembl papers provide evidence of the demand for Ensembl resources from the research community. The world's leading animal breeding and aquaculture breeding companies, of which some of the largest are UK companies, have in-house genetics expertise. Thus, these companies have the expertise to exploit the information captured and disseminated through Ensembl resources. Suppliers of species specific 'omics tools such as expression arrays, SNP chips and proteomics system will benefit from access to annotated genomes sequences which include links to features (e.g. probes) on their products. There are potential indirect benefits to the wider public through the addressing of the food security agenda as discussed below. How will they benefit? The proposed enhanced Ensembl resources, especially the genetic variation resources, will enable research to dissect the genetic control of economically important (and complex) traits in farmed animals including feed efficiency and susceptibility to infectious diseases. In companion animals such as dogs these resources will enable the identification of the determinants of inherited diseases. This enabling of genetics research in farmed animals and fish will facilitate advanced genetic improvement for these species. In the past 40+ years, there have been major productivity gains in dairy cattle, pigs and poultry and there have also been significant reductions in the greenhouse gas emissions and global warming potential per tonne of animal product. These gains have been achieved through genetic improvement alone or in combination with better husbandry, nutrition and disease control. Genetic improvement of farmed animal species is a key means of addressing the food securityagenda for the animal agriculture and aquaculture sectors. In companion animals the benefits will be improved tools for selective breeding to minimise inherited diseases and inbreeding and to improve animal welfare. The utility of 'omics technology products such as expression microarrays and SNP chips is greatly enhanced when the features on these products can be linked to a well-annotated genome sequence and other information sources. For example, probe sets for Affymetrix arrays and SNPs on Illumina chips can be linked to annotated genes and genome locations respectively, thus enabling more effective use of these products. Academic and other researchers will benefit from the ability to link the read-out from assay by sequence assays to an annotated genome sequence. Without such a frame of reference such assays are of limited value. The impacts on research will be delivered within the timeframe of the proposed project to enhance Ensembl resources for farmed and companion animals and continue thereafter. Maintaining the currency of the genome assemblies and the associated annotation is critical to ensuring that these impacts continue to be effective. The indirect impacts, for example, on the food security agenda and hence the benefits to the agriculture and aquaculture sectors and the wider public will take longer to be felt. However, the time to impact for genetic tests for susceptibility to inherited or infectious diseases in animals with their positive impacts on animal welfare can be short - 1 to 3 years.
Committee Research Committee D (Molecules, cells and industrial biotechnology)
Research TopicsAnimal Health
Research PriorityX – Research Priority information not available
Research Initiative Bioinformatics and Biological Resources Fund (BBR) [2007-2015]
Funding SchemeX – not Funded via a specific Funding Scheme
terms and conditions of use (opens in new window)
export PDF file