Award details

Addressing the problem of deep coalescence in ancient radiations: Resolving the explosive radiation of the Lophotrochozoa.

ReferenceBB/R016240/1
Principal Investigator / Supervisor Professor Maximilian Telford
Co-Investigators /
Co-Supervisors
Dr Paschalia Kapli, Professor Ziheng Yang
Institution University College London
DepartmentGenetics Evolution and Environment
Funding typeResearch
Value (£) 429,383
StatusCompleted
TypeResearch Grant
Start date 01/10/2018
End date 30/09/2021
Duration36 months

Abstract

Resolving ancient, rapid radiations is among the most challenging tasks in phylogenetics. Rapid divergences allow the accumulation of limited phylogenetic information on short internal branches. Long terminal branches mean multiple substitutions, eroding phylogenetic signal. An even more problematic product of rapid and ancient divergences is the phenomenon of deep coalescence or incomplete lineage sorting (ILS). Because of polymorphism in the ancestral species, gene sequences may not all coalesce within each species but earlier, in the ancestral population. Deep coalescence means some gene trees do not match the species phylogeny. The multispecies coalescent (MSC) model avoids the incorrect assumption that all markers/loci share the same genealogical history. Within this framework, ILS or incongruence among gene-trees can become a rich source of information regarding the duration of the radiation and the evolutionary history. Focussing on the unresolved relationships between phyla of the major animal clade of Lophotrochozoa as an important example, we will gather a dataset of up to 1000 nuclear genes from existing and novel transcriptomic data sets from diverse Lophotrochozoans and outgroups. We will investigate the subset of all possible topologies that are supported by each of these 1000 loci to gauge the level of discordance between genes. We will compare the results of reconstructing trees using standard analyses of concatenated data with those from coalescence aware methods. Using extensive simulation based on the MSC model parameters measured from our data, we will explore the robustness of different approaches to violations of model assumptions (variable substitution rates both globally and across branches; drifts in amino acid composition across sites and across branches and non homogeneous patterns of substitution across sites). Simulation will allow us to understand the problems of ILS and untangle the lophotrochozoan polytomy.

Summary

Using molecular data (sequences of genes and proteins) to resolve the relationships between living organisms has been a major international research goal for over 25 years. Our improved knowledge of the relationships between species has already radically affected our understanding of how the great diversity of life evolved. Recently, completing this project seemed in reach, thanks to plentiful molecular data from the so called next generation sequencing. Despite the ready availability of data, some major parts of the tree of life have nevertheless proved impossible to resolve.While this is a general problem affecting many different groups of organisms we will focus on resolving the relationships between a significant part portion of the animal kingdom. One of the most striking features of the animal tree is the short time frame over which the constituent major groups (phyla such as chordates, annelids, arthropods and molluscs) diverged from one another over half a billion years ago in so-called rapid, or even explosive radiations. Rapid radiations can cause two problems. First is a lack of phylogenetic signal due to the limited opportunity to accumulate genetic changes that identify individual groups. Second is the insidious problem of Incomplete Lineage Sorting (ILS) or deep coalescence. Due to polymorphism in the ancestral species, gene sequences may not all coalesce within each species but earlier causing gene-trees to diverge from the actual species-trees. Put more simply, ILS results in different genes telling us different things about the relationships of the organisms. Recent efforts to solve these problems focussed on compiling very large genomic datasets that were concatenated into a single data set and analysed as single supergenes. This approach masks the incongruence between genes we expect to exist and may lead to positively misleading estimates. To solve this final and most recalcitrant aspects of the animal phylogeny we must use methods designed todeal with data incongruence. Within the animals we will focus specifically on resolving the relationships between the phyla within the Lophotrochozoa. This is one of three ancient groups of bilaterally symmetrical animals. Lophotrochozoa contains approximately half of animal phyla including flatworms, annelids, and molluscs. Our ultimate goals are I. to use the example of the Lophotrochozoa to better understand the processes involved in rapid radiations and how best to resolve them. II. to use our understanding to reconstruct the phylogenetic relationships between the animal phyla in the Lophotrochozoa. These practical goals will result in the phylogenetic framework that is the essential foundation for understanding the evolution of this major portion of animal diversity. Our work should give us insight into the processes at work during an ancient, explosive radiation. Such radiation events have repeatedly had a major impact on the evolution of life meaning that developing methods to address such questions will be of broad use. The easy problems having been solved, many remaining phylogenetic questions across the tree of life will involve similar rapid radiations.

Impact Summary

Our work will have the most immediate impact in two obvious areas of biology: Specific major question in Zoology/Evolution/Palaeontology. We are addressing an important and long standing question in the evolution of the animal phyla. The Lophotrochozoan clade contains roughly half of all animal phyla but the relationships between these groups is entirely unclear. If we are able to solve this problem then an accurate phylogeny of the Lophotrochozoa will mean we have resolved the relationships between a significant portion of the animal tree. The impact of this stems not simply from understanding this phylogenetic question. An accurate phylogeny is the essential underlying framework for studying all aspects of the evolution of this group. To understand the evolution of the varied morphologies that exist in this very diverse group, knowledge of the phylogenetic background is essential. The same is true for comparative analyses of genetics, genomes, genome structure etc. Finally, there are many Cambrian fossils currently recognised as stem lophotrochozoan species but impossible to interpret further due to the impossibility of relating them to modern groups. Understanding how living lophotrochozoans are related and the likely pathways of their evolution will help interpret the morphological characters of the fossil members of this group. General problem in resolving the relationships across the tree of life. Rapid radiations are a feature of many groups across the tree of life. The difficulty in resolving these stems from lack of informative characters and this may be seriously compounded by the problem of deep coalescence. We are studying this phenomenon in one ancient radiation and the lessons we learn and the approaches we develop will be much more broadly applicable. Familiar instances include the radiations of mammals, birds and land plants but the phenomenon of rapid radiations is very common across the tree of life. The methods developed in this project, for understanding and resolving rapid radiations and deep coalescence, will provide powerful tools for phylogenetic analysis of genomic/transcriptomic datasets. The methods will be widely applicable and results obtained from such analyses, applied across the tree of life may be critical to effective decision making concerning any down stream application of an accurately resolved phylogeny - biodiversity studies, palaeontology, comparative genomics, comparative morphology.
Committee Research Committee C (Genes, development and STEM approaches to biology)
Research TopicsX – not assigned to a current Research Topic
Research PriorityX – Research Priority information not available
Research Initiative X - not in an Initiative
Funding SchemeX – not Funded via a specific Funding Scheme
terms and conditions of use (opens in new window)
export PDF file