Projects using RevBayes

Nicolas Lartillot, Bastien Boussau

Last modified on

Coevolution between substitution rates and life-history traits

In mammals, there is substantial variation in substitution rate between lineages. This variation may be correlated with life-history traits: small mammals, with short generation times, tend to evolve faster. Here, the aim is measure the correlation between substitution rate and body mass or other life-history traits in mammals, by building a model of the joint evolutionary process of the substitution rate and life-history traits across branches, using a variant of the Brownian model used for modelling the auto-correlated molecular clock (Lartillot and Delsuc, 2012).

A relaxed molecular clock informed by life-history and germ-line development in primates

In primates, the mutation rate per generation is mostly determined by the number of replications in the germline. The developmental process of the germline is relatively well characterized, and a model of its modulation, as a function of life-history (age of puberty, generation time, etc) has been proposed (Amster and Sella, 2016). Here, the aim is to use this model (and the empirically measured values reported in the article of Amster and Sella) to calibrate the molecular clock (and its variation) in simian primates, using information about sexual maturity and generation time in extant species.

Nucleotide compositional variation across species and its impact on phylogenetic reconstruction

The models that we have considered in the tutorials are homogeneous across branches. As a result, they predict the same nucleotide composition in all species. In practice, this is not the case. In some extreme situations, not accounting for compositional variation can result in phylogenetic reconstruction artifacts (typically, unrelated species with similar compositional biases artifactually cluster together). A simple but particularly striking example is shown in Foster, 2004. In this article, a model with branch specific nucleotide composition is introduced to improve phylogenetic reconstruction in such cases. Since then, Heaps et al. have proposed improved models. Here, the aim is to design versions of these models with branch-specific equilibrium frequencies, and to see if they improve phylogenetic reconstruction.

Multi-gene phylogenetic reconstruction of the phylogeny of mammals

Designing a model for doing multi-gene phylogenetic reconstruction. Genes may share the same species phylogeny but may differ in their rate of evolution and in their GC content. Can be used to reconstruct the phylogeny or to estimate the variance in substitution rates and in GC content across genes. The data is available from the article

Multi-gene phylogenetic reconstruction of the phylogeny of birds

Correlation between GC composition of ribosomal RNA and growth temperature in Archaea.

The idea is to model the correlated evolution of rRNA GC content and growth temperature across Archaea, and use this model to estimate the correlation and to infer ancestral temperatures along the phylogeny.

Are patterns of absence/presence of genes across genomes informative about the phylogeny?

The idea is to model the process of gain and loss of genes across a phylogeny, and to apply this model to data of absence/presence of genes across metazoans. See article of Pisani et al., (2015) and Ryan et al (2013) for two analyses giving different results on the same dataset.

Convergent evolution toward subterranean lifestyle in isopods, and its consequences on the rate of genome evolution.

Saclier et al have analysed the evolutionary patterns in a group of isopod species, in which there has been a large number of independent transitions from surface to underground lifestyle – giving an opportunity for modeling and investigating the impact of these transitions on genomic sequence evolution.

State-dependent diversification model

The aim of diversification studies is to infer the patterns of species diversification over evolutionary times, with rates of speciation and extinction potentially influenced by other factors. One such model assumes that speciation and extinction depends on a binary character (such as the mode of reproduction). However, current diversification studies assume the phylogeny is known without error. Here, the aim would be to see if we can instead do an integrative analysis, thus using the diversification model as a prior in a dating analysis.

Analysis of the SEMG2 gene in Primates

Here the project is to analyze an alignment of the SEMG2 gene from primate species that differ in their mating systems.

Analysis of mitochondrial protein evolution in Daphnia

Here the project is to analyze a concatenated alignment of 15 DNA sequences coding for proteins from 29 strains of Daphnia pulex, some of which reproduce sexually (named S1 to S14), and others, asexually (named A1 to A14). Sexual reproduction is assumed to be the ancestral condition.

Modèle de conversion génique biaisée

Les génomes des mammifères (et de beaucoup d’autres espèces) sont plus riches en GC qu’attendu étant donné leur biais mutationnel (en général vers AT). Ceci est du à la conversion génique biaisée vers GC. L’objectif de ce mini-projet est de développer un petit modèle phyogénétique qui permet, sous certaines hypothèses à discuter, d’estimer la force de cette conversion génique biaisée.