Detecting macroevolutionary genotype-phenotype associations using error-corrected rates of protein convergence release_6mx5bgebzzc6zapsjkkwehjb3e

by Kenji Fukushima, David Pollock

Released as a post by Cold Spring Harbor Laboratory.

2022  

Abstract

On macroevolutionary timescales, extensive mutations and phylogenetic uncertainty mask the signals of genotype-phenotype associations underlying convergent evolution. To overcome this problem, we extended the widely used framework of nonsynonymous-to-synonymous substitution rate ratios and developed the novel metric ω<jats:sub>C</jats:sub>, which measures the error-corrected convergence rate of protein evolution. While ω<jats:sub>C</jats:sub> distinguishes natural selection from genetic noise and phylogenetic errors in simulation and real examples, its accuracy allows an exploratory genome-wide search of adaptive molecular convergence without phenotypic hypothesis or candidate genes. Using gene expression data, we explored over 20 million branch combinations in vertebrate genes and identified the joint convergence of expression patterns and protein sequences with amino acid substitutions in functionally important sites, providing hypotheses on undiscovered phenotypes. We further extended our method with a heuristic algorithm to detect highly repetitive convergence among computationally nontrivial higher-order phylogenetic combinations. Our approach allows bidirectional searches for genotype-phenotype associations, even in lineages that diverged for hundreds of millions of years.
In application/xml+jats format

Archived Files and Locations

application/pdf  10.2 MB
file_lusuqdz2vrdglc66hy66aruhe4
www.biorxiv.org (repository)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  post
Stage   unknown
Date   2022-04-07
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: 6aba9577-590d-406f-b684-2cd01badeded
API URL: JSON