A shift in aggregation avoidance strategy marks a long-term direction to protein evolution

Scott G. Foy, Benjamin A. Wilson, Jason Bertram, Matthew Hj Cordes, Joanna Masel

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

To detect a direction to evolution, without the pitfalls of reconstructing ancestral states, we need to compare “more evolved” to “less evolved” entities. But because all extant species have the same common ancestor, none are chronologically more evolved than any other. However, different gene families were born at different times, allowing us to compare young protein-coding genes to those that are older and hence have been evolving for longer. To be retained during evolution, a protein must not only have a function, but must also avoid toxic dysfunction such as protein aggregation. There is conflict between the two requirements: hydrophobic amino acids form the cores of protein folds, but also promote aggregation. Young genes avoid strongly hydrophobic amino acids, which is presumably the simplest solution to the aggregation problem. Here we show that young genes’ few hydrophobic residues are clustered near one another along the primary sequence, presumably to assist folding. The higher aggregation risk created by the higher hydrophobicity of older genes is counteracted by more subtle effects in the ordering of the amino acids, including a reduction in the clustering of hydrophobic residues until they eventually become more interspersed than if distributed randomly. This interspersion has previously been reported to be a general property of proteins, but here we find that it is restricted to old genes. Quantitatively, the index of dispersion delineates a gradual trend, i.e., a decrease in the clustering of hydrophobic amino acids over billions of years.

Original languageEnglish (US)
Pages (from-to)1345-1355
Number of pages11
JournalGenetics
Volume211
Issue number4
DOIs
StatePublished - Apr 1 2019

Fingerprint

Amino Acids
Genes
Proteins
Cluster Analysis
Poisons
Hydrophobic and Hydrophilic Interactions
Direction compound

Keywords

  • Aggregation propensity
  • Gene age
  • Phylostratigraphy
  • Protein folding
  • Protein misfolding

ASJC Scopus subject areas

  • Genetics

Cite this

A shift in aggregation avoidance strategy marks a long-term direction to protein evolution. / Foy, Scott G.; Wilson, Benjamin A.; Bertram, Jason; Cordes, Matthew Hj; Masel, Joanna.

In: Genetics, Vol. 211, No. 4, 01.04.2019, p. 1345-1355.

Research output: Contribution to journalArticle

Foy, Scott G. ; Wilson, Benjamin A. ; Bertram, Jason ; Cordes, Matthew Hj ; Masel, Joanna. / A shift in aggregation avoidance strategy marks a long-term direction to protein evolution. In: Genetics. 2019 ; Vol. 211, No. 4. pp. 1345-1355.
@article{add143f8f2ec4c45bba9781baea7e909,
title = "A shift in aggregation avoidance strategy marks a long-term direction to protein evolution",
abstract = "To detect a direction to evolution, without the pitfalls of reconstructing ancestral states, we need to compare “more evolved” to “less evolved” entities. But because all extant species have the same common ancestor, none are chronologically more evolved than any other. However, different gene families were born at different times, allowing us to compare young protein-coding genes to those that are older and hence have been evolving for longer. To be retained during evolution, a protein must not only have a function, but must also avoid toxic dysfunction such as protein aggregation. There is conflict between the two requirements: hydrophobic amino acids form the cores of protein folds, but also promote aggregation. Young genes avoid strongly hydrophobic amino acids, which is presumably the simplest solution to the aggregation problem. Here we show that young genes’ few hydrophobic residues are clustered near one another along the primary sequence, presumably to assist folding. The higher aggregation risk created by the higher hydrophobicity of older genes is counteracted by more subtle effects in the ordering of the amino acids, including a reduction in the clustering of hydrophobic residues until they eventually become more interspersed than if distributed randomly. This interspersion has previously been reported to be a general property of proteins, but here we find that it is restricted to old genes. Quantitatively, the index of dispersion delineates a gradual trend, i.e., a decrease in the clustering of hydrophobic amino acids over billions of years.",
keywords = "Aggregation propensity, Gene age, Phylostratigraphy, Protein folding, Protein misfolding",
author = "Foy, {Scott G.} and Wilson, {Benjamin A.} and Jason Bertram and Cordes, {Matthew Hj} and Joanna Masel",
year = "2019",
month = "4",
day = "1",
doi = "10.1534/genetics.118.301719",
language = "English (US)",
volume = "211",
pages = "1345--1355",
journal = "Genetics",
issn = "0016-6731",
publisher = "Genetics Society of America",
number = "4",

}

TY - JOUR

T1 - A shift in aggregation avoidance strategy marks a long-term direction to protein evolution

AU - Foy, Scott G.

AU - Wilson, Benjamin A.

AU - Bertram, Jason

AU - Cordes, Matthew Hj

AU - Masel, Joanna

PY - 2019/4/1

Y1 - 2019/4/1

N2 - To detect a direction to evolution, without the pitfalls of reconstructing ancestral states, we need to compare “more evolved” to “less evolved” entities. But because all extant species have the same common ancestor, none are chronologically more evolved than any other. However, different gene families were born at different times, allowing us to compare young protein-coding genes to those that are older and hence have been evolving for longer. To be retained during evolution, a protein must not only have a function, but must also avoid toxic dysfunction such as protein aggregation. There is conflict between the two requirements: hydrophobic amino acids form the cores of protein folds, but also promote aggregation. Young genes avoid strongly hydrophobic amino acids, which is presumably the simplest solution to the aggregation problem. Here we show that young genes’ few hydrophobic residues are clustered near one another along the primary sequence, presumably to assist folding. The higher aggregation risk created by the higher hydrophobicity of older genes is counteracted by more subtle effects in the ordering of the amino acids, including a reduction in the clustering of hydrophobic residues until they eventually become more interspersed than if distributed randomly. This interspersion has previously been reported to be a general property of proteins, but here we find that it is restricted to old genes. Quantitatively, the index of dispersion delineates a gradual trend, i.e., a decrease in the clustering of hydrophobic amino acids over billions of years.

AB - To detect a direction to evolution, without the pitfalls of reconstructing ancestral states, we need to compare “more evolved” to “less evolved” entities. But because all extant species have the same common ancestor, none are chronologically more evolved than any other. However, different gene families were born at different times, allowing us to compare young protein-coding genes to those that are older and hence have been evolving for longer. To be retained during evolution, a protein must not only have a function, but must also avoid toxic dysfunction such as protein aggregation. There is conflict between the two requirements: hydrophobic amino acids form the cores of protein folds, but also promote aggregation. Young genes avoid strongly hydrophobic amino acids, which is presumably the simplest solution to the aggregation problem. Here we show that young genes’ few hydrophobic residues are clustered near one another along the primary sequence, presumably to assist folding. The higher aggregation risk created by the higher hydrophobicity of older genes is counteracted by more subtle effects in the ordering of the amino acids, including a reduction in the clustering of hydrophobic residues until they eventually become more interspersed than if distributed randomly. This interspersion has previously been reported to be a general property of proteins, but here we find that it is restricted to old genes. Quantitatively, the index of dispersion delineates a gradual trend, i.e., a decrease in the clustering of hydrophobic amino acids over billions of years.

KW - Aggregation propensity

KW - Gene age

KW - Phylostratigraphy

KW - Protein folding

KW - Protein misfolding

UR - http://www.scopus.com/inward/record.url?scp=85064721235&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85064721235&partnerID=8YFLogxK

U2 - 10.1534/genetics.118.301719

DO - 10.1534/genetics.118.301719

M3 - Article

VL - 211

SP - 1345

EP - 1355

JO - Genetics

JF - Genetics

SN - 0016-6731

IS - 4

ER -