Preview

Exhibit

Good Essays
Open Document
Open Document
7625 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Exhibit
Evolutionary Optimization of Protein Folding
´ ` ´ ¨ Cedric Debes1, Minglei Wang2, Gustavo Caetano-Anolles2*, Frauke Grater1,3*
1 Heidelberg Institute for Theoretical Studies, Heidelberg, Germany, 2 Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, Illinois, United States of America, 3 CAS-MPG Partner Institute and Key Laboratory for Computational Biology, Shanghai, China

Abstract
Nature has shaped the make up of proteins since their appearance, *3.8 billion years ago. However, the fundamental drivers of structural change responsible for the extraordinary diversity of proteins have yet to be elucidated. Here we explore if protein evolution affects folding speed. We estimated folding times for the present-day catalog of protein domains directly from their size-modified contact order. These values were mapped onto an evolutionary timeline of domain appearance derived from a phylogenomic analysis of protein domains in 989 fully-sequenced genomes. Our results show a clear overall increase of folding speed during evolution, with known ultra-fast downhill folders appearing rather late in the timeline. Remarkably, folding optimization depends on secondary structure. While alpha-folds showed a tendency to fold faster throughout evolution, beta-folds exhibited a trend of folding time increase during the last *1.5 billion years that began during the ‘‘big bang’’ of domain combinations. As a consequence, these domain structures are on average slow folders today. Our results suggest that fast and efficient folding of domains shaped the universe of protein structure. This finding supports the hypothesis that optimization of the kinetic and thermodynamic accessibility of the native fold reduces protein aggregation propensities that hamper cellular functions.
`s ´s ¨ Citation: Debe C, Wang M, Caetano-Anolle G, Grater F (2013) Evolutionary Optimization of Protein Folding. PLoS Comput Biol 9(1): e1002861. doi:10.1371/



References: 1. Andreeva A, Howorth D, Chandonia JM, Brenner SE, Hubbard TJP, et al. (2008) Data growth and its impact on the scop database: new developments. Nucleic Acids Res 36: D419–D425. 2. Qiu L, Pabit SA, Roitberg AE, Hagen SJ (2002) Smaller and faster: the 20residue trp-cage protein folds in 4 micros. J Am Chem Soc 124: 12952–12953. 3. Goldberg ME, Semisotnov GV, Friguet B, Kuwajima K, Ptitsyn OB, et al. (1990) An early immunoreactive folding intermediate of the tryptophan synthase 2 subunit is a molten globule. FEBS Letters 263: 51–56. 4. Matagne A, Chung EW, Ball LJ, Radford SE, Robinson CV, et al. (1998) The origin of the alphadomain intermediate in the folding of hen lysozyme. J Mol Biol 277: 997–1005. 5. Onuchic JN, Wolynes PG (2004) Theory of protein folding. Curr Opin Struct Biol 14: 70–75. 6. Levinthal C (1969) How to fold graciously. In: Debrunnder JTP, Munck E, editors. Mossbauer Spectroscopy in Biological Systems: Proceedings of a meeting held at Allerton House, Monticello, Illinois. University of Illinois Press. pp. 22–24. 7. Nlting B, Schlike W, Hampel P, Grundig F, Gantert S, et al. (2003) Structural determinants of the rate of protein folding. J Theor Biol 223: 299–307. 8. Thirumalai D, Klimov DK (1999) Emergence of stable and fast folding protein structures. Technical Report cond-mat/9910248. 9. Govindarajan S, Recabarren R, Goldstein RA (1999) Estimating the total number of protein folds. Proteins 35: 408–414. 10. Cossio P, Trovato A, Pietrucci F, Seno F, Maritan A, et al. (2010) Exploring the universe of protein structures beyond the protein data bank. PLoS Comput Biol 6: e1000957. 11. Mirny LA, Shakhnovich EI (1999) Universally conserved positions in protein folds: reading evolutionary signals about stability, folding kinetics and function. J Mol Biol 291: 177–196. 12. Xia Y, Levitt M (2004) Simulating protein evolution in sequence and structure space. Curr Opin Struct Biol 14: 202–207. 13. Ortiz AR, Skolnick J (2000) Sequence evolution and the mechanism of protein folding. Biophys J 79: 1787–1799. 14. Murzin AG, Brenner SE, Hubbard T, Chothia C (1995) Scop: a structural classi_cation of proteins database for the investigation of sequences and structures. J Mol Biol 247: 536–540. 15. Caetano-Anolls G, Caetano-Anolls D (2003) An evolutionarily structured universe of protein architecture. Genome Res 13: 1563–1571. 16. Caetano-Anolls G, Caetano-Anolls D (2005) Universal sharing patterns in proteomes and evolution of protein fold architecture and life. J Mol Evol 60: 484–498. 17. Wang M, Jiang YY, Kim KM, Qu G, Ji HF, et al. (2011) A universal molecular clock of protein folds and its power in tracing the early history of aerobic metabolism and planet oxygenation. Mol Biol Evol 28: 567–582. 18. Caetano-Anolls G, Kim KM, Caetano-Anolls D (2012) Erratum to: The phylogenomic roots of modern biochemistry: Origins of proteins, cofactors and protein biosynthesis. J Mol Evol. Epub ahead of print. 19. Wang M, Caetano-Anolls G (2009) The evolutionary mechanics of domain organization in proteomes and the rise of modularity in the protein world. Structure 17: 66–78. 20. Bowman GR, Voelz VA, Pande VS (2011) Taming the complexity of protein folding. Current Opinion in Structural Biology 21: 4–11. 21. Lindorff-Larsen K, Piana S, Dror RO, Shaw DE (2011) How fast-folding proteins fold. Science 334: 517–520. 22. Plaxco KW, Simons KT, Baker D (1998) Contact order, transition state placement and the refolding rates of single domain proteins. J Mol Biol 277: 985–994. 23. Ivankov DN, Garbuzynskiy SO, Alm E, Plaxco KW, Baker D, et al. (2003) Contact order revisited: inuence of protein size on the folding rate. Protein Sci 12: 2057–2062. 24. Bogatyreva NS, Osypov AA, Ivankov DN (2009) Kineticdb: a database of protein folding kinetics. Nucleic Acids Res 37: D342–D346. 25. Ouyang Z, Liang J (2008) Predicting protein folding rates from geometric contact and amino acid sequence. Protein Sci 17: 1256–1263. 26. Vendruscolo M, Dokholyan NV, Paci E, Karplus M (2002) Small-world view of the amino acids that play a key role in protein folding. Phys Rev E Stat Nonlin Soft Matter Phys 65: 061910. 27. Kubelka J, Hofrichter J, Eaton WA (2004) The protein folding ‘speed limit’. Curr Opin Struct Biol 14: 76–88. 28. Sancho DD, Doshi U, Muoz V (2009) Protein folding rates and stability: how much is there beyond size? J Am Chem Soc 131: 2074–2075. 29. Portman JJ (2010) Cooperativity and protein folding rates. Curr Opin Struct Biol 20: 11–15. 30. Cieplak M, Xuan Hoang T (2000) Scaling of folding properties in go models of proteins. Journal of Biological Physics 26: 273–294. 31. Felice FGD, Vieira MNN, Meirelles MNL, Morozova-Roche LA, Dobson CM, et al. (2004) Formation of amyloid aggregates from human lysozyme and its disease-associated variants using hydrostatic pressure. FASEB J 18: 1099–1101. 32. Tanzi RE, Bertram L (2005) Twenty years of the alzheimer’s disease amyloid hypothesis: a genetic perspective. Cell 120: 545–555. 33. Ross CA, Poirier MA (2004) Protein aggregation and neurodegenerative disease. Nat Med 10 Suppl: S10–S17. 34. Monsellier E, Chiti F (2007) Prevention of amyloid-like aggregation as a driving force of protein evolution. EMBO Rep 8: 737–742. 35. Ramanathan A, Agarwal PK (2011) Evolutionarily conserved linkage between enzyme fold, exibility, and catalysis. PLoS Biol 9: e1001193. 36. Hagen SJ, Hofrichter J, Szabo A, Eaton WA (1996) Diffusion-limited contact formation in unfolded cytochrome c: estimating the maximum rate of protein folding. Proc Natl Acad Sci U S A 93: 11615–11617. 37. Jaenicke R (1991) Protein stability and molecular adaptation to extreme conditions. Eur J Biochem 202: 715–728. 38. Han JH, Batey S, Nickson AA, Teichmann SA, Clarke J (2007) The folding and evolution of multidomain proteins. Nat Rev Mol Cell Biol 8: 319–330. PLOS Computational Biology | www.ploscompbiol.org 8 January 2013 | Volume 9 | Issue 1 | e1002861 Evolutionary Optimization of Protein Folding 39. Pauwels K, Molle IV, Tommassen J, Gelder PV (2007) Chaperoning anfinsen: the steric foldases. Mol Microbiol 64: 917–922. 40. Bogumil D, Landan G, Ilhan J, Dagan T (2012) Chaperones divide yeast proteins into classes of expression level and evolutionary rate. Genome Biol Evol 4: 618–625. 41. Vendruscolo M (2012) Proteome folding and aggregation. Curr Opin Struct Biol 22: 138–143. 42. Riddle DS, Santiago JV, Bray-Hall ST, Doshi N, Grantcharova VP, et al. (1997) Functional rapidly folding proteins from simplified amino acid sequences. Nat Struct Biol 4: 805–809. 43. Li L, Shakhnovich EI (2001) Different circular permutations produced different folding nuclei in proteins: a computational study. J Mol Biol 306: 121–132. 44. Jung J, Lee B (2001) Circularly permuted proteins in the protein structure database. Protein Sci 10: 1881–1886. 45. Bliven S, Prli A (2012) Circular permutation in proteins. PLoS Comput Biol 8: e1002445. 46. Coles M, Hulko M, Djuranovic S, Truffault V, Koretke K, et al. (2006) Common evolutionary origin of swapped-hairpin and double-psi beta barrels. Structure 14: 1489–1498. 47. Wolf YI, Grishin NV, Koonin EV (2000) Estimating the number of protein folds and families from complete genome data. J Mol Biol 299: 897–905. 48. Muoz V, Serrano L (1996) Local versus nonlocal interactions in protein folding and stability an experimentalist’s point of view. Folding and Design 1: R71– R77. 49. Kim KM, Caetano-Anolls G (2012) The evolutionary history of protein fold families and proteomes confirms that the archaeal ancestor is more ancient than the ancestors of other superkingdoms. BMC Evol Biol 12: 13. 50. Gough J, Karplus K, Hughey R, Chothia C (2001) Assignment of homology to genome sequences using a library of hidden markov models that represent all proteins of known structure. J Mol Biol 313: 903–919. 51. Swofford DL (2003) PAUP* Phylogenetic Analysis Using Parsimony (*and Other Methods) Version 4.04beta. Sunderland, Massachusetts: Sinauer Associates. 52. Shank EA, Cecconi C, Dill JW, Marqusee S, Bustamante C (2010) The folding cooperativity of a protein is controlled by its chain topology. Nature 465: 637– 640. 53. Wang G, Dunbrack RL (2005) Pisces: recent improvements to a pdb sequence culling server. Nucleic Acids Res 33: W94–W98. 54. Cleveland WS (1981) Lowess: A program for smoothing scatterplots by robust locally weighted regression. The American Statistician 35: p. 54. 55. Cleveland WS, Devlin SJ, Wagenaar JB (1988) Locally weighted regression: An approach to regression analysis by local fitting. Journal of the American Statistical Association 83: 596–610. 56. Bairoch A, Apweiler R (1999) The swiss-prot protein sequence data bank and its supplement tremble in 1999. Nucleic Acids Res 27: 49–54. 57. Shi Y, Zhou J, Arndt D, Wishart DS, Lin G (2008) Protein contact order prediction from primary sequences. BMC Bioinformatics 9: 255. PLOS Computational Biology | www.ploscompbiol.org 9 January 2013 | Volume 9 | Issue 1 | e1002861

You May Also Find These Documents Helpful

  • Good Essays

    | The parental double helix is unzipped, and copied as individual template strands; Watson and Crick assumed this was correct, and it is…

    • 1676 Words
    • 7 Pages
    Good Essays
  • Powerful Essays

    Secondary: local regions of polypeptide chain fold into specific shapes (shapes arise from the bonding forces between amino acids close in proximity of linear sequence…

    • 2586 Words
    • 11 Pages
    Powerful Essays
  • Good Essays

    If the polypeptide chain form beta pleated sheets (folded chains running parallel to each other) then the hydrogen bonds between the CO (carboxyl group) and NH (amino group) occurs between two separate beta pleated polypeptide chains (see figure 10)…

    • 803 Words
    • 4 Pages
    Good Essays
  • Better Essays

    Michael J. Behe wrote this book to show that Darwinism is not consistent with what we now know about biochemistry. The book is a daring attempt to re-establish the argument for design in living things. Chapter three is all about how molecule machines operate a cell.…

    • 1849 Words
    • 8 Pages
    Better Essays
  • Good Essays

    o A yardstick for measuring the absolute time of evolutionary change based on the observation that some genes and other regions of genomes appear to evolve a constant rates…

    • 4658 Words
    • 19 Pages
    Good Essays
  • Better Essays

    Sordoria Lab

    • 1569 Words
    • 7 Pages

    There are four “Evolution Canyons”, each of which consists of two mountain slopes with varying climates. Evolution canyon is a research model site, which is used for understanding microevolution and can be used to study how mutation and…

    • 1569 Words
    • 7 Pages
    Better Essays
  • Satisfactory Essays

    Revision Questions

    • 510 Words
    • 3 Pages

    3. Scientists seeking to determine which molecule is responsible for the transmission of characteristics from one generation to the next knew that the molecule must (1) copy itself precisely, (2) be stable but able to be changed, and (3) be complex enough to determine the organism’s phenotype.…

    • 510 Words
    • 3 Pages
    Satisfactory Essays
  • Powerful Essays

    Gnt1 Tay Sach's

    • 1961 Words
    • 8 Pages

    References: American Museum of Natural History. (n.d). Seminars on science; genetics, genomics, genethics. molecular biology. Retrieved on September 24th, 2012 from http://amnh.ecollege.com/ec/crs/default.learn?CourseID=4572911&CPURL=amnh.ecollege.com&Survey=1&47=13217312&ClientNodeID=910503&coursenav=0&bhcp=1…

    • 1961 Words
    • 8 Pages
    Powerful Essays
  • Good Essays

    Biology Final Review

    • 17056 Words
    • 69 Pages

    BSC2011C Final Review Unit 1 Review Ch. 25, 22, 23, 24, 26, 19, 27 Ch. 25 1. Life is metabolism and heredity. Metabolism is the mechanism that creates order and complexity from chaos, by acquiring and expending energy. Heredity is the ability of an organism to copy itself and it is broken down into: i. Multiplication, ii. Inheritance, iii. Variation. 2. DNA codes via RNA for 20 of naturally occurring amino acids. Amino Acids are the building blocks of proteins and bodies. DNA stores and transmits hereditary information, but proteins do most of the work. DNA IS THE UNIVERSAL DIGITAL CODE FOR LIFE. To replicate and synthesize proteins, DNA relies on the pre-existence of protein molecules and RNA molecules. 3. RNA is the bridge between DNA and proteins, via mRNA for transcription and rRNA for translation. Thus, RNA can survive on its own while DNA relies on the existence of RNA and proteins, with them DNA is helpless. 4. The 4 points of “first life” are: 1. The Abiotic (non-living) synthesis of small organic molecules, such as amino acids and nucleotides. 2. The joining of these small molecules into macromolecules, including proteins and nucleic acids. 3. The packing of these molecules into “protobionts,” droplets with membranes hat maintained an internal chemistry different from that of their surroundings. 4. The origin of self-replicating molecules that eventually made inheritance possible. 5. The first cells to develop occurred in this order: Monomers > Polymers > Protobionts > RNA ‘world’ > DNA protobionts > first cell. 6. Fossils are the evidence of life and evolution. Organisms trapped in sediment > remain mineralized with hard and soft parts. 7. Fossils can be dated by two methods: Radiometric dating & Magnetism. In Radiometric dating, the age is based on the decay of radioactive isotopes. A radioactive “parent” isotope decays to a “daughter” isotope at a constant rate. The rate of decay is expressed by the half-life, the time requires for 50% of the parent…

    • 17056 Words
    • 69 Pages
    Good Essays
  • Satisfactory Essays

    01.05 biology

    • 363 Words
    • 4 Pages

    -Differences and similarities in genetic codes could be used to determine how closely related different species are by comparing and contrasting the amino acids in their genetic code.…

    • 363 Words
    • 4 Pages
    Satisfactory Essays
  • Better Essays

    Pill Bug Lab

    • 2704 Words
    • 11 Pages

    Cited: Wagner, David, Theodore Taigen, Thomas Terry, and Karen Lombard. Biology 102: Foundations of Biology. Fall 2006 Stamford Edition. 129-137.…

    • 2704 Words
    • 11 Pages
    Better Essays
  • Powerful Essays

    It has often been said that living things, including humans, cannot be well-understood without looking at the evolutionary forces that have shaped them. Biological science and medicine are becoming increasingly more evolutionary as our exponentially-growing knowledge base at all levels – from DNA to the process of biological inheritance; from the biology and genetics of populations and species to the evolutionary processes that shape them; from cells to multicellular beings, and from individuals to the planetary biosphere – reveals more and more clearly how living systems work.…

    • 3773 Words
    • 16 Pages
    Powerful Essays
  • Good Essays

    This paper describes the origins of biomolecules hypothesis. Each different hypothesis is derived from a different scientist. It explains their claim and answers the question if the origin of biomolecules using their hypothesis. All the scientists provided evidence to help support their hypothesis. Some of the scientists had experiments to test their hypothesis. They also gave reasoning for supporting their theory.…

    • 1098 Words
    • 5 Pages
    Good Essays
  • Good Essays

    The basic building blocks of proteins are amino acids, the biuret reaction tests for protein. A solution of sodium hydroxide is added to a sample then a few drops of copper sulphate solution, if positive – the solution will turn mauve. There are 20 different amino acids and they can be joined in any order. Therefore there can be many different functions. A protein consists of one or more polypeptide chains (a polypeptide chain being multiple amino acids joined together via condensation, producing a peptide bond). Different proteins have different shapes as the shapes are determined by the sequence of amino acids.…

    • 1015 Words
    • 5 Pages
    Good Essays
  • Powerful Essays

    your inner fish

    • 3496 Words
    • 11 Pages

    Through the combination of molecular and fossil data, we gain a better understanding to the concept of evolution and change.…

    • 3496 Words
    • 11 Pages
    Powerful Essays