Schroeter Elena R, DeHart Caroline J, Cleland Timothy P, Zheng Wenxia, Thomas Paul M, Kelleher Neil L, Bern Marshall, Schweitzer Mary H
Department of Biological Sciences, North Carolina State University , Raleigh, North Carolina 27695, United States.
National Resource for Translational and Developmental Proteomics, Northwestern University , Evanston, Illinois 60208, United States.
J Proteome Res. 2017 Feb 3;16(2):920-932. doi: 10.1021/acs.jproteome.6b00873. Epub 2017 Jan 23.
Sequence data from biomolecules such as DNA and proteins, which provide critical information for evolutionary studies, have been assumed to be forever outside the reach of dinosaur paleontology. Proteins, which are predicted to have greater longevity than DNA, have been recovered from two nonavian dinosaurs, but these results remain controversial. For proteomic data derived from extinct Mesozoic organisms to reach their greatest potential for investigating questions of phylogeny and paleobiology, it must be shown that peptide sequences can be reliably and reproducibly obtained from fossils and that fragmentary sequences for ancient proteins can be increasingly expanded. To test the hypothesis that peptides can be repeatedly detected and validated from fossil tissues many millions of years old, we applied updated extraction methodology, high-resolution mass spectrometry, and bioinformatics analyses on a Brachylophosaurus canadensis specimen (MOR 2598) from which collagen I peptides were recovered in 2009. We recovered eight peptide sequences of collagen I: two identical to peptides recovered in 2009 and six new peptides. Phylogenetic analyses place the recovered sequences within basal archosauria. When only the new sequences are considered, B. canadensis is grouped more closely to crocodylians, but when all sequences (current and those reported in 2009) are analyzed, B. canadensis is placed more closely to basal birds. The data robustly support the hypothesis of an endogenous origin for these peptides, confirm the idea that peptides can survive in specimens tens of millions of years old, and bolster the validity of the 2009 study. Furthermore, the new data expand the coverage of B. canadensis collagen I (a 33.6% increase in collagen I alpha 1 and 116.7% in alpha 2). Finally, this study demonstrates the importance of reexamining previously studied specimens with updated methods and instrumentation, as we obtained roughly the same amount of sequence data as the previous study with substantially less sample material. Data are available via ProteomeXchange with identifier PXD005087.
来自DNA和蛋白质等生物分子的序列数据为进化研究提供了关键信息,一直以来被认为是恐龙古生物学永远无法触及的。蛋白质预计比DNA具有更长的寿命,已从两种非鸟类恐龙中提取出来,但这些结果仍存在争议。为了使源自已灭绝中生代生物的蛋白质组数据在研究系统发育和古生物学问题时发挥最大潜力,必须证明可以从化石中可靠且可重复地获得肽序列,并且古代蛋白质的片段序列能够不断扩充。为了验证能否从距今数百万年的化石组织中反复检测并验证肽段这一假设,我们对2009年从中提取出I型胶原蛋白肽段的短冠龙标本(MOR 2598)应用了更新的提取方法、高分辨率质谱和生物信息学分析。我们获得了I型胶原蛋白的八个肽序列:两个与2009年提取的肽段相同,六个是新肽段。系统发育分析将所获得的序列置于基础主龙类之中。仅考虑新序列时,短冠龙与鳄形类的亲缘关系更近,但当分析所有序列(当前的和2009年报道的)时,短冠龙与基础鸟类的亲缘关系更近。这些数据有力地支持了这些肽段内源性起源的假设,证实了肽段能够在数千万年的标本中存活的观点,并支持了2009年研究的有效性。此外,新数据扩展了短冠龙I型胶原蛋白的覆盖范围(I型胶原蛋白α1增加了33.6%,α2增加了116.7%)。最后,本研究证明了使用更新的方法和仪器重新检查先前研究的标本的重要性,因为我们用少得多的样本材料获得了与先前研究大致相同数量的序列数据。数据可通过ProteomeXchange获得,标识符为PXD005087。