• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质组学作为一种计量工具,用于评估从头组装基因组后基因组注释的准确性:以大西洋宽吻海豚()为例的研究

Proteomics as a Metrological Tool to Evaluate Genome Annotation Accuracy Following De Novo Genome Assembly: A Case Study Using the Atlantic Bottlenose Dolphin ().

机构信息

National Institute of Standards and Technology, NIST Charleston, 331 Fort Johnson Road, Charleston, SC 29412, USA.

出版信息

Genes (Basel). 2023 Aug 25;14(9):1696. doi: 10.3390/genes14091696.

DOI:10.3390/genes14091696
PMID:37761836
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10531373/
Abstract

The last decade has witnessed dramatic improvements in whole-genome sequencing capabilities coupled to drastically decreased costs, leading to an inundation of high-quality de novo genomes. For this reason, the continued development of genome quality metrics is imperative. Using the 2016 Atlantic bottlenose dolphin NCBI RefSeq annotation and mass spectrometry-based proteomic analysis of six tissues, we confirmed 10,402 proteins from 4711 protein groups, constituting nearly one-third of the possible predicted proteins. Since the identification of larger proteins with more identified peptides implies reduced database fragmentation and improved gene annotation accuracy, we propose the metric NP, which attempts to capture this quality improvement. The NP metric is calculated by first stratifying proteomic results by identifying the top decile (or 10th 10-quantile) of identified proteins based on the number of peptides per protein and then returns the median molecular weight of the resulting proteins. When using the 2016 versus 2012 genome annotation to search this proteomic data set, there was a 21% improvement in NP. This metric was further demonstrated by using a publicly available proteomic data set to compare human genome annotations from 2004, 2013 and 2016, which showed a 33% improvement in NP. These results demonstrate that proteomics may be a useful metrological tool to benchmark genome accuracy, though there is a need for reference proteomic datasets across species to facilitate the evaluation of new de novo and existing genome.

摘要

在过去的十年中,全基因组测序能力得到了显著提高,成本也大幅降低,导致高质量的从头基因组数量激增。因此,继续开发基因组质量指标势在必行。我们使用了 2016 年大西洋宽吻海豚 NCBI RefSeq 注释和基于质谱的六种组织的蛋白质组学分析,从 4711 个蛋白质组中鉴定了 10402 个蛋白质,构成了近三分之一可能预测的蛋白质。由于具有更多鉴定肽的较大蛋白质的鉴定意味着减少了数据库碎片化和提高了基因注释的准确性,因此我们提出了 NP 度量标准,该度量标准试图捕获这种质量改进。NP 度量标准是通过首先根据每个蛋白质的肽数对蛋白质组学结果进行分层,确定鉴定蛋白的前十分位数(或第 10 个十分位数),然后返回所得蛋白质的中值分子量来计算的。当使用 2016 年与 2012 年的基因组注释来搜索这个蛋白质组数据集时,NP 提高了 21%。通过使用公共蛋白质组数据集来比较 2004 年、2013 年和 2016 年的人类基因组注释,进一步证明了这一度量标准,NP 提高了 33%。这些结果表明,蛋白质组学可能是一种有用的计量工具,可以用于基准基因组的准确性,尽管需要跨物种的参考蛋白质组数据集来促进对新的从头和现有的基因组的评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349f/10531373/5f600c93ff4e/genes-14-01696-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349f/10531373/a46a393ab68a/genes-14-01696-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349f/10531373/50fe6e82215e/genes-14-01696-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349f/10531373/e6871b86a145/genes-14-01696-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349f/10531373/303ff1e0f78f/genes-14-01696-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349f/10531373/5fab29637c00/genes-14-01696-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349f/10531373/70406bffe1a5/genes-14-01696-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349f/10531373/5f600c93ff4e/genes-14-01696-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349f/10531373/a46a393ab68a/genes-14-01696-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349f/10531373/50fe6e82215e/genes-14-01696-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349f/10531373/e6871b86a145/genes-14-01696-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349f/10531373/303ff1e0f78f/genes-14-01696-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349f/10531373/5fab29637c00/genes-14-01696-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349f/10531373/70406bffe1a5/genes-14-01696-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349f/10531373/5f600c93ff4e/genes-14-01696-g007.jpg

相似文献

1
Proteomics as a Metrological Tool to Evaluate Genome Annotation Accuracy Following De Novo Genome Assembly: A Case Study Using the Atlantic Bottlenose Dolphin ().蛋白质组学作为一种计量工具,用于评估从头组装基因组后基因组注释的准确性:以大西洋宽吻海豚()为例的研究
Genes (Basel). 2023 Aug 25;14(9):1696. doi: 10.3390/genes14091696.
2
New de novo assembly of the Atlantic bottlenose dolphin (Tursiops truncatus) improves genome completeness and provides haplotype phasing.新的大西洋宽吻海豚(Tursiops truncatus)从头组装提高了基因组的完整性,并提供了单倍型相位信息。
Gigascience. 2019 Mar 1;8(3). doi: 10.1093/gigascience/giy168.
3
Establishment of epidermal cell lines derived from the skin of the Atlantic bottlenose dolphin (Tursiops truncatus).源自大西洋宽吻海豚(Tursiops truncatus)皮肤的表皮细胞系的建立。
Anat Rec A Discov Mol Cell Evol Biol. 2005 Dec;287(2):1246-55. doi: 10.1002/ar.a.20266.
4
Labeled quantitative mass spectrometry to study the host response during aspergillosis in the common bottlenose dolphin (Tursiops truncatus).基于标记定量质谱技术研究宽吻海豚(Tursiops truncatus)曲霉病感染过程中的宿主反应。
Vet Microbiol. 2019 May;232:42-49. doi: 10.1016/j.vetmic.2019.03.030. Epub 2019 Mar 30.
5
Microarray applications to understand the impact of exposure to environmental contaminants in wild dolphins (Tursiops truncatus).利用微阵列技术了解野生宽吻海豚(瓶鼻海豚)暴露于环境污染物中的影响。
Mar Genomics. 2015 Feb;19:47-57. doi: 10.1016/j.margen.2014.11.002. Epub 2014 Nov 17.
6
Characterization of circulating steroid hormone profiles in the bottlenose dolphin (Tursiops truncatus) by liquid chromatography-tandem mass spectrometry (LC-MS/MS).通过液相色谱-串联质谱法(LC-MS/MS)对宽吻海豚(Tursiops truncatus)循环类固醇激素谱进行表征。
Gen Comp Endocrinol. 2018 Jul 1;263:80-91. doi: 10.1016/j.ygcen.2018.04.003. Epub 2018 Apr 6.
7
Mutations and polymorphism in bottlenose dolphin (Tursiops truncatus, Montagu 1821) albumin gene: First identification of mutations responsible for inherited bisalbuminemia.宽吻海豚(Tursiops truncatus,蒙塔古,1821年)白蛋白基因的突变与多态性:首次鉴定出导致遗传性双白蛋白血症的突变
Res Vet Sci. 2017 Oct;114:12-17. doi: 10.1016/j.rvsc.2017.02.018. Epub 2017 Feb 24.
8
RNA-Seq analysis of seasonal and individual variation in blood transcriptomes of healthy managed bottlenose dolphins.健康圈养宽吻海豚血液转录组中季节性和个体差异的RNA测序分析
BMC Genomics. 2016 Sep 8;17(1):720. doi: 10.1186/s12864-016-3020-8.
9
Occurrence, residency patterns and habitat use of the bottlenose dolphin, Tursiops truncatus truncatus, on two Marine Protected Areas in Southeastern Brazil.巴西东南部两个海洋保护区内宽吻海豚(Tursiops truncatus truncatus)的出现情况、栖息模式及栖息地利用
An Acad Bras Cienc. 2020;92(2):e20180843. doi: 10.1590/0001-3765202020180843. Epub 2020 Aug 12.
10
Predicted microplastic uptake through trophic transfer by the short-beaked common dolphin (Delphinus delphis) and common bottlenose dolphin (Tursiops truncatus) in the Northeast Atlantic Ocean and Mediterranean Sea.预测东北大西洋和地中海短吻海豚(Delphinus delphis)和宽吻海豚(Tursiops truncatus)通过食物链转移摄入微塑料的情况。
Mar Pollut Bull. 2022 Jul;180:113745. doi: 10.1016/j.marpolbul.2022.113745. Epub 2022 May 30.

引用本文的文献

1
Proteomics in Non-model Organisms: A New Analytical Frontier.非模式生物蛋白质组学:一个新的分析前沿领域。
J Proteome Res. 2020 Sep 4;19(9):3595-3606. doi: 10.1021/acs.jproteome.0c00448. Epub 2020 Aug 20.
2
New de novo assembly of the Atlantic bottlenose dolphin (Tursiops truncatus) improves genome completeness and provides haplotype phasing.新的大西洋宽吻海豚(Tursiops truncatus)从头组装提高了基因组的完整性,并提供了单倍型相位信息。
Gigascience. 2019 Mar 1;8(3). doi: 10.1093/gigascience/giy168.

本文引用的文献

1
Global detection of human variants and isoforms by deep proteome sequencing.通过深度蛋白质组测序进行人类变体和异构体的全球检测。
Nat Biotechnol. 2023 Dec;41(12):1776-1786. doi: 10.1038/s41587-023-01714-x. Epub 2023 Mar 23.
2
The PRIDE database resources in 2022: a hub for mass spectrometry-based proteomics evidences.PRIDE 数据库资源在 2022 年:一个基于质谱的蛋白质组学证据的中心。
Nucleic Acids Res. 2022 Jan 7;50(D1):D543-D552. doi: 10.1093/nar/gkab1038.
3
BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes.
BUSCO 更新:用于真核生物、原核生物和病毒基因组评分的新颖且简化的工作流程以及更广泛和更深的系统发育覆盖范围。
Mol Biol Evol. 2021 Sep 27;38(10):4647-4654. doi: 10.1093/molbev/msab199.
4
OrthoDB in 2020: evolutionary and functional annotations of orthologs.2020 年的 OrthoDB:直系同源物的进化和功能注释。
Nucleic Acids Res. 2021 Jan 8;49(D1):D389-D393. doi: 10.1093/nar/gkaa1009.
5
Proteomics in Non-model Organisms: A New Analytical Frontier.非模式生物蛋白质组学:一个新的分析前沿领域。
J Proteome Res. 2020 Sep 4;19(9):3595-3606. doi: 10.1021/acs.jproteome.0c00448. Epub 2020 Aug 20.
6
Assessing Protein Sequence Database Suitability Using Sequencing.利用测序评估蛋白质序列数据库的适用性
Mol Cell Proteomics. 2020 Jan;19(1):198-208. doi: 10.1074/mcp.TIR119.001752. Epub 2019 Nov 15.
7
BUSCO: Assessing Genome Assembly and Annotation Completeness.BUSCO:评估基因组组装和注释的完整性
Methods Mol Biol. 2019;1962:227-245. doi: 10.1007/978-1-4939-9173-0_14.
8
Reverse and Random Decoy Methods for False Discovery Rate Estimation in High Mass Accuracy Peptide Spectral Library Searches.反转和随机诱饵方法在高质量精度肽谱库搜索中的假发现率估计。
J Proteome Res. 2018 Feb 2;17(2):846-857. doi: 10.1021/acs.jproteome.7b00614. Epub 2018 Jan 11.
9
The Genome of the Beluga Whale (Delphinapterus leucas).白鲸(白鲸属)的基因组。
Genes (Basel). 2017 Dec 11;8(12):378. doi: 10.3390/genes8120378.
10
BUSCO Applications from Quality Assessments to Gene Prediction and Phylogenomics.BUSCO的应用:从质量评估到基因预测和系统发育基因组学
Mol Biol Evol. 2018 Mar 1;35(3):543-548. doi: 10.1093/molbev/msx319.