用于DNA和蛋白质诊断特征开发的草图序列数据与完成序列数据。

Draft versus finished sequence data for DNA and protein diagnostic signature development.

作者信息

Gardner Shea N, Lam Marisa W, Smith Jason R, Torres Clinton L, Slezak Tom R

机构信息

Pathogen Bio-Informatics, Lawrence Livermore National Laboratory, PO Box 808, L-174, Livermore, CA 94551, USA.

出版信息

Nucleic Acids Res. 2005 Oct 20;33(18):5838-50. doi: 10.1093/nar/gki896. Print 2005.

DOI:10.1093/nar/gki896

PMID:16243783

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1266063/

Abstract

Sequencing pathogen genomes is costly, demanding careful allocation of limited sequencing resources. We built a computational Sequencing Analysis Pipeline (SAP) to guide decisions regarding the amount of genomic sequencing necessary to develop high-quality diagnostic DNA and protein signatures. SAP uses simulations to estimate the number of target genomes and close phylogenetic relatives (near neighbors or NNs) to sequence. We use SAP to assess whether draft data are sufficient or finished sequencing is required using Marburg and variola virus sequences. Simulations indicate that intermediate to high-quality draft with error rates of 10(-3)-10(-5) (approximately 8x coverage) of target organisms is suitable for DNA signature prediction. Low-quality draft with error rates of approximately 1% (3x to 6x coverage) of target isolates is inadequate for DNA signature prediction, although low-quality draft of NNs is sufficient, as long as the target genomes are of high quality. For protein signature prediction, sequencing errors in target genomes substantially reduce the detection of amino acid sequence conservation, even if the draft is of high quality. In summary, high-quality draft of target and low-quality draft of NNs appears to be a cost-effective investment for DNA signature prediction, but may lead to underestimation of predicted protein signatures.

摘要

对病原体基因组进行测序成本高昂，需要谨慎分配有限的测序资源。我们构建了一个计算测序分析流程（SAP），以指导关于开发高质量诊断DNA和蛋白质特征所需的基因组测序量的决策。SAP使用模拟来估计要测序的目标基因组数量和密切的系统发育亲属（近邻或NNs）。我们使用SAP通过马尔堡病毒和天花病毒序列评估草图数据是否足够或是否需要完成测序。模拟表明，目标生物体错误率为10^(-3)-10^(-5)（约8倍覆盖度）的中等至高质量草图适用于DNA特征预测。目标分离株错误率约为1%（3倍至6倍覆盖度）的低质量草图不足以进行DNA特征预测，不过只要目标基因组质量高，NNs的低质量草图就足够。对于蛋白质特征预测，即使草图质量高，目标基因组中的测序错误也会大幅降低氨基酸序列保守性的检测。总之，目标的高质量草图和NNs的低质量草图似乎是DNA特征预测的一种经济有效的投入，但可能会导致预测的蛋白质特征被低估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1203/1266063/ed5350c70551/gki896f1.jpg

相似文献

Draft versus finished sequence data for DNA and protein diagnostic signature development.用于DNA和蛋白质诊断特征开发的草图序列数据与完成序列数据。

Nucleic Acids Res. 2005 Oct 20;33(18):5838-50. doi: 10.1093/nar/gki896. Print 2005.

System to assess genome sequencing needs for viral protein diagnostics and therapeutics.用于评估病毒蛋白诊断和治疗的基因组测序需求的系统。

J Clin Microbiol. 2005 Apr;43(4):1807-17. doi: 10.1128/JCM.43.4.1807-1817.2005.

Sequencing needs for viral diagnostics.病毒诊断的测序需求。

J Clin Microbiol. 2004 Dec;42(12):5472-6. doi: 10.1128/JCM.42.12.5472-5476.2004.

Single-molecule DNA sequencing of a viral genome.病毒基因组的单分子DNA测序

Science. 2008 Apr 4;320(5872):106-9. doi: 10.1126/science.1150427.

Should the draft chimpanzee sequence be finished?黑猩猩基因组草图应该完成吗？

Trends Genet. 2006 Mar;22(3):122-5. doi: 10.1016/j.tig.2005.12.007. Epub 2006 Jan 10.

A Sanger/pyrosequencing hybrid approach for the generation of high-quality draft assemblies of marine microbial genomes.一种用于生成海洋微生物基因组高质量草图组装的桑格/焦磷酸测序混合方法。

Proc Natl Acad Sci U S A. 2006 Jul 25;103(30):11240-5. doi: 10.1073/pnas.0604351103. Epub 2006 Jul 13.

Bioinformatics for analysis of poxvirus genomes.用于痘病毒基因组分析的生物信息学

Methods Mol Biol. 2012;890:233-58. doi: 10.1007/978-1-61779-876-4_14.

WebScipio: an online tool for the determination of gene structures using protein sequences.WebScipio：一种利用蛋白质序列确定基因结构的在线工具。

BMC Genomics. 2008 Sep 18;9:422. doi: 10.1186/1471-2164-9-422.

Potential use of host-derived genome signatures to root virus phylogenies.利用宿主来源的基因组特征追溯病毒系统发育的潜在用途。

Mol Phylogenet Evol. 2008 Dec;49(3):969-78. doi: 10.1016/j.ympev.2008.08.014. Epub 2008 Aug 29.

Phylogenetic understanding of clonal populations in an era of whole genome sequencing.全基因组测序时代克隆群体的系统发育理解

Infect Genet Evol. 2009 Sep;9(5):1010-9. doi: 10.1016/j.meegid.2009.05.014. Epub 2009 May 27.

引用本文的文献

Development, testing and validation of a SARS-CoV-2 multiplex panel for detection of the five major variants of concern on a portable PCR platform.开发、测试和验证一种 SARS-CoV-2 多重检测试剂盒，用于在便携式 PCR 平台上检测五种主要关切变异株。

Front Public Health. 2022 Dec 15;10:1042647. doi: 10.3389/fpubh.2022.1042647. eCollection 2022.

Comparative Genomic Analysis of : An Overview.《比较基因组分析概述》（原文标题似乎不完整，推测完整标题可能是这样，具体可根据实际情况调整）

Int J Genomics. 2019 Apr 10;2019:4973214. doi: 10.1155/2019/4973214. eCollection 2019.

Development and validation of four one-step real-time RT-LAMP assays for specific detection of each dengue virus serotype.开发和验证四种一步法实时 RT-LAMP 检测试剂盒，用于特异性检测每种登革热病毒血清型。

PLoS Negl Trop Dis. 2018 May 29;12(5):e0006381. doi: 10.1371/journal.pntd.0006381. eCollection 2018 May.

Evaluation of Signature Erosion in Ebola Virus Due to Genomic Drift and Its Impact on the Performance of Diagnostic Assays.评估埃博拉病毒因基因组漂移导致的特征性缺失及其对诊断检测性能的影响。

Viruses. 2015 Jun 17;7(6):3130-54. doi: 10.3390/v7062763.

Sequence Analysis of Inducible Prophage phIS3501 Integrated into the Haemolysin II Gene of Bacillus thuringiensis var israelensis ATCC35646.整合到苏云金芽孢杆菌以色列变种ATCC35646溶血素II基因中的诱导性原噬菌体phIS3501的序列分析

Genet Res Int. 2012;2012:543286. doi: 10.1155/2012/543286. Epub 2012 Mar 6.

A genome survey of Moniliophthora perniciosa gives new insights into Witches' Broom Disease of cacao.对可可毛色二孢菌的基因组调查为可可树的女巫扫帚病提供了新见解。

BMC Genomics. 2008 Nov 18;9:548. doi: 10.1186/1471-2164-9-548.

A model of base-call resolution on broad-spectrum pathogen detection resequencing DNA microarrays.广谱病原体检测重测序DNA微阵列上碱基识别解析模型。

Nucleic Acids Res. 2008 Jun;36(10):3194-201. doi: 10.1093/nar/gkm1156. Epub 2008 Apr 15.

本文引用的文献

System to assess genome sequencing needs for viral protein diagnostics and therapeutics.用于评估病毒蛋白诊断和治疗的基因组测序需求的系统。

J Clin Microbiol. 2005 Apr;43(4):1807-17. doi: 10.1128/JCM.43.4.1807-1817.2005.

Sequencing needs for viral diagnostics.病毒诊断的测序需求。

J Clin Microbiol. 2004 Dec;42(12):5472-6. doi: 10.1128/JCM.42.12.5472-5476.2004.

Progress towards the development of a HIV-1 gp41-directed vaccine.针对HIV-1 gp41的疫苗研发进展。

Curr HIV Res. 2004 Apr;2(2):193-204. doi: 10.2174/1570162043484933.

Synthetic peptide studies on the severe acute respiratory syndrome (SARS) coronavirus spike glycoprotein: perspective for SARS vaccine development.严重急性呼吸综合征（SARS）冠状病毒刺突糖蛋白的合成肽研究：SARS疫苗开发的前景

Clin Chem. 2004 Jun;50(6):1036-42. doi: 10.1373/clinchem.2003.029801. Epub 2004 Mar 25.

Enfuvirtide: the first therapy to inhibit the entry of HIV-1 into host CD4 lymphocytes.恩夫韦肽：第一种抑制HIV-1进入宿主CD4淋巴细胞的疗法。

Nat Rev Drug Discov. 2004 Mar;3(3):215-25. doi: 10.1038/nrd1331.

PPE protein (Rv3873) from DNA segment RD1 of Mycobacterium tuberculosis: strong recognition of both specific T-cell epitopes and epitopes conserved within the PPE family.来自结核分枝杆菌DNA片段RD1的PPE蛋白（Rv3873）：对特定T细胞表位和PPE家族内保守表位均有强烈识别作用。

Infect Immun. 2003 Nov;71(11):6116-23. doi: 10.1128/IAI.71.11.6116-6123.2003.

Comparative genomics tools applied to bioterrorism defence.应用于生物恐怖主义防御的比较基因组学工具

Brief Bioinform. 2003 Jun;4(2):133-49. doi: 10.1093/bib/4.2.133.

Limitations of TaqMan PCR for detecting divergent viral pathogens illustrated by hepatitis A, B, C, and E viruses and human immunodeficiency virus.甲型、乙型、丙型和戊型肝炎病毒以及人类免疫缺陷病毒对TaqMan聚合酶链反应检测不同病毒病原体的局限性

J Clin Microbiol. 2003 Jun;41(6):2417-27. doi: 10.1128/JCM.41.6.2417-2427.2003.

On the high value of low standards.论低标准的高价值。

J Bacteriol. 2002 Dec;184(23):6406-9; discussion 6409. doi: 10.1128/JB.184.23.6406-6409.2002.

The value of complete microbial genome sequencing (you get what you pay for).完整微生物基因组测序的价值（一分钱一分货）。

J Bacteriol. 2002 Dec;184(23):6403-5; discusion 6405. doi: 10.1128/JB.184.23.6403-6405.2002.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于DNA和蛋白质诊断特征开发的草图序列数据与完成序列数据。

Draft versus finished sequence data for DNA and protein diagnostic signature development.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献