• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
PCAP: a whole-genome assembly program.PCAP:一个全基因组组装程序。
Genome Res. 2003 Sep;13(9):2164-70. doi: 10.1101/gr.1390403.
2
CAP3: A DNA sequence assembly program.CAP3:一个DNA序列组装程序。
Genome Res. 1999 Sep;9(9):868-77. doi: 10.1101/gr.9.9.868.
3
Subset selection of high-depth next generation sequencing reads for de novo genome assembly using MapReduce framework.使用MapReduce框架进行从头基因组组装时对高深度下一代测序读数的子集选择。
BMC Genomics. 2015;16 Suppl 12(Suppl 12):S9. doi: 10.1186/1471-2164-16-S12-S9. Epub 2015 Dec 9.
4
ARACHNE: a whole-genome shotgun assembler.ARACHNE:一种全基因组鸟枪法测序序列拼接程序。
Genome Res. 2002 Jan;12(1):177-89. doi: 10.1101/gr.208902.
5
Assembly of the working draft of the human genome with GigAssembler.使用GigAssembler组装人类基因组工作草图。
Genome Res. 2001 Sep;11(9):1541-8. doi: 10.1101/gr.183201.
6
LR_Gapcloser: a tiling path-based gap closer that uses long reads to complete genome assembly.LR_Gapcloser:一种基于平铺路径的缺口闭合器,它使用长读长来完成基因组组装。
Gigascience. 2019 Jan 1;8(1):giy157. doi: 10.1093/gigascience/giy157.
7
Fragment assembly with short reads.使用短读段进行片段组装。
Bioinformatics. 2004 Sep 1;20(13):2067-74. doi: 10.1093/bioinformatics/bth205. Epub 2004 Apr 1.
8
Hybrid assembly of the large and highly repetitive genome of , a progenitor of bread wheat, with the MaSuRCA mega-reads algorithm.利用MaSuRCA巨读算法对面包小麦的祖先之一——[具体物种名称未给出]的大型高度重复基因组进行混合组装。
Genome Res. 2017 May;27(5):787-792. doi: 10.1101/gr.213405.116. Epub 2017 Jan 27.
9
Application of a superword array in genome assembly.超级词阵列在基因组组装中的应用。
Nucleic Acids Res. 2006 Jan 5;34(1):201-5. doi: 10.1093/nar/gkj419. Print 2006.
10
Direct comparison of performance of single nucleotide variant calling in human genome with alignment-based and assembly-based approaches.直接比较基于比对和组装的方法在人类基因组中单核苷酸变异calling 的性能。
Sci Rep. 2017 Sep 8;7(1):10963. doi: 10.1038/s41598-017-10826-9.

引用本文的文献

1
Accurate assembly of full-length consensus for viral quasispecies.病毒准种全长一致性序列的精确组装。
BMC Bioinformatics. 2025 Feb 1;26(1):36. doi: 10.1186/s12859-025-06045-z.
2
Identification and functional analyses of drought stress resistance genes by transcriptomics of the Mongolian grassland plant Chloris virgata.通过蒙古草原植物虎尾草转录组学鉴定抗旱胁迫基因并进行功能分析
BMC Plant Biol. 2025 Jan 11;25(1):44. doi: 10.1186/s12870-025-06046-3.
3
Phylomitogenomics bolsters the high-level classification of Demospongiae (phylum Porifera).系统发生基因组学支持多孔动物门(多孔动物门)的高级分类。
PLoS One. 2023 Dec 4;18(12):e0287281. doi: 10.1371/journal.pone.0287281. eCollection 2023.
4
Lightweight Pattern Matching Method for DNA Sequencing in Internet of Medical Things.物联网中 DNA 测序的轻量级模式匹配方法。
Comput Intell Neurosci. 2022 Sep 8;2022:6980335. doi: 10.1155/2022/6980335. eCollection 2022.
5
Transcriptome Analysis of , Which Shows the Fastest Germination and Growth in the Major Mongolian Grassland Plant.在主要蒙古草原植物中发芽和生长最快的[植物名称]的转录组分析 。(注:原文中“of ”后面缺少具体内容)
Front Plant Sci. 2021 Jun 28;12:684987. doi: 10.3389/fpls.2021.684987. eCollection 2021.
6
Biochemical and Structural Characterization of an Unusual and Naturally Split Class 3 Intein.一种不寻常且天然分裂的 Class 3 内含肽的生化和结构特征。
Chembiochem. 2021 Jan 15;22(2):364-373. doi: 10.1002/cbic.202000509. Epub 2020 Sep 30.
7
CSA: A high-throughput chromosome-scale assembly pipeline for vertebrate genomes.CSA:脊椎动物基因组的高通量染色体级别的组装流水线。
Gigascience. 2020 May 1;9(5). doi: 10.1093/gigascience/giaa034.
8
Small and mighty: adaptation of superphylum Patescibacteria to groundwater environment drives their genome simplicity.小巧而强大:超门巴氏菌对地下水环境的适应导致其基因组简单。
Microbiome. 2020 Apr 6;8(1):51. doi: 10.1186/s40168-020-00825-w.
9
A comparative evaluation of genome assembly reconciliation tools.基因组组装比对工具的比较评估
Genome Biol. 2017 May 18;18(1):93. doi: 10.1186/s13059-017-1213-3.
10
Position-dependent termination and widespread obligatory frameshifting in Euplotes translation.真核游动放线菌翻译中位置依赖性终止和广泛的强制性移码
Nat Struct Mol Biol. 2017 Jan;24(1):61-68. doi: 10.1038/nsmb.3330. Epub 2016 Nov 21.

本文引用的文献

1
Whole-genome sequence assembly for mammalian genomes: Arachne 2.哺乳动物基因组的全基因组序列组装:Arachne 2
Genome Res. 2003 Jan;13(1):91-6. doi: 10.1101/gr.828403.
2
The phusion assembler.Phusion 组装器。
Genome Res. 2003 Jan;13(1):81-90. doi: 10.1101/gr.731003.
3
Initial sequencing and comparative analysis of the mouse genome.小鼠基因组的初步测序与比较分析。
Nature. 2002 Dec 5;420(6915):520-62. doi: 10.1038/nature01262.
4
Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes.红鳍东方鲀全基因组鸟枪法测序组装与基因组分析
Science. 2002 Aug 23;297(5585):1301-10. doi: 10.1126/science.1072104. Epub 2002 Jul 25.
5
RePS: a sequence assembler that masks exact repeats identified from the shotgun data.RePS:一种序列组装程序,可屏蔽从鸟枪法数据中识别出的精确重复序列。
Genome Res. 2002 May;12(5):824-31. doi: 10.1101/gr.165102.
6
ARACHNE: a whole-genome shotgun assembler.ARACHNE:一种全基因组鸟枪法测序序列拼接程序。
Genome Res. 2002 Jan;12(1):177-89. doi: 10.1101/gr.208902.
7
An Eulerian path approach to DNA fragment assembly.一种用于DNA片段组装的欧拉路径方法。
Proc Natl Acad Sci U S A. 2001 Aug 14;98(17):9748-53. doi: 10.1073/pnas.171285098.
8
An optimized protocol for analysis of EST sequences.一种用于表达序列标签(EST)序列分析的优化方案。
Nucleic Acids Res. 2000 Sep 15;28(18):3657-65. doi: 10.1093/nar/28.18.3657.
9
A case study in genome-level fragment assembly.
Bioinformatics. 2000 Jun;16(6):494-500. doi: 10.1093/bioinformatics/16.6.494.
10
A whole-genome assembly of Drosophila.果蝇的全基因组组装
Science. 2000 Mar 24;287(5461):2196-204. doi: 10.1126/science.287.5461.2196.

PCAP:一个全基因组组装程序。

PCAP: a whole-genome assembly program.

作者信息

Huang Xiaoqiu, Wang Jianmin, Aluru Srinivas, Yang Shiaw-Pyng, Hillier LaDeana

机构信息

Department of Computer Science Iowa State University, Ames, Iowa 50011-1040, USA.

出版信息

Genome Res. 2003 Sep;13(9):2164-70. doi: 10.1101/gr.1390403.

DOI:10.1101/gr.1390403
PMID:12952883
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC403719/
Abstract

We describe a whole-genome assembly program named PCAP for processing tens of millions of reads. The PCAP program has several features to address efficiency and accuracy issues in assembly. Multiple processors are used to perform most time-consuming computations in assembly. A more sensitive method is used to avoid missing overlaps caused by sequencing errors. Repetitive regions of reads are detected on the basis of many overlaps with other reads, instead of many shorter word matches with other reads. Contaminated end regions of reads are identified and removed. Generation of a consensus sequence for a contig is based on an alignment of reads in the contig, in which both base quality values and coverage information are used to determine every consensus base. The PCAP program was tested on a mouse whole-genome data set of 30 million reads and a human Chromosome 20 data set of 1.7 million reads. The program is freely available for academic use.

摘要

我们描述了一个名为PCAP的全基因组组装程序,用于处理数千万条 reads。PCAP程序具有多个特性来解决组装中的效率和准确性问题。多个处理器用于执行组装中最耗时的计算。采用了一种更灵敏的方法来避免因测序错误导致的重叠缺失。基于与其他reads的许多重叠来检测reads的重复区域,而不是基于与其他reads的许多较短词匹配。识别并去除reads的污染末端区域。重叠群一致序列的生成基于重叠群中reads的比对,其中碱基质量值和覆盖信息都用于确定每个一致碱基。PCAP程序在一个包含3000万条reads的小鼠全基因组数据集和一个包含170万条reads的人类20号染色体数据集上进行了测试。该程序可供学术使用,免费获取。