• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

靶向组装短序列读段。

Targeted assembly of short sequence reads.

机构信息

Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada.

出版信息

PLoS One. 2011 May 11;6(5):e19816. doi: 10.1371/journal.pone.0019816.

DOI:10.1371/journal.pone.0019816
PMID:21589938
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3092772/
Abstract

As next-generation sequence (NGS) production continues to increase, analysis is becoming a significant bottleneck. However, in situations where information is required only for specific sequence variants, it is not necessary to assemble or align whole genome data sets in their entirety. Rather, NGS data sets can be mined for the presence of sequence variants of interest by localized assembly, which is a faster, easier, and more accurate approach. We present TASR, a streamlined assembler that interrogates very large NGS data sets for the presence of specific variants by only considering reads within the sequence space of input target sequences provided by the user. The NGS data set is searched for reads with an exact match to all possible short words within the target sequence, and these reads are then assembled stringently to generate a consensus of the target and flanking sequence. Typically, variants of a particular locus are provided as different target sequences, and the presence of the variant in the data set being interrogated is revealed by a successful assembly outcome. However, TASR can also be used to find unknown sequences that flank a given target. We demonstrate that TASR has utility in finding or confirming genomic mutations, polymorphisms, fusions and integration events. Targeted assembly is a powerful method for interrogating large data sets for the presence of sequence variants of interest. TASR is a fast, flexible and easy to use tool for targeted assembly.

摘要

随着下一代测序(NGS)的产量不断增加,分析已成为一个重大瓶颈。然而,在仅需要特定序列变异信息的情况下,没有必要对整个基因组数据集进行组装或对齐。相反,可以通过局部组装来挖掘 NGS 数据集以查找感兴趣的序列变异,这是一种更快、更容易、更准确的方法。我们提出了 TASR,这是一种精简的组装程序,通过仅考虑用户提供的输入目标序列的序列空间内的读取,来检测非常大的 NGS 数据集是否存在特定变体。在 NGS 数据集中搜索与目标序列内所有可能的短字完全匹配的读取,然后严格组装这些读取,以生成目标和侧翼序列的共识。通常,特定位置的变体作为不同的目标序列提供,并且通过成功的组装结果揭示了正在检测的数据集中变体的存在。然而,TASR 也可用于查找给定目标侧翼的未知序列。我们证明 TASR 在发现或确认基因组突变、多态性、融合和整合事件方面具有实用性。靶向组装是一种用于检测大型数据集是否存在感兴趣的序列变异的强大方法。TASR 是一种快速、灵活且易于使用的靶向组装工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e18/3092772/9a0ff6d4a002/pone.0019816.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e18/3092772/23549f7494f4/pone.0019816.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e18/3092772/9a0ff6d4a002/pone.0019816.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e18/3092772/23549f7494f4/pone.0019816.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7e18/3092772/9a0ff6d4a002/pone.0019816.g002.jpg

相似文献

1
Targeted assembly of short sequence reads.靶向组装短序列读段。
PLoS One. 2011 May 11;6(5):e19816. doi: 10.1371/journal.pone.0019816.
2
RepAHR: an improved approach for de novo repeat identification by assembly of the high-frequency reads.RepAHR:通过组装高频读段进行从头鉴定重复序列的改进方法。
BMC Bioinformatics. 2020 Oct 19;21(1):463. doi: 10.1186/s12859-020-03779-w.
3
Mapsembler, targeted and micro assembly of large NGS datasets on a desktop computer.Mapsembler,在台式计算机上对大型 NGS 数据集进行靶向和微组装。
BMC Bioinformatics. 2012 Mar 23;13:48. doi: 10.1186/1471-2105-13-48.
4
Bioinformatics Basics for High-Throughput Hybridization-Based Targeted DNA Sequencing from FFPE-Derived Tumor Specimens: From Reads to Variants.基于FFPE来源肿瘤标本的高通量杂交靶向DNA测序的生物信息学基础:从 reads 到变异体
Methods Mol Biol. 2019;1908:37-48. doi: 10.1007/978-1-4939-9004-7_3.
5
Enhancing SNV identification in whole-genome sequencing data through the incorporation of known genetic variants into the minimap2 index.通过将已知遗传变异纳入 minimap2 索引来提高全基因组测序数据中 SNV 的识别能力。
BMC Bioinformatics. 2024 Jul 13;25(1):238. doi: 10.1186/s12859-024-05862-y.
6
Software for pre-processing Illumina next-generation sequencing short read sequences.用于预处理Illumina下一代测序短读序列的软件。
Source Code Biol Med. 2014 May 3;9:8. doi: 10.1186/1751-0473-9-8. eCollection 2014.
7
Graph mining for next generation sequencing: leveraging the assembly graph for biological insights.用于下一代测序的图挖掘:利用组装图获取生物学见解。
BMC Genomics. 2016 May 6;17:340. doi: 10.1186/s12864-016-2678-2.
8
Fast and SNP-aware short read alignment with SALT.基于 SALT 的快速 SNP 感知短读序列比对。
BMC Bioinformatics. 2021 Aug 25;22(Suppl 9):172. doi: 10.1186/s12859-021-04088-6.
9
Fast and simple protein-alignment-guided assembly of orthologous gene families from microbiome sequencing reads.快速而简单的基于蛋白质比对的微生物组测序读段中直系同源基因家族组装方法。
Microbiome. 2017 Jan 25;5(1):11. doi: 10.1186/s40168-017-0233-2.
10
De novo assembly of highly polymorphic metagenomic data using in situ generated reference sequences and a novel BLAST-based assembly pipeline.使用原位生成的参考序列和基于新型BLAST的组装流程对高度多态的宏基因组数据进行从头组装。
BMC Bioinformatics. 2017 Apr 26;18(1):223. doi: 10.1186/s12859-017-1630-z.

引用本文的文献

1
Immune response stability to the SARS-CoV-2 mRNA vaccine booster is influenced by differential splicing of HLA genes.免疫应答对 SARS-CoV-2 mRNA 疫苗加强针的稳定性受 HLA 基因差异剪接的影响。
Sci Rep. 2024 Apr 18;14(1):8982. doi: 10.1038/s41598-024-59259-1.
2
MTG-Link: leveraging barcode information from linked-reads to assemble specific loci.MTG-Link:利用来自链接读取的条形码信息来组装特定的基因座。
BMC Bioinformatics. 2023 Jul 14;24(1):284. doi: 10.1186/s12859-023-05395-w.
3
Differential haplotype expression in class I MHC genes during SARS-CoV-2 infection of human lung cell lines.

本文引用的文献

1
Deep RNA sequencing analysis of readthrough gene fusions in human prostate adenocarcinoma and reference samples.深度 RNA 测序分析人前列腺腺癌和对照样本中的通读基因融合。
BMC Med Genomics. 2011 Jan 24;4:11. doi: 10.1186/1755-8794-4-11.
2
A map of human genome variation from population-scale sequencing.人类基因组变异的图谱来自于基于人群的测序。
Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.
3
The case for cloud computing in genome informatics.云计算在基因组信息学中的应用。
在 SARS-CoV-2 感染人肺细胞系期间,I 类 MHC 基因中的差异单倍型表达。
Front Immunol. 2023 Feb 1;13:1101526. doi: 10.3389/fimmu.2022.1101526. eCollection 2022.
4
Host genetic susceptibility underlying SARS-CoV-2-associated Multisystem Inflammatory Syndrome in Brazilian Children.巴西儿童 SARS-CoV-2 相关多系统炎症综合征的宿主遗传易感性。
Mol Med. 2022 Dec 12;28(1):153. doi: 10.1186/s10020-022-00583-5.
5
slag: A program for seeded local assembly of genes in complex genomes.Slag:一个用于复杂基因组中基因种子局部组装的程序。
Mol Ecol Resour. 2022 Jul;22(5):1999-2017. doi: 10.1111/1755-0998.13580. Epub 2022 Jan 27.
6
Exome-Wide Search for Genes Associated With Central Nervous System Inflammatory Demyelinating Diseases Following CHIKV Infection: The Tip of the Iceberg.对基孔肯雅病毒感染后中枢神经系统炎性脱髓鞘疾病相关基因进行全外显子组搜索:冰山一角
Front Genet. 2021 Mar 17;12:639364. doi: 10.3389/fgene.2021.639364. eCollection 2021.
7
HAPHPIPE: Haplotype Reconstruction and Phylodynamics for Deep Sequencing of Intrahost Viral Populations.HAPHPIPE:用于深度测序宿主内病毒群体的单倍型重建和系统发生分析。
Mol Biol Evol. 2021 Apr 13;38(4):1677-1690. doi: 10.1093/molbev/msaa315.
8
Mismatch-tolerant, alignment-free sequence classification using multiple spaced seeds and multiindex Bloom filters.使用多间隔种子和多索引布隆过滤器进行容错、无比对的序列分类。
Proc Natl Acad Sci U S A. 2020 Jul 21;117(29):16961-16968. doi: 10.1073/pnas.1903436117. Epub 2020 Jul 8.
9
Is Repressed by Transgene-Induced DNA Methylation in the Arabidopsis Mutant.在拟南芥突变体中,被转座子诱导的 DNA 甲基化所抑制。
Plant Cell. 2020 Apr;32(4):1035-1048. doi: 10.1105/tpc.19.00532. Epub 2020 Feb 7.
10
Assexon: Assembling Exon Using Gene Capture Data.Assexon:利用基因捕获数据组装外显子
Evol Bioinform Online. 2019 Sep 6;15:1176934319874792. doi: 10.1177/1176934319874792. eCollection 2019.
Genome Biol. 2010;11(5):207. doi: 10.1186/gb-2010-11-5-207. Epub 2010 May 5.
4
Ancient human genome sequence of an extinct Palaeo-Eskimo.已灭绝的古爱斯基摩人古人类基因组序列。
Nature. 2010 Feb 11;463(7282):757-62. doi: 10.1038/nature08835.
5
SNVMix: predicting single nucleotide variants from next-generation sequencing of tumors.SNVMix:从肿瘤的下一代测序中预测单核苷酸变异。
Bioinformatics. 2010 Mar 15;26(6):730-6. doi: 10.1093/bioinformatics/btq040. Epub 2010 Feb 3.
6
De novo assembly of human genomes with massively parallel short read sequencing.利用大规模平行短读测序进行人类基因组从头组装。
Genome Res. 2010 Feb;20(2):265-72. doi: 10.1101/gr.097261.109. Epub 2009 Dec 17.
7
Building the sequence map of the human pan-genome.构建人类泛基因组的序列图谱。
Nat Biotechnol. 2010 Jan;28(1):57-63. doi: 10.1038/nbt.1596. Epub 2009 Dec 7.
8
Mutational evolution in a lobular breast tumour profiled at single nucleotide resolution.在单核苷酸分辨率下分析的小叶型乳腺肿瘤中的突变进化。
Nature. 2009 Oct 8;461(7265):809-13. doi: 10.1038/nature08489.
9
Profiling the T-cell receptor beta-chain repertoire by massively parallel sequencing.通过大规模平行测序分析T细胞受体β链库
Genome Res. 2009 Oct;19(10):1817-24. doi: 10.1101/gr.092924.109. Epub 2009 Jun 18.
10
The Sequence Alignment/Map format and SAMtools.序列比对/映射格式和 SAMtools。
Bioinformatics. 2009 Aug 15;25(16):2078-9. doi: 10.1093/bioinformatics/btp352. Epub 2009 Jun 8.