• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

准确测序能够形成替代(非 B)结构的 DNA 基序。

Accurate sequencing of DNA motifs able to form alternative (non-B) structures.

机构信息

Department of Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA.

Department of Operations and Decision Systems, Université Laval, Quebec, Quebec G1V0A6, Canada.

出版信息

Genome Res. 2023 Jun;33(6):907-922. doi: 10.1101/gr.277490.122. Epub 2023 Jul 11.

DOI:10.1101/gr.277490.122
PMID:37433640
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10519405/
Abstract

Approximately 13% of the human genome at certain motifs have the potential to form noncanonical (non-B) DNA structures (e.g., G-quadruplexes, cruciforms, and Z-DNA), which regulate many cellular processes but also affect the activity of polymerases and helicases. Because sequencing technologies use these enzymes, they might possess increased errors at non-B structures. To evaluate this, we analyzed error rates, read depth, and base quality of Illumina, Pacific Biosciences (PacBio) HiFi, and Oxford Nanopore Technologies (ONT) sequencing at non-B motifs. All technologies showed altered sequencing success for most non-B motif types, although this could be owing to several factors, including structure formation, biased GC content, and the presence of homopolymers. Single-nucleotide mismatch errors had low biases in HiFi and ONT for all non-B motif types but were increased for G-quadruplexes and Z-DNA in all three technologies. Deletion errors were increased for all non-B types but Z-DNA in Illumina and HiFi, as well as only for G-quadruplexes in ONT. Insertion errors for non-B motifs were highly, moderately, and slightly elevated in Illumina, HiFi, and ONT, respectively. Additionally, we developed a probabilistic approach to determine the number of false positives at non-B motifs depending on sample size and variant frequency, and applied it to publicly available data sets (1000 Genomes, Simons Genome Diversity Project, and gnomAD). We conclude that elevated sequencing errors at non-B DNA motifs should be considered in low-read-depth studies (single-cell, ancient DNA, and pooled-sample population sequencing) and in scoring rare variants. Combining technologies should maximize sequencing accuracy in future studies of non-B DNA.

摘要

大约 13%的人类基因组在特定的基序中具有形成非规范(非 B)DNA 结构的潜力(例如 G-四联体、十字结构和 Z-DNA),这些结构调节许多细胞过程,但也影响聚合酶和解旋酶的活性。由于测序技术使用这些酶,它们在非 B 结构中可能会有更高的错误率。为了评估这一点,我们分析了 Illumina、Pacific Biosciences(PacBio)HiFi 和 Oxford Nanopore Technologies(ONT)在非 B 基序上的测序错误率、读深和碱基质量。尽管这可能是由于结构形成、GC 含量偏倚和存在同源多聚体等多种因素造成的,但所有技术在大多数非 B 基序类型上的测序成功率都发生了改变。在所有非 B 基序类型中,HiFi 和 ONT 的单核苷酸错配错误的偏倚较低,但在所有三种技术中,G-四联体和 Z-DNA 的错误率都增加了。在所有非 B 类型中,缺失错误都增加了,但在 Illumina 和 HiFi 中,除了 Z-DNA 外,在 ONT 中也是如此。在 Illumina、HiFi 和 ONT 中,非 B 基序的插入错误分别高度、中度和轻度增加。此外,我们开发了一种概率方法来根据样本大小和变异频率确定非 B 基序中假阳性的数量,并将其应用于公开可用的数据集(1000 基因组、西蒙斯基因组多样性计划和 gnomAD)。我们得出的结论是,在低读深研究(单细胞、古 DNA 和混合样本群体测序)和稀有变异评分中,应该考虑非 B DNA 基序中测序错误的增加。在未来非 B DNA 的研究中,结合多种技术应该可以最大限度地提高测序准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a9f/10519405/55d32ab829e4/907f06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a9f/10519405/00c887bc355f/907f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a9f/10519405/f2a35c78addc/907f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a9f/10519405/d9f7e3c40551/907f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a9f/10519405/fad9fe92b61c/907f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a9f/10519405/f1a4bda7116c/907f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a9f/10519405/55d32ab829e4/907f06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a9f/10519405/00c887bc355f/907f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a9f/10519405/f2a35c78addc/907f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a9f/10519405/d9f7e3c40551/907f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a9f/10519405/fad9fe92b61c/907f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a9f/10519405/f1a4bda7116c/907f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a9f/10519405/55d32ab829e4/907f06.jpg

相似文献

1
Accurate sequencing of DNA motifs able to form alternative (non-B) structures.准确测序能够形成替代(非 B)结构的 DNA 基序。
Genome Res. 2023 Jun;33(6):907-922. doi: 10.1101/gr.277490.122. Epub 2023 Jul 11.
2
Comparison of the two up-to-date sequencing technologies for genome assembly: HiFi reads of Pacific Biosciences Sequel II system and ultralong reads of Oxford Nanopore.比较两种最新的基因组组装测序技术:太平洋生物科学测序仪二代系统的 HiFi 读取和牛津纳米孔的超长读取。
Gigascience. 2020 Dec 15;9(12). doi: 10.1093/gigascience/giaa123.
3
Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes.比较长读测序技术在细菌和果蝇基因组分析中的应用。
G3 (Bethesda). 2021 Jun 17;11(6). doi: 10.1093/g3journal/jkab083.
4
Comparison of Illumina and Oxford Nanopore Technology for genome analysis of Francisella tularensis, Bacillus anthracis, and Brucella suis.Illumina 与 Oxford Nanopore 技术在分析土拉弗朗西斯菌、炭疽芽孢杆菌和猪布鲁氏菌基因组中的比较。
BMC Genomics. 2023 May 12;24(1):258. doi: 10.1186/s12864-023-09343-z.
5
Comparing assembly strategies for third-generation sequencing technologies across different genomes.比较不同基因组中第三代测序技术的组装策略。
Genomics. 2023 Sep;115(5):110700. doi: 10.1016/j.ygeno.2023.110700. Epub 2023 Aug 18.
6
Evaluation of strategies for the assembly of diverse bacterial genomes using MinION long-read sequencing.利用 MinION 长读测序技术评估组装多种细菌基因组的策略。
BMC Genomics. 2019 Jan 9;20(1):23. doi: 10.1186/s12864-018-5381-7.
7
Sequencing of individual barcoded cDNAs using Pacific Biosciences and Oxford Nanopore Technologies reveals platform-specific error patterns.使用 Pacific Biosciences 和 Oxford Nanopore Technologies 对个体条形码 cDNA 进行测序可揭示特定于平台的错误模式。
Genome Res. 2022 Apr;32(4):726-737. doi: 10.1101/gr.276405.121. Epub 2022 Mar 17.
8
Adaptation of Oxford Nanopore technology for hepatitis C whole genome sequencing and identification of within-host viral variants.牛津纳米孔技术在丙型肝炎全基因组测序及宿主内病毒变异体鉴定中的应用。
BMC Genomics. 2021 Mar 2;22(1):148. doi: 10.1186/s12864-021-07460-1.
9
Comparison of Illumina versus Nanopore 16S rRNA Gene Sequencing of the Human Nasal Microbiota.Illumina 与 Nanopore 16S rRNA 基因测序技术在人类鼻腔微生物组中的比较。
Genes (Basel). 2020 Sep 21;11(9):1105. doi: 10.3390/genes11091105.
10
Improving nanopore read accuracy with the R2C2 method enables the sequencing of highly multiplexed full-length single-cell cDNA.利用 R2C2 方法提高纳米孔读取准确性,使高度多重化全长单细胞 cDNA 的测序成为可能。
Proc Natl Acad Sci U S A. 2018 Sep 25;115(39):9726-9731. doi: 10.1073/pnas.1806447115. Epub 2018 Sep 10.

引用本文的文献

1
Genetic analysis using long-read sequencing to overcome the difficulties in gene.使用长读长测序进行基因分析以克服基因研究中的困难。
Res Pract Thromb Haemost. 2025 May 17;9(4):102888. doi: 10.1016/j.rpth.2025.102888. eCollection 2025 May.
2
Centromere-size reduction and chromatin state dynamics following intergenomic hybridization in cotton.棉花基因组间杂交后着丝粒大小减小与染色质状态动态变化
PLoS Genet. 2025 May 2;21(5):e1011689. doi: 10.1371/journal.pgen.1011689. eCollection 2025 May.
3
Non-canonical DNA in human and other ape telomere-to-telomere genomes.

本文引用的文献

1
Benchmarking challenging small variants with linked and long reads.使用连锁读段和长读段对具有挑战性的小变异进行基准测试。
Cell Genom. 2022 May;2(5). doi: 10.1016/j.xgen.2022.100128.
2
Variation in G-quadruplex sequence and topology differentially impacts human DNA polymerase fidelity.G-四链体序列和拓扑结构的变异差异影响人类 DNA 聚合酶的保真度。
DNA Repair (Amst). 2022 Nov;119:103402. doi: 10.1016/j.dnarep.2022.103402. Epub 2022 Sep 9.
3
Assessing the contribution of rare variants to complex trait heritability from whole-genome sequence data.
人类及其他猿类端粒到端粒基因组中的非规范DNA。
Nucleic Acids Res. 2025 Apr 10;53(7). doi: 10.1093/nar/gkaf298.
4
Complete sequencing of ape genomes.猿类基因组的完整测序。
Nature. 2025 May;641(8062):401-418. doi: 10.1038/s41586-025-08816-3. Epub 2025 Apr 9.
5
Single-Molecule Nanopore Sequencing of the CpG Island from the Promoter of O6-Methylguanine-DNA Methyltransferase Provides Insights into the Mechanism of De Novo Methylation of G/C-Rich Regions.对O6-甲基鸟嘌呤-DNA甲基转移酶启动子区域CpG岛进行单分子纳米孔测序,为富含G/C区域的从头甲基化机制提供了见解。
Epigenomes. 2025 Jan 26;9(1):4. doi: 10.3390/epigenomes9010004.
6
Non-canonical DNA in human and other ape telomere-to-telomere genomes.人类及其他猿类端粒到端粒基因组中的非规范DNA。
bioRxiv. 2025 Mar 8:2024.09.02.610891. doi: 10.1101/2024.09.02.610891.
7
Comparative analysis of predicted DNA secondary structures infers complex human centromere topology.对预测的DNA二级结构进行比较分析可推断出复杂的人类着丝粒拓扑结构。
Am J Hum Genet. 2024 Dec 5;111(12):2707-2719. doi: 10.1016/j.ajhg.2024.10.016. Epub 2024 Nov 18.
8
Complete sequencing of ape genomes.猿类基因组的完整测序。
bioRxiv. 2024 Oct 5:2024.07.31.605654. doi: 10.1101/2024.07.31.605654.
9
The complete sequence and comparative analysis of ape sex chromosomes.猿类性染色体的完整序列与比较分析。
Nature. 2024 Jun;630(8016):401-411. doi: 10.1038/s41586-024-07473-2. Epub 2024 May 29.
10
Special Issue "Bioinformatics of Unusual DNA and RNA Structures".特刊:非寻常 DNA 和 RNA 结构的生物信息学
Int J Mol Sci. 2024 May 10;25(10):5226. doi: 10.3390/ijms25105226.
评估全基因组序列数据中稀有变异对复杂性状遗传度的贡献。
Nat Genet. 2022 Mar;54(3):263-273. doi: 10.1038/s41588-021-00997-7. Epub 2022 Mar 7.
4
Sequencing error profiles of Illumina sequencing instruments.Illumina测序仪的测序错误图谱。
NAR Genom Bioinform. 2021 Mar 27;3(1):lqab019. doi: 10.1093/nargab/lqab019. eCollection 2021 Mar.
5
High-accuracy long-read amplicon sequences using unique molecular identifiers with Nanopore or PacBio sequencing.使用独特分子标识符结合纳米孔或PacBio测序的高精度长读长扩增子序列。
Nat Methods. 2021 Feb;18(2):165-169. doi: 10.1038/s41592-020-01041-y. Epub 2021 Jan 11.
6
Long-read human genome sequencing and its applications.长读长基因组测序及其应用。
Nat Rev Genet. 2020 Oct;21(10):597-614. doi: 10.1038/s41576-020-0236-x. Epub 2020 Jun 5.
7
The mutational constraint spectrum quantified from variation in 141,456 humans.从 141456 名人类个体的变异中量化的突变约束谱。
Nature. 2020 May;581(7809):434-443. doi: 10.1038/s41586-020-2308-7. Epub 2020 May 27.
8
Sequencing of human genomes with nanopore technology.纳米孔技术测序人类基因组。
Nat Commun. 2019 Apr 23;10(1):1869. doi: 10.1038/s41467-019-09637-5.
9
Empirical evaluation of variant calling accuracy using ultra-deep whole-genome sequencing data.使用超深度全基因组测序数据进行变异调用准确性的实证评估。
Sci Rep. 2019 Feb 11;9(1):1784. doi: 10.1038/s41598-018-38346-0.
10
Long-read sequencing technology indicates genome-wide effects of non-B DNA on polymerization speed and error rate.长读测序技术表明非 B DNA 对聚合速度和错误率的全基因组影响。
Genome Res. 2018 Dec;28(12):1767-1778. doi: 10.1101/gr.241257.118. Epub 2018 Nov 6.