• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

短串联重复序列处的逆转录错误与RNA-DNA差异

Reverse Transcription Errors and RNA-DNA Differences at Short Tandem Repeats.

作者信息

Fungtammasan Arkarachai, Tomaszkiewicz Marta, Campos-Sánchez Rebeca, Eckert Kristin A, DeGiorgio Michael, Makova Kateryna D

机构信息

Integrative Biosciences, Bioinformatics and Genomics Option, Pennsylvania State University Department of Biology, Pennsylvania State University Center for Medical Genomics, Pennsylvania State University Huck Institute of Genome Sciences, Pennsylvania State University.

Department of Biology, Pennsylvania State University Center for Medical Genomics, Pennsylvania State University.

出版信息

Mol Biol Evol. 2016 Oct;33(10):2744-58. doi: 10.1093/molbev/msw139. Epub 2016 Jul 12.

DOI:10.1093/molbev/msw139
PMID:27413049
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5026258/
Abstract

Transcript variation has important implications for organismal function in health and disease. Most transcriptome studies focus on assessing variation in gene expression levels and isoform representation. Variation at the level of transcript sequence is caused by RNA editing and transcription errors, and leads to nongenetically encoded transcript variants, or RNA-DNA differences (RDDs). Such variation has been understudied, in part because its detection is obscured by reverse transcription (RT) and sequencing errors. It has only been evaluated for intertranscript base substitution differences. Here, we investigated transcript sequence variation for short tandem repeats (STRs). We developed the first maximum-likelihood estimator (MLE) to infer RT error and RDD rates, taking next generation sequencing error rates into account. Using the MLE, we empirically evaluated RT error and RDD rates for STRs in a large-scale DNA and RNA replicated sequencing experiment conducted in a primate species. The RT error rates increased exponentially with STR length and were biased toward expansions. The RDD rates were approximately 1 order of magnitude lower than the RT error rates. The RT error rates estimated with the MLE from a primate data set were concordant with those estimated with an independent method, barcoded RNA sequencing, from a Caenorhabditis elegans data set. Our results have important implications for medical genomics, as STR allelic variation is associated with >40 diseases. STR nonallelic transcript variation can also contribute to disease phenotype. The MLE and empirical rates presented here can be used to evaluate the probability of disease-associated transcripts arising due to RDD.

摘要

转录本变异对健康和疾病状态下的机体功能具有重要影响。大多数转录组研究聚焦于评估基因表达水平和异构体表现形式的变异。转录本序列水平的变异由RNA编辑和转录错误引起,会导致非基因编码的转录本变体,即RNA-DNA差异(RDDs)。此类变异一直未得到充分研究,部分原因是其检测会被逆转录(RT)和测序错误所掩盖。此前仅针对转录本间碱基替换差异进行过评估。在此,我们研究了短串联重复序列(STRs)的转录本序列变异。我们开发了首个最大似然估计器(MLE)来推断RT错误率和RDD率,同时考虑了新一代测序错误率。通过MLE,我们在一个灵长类物种进行的大规模DNA和RNA重复测序实验中,凭经验评估了STRs的RT错误率和RDD率。RT错误率随STR长度呈指数增长,且倾向于扩增。RDD率比RT错误率低约1个数量级。用MLE从灵长类数据集估计的RT错误率与用独立方法(条形码RNA测序)从秀丽隐杆线虫数据集估计的结果一致。我们的结果对医学基因组学具有重要意义,因为STR等位基因变异与40多种疾病相关。STR非等位基因转录本变异也可能导致疾病表型。本文提出的MLE和经验率可用于评估因RDD产生疾病相关转录本的概率。

相似文献

1
Reverse Transcription Errors and RNA-DNA Differences at Short Tandem Repeats.短串联重复序列处的逆转录错误与RNA-DNA差异
Mol Biol Evol. 2016 Oct;33(10):2744-58. doi: 10.1093/molbev/msw139. Epub 2016 Jul 12.
2
Rapid multiplexed genotyping of simple tandem repeats using capture and high-throughput sequencing.利用捕获和高通量测序技术快速多重基因分型简单串联重复。
Hum Mutat. 2013 Sep;34(9):1304-11. doi: 10.1002/humu.22359. Epub 2013 Jun 17.
3
Introduction of the Python script STRinNGS for analysis of STR regions in FASTQ or BAM files and expansion of the Danish STR sequence database to 11 STRs.用于分析FASTQ或BAM文件中STR区域的Python脚本STRinNGS介绍以及丹麦STR序列数据库扩展至11个STR。
Forensic Sci Int Genet. 2016 Mar;21:68-75. doi: 10.1016/j.fsigen.2015.12.006. Epub 2015 Dec 12.
4
Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.从全基因组测序数据中准确分型短串联重复序列及其应用。
Genome Res. 2015 May;25(5):736-49. doi: 10.1101/gr.185892.114. Epub 2015 Mar 30.
5
Detection theory in identification of RNA-DNA sequence differences using RNA-sequencing.利用RNA测序鉴定RNA-DNA序列差异中的检测理论
PLoS One. 2014 Nov 14;9(11):e112040. doi: 10.1371/journal.pone.0112040. eCollection 2014.
6
The accuracy, feasibility and challenges of sequencing short tandem repeats using next-generation sequencing platforms.使用下一代测序平台对短串联重复序列进行测序的准确性、可行性及挑战。
PLoS One. 2014 Dec 1;9(12):e113862. doi: 10.1371/journal.pone.0113862. eCollection 2014.
7
Detection of repeat expansions in large next generation DNA and RNA sequencing data without alignment.在不进行比对的情况下,从大型下一代 DNA 和 RNA 测序数据中检测重复扩展。
Sci Rep. 2022 Jul 30;12(1):13124. doi: 10.1038/s41598-022-17267-z.
8
Profiling short tandem repeats from short reads.从短读段中分析短串联重复序列。
Methods Mol Biol. 2013;1038:113-35. doi: 10.1007/978-1-62703-514-9_7.
9
Massively parallel sequencing of forensic STRs: Considerations of the DNA commission of the International Society for Forensic Genetics (ISFG) on minimal nomenclature requirements.法医 STRs 的大规模平行测序:国际法医遗传学会(ISFG)DNA 委员会关于最小命名要求的考虑。
Forensic Sci Int Genet. 2016 May;22:54-63. doi: 10.1016/j.fsigen.2016.01.009. Epub 2016 Jan 21.
10
Rapid detection of expanded short tandem repeats in personal genomics using hybrid sequencing.利用混合测序技术快速检测个人基因组中的扩展短串联重复序列。
Bioinformatics. 2014 Mar 15;30(6):815-22. doi: 10.1093/bioinformatics/btt647. Epub 2013 Nov 8.

引用本文的文献

1
Re-analysis of mobile mRNA datasets raises questions about the extent of long-distance mRNA communication.对移动mRNA数据集的重新分析引发了关于长距离mRNA通讯程度的问题。
Nat Plants. 2025 May;11(5):977-984. doi: 10.1038/s41477-025-01979-x. Epub 2025 Apr 16.
2
Cryo-X-Ray Phase Contrast Imaging Enables Combined 3D Structural Quantification and Nucleic Acid Analysis of Myocardial Biopsies.低温X射线相衬成像实现心肌活检组织的三维结构定量与核酸分析联用
Adv Sci (Weinh). 2024 Dec;11(48):e2409163. doi: 10.1002/advs.202409163. Epub 2024 Oct 30.
3
Discrete measurements of RNA polymerase and reverse transcriptase fidelity reveal evolutionary tuning.

本文引用的文献

1
Conserved rates and patterns of transcription errors across bacterial growth states and lifestyles.细菌不同生长状态和生活方式下转录错误的保守速率和模式。
Proc Natl Acad Sci U S A. 2016 Mar 22;113(12):3311-6. doi: 10.1073/pnas.1525329113. Epub 2016 Feb 16.
2
A survey of best practices for RNA-seq data analysis.RNA测序数据分析的最佳实践调查。
Genome Biol. 2016 Jan 26;17:13. doi: 10.1186/s13059-016-0881-8.
3
Heterozygosity increases microsatellite mutation rate.杂合性会增加微卫星突变率。
离散测量 RNA 聚合酶和逆转录酶的保真度揭示了进化的调节。
RNA. 2024 Aug 16;30(9):1246-1258. doi: 10.1261/rna.080002.124.
4
Plasma-Derived HIV-1 Virions Contain Considerable Levels of Defective Genomes.血浆衍生的 HIV-1 病毒粒子含有相当水平的缺陷基因组。
J Virol. 2022 Mar 23;96(6):e0201121. doi: 10.1128/jvi.02011-21.
5
Early sample tagging and pooling enables simultaneous SARS-CoV-2 detection and variant sequencing.早期样本标记和汇集可实现 SARS-CoV-2 的同时检测和变异测序。
Sci Transl Med. 2021 Nov 3;13(618):eabj2266. doi: 10.1126/scitranslmed.abj2266.
6
Relatively semi-conservative replication and a folded slippage model for short tandem repeats.短串联重复序列的相对半保守复制和折叠滑动模型
BMC Genomics. 2020 Aug 17;21(1):563. doi: 10.1186/s12864-020-06949-5.
7
Ampliconic Genes on the Great Ape Y Chromosomes: Rapid Evolution of Copy Number but Conservation of Expression Levels.大猿类 Y 染色体上的扩增基因:拷贝数快速进化,但表达水平保持不变。
Genome Biol Evol. 2020 Jun 1;12(6):842-859. doi: 10.1093/gbe/evaa088.
8
Rare Taxa Exhibit Disproportionate Cell-Level Metabolic Activity in Enriched Anaerobic Digestion Microbial Communities.稀有分类群在富集的厌氧消化微生物群落中表现出不成比例的细胞水平代谢活性。
mSystems. 2019 Jan 22;4(1). doi: 10.1128/mSystems.00208-18. eCollection 2019 Jan-Feb.
9
Deciphering highly similar multigene family transcripts from Iso-Seq data with IsoCon.从 Iso-Seq 数据中破译高度相似的多基因家族转录本,使用 IsoCon。
Nat Commun. 2018 Nov 2;9(1):4601. doi: 10.1038/s41467-018-06910-x.
10
Stimulation of reverse transcriptase generated cDNAs with specific indels by template RNA structure: retrotransposon, dNTP balance, RT-reagent usage.通过模板RNA结构刺激逆转录酶产生具有特定插入缺失的cDNA:逆转座子、dNTP平衡、逆转录试剂使用情况。
Nucleic Acids Res. 2017 Sep 29;45(17):10143-10155. doi: 10.1093/nar/gkx689.
Biol Lett. 2016 Jan;12(1):20150929. doi: 10.1098/rsbl.2015.0929.
4
Genetic Architectures of Quantitative Variation in RNA Editing Pathways.RNA编辑途径中数量变异的遗传结构
Genetics. 2016 Feb;202(2):787-98. doi: 10.1534/genetics.115.179481. Epub 2015 Nov 27.
5
Quantitative and functional interrogation of parent-of-origin allelic expression biases in the brain.对大脑中亲本来源等位基因表达偏差进行定量和功能研究。
Elife. 2015 Jul 3;4:e07860. doi: 10.7554/eLife.07860.
6
Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans.人类基因组学。基因型-组织表达(GTEx)试点分析:人类多组织基因调控
Science. 2015 May 8;348(6235):648-60. doi: 10.1126/science.1262110. Epub 2015 May 7.
7
G&T-seq: parallel sequencing of single-cell genomes and transcriptomes.G&T-seq:单细胞基因组和转录组的并行测序。
Nat Methods. 2015 Jun;12(6):519-22. doi: 10.1038/nmeth.3370. Epub 2015 Apr 27.
8
Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.从全基因组测序数据中准确分型短串联重复序列及其应用。
Genome Res. 2015 May;25(5):736-49. doi: 10.1101/gr.185892.114. Epub 2015 Mar 30.
9
Genetic evidence that both dNTP-stabilized and strand slippage mechanisms may dictate DNA polymerase errors within mononucleotide microsatellites.遗传证据表明,dNTP 稳定机制和链滑动机制都可能决定单核苷酸微卫星内的 DNA 聚合酶错误。
DNA Repair (Amst). 2015 May;29:91-100. doi: 10.1016/j.dnarep.2015.02.016. Epub 2015 Feb 27.
10
Integrative analysis of haplotype-resolved epigenomes across human tissues.人类组织中单体型分辨率表观基因组的综合分析。
Nature. 2015 Feb 19;518(7539):350-354. doi: 10.1038/nature14217.