• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

位置相关分析可提高从有噪声的芯片信号或短读段中重构全长转录本和可变剪接异构体的能力。

Positional correlation analysis improves reconstruction of full-length transcripts and alternative isoforms from noisy array signals or short reads.

机构信息

Bioinformatics and Systems Engineering division, RIKEN Yokohama Institute, Tsurumi, Yokohama, Kanagawa 230-0045, Japan.

出版信息

Bioinformatics. 2012 Apr 1;28(7):929-37. doi: 10.1093/bioinformatics/bts065. Epub 2012 Feb 13.

DOI:10.1093/bioinformatics/bts065
PMID:22332235
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3315713/
Abstract

MOTIVATION

A reconstruction of full-length transcripts observed by next-generation sequencer or tiling arrays is an essential technique to know all phenomena of transcriptomes. Several techniques of the reconstruction have been developed. However, problems of high-level noises and biases still remain and interrupt the reconstruction. A method is required that is robust against noise and bias and correctly reconstructs transcripts regardless of equipment used.

RESULTS

We propose a completely new statistical method that reconstructs full-length transcripts and can be applied on both next-generation sequencers and tiling arrays. The method called ARTADE2 analyzes 'positional correlation', meaning correlations of expression values for every combination on genomic positions of multiple transcriptional data. ARTADE2 then reconstructs full-length transcripts using a logistic model based on the positional correlation and the Markov model. ARTADE2 elucidated 17 591 full-length transcripts from 55 transcriptome datasets and showed notable performance compared with other recent prediction methods. Moreover, 1489 novel transcripts were discovered. We experimentally tested 16 novel transcripts, among which 14 were confirmed by reverse transcription-polymerase chain reaction and sequence mapping. The method also showed notable performance for reconstructing of mRNA observed by a next-generation sequencer. Moreover, the positional correlation and factor analysis embedded in ARTADE2 successfully detected regions at which alternative isoforms may exist, and thus are expected to be applied for discovering transcript biomarkers for a wide range of disciplines including preemptive medicine.

AVAILABILITY

http://matome.base.riken.jp

CONTACT

toyoda@base.riken.jp

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

通过下一代测序仪或平铺阵列观察到全长转录本的重建是了解转录组所有现象的重要技术。已经开发了几种重建技术。然而,高水平噪声和偏差的问题仍然存在,并中断了重建。需要一种能够抵抗噪声和偏差并正确重建转录本的方法,而与所使用的设备无关。

结果

我们提出了一种全新的统计方法,该方法可以重建全长转录本,并可应用于下一代测序仪和平铺阵列。该方法称为 ARTADE2,分析了“位置相关性”,即多个转录组数据的基因组位置上的表达值的组合的相关性。然后,ARTADE2 使用基于位置相关性和马尔可夫模型的逻辑模型来重建全长转录本。ARTADE2 从 55 个转录组数据集重建了 17591 个全长转录本,并与其他最近的预测方法相比表现出了显著的性能。此外,还发现了 1489 个新的转录本。我们通过实验测试了 16 个新的转录本,其中 14 个通过逆转录-聚合酶链反应和序列映射得到了证实。该方法还显示出在重建下一代测序仪观察到的 mRNA 方面的显著性能。此外,ARTADE2 中嵌入的位置相关性和因子分析成功地检测到了可能存在替代异构体的区域,因此有望应用于发现包括预防性医学在内的广泛学科的转录生物标志物。

可用性

http://matome.base.riken.jp

联系方式

toyoda@base.riken.jp

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/4fe1783c186f/bts065f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/87730cbce1ab/bts065f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/c279705500c7/bts065f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/9b418f20765b/bts065f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/8f75cecb71bb/bts065f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/7f9d56624913/bts065f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/755e85836a1f/bts065f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/9b5cbbb88232/bts065f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/08c0bdca9f9c/bts065f8a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/3de3c75e35bc/bts065f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/4fe1783c186f/bts065f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/87730cbce1ab/bts065f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/c279705500c7/bts065f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/9b418f20765b/bts065f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/8f75cecb71bb/bts065f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/7f9d56624913/bts065f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/755e85836a1f/bts065f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/9b5cbbb88232/bts065f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/08c0bdca9f9c/bts065f8a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/3de3c75e35bc/bts065f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01a2/3315713/4fe1783c186f/bts065f10.jpg

相似文献

1
Positional correlation analysis improves reconstruction of full-length transcripts and alternative isoforms from noisy array signals or short reads.位置相关分析可提高从有噪声的芯片信号或短读段中重构全长转录本和可变剪接异构体的能力。
Bioinformatics. 2012 Apr 1;28(7):929-37. doi: 10.1093/bioinformatics/bts065. Epub 2012 Feb 13.
2
Knowledge-based reconstruction of mRNA transcripts with short sequencing reads for transcriptome research.基于知识的短测序 reads 转录本重构用于转录组研究。
PLoS One. 2012;7(2):e31440. doi: 10.1371/journal.pone.0031440. Epub 2012 Feb 1.
3
ARTADE2DB: improved statistical inferences for Arabidopsis gene functions and structure predictions by dynamic structure-based dynamic expression (DSDE) analyses.ARTADE2DB:通过基于结构的动态表达(DSDE)分析提高拟南芥基因功能和结构预测的统计推断。
Plant Cell Physiol. 2011 Feb;52(2):254-64. doi: 10.1093/pcp/pcq202. Epub 2011 Jan 12.
4
Next-generation tag sequencing for cancer gene expression profiling.用于癌症基因表达谱分析的新一代标签测序
Genome Res. 2009 Oct;19(10):1825-35. doi: 10.1101/gr.094482.109. Epub 2009 Jun 18.
5
Haplotype and isoform specific expression estimation using multi-mapping RNA-seq reads.利用多映射 RNA-seq reads 进行单倍型和异构体特异性表达估计。
Genome Biol. 2011;12(2):R13. doi: 10.1186/gb-2011-12-2-r13. Epub 2011 Feb 10.
6
The salt-responsive transcriptome of chickpea roots and nodules via deepSuperSAGE.通过 deepSuperSAGE 研究盐响应的鹰嘴豆根和根瘤转录组。
BMC Plant Biol. 2011 Feb 14;11:31. doi: 10.1186/1471-2229-11-31.
7
Next-generation sequencing facilitates quantitative analysis of wild-type and Nrl(-/-) retinal transcriptomes.新一代测序技术有助于对野生型和Nrl基因敲除小鼠视网膜转录组进行定量分析。
Mol Vis. 2011;17:3034-54. Epub 2011 Nov 23.
8
FDM: a graph-based statistical method to detect differential transcription using RNA-seq data.FDM:一种基于图的统计方法,用于检测使用 RNA-seq 数据的差异转录。
Bioinformatics. 2011 Oct 1;27(19):2633-40. doi: 10.1093/bioinformatics/btr458. Epub 2011 Aug 8.
9
Ryūtō: network-flow based transcriptome reconstruction.龙童:基于网络流的转录组重构。
BMC Bioinformatics. 2019 Apr 16;20(1):190. doi: 10.1186/s12859-019-2786-5.
10
A Full-Length mRNA Transcriptome Generated From Hybrid-Corrected PacBio Long-Reads Improves the Transcript Annotation and Identifies Thousands of Novel Splice Variants in Atlantic Salmon.通过混合校正的PacBio长读长生成的全长mRNA转录组改善了转录本注释并鉴定了大西洋鲑鱼中数千种新的剪接变体。
Front Genet. 2021 Apr 27;12:656334. doi: 10.3389/fgene.2021.656334. eCollection 2021.

引用本文的文献

1
Novel Stress-Inducible Antisense RNAs of Protein-Coding Loci Are Synthesized by RNA-Dependent RNA Polymerase.蛋白质编码基因座的新型应激诱导反义RNA由RNA依赖性RNA聚合酶合成。
Plant Physiol. 2017 Sep;175(1):457-472. doi: 10.1104/pp.17.00787. Epub 2017 Jul 14.
2
A Stress-Activated Transposon in Arabidopsis Induces Transgenerational Abscisic Acid Insensitivity.拟南芥中应激激活的转座子诱导跨代脱落酸不敏感。
Sci Rep. 2016 Mar 15;6:23181. doi: 10.1038/srep23181.
3
Small open reading frames associated with morphogenesis are hidden in plant genomes.

本文引用的文献

1
Genome sequence and analysis of the tuber crop potato.马铃薯块茎作物的基因组序列与分析。
Nature. 2011 Jul 10;475(7355):189-95. doi: 10.1038/nature10158.
2
Identification of novel transcripts in annotated genomes using RNA-Seq.利用 RNA-Seq 鉴定注释基因组中的新型转录本。
Bioinformatics. 2011 Sep 1;27(17):2325-9. doi: 10.1093/bioinformatics/btr355. Epub 2011 Jun 21.
3
Improving RNA-Seq expression estimates by correcting for fragment bias.通过纠正片段偏倚来提高 RNA-Seq 表达估计。
与形态发生相关的小开放阅读框隐藏在植物基因组中。
Proc Natl Acad Sci U S A. 2013 Feb 5;110(6):2395-400. doi: 10.1073/pnas.1213958110. Epub 2013 Jan 22.
Genome Biol. 2011;12(3):R22. doi: 10.1186/gb-2011-12-3-r22. Epub 2011 Mar 16.
4
ARTADE2DB: improved statistical inferences for Arabidopsis gene functions and structure predictions by dynamic structure-based dynamic expression (DSDE) analyses.ARTADE2DB:通过基于结构的动态表达(DSDE)分析提高拟南芥基因功能和结构预测的统计推断。
Plant Cell Physiol. 2011 Feb;52(2):254-64. doi: 10.1093/pcp/pcq202. Epub 2011 Jan 12.
5
De novo assembled expressed gene catalog of a fast-growing Eucalyptus tree produced by Illumina mRNA-Seq.利用 Illumina mRNA-Seq 技术对快速生长的桉树进行从头组装表达基因目录
BMC Genomics. 2010 Dec 1;11:681. doi: 10.1186/1471-2164-11-681.
6
Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation.通过 RNA-Seq 进行转录本组装和定量分析揭示了细胞分化过程中未注释的转录本和异构体转换。
Nat Biotechnol. 2010 May;28(5):511-5. doi: 10.1038/nbt.1621. Epub 2010 May 2.
7
Genome-wide analysis of endogenous abscisic acid-mediated transcription in dry and imbibed seeds of Arabidopsis using tiling arrays.利用基因芯片技术对拟南芥干种子和吸胀种子中内源性脱落酸介导的转录进行全基因组分析。
Plant J. 2010 Apr 1;62(1):39-51. doi: 10.1111/j.1365-313X.2010.04135.x. Epub 2010 Jan 20.
8
Ribosome and transcript copy numbers, polysome occupancy and enzyme dynamics in Arabidopsis.拟南芥核糖体和转录本拷贝数、多核糖体占据率和酶动力学。
Mol Syst Biol. 2009;5:314. doi: 10.1038/msb.2009.68. Epub 2009 Oct 13.
9
Deterministic protein inference for shotgun proteomics data provides new insights into Arabidopsis pollen development and function.鸟枪法蛋白质组学数据的确定性蛋白质推断为拟南芥花粉发育和功能提供了新见解。
Genome Res. 2009 Oct;19(10):1786-800. doi: 10.1101/gr.089060.108. Epub 2009 Jun 22.
10
Large-scale Arabidopsis phosphoproteome profiling reveals novel chloroplast kinase substrates and phosphorylation networks.大规模拟南芥磷酸化蛋白质组分析揭示了新的叶绿体激酶底物和磷酸化网络。
Plant Physiol. 2009 Jun;150(2):889-903. doi: 10.1104/pp.109.138677. Epub 2009 Apr 17.