• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

聚腺苷酸(Poly(A))标签深度测序数据处理以提取聚腺苷酸位点。

Poly(A)-tag deep sequencing data processing to extract poly(A) sites.

作者信息

Wu Xiaohui, Ji Guoli, Li Qingshun Quinn

机构信息

Department of Automation, Xiamen University, 422 South Siming Road, Xiamen, Fujian, 361005, China,

出版信息

Methods Mol Biol. 2015;1255:39-48. doi: 10.1007/978-1-4939-2175-1_4.

DOI:10.1007/978-1-4939-2175-1_4
PMID:25487202
Abstract

Polyadenylation [poly(A)] is an essential posttranscriptional processing step in the maturation of eukaryotic mRNA. The advent of next-generation sequencing (NGS) technology has offered feasible means to generate large-scale data and new opportunities for intensive study of polyadenylation, particularly deep sequencing of the transcriptome targeting the junction of 3'-UTR and the poly(A) tail of the transcript. To take advantage of this unprecedented amount of data, we present an automated workflow to identify polyadenylation sites by integrating NGS data cleaning, processing, mapping, normalizing, and clustering. In this pipeline, a series of Perl scripts are seamlessly integrated to iteratively map the single- or paired-end sequences to the reference genome. After mapping, the poly(A) tags (PATs) at the same genome coordinate are grouped into one cleavage site, and the internal priming artifacts removed. Then the ambiguous region is introduced to parse the genome annotation for cleavage site clustering. Finally, cleavage sites within a close range of 24 nucleotides and from different samples can be clustered into poly(A) clusters. This procedure could be used to identify thousands of reliable poly(A) clusters from millions of NGS sequences in different tissues or treatments.

摘要

聚腺苷酸化(poly(A))是真核生物mRNA成熟过程中一个必不可少的转录后加工步骤。新一代测序(NGS)技术的出现为生成大规模数据提供了可行的方法,并为深入研究聚腺苷酸化带来了新机遇,特别是针对转录组3'-UTR与转录本聚(A)尾连接处的深度测序。为了利用这些前所未有的大量数据,我们提出了一种自动化流程,通过整合NGS数据清理、处理、映射、归一化和聚类来识别聚腺苷酸化位点。在这个流程中,一系列Perl脚本被无缝整合,以将单端或双端序列迭代映射到参考基因组。映射后,将相同基因组坐标处的聚(A)标签(PATs)分组到一个切割位点,并去除内部引物假象。然后引入模糊区域以解析基因组注释进行切割位点聚类。最后,在24个核苷酸的近距离范围内且来自不同样本的切割位点可以聚类为聚(A)簇。该程序可用于从不同组织或处理的数百万个NGS序列中识别数千个可靠的聚(A)簇。

相似文献

1
Poly(A)-tag deep sequencing data processing to extract poly(A) sites.聚腺苷酸(Poly(A))标签深度测序数据处理以提取聚腺苷酸位点。
Methods Mol Biol. 2015;1255:39-48. doi: 10.1007/978-1-4939-2175-1_4.
2
Extraction of poly(A) sites from large-scale RNA-Seq data.从大规模RNA测序数据中提取聚腺苷酸化位点
Methods Mol Biol. 2015;1255:25-37. doi: 10.1007/978-1-4939-2175-1_3.
3
DNA/RNA hybrid primer mediated poly(A) tag library construction for Illumina sequencing.用于Illumina测序的DNA/RNA杂交引物介导的聚腺苷酸标签文库构建
Methods Mol Biol. 2015;1255:175-84. doi: 10.1007/978-1-4939-2175-1_15.
4
Experimental Genome-Wide Determination of RNA Polyadenylation in Chlamydomonas reinhardtii.莱茵衣藻RNA多聚腺苷酸化的全基因组实验测定
PLoS One. 2016 Jan 5;11(1):e0146107. doi: 10.1371/journal.pone.0146107. eCollection 2016.
5
Bioinformatics analysis of alternative polyadenylation in green alga Chlamydomonas reinhardtii using transcriptome sequences from three different sequencing platforms.利用来自三个不同测序平台的转录组序列对莱茵衣藻中可变聚腺苷酸化进行生物信息学分析。
G3 (Bethesda). 2014 Mar 13;4(5):871-83. doi: 10.1534/g3.114.010249.
6
Characterization and prediction of mRNA alternative polyadenylation sites in rice genes.水稻基因中mRNA可变聚腺苷酸化位点的表征与预测
Biomed Mater Eng. 2014;24(6):3779-85. doi: 10.3233/BME-141207.
7
Prediction of plant mRNA polyadenylation sites.植物mRNA聚腺苷酸化位点的预测
Methods Mol Biol. 2015;1255:13-23. doi: 10.1007/978-1-4939-2175-1_2.
8
Genome-wide determination of poly(A) site choice in plants.植物中聚腺苷酸化位点选择的全基因组测定
Methods Mol Biol. 2015;1255:159-74. doi: 10.1007/978-1-4939-2175-1_14.
9
Poly(A)-ClickSeq: click-chemistry for next-generation 3΄-end sequencing without RNA enrichment or fragmentation.聚腺苷酸点击测序法:用于下一代3′端测序的无需RNA富集或片段化的点击化学法。
Nucleic Acids Res. 2017 Jul 7;45(12):e112. doi: 10.1093/nar/gkx286.
10
PAT-Seq: A Method for Simultaneous Quantitation of Gene Expression, Poly(A)-Site Selection and Poly(A)-Length Distribution in Yeast Transcriptomes.PAT-Seq:一种同时定量酵母转录组中基因表达、聚腺苷酸化位点选择和聚腺苷酸长度分布的方法。
Methods Mol Biol. 2019;2049:141-164. doi: 10.1007/978-1-4939-9736-7_9.

引用本文的文献

1
HDA6-dependent histone deacetylation regulates mRNA polyadenylation in .HDA6 依赖性组蛋白去乙酰化调节. 中的 mRNA 多聚腺苷酸化。
Genome Res. 2020 Oct;30(10):1407-1417. doi: 10.1101/gr.255232.119. Epub 2020 Aug 5.
2
Root Hair Single Cell Type Specific Profiles of Gene Expression and Alternative Polyadenylation Under Cadmium Stress.镉胁迫下根毛单细胞类型特异性基因表达和可变聚腺苷酸化谱
Front Plant Sci. 2019 May 10;10:589. doi: 10.3389/fpls.2019.00589. eCollection 2019.
3
PlantAPA: A Portal for Visualization and Analysis of Alternative Polyadenylation in Plants.
植物APA:植物中可变聚腺苷酸化可视化与分析门户
Front Plant Sci. 2016 Jun 21;7:889. doi: 10.3389/fpls.2016.00889. eCollection 2016.