Suppr超能文献

用于解析四个物种转录组复杂性的全长转录本测序数据集。

Datasets of Iso-Seq transcripts for decoding transcriptome complexity in four species.

作者信息

González-de la Fuente Sandra, Requena Jose M, Aguado Begoña

机构信息

Centro de Biología Molecular Severo Ochoa (CSIC-UAM), Genomic and NGS Facility (GENGS), 28049 Madrid, Spain.

Centro de Biología Molecular Severo Ochoa (CSIC-UAM), Departamento de Biología Molecular, Instituto Universitario de Biología Molecular (IUBM), Universidad Autónoma de Madrid, 28049 Madrid, Spain.

出版信息

Data Brief. 2023 Nov 20;52:109838. doi: 10.1016/j.dib.2023.109838. eCollection 2024 Feb.

Abstract

The Iso-Seq technology, based on PacBio sequencing, enables the generation of high-quality, full-length transcripts, providing insights into transcriptome complexity. In this study, total RNA from promastigotes of four species ( and ) was sequenced using Single Molecule, Real-Time (SMRT) Sequencing (PacBio) methodology. The Iso-seq transcripts were categorized as either complete or truncated according to the presence or absence of the Spliced-Leader (SL) sequence at their 5'-end, respectively. Moreover, only transcripts having a poly-A at their 3'-end were considered. Supplied datasets represent valuable information that may help to uncover novel transcripts and alternative splicing events in a parasite that regulates its gene expression at the post-transcriptional level. A better knowledge of gene expression regulation in will open avenues for the development of new drugs to treat leishmaniasis, a devastating disease that has worldwide distribution. Additionally, the bioinformatics pipeline followed here may guide the analysis of Iso-Seq data derived from related trypanosomatids like (Chagas disease agent) and (sleeping disease). © 2023 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

摘要

基于PacBio测序的Iso-Seq技术能够生成高质量的全长转录本,从而深入了解转录组的复杂性。在本研究中,使用单分子实时(SMRT)测序(PacBio)方法对四种物种(和)前鞭毛体的总RNA进行了测序。Iso-seq转录本根据其5'端是否存在剪接前导序列(SL)分别分类为完整或截短。此外,仅考虑在其3'端具有多聚A的转录本。所提供的数据集代表了有价值的信息,可能有助于揭示该寄生虫在转录后水平调节其基因表达时的新转录本和可变剪接事件。对该寄生虫基因表达调控的更好了解将为开发治疗利什曼病的新药开辟道路,利什曼病是一种在全球范围内分布的毁灭性疾病。此外,这里遵循的生物信息学流程可能会指导对来自相关锥虫(如恰加斯病病原体)和(昏睡病病原体)的Iso-Seq数据的分析。© 2023作者。由爱思唯尔公司出版。这是一篇根据CC BY许可(http://creativecommons.org/licenses/by/4.0/)发布的开放获取文章。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebda/10698239/16ef72dc3754/gr1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验