• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Linc2function:一种使用深度学习方法进行长链非编码RNA(lncRNA)鉴定和功能预测的综合流程及网络服务器。

Linc2function: A Comprehensive Pipeline and Webserver for Long Non-Coding RNA (lncRNA) Identification and Functional Predictions Using Deep Learning Approaches.

作者信息

Ramakrishnaiah Yashpal, Morris Adam P, Dhaliwal Jasbir, Philip Melcy, Kuhlmann Levin, Tyagi Sonika

机构信息

Central Clinical School, Monash University, Melbourne, VIC 3000, Australia.

School of Computing Technologies, Royal Melbourne Institute of Technology University, Melbourne, VIC 3000, Australia.

出版信息

Epigenomes. 2023 Sep 15;7(3):22. doi: 10.3390/epigenomes7030022.

DOI:10.3390/epigenomes7030022
PMID:37754274
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10528440/
Abstract

Long non-coding RNAs (lncRNAs), comprising a significant portion of the human transcriptome, serve as vital regulators of cellular processes and potential disease biomarkers. However, the function of most lncRNAs remains unknown, and furthermore, existing approaches have focused on gene-level investigation. Our work emphasizes the importance of transcript-level annotation to uncover the roles of specific transcript isoforms. We propose that understanding the mechanisms of lncRNA in pathological processes requires solving their structural motifs and interactomes. A complete lncRNA annotation first involves discriminating them from their coding counterparts and then predicting their functional motifs and target bio-molecules. Current in silico methods mainly perform primary-sequence-based discrimination using a reference model, limiting their comprehensiveness and generalizability. We demonstrate that integrating secondary structure and interactome information, in addition to using transcript sequence, enables a comprehensive functional annotation. Annotating lncRNA for newly sequenced species is challenging due to inconsistencies in functional annotations, specialized computational techniques, limited accessibility to source code, and the shortcomings of reference-based methods for cross-species predictions. To address these challenges, we developed a pipeline for identifying and annotating transcript sequences at the isoform level. We demonstrate the effectiveness of the pipeline by comprehensively annotating the lncRNA associated with two specific disease groups. The source code of our pipeline is available under the MIT licensefor local use by researchers to make new predictions using the pre-trained models or to re-train models on new sequence datasets. Non-technical users can access the pipeline through a web server setup.

摘要

长链非编码RNA(lncRNA)构成了人类转录组的很大一部分,是细胞过程的重要调节因子和潜在的疾病生物标志物。然而,大多数lncRNA的功能仍然未知,此外,现有方法主要集中在基因水平的研究上。我们的工作强调了转录本水平注释对于揭示特定转录本异构体作用的重要性。我们提出,了解lncRNA在病理过程中的机制需要解析它们的结构基序和相互作用组。完整的lncRNA注释首先要将它们与编码对应物区分开来,然后预测它们的功能基序和靶标生物分子。当前的计算机方法主要使用参考模型进行基于一级序列的区分,限制了它们的全面性和通用性。我们证明,除了使用转录本序列外,整合二级结构和相互作用组信息能够实现全面的功能注释。由于功能注释不一致、专门的计算技术、源代码获取受限以及基于参考的跨物种预测方法的缺点,对新测序物种的lncRNA进行注释具有挑战性。为了应对这些挑战,我们开发了一个用于在异构体水平识别和注释转录本序列的流程。我们通过全面注释与两个特定疾病组相关的lncRNA来证明该流程的有效性。我们流程的源代码可根据麻省理工学院许可获取,供研究人员本地使用,以便使用预训练模型进行新的预测或在新的序列数据集上重新训练模型。非技术用户可以通过设置的网络服务器访问该流程。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2cf2/10528440/b2f84c62ab79/epigenomes-07-00022-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2cf2/10528440/8927db3ae98f/epigenomes-07-00022-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2cf2/10528440/0cd0bff76a8f/epigenomes-07-00022-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2cf2/10528440/e5bffc2b0a79/epigenomes-07-00022-g003a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2cf2/10528440/b2f84c62ab79/epigenomes-07-00022-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2cf2/10528440/8927db3ae98f/epigenomes-07-00022-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2cf2/10528440/0cd0bff76a8f/epigenomes-07-00022-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2cf2/10528440/e5bffc2b0a79/epigenomes-07-00022-g003a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2cf2/10528440/b2f84c62ab79/epigenomes-07-00022-g004.jpg

相似文献

1
Linc2function: A Comprehensive Pipeline and Webserver for Long Non-Coding RNA (lncRNA) Identification and Functional Predictions Using Deep Learning Approaches.Linc2function:一种使用深度学习方法进行长链非编码RNA(lncRNA)鉴定和功能预测的综合流程及网络服务器。
Epigenomes. 2023 Sep 15;7(3):22. doi: 10.3390/epigenomes7030022.
2
ICAnnoLncRNA: A Snakemake Pipeline for a Long Non-Coding-RNA Search and Annotation in Transcriptomic Sequences.ICAnnoLncRNA:一种用于转录组序列中长非编码 RNA 搜索和注释的 SnakeMake 流程。
Genes (Basel). 2023 Jun 24;14(7):1331. doi: 10.3390/genes14071331.
3
lncRNA-screen: an interactive platform for computationally screening long non-coding RNAs in large genomics datasets.lncRNA-screen:一个用于在大型基因组数据集中通过计算筛选长链非编码RNA的交互式平台。
BMC Genomics. 2017 Jun 5;18(1):434. doi: 10.1186/s12864-017-3817-0.
4
LncADeep: an ab initio lncRNA identification and functional annotation tool based on deep learning.LncADeep:一种基于深度学习的从头鉴定长非编码 RNA 及其功能注释工具。
Bioinformatics. 2018 Nov 15;34(22):3825-3834. doi: 10.1093/bioinformatics/bty428.
5
PLAIDOH: a novel method for functional prediction of long non-coding RNAs identifies cancer-specific LncRNA activities.PLAIDOH:一种新的长非编码 RNA 功能预测方法鉴定了癌症特异性 LncRNA 活性。
BMC Genomics. 2019 Feb 15;20(1):137. doi: 10.1186/s12864-019-5497-4.
6
PLEK: a tool for predicting long non-coding RNAs and messenger RNAs based on an improved k-mer scheme.PLEK:一种基于改进的k-mer方案预测长链非编码RNA和信使RNA的工具。
BMC Bioinformatics. 2014 Sep 19;15(1):311. doi: 10.1186/1471-2105-15-311.
7
Biological Function of Long Non-coding RNA (LncRNA) Xist.长链非编码RNA(LncRNA)Xist的生物学功能
Front Cell Dev Biol. 2021 Jun 10;9:645647. doi: 10.3389/fcell.2021.645647. eCollection 2021.
8
Lantern: an integrative repository of functional annotations for lncRNAs in the human genome.Lantern:一个整合了人类基因组中长非编码 RNA 功能注释的资源库。
BMC Bioinformatics. 2021 May 26;22(1):279. doi: 10.1186/s12859-021-04207-3.
9
Systematic and computational identification of Androctonus crassicauda long non-coding RNAs.系统和计算鉴定粗尾蝎长非编码 RNA。
Sci Rep. 2021 Feb 25;11(1):4720. doi: 10.1038/s41598-021-83815-8.
10
Co-LncRNA: investigating the lncRNA combinatorial effects in GO annotations and KEGG pathways based on human RNA-Seq data.共长链非编码RNA:基于人类RNA测序数据研究长链非编码RNA在基因本体注释和京都基因与基因组百科全书通路中的组合效应。
Database (Oxford). 2015 Sep 10;2015. doi: 10.1093/database/bav082. Print 2015.

引用本文的文献

1
Long Intergenic Non-Coding RNAs of Human Chromosome 18: Focus on Cancers.人类18号染色体的长链基因间非编码RNA:聚焦于癌症
Biomedicines. 2024 Feb 28;12(3):544. doi: 10.3390/biomedicines12030544.

本文引用的文献

1
Development and Validation of an Explainable Machine Learning-Based Prediction Model for Drug-Food Interactions from Chemical Structures.基于可解释机器学习的药物-食物相互作用化学结构预测模型的开发与验证。
Sensors (Basel). 2023 Apr 13;23(8):3962. doi: 10.3390/s23083962.
2
Prediction of anticancer peptides based on an ensemble model of deep learning and machine learning using ordinal positional encoding.基于使用有序位置编码的深度学习和机器学习集成模型预测抗癌肽。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac630.
3
A Survey of Current Resources to Study lncRNA-Protein Interactions.
lncRNA-蛋白质相互作用研究的现有资源综述。
Noncoding RNA. 2021 Jun 8;7(2):33. doi: 10.3390/ncrna7020033.
4
Population-scale tissue transcriptomics maps long non-coding RNAs to complex disease.人群规模组织转录组学将长非编码 RNA 映射到复杂疾病。
Cell. 2021 May 13;184(10):2633-2648.e19. doi: 10.1016/j.cell.2021.03.050. Epub 2021 Apr 16.
5
Deep learning based DNA:RNA triplex forming potential prediction.基于深度学习的 DNA:RNA 三链形成势能预测。
BMC Bioinformatics. 2020 Nov 12;21(1):522. doi: 10.1186/s12859-020-03864-0.
6
Towards a comprehensive pipeline to identify and functionally annotate long noncoding RNA (lncRNA).构建一个用于识别长链非编码RNA(lncRNA)并对其进行功能注释的综合流程。
Comput Biol Med. 2020 Dec;127:104028. doi: 10.1016/j.compbiomed.2020.104028. Epub 2020 Oct 13.
7
RNA structure prediction using positive and negative evolutionary information.利用正向和负向进化信息进行 RNA 结构预测。
PLoS Comput Biol. 2020 Oct 30;16(10):e1008387. doi: 10.1371/journal.pcbi.1008387. eCollection 2020 Oct.
8
Genome-Wide Technologies to Study RNA-Chromatin Interactions.用于研究RNA-染色质相互作用的全基因组技术。
Noncoding RNA. 2020 May 27;6(2):20. doi: 10.3390/ncrna6020020.
9
DynaVenn: web-based computation of the most significant overlap between ordered sets.DynaVenn:基于网络的有序集合间最重要重叠的计算。
BMC Bioinformatics. 2019 Dec 30;20(1):743. doi: 10.1186/s12859-019-3320-5.
10
RNA secondary structure prediction using an ensemble of two-dimensional deep neural networks and transfer learning.使用二维深度神经网络集成和迁移学习进行 RNA 二级结构预测。
Nat Commun. 2019 Nov 27;10(1):5407. doi: 10.1038/s41467-019-13395-9.