• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

2L-piRNA:一种用于识别Piwi相互作用RNA及其功能的双层集成分类器。

2L-piRNA: A Two-Layer Ensemble Classifier for Identifying Piwi-Interacting RNAs and Their Function.

作者信息

Liu Bin, Yang Fan, Chou Kuo-Chen

机构信息

School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong 518055, China; Key Laboratory of Network Oriented Intelligent Computation, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong 518055, China; Gordon Life Science Institute, Belmont, MA 02478, USA.

School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong 518055, China.

出版信息

Mol Ther Nucleic Acids. 2017 Jun 16;7:267-277. doi: 10.1016/j.omtn.2017.04.008. Epub 2017 Apr 13.

DOI:10.1016/j.omtn.2017.04.008
PMID:28624202
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5415553/
Abstract

Involved with important cellular or gene functions and implicated with many kinds of cancers, piRNAs, or piwi-interacting RNAs, are of small non-coding RNA with around 19-33 nt in length. Given a small non-coding RNA molecule, can we predict whether it is of piRNA according to its sequence information alone? Furthermore, there are two types of piRNA: one has the function of instructing target mRNA deadenylation, and the other does not. Can we discriminate one from the other? With the avalanche of RNA sequences emerging in the postgenomic age, it is urgent to address the two problems for both basic research and drug development. Unfortunately, to the best of our knowledge, so far no computational methods whatsoever could be used to deal with the second problem, let alone deal with the two problems together. Here, by incorporating the physicochemical properties of nucleotides into the pseudo K-tuple nucleotide composition (PseKNC), we proposed a powerful predictor called 2L-piRNA. It is a two-layer ensemble classifier, in which the first layer is for identifying whether a query RNA molecule is piRNA or non-piRNA, and the second layer for identifying whether a piRNA is with or without the function of instructing target mRNA deadenylation. Rigorous cross-validations have indicated that the success rates achieved by the proposed predictor are quite high. For the convenience of most biologists and drug development scientists, the web server for 2L-piRNA has been established at http://bioinformatics.hitsz.edu.cn/2L-piRNA/, by which users can easily get their desired results without the need to go through the mathematical details.

摘要

PIWI相互作用RNA(piRNA)参与重要的细胞或基因功能,并与多种癌症相关,是一类长度约为19 - 33个核苷酸的小型非编码RNA。对于一个小型非编码RNA分子,我们能否仅根据其序列信息预测它是否为piRNA?此外,piRNA有两种类型:一种具有指导靶mRNA去腺苷酸化的功能,另一种则没有。我们能否将它们区分开来?在后基因组时代,随着RNA序列的大量涌现,从基础研究和药物开发的角度来看,迫切需要解决这两个问题。不幸的是,据我们所知,到目前为止还没有任何计算方法可用于处理第二个问题,更不用说同时处理这两个问题了。在这里,通过将核苷酸的物理化学性质纳入伪K元核苷酸组成(PseKNC),我们提出了一种强大的预测器,称为2L-piRNA。它是一个两层的集成分类器,其中第一层用于识别查询的RNA分子是piRNA还是非piRNA,第二层用于识别piRNA是否具有指导靶mRNA去腺苷酸化的功能。严格的交叉验证表明,所提出的预测器取得的成功率相当高。为了方便大多数生物学家和药物开发科学家,已在http://bioinformatics.hitsz.edu.cn/2L-piRNA/建立了2L-piRNA的网络服务器,用户可以通过该服务器轻松获得他们想要的结果,而无需了解数学细节。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faca/5415553/aa38a91f76ce/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faca/5415553/afa2ee31b4f8/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faca/5415553/2993ca0e53ad/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faca/5415553/fe2a26127a68/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faca/5415553/d6b18db502d7/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faca/5415553/aa38a91f76ce/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faca/5415553/afa2ee31b4f8/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faca/5415553/2993ca0e53ad/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faca/5415553/fe2a26127a68/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faca/5415553/d6b18db502d7/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faca/5415553/aa38a91f76ce/gr5.jpg

相似文献

1
2L-piRNA: A Two-Layer Ensemble Classifier for Identifying Piwi-Interacting RNAs and Their Function.2L-piRNA:一种用于识别Piwi相互作用RNA及其功能的双层集成分类器。
Mol Ther Nucleic Acids. 2017 Jun 16;7:267-277. doi: 10.1016/j.omtn.2017.04.008. Epub 2017 Apr 13.
2
iPSW(2L)-PseKNC: A two-layer predictor for identifying promoters and their strength by hybrid features via pseudo K-tuple nucleotide composition.iPSW(2L)-PseKNC:一种双层预测器,通过伪 K- 元核苷酸组成的混合特征来识别启动子及其强度。
Genomics. 2019 Dec;111(6):1785-1793. doi: 10.1016/j.ygeno.2018.12.001. Epub 2018 Dec 5.
3
iEnhancer-2L: a two-layer predictor for identifying enhancers and their strength by pseudo k-tuple nucleotide composition.iEnhancer-2L:一种通过伪 k-元核苷酸组成识别增强子及其强度的两层预测器。
Bioinformatics. 2016 Feb 1;32(3):362-9. doi: 10.1093/bioinformatics/btv604. Epub 2015 Oct 17.
4
iPromoter-2L: a two-layer predictor for identifying promoters and their types by multi-window-based PseKNC.iPromoter-2L:一种双层预测器,通过基于多窗口的 PseKNC 来识别启动子及其类型。
Bioinformatics. 2018 Jan 1;34(1):33-40. doi: 10.1093/bioinformatics/btx579.
5
iEnhancer-EL: identifying enhancers and their strength with ensemble learning approach.iEnhancer-EL:基于集成学习方法识别增强子及其强度。
Bioinformatics. 2018 Nov 15;34(22):3835-3842. doi: 10.1093/bioinformatics/bty458.
6
iRNA-2methyl: Identify RNA 2'-O-methylation Sites by Incorporating Sequence-Coupled Effects into General PseKNC and Ensemble Classifier.iRNA-2甲基化:通过将序列耦合效应纳入通用伪核苷酸组成和集成分类器来识别RNA 2'-O-甲基化位点。
Med Chem. 2017;13(8):734-743. doi: 10.2174/1573406413666170623082245.
7
iRNA-PseColl: Identifying the Occurrence Sites of Different RNA Modifications by Incorporating Collective Effects of Nucleotides into PseKNC.iRNA-PseColl:通过将核苷酸的集体效应纳入伪核苷酸组成来识别不同RNA修饰的发生位点
Mol Ther Nucleic Acids. 2017 Jun 16;7:155-163. doi: 10.1016/j.omtn.2017.03.006. Epub 2017 Mar 29.
8
iDNA6mA-PseKNC: Identifying DNA N-methyladenosine sites by incorporating nucleotide physicochemical properties into PseKNC.iDNA6mA-PseKNC:通过将核苷酸理化性质纳入 PseKNC 来鉴定 DNA N6-甲基腺苷位点。
Genomics. 2019 Jan;111(1):96-102. doi: 10.1016/j.ygeno.2018.01.005. Epub 2018 Jan 31.
9
iRNA-PseU: Identifying RNA pseudouridine sites.iRNA-PseU:识别RNA假尿苷位点。
Mol Ther Nucleic Acids. 2016;5(7):e332. doi: 10.1038/mtna.2016.37.
10
iRNA-AI: identifying the adenosine to inosine editing sites in RNA sequences.iRNA-AI:识别RNA序列中腺苷到肌苷的编辑位点。
Oncotarget. 2017 Jan 17;8(3):4208-4217. doi: 10.18632/oncotarget.13758.

引用本文的文献

1
Biological Sequence Classification: A Review on Data and General Methods.生物序列分类:数据与通用方法综述
Research (Wash D C). 2022 Dec 19;2022:0011. doi: 10.34133/research.0011. eCollection 2022.
2
Decoding the Non-coding: Tools and Databases Unveiling the Hidden World of "Junk" RNAs for Innovative Therapeutic Exploration.解码非编码:工具与数据库揭示“垃圾”RNA的隐秘世界以进行创新治疗探索
ACS Pharmacol Transl Sci. 2024 Jun 7;7(7):1901-1915. doi: 10.1021/acsptsci.3c00388. eCollection 2024 Jul 12.
3
Sequence based model using deep neural network and hybrid features for identification of 5-hydroxymethylcytosine modification.

本文引用的文献

1
iPhos-PseEvo: Identifying Human Phosphorylated Proteins by Incorporating Evolutionary Information into General PseAAC via Grey System Theory.iPhos-PseEvo:通过灰色系统理论将进化信息纳入通用 PseAAC 来识别人类磷酸化蛋白质。
Mol Inform. 2017 May;36(5-6). doi: 10.1002/minf.201600010. Epub 2016 May 12.
2
iRNA-PseU: Identifying RNA pseudouridine sites.iRNA-PseU:识别RNA假尿苷位点。
Mol Ther Nucleic Acids. 2016;5(7):e332. doi: 10.1038/mtna.2016.37.
3
Predicting antimicrobial peptides with improved accuracy by incorporating the compositional, physico-chemical and structural features into Chou's general PseAAC.
基于序列的深度学习神经网络模型和混合特征用于 5-羟甲基胞嘧啶修饰的识别。
Sci Rep. 2024 Apr 20;14(1):9116. doi: 10.1038/s41598-024-59777-y.
4
StackDPP: a stacking ensemble based DNA-binding protein prediction model.StackDPP:一种基于堆叠集成的 DNA 结合蛋白预测模型。
BMC Bioinformatics. 2024 Mar 14;25(1):111. doi: 10.1186/s12859-024-05714-9.
5
Databases and computational methods for the identification of piRNA-related molecules: A survey.用于鉴定piRNA相关分子的数据库和计算方法:一项综述。
Comput Struct Biotechnol J. 2024 Jan 22;23:813-833. doi: 10.1016/j.csbj.2024.01.011. eCollection 2024 Dec.
6
A GHKNN model based on the physicochemical property extraction method to identify SNARE proteins.一种基于物理化学性质提取方法的GHKNN模型,用于识别SNARE蛋白。
Front Genet. 2022 Nov 23;13:935717. doi: 10.3389/fgene.2022.935717. eCollection 2022.
7
Identification of intelligence-related proteins through a robust two-layer predictor.通过强大的双层预测器鉴定与智力相关的蛋白质。
Commun Integr Biol. 2022 Nov 15;15(1):253-264. doi: 10.1080/19420889.2022.2143101. eCollection 2022.
8
Tissue Recognition Based on Electrical Impedance Classified by Support Vector Machine in Spinal Operation Area.基于支持向量机的脊柱手术区域电阻抗分类的组织识别。
Orthop Surg. 2022 Sep;14(9):2276-2285. doi: 10.1111/os.13406. Epub 2022 Aug 1.
9
A proteome-wide systems toxicological approach deciphers the interaction network of chemotherapeutic drugs in the cardiovascular milieu.一种全蛋白质组范围的系统毒理学方法解析了心血管环境中化疗药物的相互作用网络。
RSC Adv. 2018 Jun 4;8(36):20211-20221. doi: 10.1039/c8ra02877j. eCollection 2018 May 30.
10
piRNAs and PIWI Proteins as Diagnostic and Prognostic Markers of Genitourinary Cancers.piRNAs 和 PIWI 蛋白作为泌尿生殖系统癌症的诊断和预后标志物。
Biomolecules. 2022 Jan 22;12(2):186. doi: 10.3390/biom12020186.
通过将组成、物理化学和结构特征纳入到周元的通用 PseAAC 中,提高了抗菌肽预测的准确性。
Sci Rep. 2017 Feb 13;7:42362. doi: 10.1038/srep42362.
4
iATC-mISF: a multi-label classifier for predicting the classes of anatomical therapeutic chemicals.iATC-mISF:一种用于预测解剖治疗化学物质类别的多标签分类器。
Bioinformatics. 2017 Feb 1;33(3):341-346. doi: 10.1093/bioinformatics/btw644.
5
Pse-Analysis: a python package for DNA/RNA and protein/ peptide sequence analysis based on pseudo components and kernel methods.伪分析:一个基于伪组件和核方法用于DNA/RNA以及蛋白质/肽序列分析的Python软件包。
Oncotarget. 2017 Feb 21;8(8):13338-13343. doi: 10.18632/oncotarget.14524.
6
Unb-DPC: Identify mycobacterial membrane protein types by incorporating un-biased dipeptide composition into Chou's general PseAAC.Unb-DPC:通过将无偏差二肽组成纳入周的通用伪氨基酸组成来鉴定分枝杆菌膜蛋白类型。
J Theor Biol. 2017 Feb 21;415:13-19. doi: 10.1016/j.jtbi.2016.12.004. Epub 2016 Dec 8.
7
iRNA-AI: identifying the adenosine to inosine editing sites in RNA sequences.iRNA-AI:识别RNA序列中腺苷到肌苷的编辑位点。
Oncotarget. 2017 Jan 17;8(3):4208-4217. doi: 10.18632/oncotarget.13758.
8
OOgenesis_Pred: A sequence-based method for predicting oogenesis proteins by six different modes of Chou's pseudo amino acid composition.卵子发生预测:一种基于序列的方法,通过六种不同模式的周氏伪氨基酸组成预测卵子发生蛋白。
J Theor Biol. 2017 Feb 7;414:128-136. doi: 10.1016/j.jtbi.2016.11.028. Epub 2016 Dec 2.
9
iOri-Human: identify human origin of replication by incorporating dinucleotide physicochemical properties into pseudo nucleotide composition.iOri-Human:通过将二核苷酸物理化学性质纳入伪核苷酸组成来识别人类复制起点。
Oncotarget. 2016 Oct 25;7(43):69783-69793. doi: 10.18632/oncotarget.11975.
10
Analysis and comparison of lignin peroxidases between fungi and bacteria using three different modes of Chou's general pseudo amino acid composition.使用三种不同模式的周氏一般伪氨基酸组成对真菌和细菌中的木质素过氧化物酶进行分析和比较。
J Theor Biol. 2016 Dec 21;411:1-5. doi: 10.1016/j.jtbi.2016.09.001. Epub 2016 Sep 8.