• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RNAcode_Web——方便鉴定进化保守的蛋白质编码区。

RNAcode_Web - Convenient identification of evolutionary conserved protein coding regions.

机构信息

Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Universität Leipzig, Härtelstraße 16-18, D-04107 Leipzig, Germany.

Max-Planck-Institute for Mathematics in the Sciences, Inselstraße 22, D-04103 Leipzig, Germany.

出版信息

J Integr Bioinform. 2023 Aug 25;20(3). doi: 10.1515/jib-2022-0046. eCollection 2023 Sep 1.

DOI:10.1515/jib-2022-0046
PMID:37615674
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10757073/
Abstract

The differentiation of regions with coding potential from non-coding regions remains a key task in computational biology. Methods such as RNAcode that exploit patterns of sequence conservation for this task have a substantial advantage in classification accuracy in particular for short coding sequences, compared to methods that rely on a single input sequence. However, they require sequence alignments as input. Frequently, suitable multiple sequence alignments are not readily available and are tedious, and sometimes difficult to construct. We therefore introduce here a new web service that provides access to the well-known coding sequence detector RNAcode with minimal user overhead. It requires as input only a single target nucleotide sequence. The service automates the collection, selection, and preparation of homologous sequences from the NCBI database, as well as the construction of the multiple sequence alignment that are needed as input for RNAcode. The service automatizes the entire pre- and postprocessing and thus makes the investigation of specific genomic regions for previously unannotated coding regions, such as small peptides or additional introns, a simple task that is easily accessible to non-expert users. RNAcode_Web is accessible online at rnacode.bioinf.uni-leipzig.de.

摘要

从编码区域中区分非编码区域仍然是计算生物学中的一项关键任务。与依赖单一输入序列的方法相比,利用序列保守模式进行此任务的方法(如 RNAcode)在分类准确性方面具有显著优势,特别是对于短编码序列。然而,它们需要序列比对作为输入。通常,合适的多重序列比对不容易获得,并且很繁琐,有时难以构建。因此,我们在这里引入了一个新的网络服务,该服务提供对著名的编码序列检测器 RNAcode 的访问,用户只需付出最小的努力。它只需要一个单一的目标核苷酸序列作为输入。该服务自动从 NCBI 数据库中收集、选择和准备同源序列,并构建 RNAcode 所需的多重序列比对。该服务自动完成整个预处理和后处理,因此,对于以前未注释的编码区域(如小肽或额外的内含子)的特定基因组区域的研究成为一项简单的任务,即使是非专业用户也可以轻松访问。RNAcode_Web 可在线访问,网址为 rnacode.bioinf.uni-leipzig.de。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa16/10757073/7f9073ea48d3/j_jib-2022-0046_fig_003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa16/10757073/635b5578bb7d/j_jib-2022-0046_fig_001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa16/10757073/4539648b6935/j_jib-2022-0046_fig_002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa16/10757073/7f9073ea48d3/j_jib-2022-0046_fig_003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa16/10757073/635b5578bb7d/j_jib-2022-0046_fig_001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa16/10757073/4539648b6935/j_jib-2022-0046_fig_002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa16/10757073/7f9073ea48d3/j_jib-2022-0046_fig_003.jpg

相似文献

1
RNAcode_Web - Convenient identification of evolutionary conserved protein coding regions.RNAcode_Web——方便鉴定进化保守的蛋白质编码区。
J Integr Bioinform. 2023 Aug 25;20(3). doi: 10.1515/jib-2022-0046. eCollection 2023 Sep 1.
2
RNAcode: robust discrimination of coding and noncoding regions in comparative sequence data.RNAcode:在比较序列数据中稳健地区分编码和非编码区域。
RNA. 2011 Apr;17(4):578-94. doi: 10.1261/rna.2536111. Epub 2011 Feb 28.
3
transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences.transAlign:利用氨基酸促进蛋白质编码DNA序列的多重比对。
BMC Bioinformatics. 2005 Jun 22;6:156. doi: 10.1186/1471-2105-6-156.
4
uPEPperoni: an online tool for upstream open reading frame location and analysis of transcript conservation.uPEPperoni:一个用于上游开放阅读框定位和转录保守性分析的在线工具。
BMC Bioinformatics. 2014 Feb 1;15:36. doi: 10.1186/1471-2105-15-36.
5
Detecting overlapping coding sequences in virus genomes.检测病毒基因组中的重叠编码序列。
BMC Bioinformatics. 2006 Feb 16;7:75. doi: 10.1186/1471-2105-7-75.
6
Aligning Protein-Coding Nucleotide Sequences with MACSE.使用MACSE比对蛋白质编码核苷酸序列。
Methods Mol Biol. 2021;2231:51-70. doi: 10.1007/978-1-0716-1036-7_4.
7
New tools to analyze overlapping coding regions.用于分析重叠编码区域的新工具。
BMC Bioinformatics. 2016 Dec 13;17(1):530. doi: 10.1186/s12859-016-1389-7.
8
Protein-coding structured RNAs: A computational survey of conserved RNA secondary structures overlapping coding regions in drosophilids.蛋白质编码的结构 RNA:对果蝇中编码区域重叠的保守 RNA 二级结构的计算调查。
Biochimie. 2011 Nov;93(11):2019-23. doi: 10.1016/j.biochi.2011.07.023. Epub 2011 Jul 31.
9
Ensembl comparative genomics resources.Ensembl比较基因组学资源。
Database (Oxford). 2016 Feb 20;2016. doi: 10.1093/database/bav096. Print 2016.
10
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.

引用本文的文献

1
The hidden bacterial microproteome.隐藏的细菌微蛋白质组
Mol Cell. 2025 Mar 6;85(5):1024-1041.e6. doi: 10.1016/j.molcel.2025.01.025. Epub 2025 Feb 19.
2
The Cryptic Bacterial Microproteome.神秘的细菌微蛋白质组
bioRxiv. 2024 Feb 18:2024.02.17.580829. doi: 10.1101/2024.02.17.580829.

本文引用的文献

1
A workflow to identify novel proteins based on the direct mapping of peptide-spectrum-matches to genomic locations.基于肽-谱匹配直接映射到基因组位置来鉴定新蛋白质的工作流程。
BMC Bioinformatics. 2021 May 26;22(1):277. doi: 10.1186/s12859-021-04159-8.
2
The largely unexplored biology of small proteins in pro- and eukaryotes.原核生物和真核生物中小蛋白的大部分未被探索的生物学。
FEBS J. 2021 Dec;288(24):7002-7024. doi: 10.1111/febs.15845. Epub 2021 Jun 8.
3
Progressive Cactus is a multiple-genome aligner for the thousand-genome era.渐进仙人掌是一个适用于千基因组时代的多基因组比对工具。
Nature. 2020 Nov;587(7833):246-251. doi: 10.1038/s41586-020-2871-y. Epub 2020 Nov 11.
4
Enrichment and identification of small proteins in a simplified human gut microbiome.简化人体肠道微生物组中小蛋白的富集和鉴定。
J Proteomics. 2020 Feb 20;213:103604. doi: 10.1016/j.jprot.2019.103604. Epub 2019 Dec 10.
5
Unraveling the hidden universe of small proteins in bacterial genomes.揭示细菌基因组中小蛋白的隐藏宇宙。
Mol Syst Biol. 2019 Feb 22;15(2):e8290. doi: 10.15252/msb.20188290.
6
Mining for missed sORF-encoded peptides.挖掘缺失的短开放阅读框编码肽。
Expert Rev Proteomics. 2019 Mar;16(3):257-266. doi: 10.1080/14789450.2019.1571919. Epub 2019 Feb 13.
7
Classification and function of small open reading frames.小开放阅读框的分类与功能。
Nat Rev Mol Cell Biol. 2017 Sep;18(9):575-589. doi: 10.1038/nrm.2017.58. Epub 2017 Jul 12.
8
Mining for Micropeptides.挖掘微肽
Trends Cell Biol. 2017 Sep;27(9):685-696. doi: 10.1016/j.tcb.2017.04.006. Epub 2017 May 18.
9
FEELnc: a tool for long non-coding RNA annotation and its application to the dog transcriptome.FEELnc:一种用于长链非编码RNA注释的工具及其在犬转录组中的应用。
Nucleic Acids Res. 2017 May 5;45(8):e57. doi: 10.1093/nar/gkw1306.
10
COME: a robust coding potential calculation tool for lncRNA identification and characterization based on multiple features.COME:一种基于多种特征的用于lncRNA鉴定和表征的强大编码潜力计算工具。
Nucleic Acids Res. 2017 Jan 9;45(1):e2. doi: 10.1093/nar/gkw798. Epub 2016 Sep 7.