• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一体化策略用于微量蛋白鉴定和序列分析。

An Integrated Approach for Microprotein Identification and Sequence Analysis.

机构信息

The Heart Institute, Division of Molecular Cardiovascular Biology, Cincinnati Children's Hospital Medical Center.

The Heart Institute, Division of Molecular Cardiovascular Biology, Cincinnati Children's Hospital Medical Center; Department of Pediatrics, University of Cincinnati College of Medicine;

出版信息

J Vis Exp. 2022 Jul 12(185). doi: 10.3791/63841.

DOI:10.3791/63841
PMID:35913170
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9521633/
Abstract

Next-generation sequencing (NGS) has propelled the field of genomics forward and produced whole genome sequences for numerous animal species and model organisms. However, despite this wealth of sequence information, comprehensive gene annotation efforts have proven challenging, especially for small proteins. Notably, conventional protein annotation methods were designed to intentionally exclude putative proteins encoded by short open reading frames (sORFs) less than 300 nucleotides in length to filter out the exponentially higher number of spurious noncoding sORFs throughout the genome. As a result, hundreds of functional small proteins called microproteins (<100 amino acids in length) have been incorrectly classified as noncoding RNAs or overlooked entirely. Here we provide a detailed protocol to leverage free, publicly available bioinformatic tools to query genomic regions for microprotein-coding potential based on evolutionary conservation. Specifically, we provide step-by-step instructions on how to examine sequence conservation and coding potential using Phylogenetic Codon Substitution Frequencies (PhyloCSF) on the user-friendly University of California Santa Cruz (UCSC) Genome Browser. Additionally, we detail steps to efficiently generate multiple species alignments of identified microprotein sequences to visualize amino acid sequence conservation and recommend resources to analyze microprotein characteristics, including predicted domain structures. These powerful tools can be used to help identify putative microprotein-coding sequences in noncanonical genomic regions or to rule out the presence of a conserved coding sequence with translational potential in a noncoding transcript of interest.

摘要

下一代测序(NGS)推动了基因组学领域的发展,为许多动物物种和模式生物生成了全基因组序列。然而,尽管有了如此丰富的序列信息,全面的基因注释工作仍然具有挑战性,尤其是对于小蛋白。值得注意的是,传统的蛋白质注释方法旨在有意排除短开放阅读框(sORF)编码的假定蛋白,这些 sORF 的长度小于 300 个核苷酸,以过滤掉基因组中数量呈指数级增加的虚假非编码 sORF。因此,数百种被称为微蛋白的功能小蛋白(长度小于 100 个氨基酸)被错误地归类为非编码 RNA 或完全被忽视。在这里,我们提供了一个详细的方案,利用免费的、公开的生物信息学工具,根据进化保守性查询基因组区域的微蛋白编码潜力。具体来说,我们提供了如何使用用户友好的加州大学圣克鲁兹分校(UCSC)基因组浏览器上的 Phylogenetic Codon Substitution Frequencies(PhyloCSF)检查序列保守性和编码潜力的分步说明。此外,我们详细介绍了如何有效地生成已识别的微蛋白序列的多种物种比对,以可视化氨基酸序列保守性,并推荐用于分析微蛋白特征的资源,包括预测的结构域结构。这些强大的工具可用于帮助识别非规范基因组区域中的假定微蛋白编码序列,或排除具有翻译潜力的保守编码序列在感兴趣的非编码转录本中的存在。

相似文献

1
An Integrated Approach for Microprotein Identification and Sequence Analysis.一体化策略用于微量蛋白鉴定和序列分析。
J Vis Exp. 2022 Jul 12(185). doi: 10.3791/63841.
2
Revisiting sORFs: overcoming challenges to identify and characterize functional microproteins.重新审视短开放阅读框(sORFs):克服鉴定和表征功能微蛋白的挑战。
FEBS J. 2022 Jan;289(1):53-74. doi: 10.1111/febs.15769. Epub 2021 Feb 24.
3
Microproteins-Discovery, structure, and function.微量蛋白——发现、结构与功能。
Proteomics. 2023 Dec;23(23-24):e2100211. doi: 10.1002/pmic.202100211. Epub 2023 Aug 21.
4
Microproteins: Overlooked regulators of physiology and disease.微小蛋白:被忽视的生理与疾病调节因子。
iScience. 2023 Apr 29;26(6):106781. doi: 10.1016/j.isci.2023.106781. eCollection 2023 Jun 16.
5
Computational discovery and annotation of conserved small open reading frames in fungal genomes.计算发现和注释真菌基因组中的保守小开放阅读框。
BMC Bioinformatics. 2019 Feb 4;19(Suppl 13):551. doi: 10.1186/s12859-018-2550-2.
6
Discovery and annotation of small proteins using genomics, proteomics, and computational approaches.利用基因组学、蛋白质组学和计算方法发现和注释小蛋白。
Genome Res. 2011 Apr;21(4):634-41. doi: 10.1101/gr.109280.110. Epub 2011 Mar 2.
7
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
8
D-sORF: Accurate Ab Initio Classification of Experimentally Detected Small Open Reading Frames (sORFs) Associated with Translational Machinery.D-sORF:对实验检测到的与翻译机制相关的小开放阅读框(sORF)进行准确的从头分类。
Biology (Basel). 2024 Jul 26;13(8):563. doi: 10.3390/biology13080563.
9
Short open reading frames (sORFs) and microproteins: an update on their identification and validation measures.短开放阅读框 (sORFs) 和微蛋白:它们的鉴定和验证措施的最新进展。
J Biomed Sci. 2022 Mar 17;29(1):19. doi: 10.1186/s12929-022-00802-5.
10
Integrated workflow for discovery of microprotein-coding small open reading frames.微蛋白编码小开放阅读框发现的综合工作流程。
STAR Protoc. 2023 Dec 15;4(4):102649. doi: 10.1016/j.xpro.2023.102649. Epub 2023 Oct 23.

引用本文的文献

1
Investigating the role of long non-coding RNA in hypertrophic cardiomyopathy.研究长链非编码RNA在肥厚型心肌病中的作用。
bioRxiv. 2025 Jul 31:2025.07.26.666851. doi: 10.1101/2025.07.26.666851.
2
LncRNA-Encoded Micropeptides: Expression Validation, Translational Mechanisms, and Roles in Cellular Metabolism.长链非编码RNA编码的微肽:表达验证、翻译机制及其在细胞代谢中的作用
Int J Mol Sci. 2025 Jun 19;26(12):5913. doi: 10.3390/ijms26125913.
3
Exploring the Dark Matter of Human Proteome: The Emerging Role of Non-Canonical Open Reading Frame (ncORF) in Cancer Diagnosis, Biology, and Therapy.

本文引用的文献

1
De novo birth of functional microproteins in the human lineage.人类谱系中功能性微蛋白的从头生成。
Cell Rep. 2022 Dec 20;41(12):111808. doi: 10.1016/j.celrep.2022.111808.
2
The cardiac-enriched microprotein mitolamban regulates mitochondrial respiratory complex assembly and function in mice.富含心脏的微蛋白mitolamban 调节小鼠线粒体呼吸复合物的组装和功能。
Proc Natl Acad Sci U S A. 2022 Feb 8;119(6). doi: 10.1073/pnas.2120476119.
3
TUNAR lncRNA Encodes a Microprotein that Regulates Neural Differentiation and Neurite Formation by Modulating Calcium Dynamics.
探索人类蛋白质组的暗物质:非规范开放阅读框(ncORF)在癌症诊断、生物学及治疗中的新兴作用
Cancers (Basel). 2024 Jul 26;16(15):2660. doi: 10.3390/cancers16152660.
4
Microproteins: Overlooked regulators of physiology and disease.微小蛋白:被忽视的生理与疾病调节因子。
iScience. 2023 Apr 29;26(6):106781. doi: 10.1016/j.isci.2023.106781. eCollection 2023 Jun 16.
TUNAR长链非编码RNA编码一种微蛋白,该微蛋白通过调节钙动力学来调控神经分化和神经突形成。
Front Cell Dev Biol. 2021 Dec 31;9:747667. doi: 10.3389/fcell.2021.747667. eCollection 2021.
4
SignalP 6.0 predicts all five types of signal peptides using protein language models.SignalP 6.0 使用蛋白质语言模型预测所有五种类型的信号肽。
Nat Biotechnol. 2022 Jul;40(7):1023-1025. doi: 10.1038/s41587-021-01156-3. Epub 2022 Jan 3.
5
The dark proteome: translation from noncanonical open reading frames.暗蛋白质组学:从非规范开放阅读框的翻译。
Trends Cell Biol. 2022 Mar;32(3):243-258. doi: 10.1016/j.tcb.2021.10.010. Epub 2021 Nov 26.
6
Intergenic ORFs as elementary structural modules of de novo gene birth and protein evolution.基因间开放阅读框作为从头起源基因诞生和蛋白质进化的基本结构模块。
Genome Res. 2021 Dec;31(12):2303-2315. doi: 10.1101/gr.275638.121. Epub 2021 Nov 22.
7
Micropeptide ASAP encoded by LINC00467 promotes colorectal cancer progression by directly modulating ATP synthase activity.LINC00467 编码的微肽 ASAP 通过直接调节 ATP 合酶活性促进结直肠癌的进展。
J Clin Invest. 2021 Nov 15;131(22). doi: 10.1172/JCI152911.
8
A putative long noncoding RNA-encoded micropeptide maintains cellular homeostasis in pancreatic β cells.一种假定的长链非编码RNA编码的微肽维持胰腺β细胞的细胞稳态。
Mol Ther Nucleic Acids. 2021 Jul 16;26:307-320. doi: 10.1016/j.omtn.2021.06.027. eCollection 2021 Dec 3.
9
SARS-CoV-2 gene content and COVID-19 mutation impact by comparing 44 Sarbecovirus genomes.比较 44 种 Sarbecovirus 基因组分析 SARS-CoV-2 的基因组成和 COVID-19 的突变影响。
Nat Commun. 2021 May 11;12(1):2642. doi: 10.1038/s41467-021-22905-7.
10
Revisiting sORFs: overcoming challenges to identify and characterize functional microproteins.重新审视短开放阅读框(sORFs):克服鉴定和表征功能微蛋白的挑战。
FEBS J. 2022 Jan;289(1):53-74. doi: 10.1111/febs.15769. Epub 2021 Feb 24.