• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种整合的蛋白质基因组学方法揭示了小鼠肾脏髓质中注释 lincRNA 编码的肽。

An integrative proteogenomics approach reveals peptides encoded by annotated lincRNA in the mouse kidney inner medulla.

机构信息

Epithelial Systems Biology Laboratory, Systems Biology Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland.

出版信息

Physiol Genomics. 2020 Oct 1;52(10):485-491. doi: 10.1152/physiolgenomics.00048.2020. Epub 2020 Aug 31.

DOI:10.1152/physiolgenomics.00048.2020
PMID:32866085
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7750510/
Abstract

Long noncoding RNAs (lncRNAs) are intracellular transcripts longer than 200 nucleotides and lack protein-coding information. A subclass of lncRNA known as long intergenic noncoding RNAs (lincRNAs) are transcribed from genomic regions that share no overlap with annotated protein-coding genes. Increasing evidence has shown that some annotated lincRNA transcripts do in fact contain open reading frames (ORFs) encoding functional short peptides in the cell. Few robust methods for lincRNA-encoded peptide identification have been reported, and the tissue-specific expression of these peptides has been largely unexplored. Here we propose an integrative workflow for lincRNA-encoded peptide discovery and test it on the mouse kidney inner medulla (IM). In brief, low molecular weight protein fractions were enriched from homogenate of IMs and trypsinized into shorter peptides, which were sequenced by high resolution liquid chromatography-tandem mass spectrometry (LC-MS/MS). To curate a hypothetical lincRNA-encoded peptide database for peptide-spectrum matching following LC-MS/MS, we performed RNA-Seq on IMs, computationally removed reads overlapping with annotated protein-coding genes, and remapped the remaining reads to a database of mouse noncoding transcripts to infer lincRNA expression. Expressed lincRNAs were searched for ORFs by an existing rule-based algorithm, and translated ORFs were used for peptide-spectrum matching. Peptides identified by LC-MS/MS were further evaluated by using several quality control criteria and bioinformatics methods. We discovered three novel lincRNA-encoded peptides, which are conserved in mouse, rat, and human. The workflow can be adapted for discovery of small protein-coding genes in any species or tissue where noncoding transcriptome information is available.

摘要

长链非编码 RNA(lncRNA)是长度超过 200 个核苷酸且缺乏蛋白编码信息的细胞内转录本。长链非编码 RNA 的一个亚类,长基因间非编码 RNA(lincRNA),由与注释的蛋白编码基因没有重叠的基因组区域转录。越来越多的证据表明,一些注释的 lincRNA 转录本实际上包含开放阅读框(ORF),在细胞中编码功能短肽。目前已经报道了几种用于鉴定 lincRNA 编码肽的稳健方法,但这些肽的组织特异性表达在很大程度上尚未得到探索。在这里,我们提出了一种用于 lincRNA 编码肽发现的综合工作流程,并在小鼠肾脏髓质(IM)上进行了测试。简而言之,从 IM 匀浆中富集低分子量蛋白质分数,并将其用胰蛋白酶切成较短的肽,然后通过高分辨率液相色谱-串联质谱(LC-MS/MS)进行测序。为了在 LC-MS/MS 后进行肽谱匹配,整理一个假设的 lincRNA 编码肽数据库,我们对 IM 进行了 RNA-Seq,计算去除与注释的蛋白编码基因重叠的读数,并将剩余的读数重新映射到小鼠非编码转录本数据库,以推断 lincRNA 的表达。通过现有的基于规则的算法搜索表达的 lincRNA 的 ORF,并使用翻译的 ORF 进行肽谱匹配。通过使用几种质量控制标准和生物信息学方法进一步评估通过 LC-MS/MS 鉴定的肽。我们发现了三个新的 lincRNA 编码肽,这些肽在小鼠、大鼠和人类中保守。该工作流程可适应于任何具有非编码转录组信息的物种或组织中发现小蛋白编码基因。

相似文献

1
An integrative proteogenomics approach reveals peptides encoded by annotated lincRNA in the mouse kidney inner medulla.一种整合的蛋白质基因组学方法揭示了小鼠肾脏髓质中注释 lincRNA 编码的肽。
Physiol Genomics. 2020 Oct 1;52(10):485-491. doi: 10.1152/physiolgenomics.00048.2020. Epub 2020 Aug 31.
2
Long Intergenic Noncoding RNA (lincRNA) Discovery from Non-Strand-Specific RNA-Seq Data.从非链特异性RNA测序数据中发现长基因间非编码RNA(lincRNA)
Methods Mol Biol. 2022;2443:465-482. doi: 10.1007/978-1-0716-2067-0_24.
3
A deep learning method for lincRNA detection using auto-encoder algorithm.一种使用自动编码器算法进行长链非编码RNA(lincRNA)检测的深度学习方法。
BMC Bioinformatics. 2017 Dec 6;18(Suppl 15):511. doi: 10.1186/s12859-017-1922-3.
4
Genome-wide discovery of long intergenic noncoding RNAs and their epigenetic signatures in the rat.在大鼠中全基因组范围内发现长基因间非编码 RNA 及其表观遗传特征。
Sci Rep. 2017 Nov 1;7(1):14817. doi: 10.1038/s41598-017-13844-9.
5
The influence of transcript assembly on the proteogenomics discovery of microproteins.转录本组装对微小蛋白质的蛋白质基因组学发现的影响。
PLoS One. 2018 Mar 27;13(3):e0194518. doi: 10.1371/journal.pone.0194518. eCollection 2018.
6
Discovery of coding regions in the human genome by integrated proteogenomics analysis workflow.通过整合蛋白质基因组学分析流程发现人类基因组中的编码区域。
Nat Commun. 2018 Mar 2;9(1):903. doi: 10.1038/s41467-018-03311-y.
7
Identification of transcribed protein coding sequence remnants within lincRNAs.鉴定 lincRNAs 中转录蛋白编码序列的残段。
Nucleic Acids Res. 2018 Sep 28;46(17):8720-8729. doi: 10.1093/nar/gky608.
8
Long noncoding RNAs are rarely translated in two human cell lines.长非编码 RNA 在两种人类细胞系中很少被翻译。
Genome Res. 2012 Sep;22(9):1646-57. doi: 10.1101/gr.134767.111.
9
Systematic identification of long intergenic non-coding RNAs expressed in bovine oocytes.系统鉴定牛卵母细胞中表达的长基因间非编码 RNA。
Reprod Biol Endocrinol. 2020 Feb 21;18(1):13. doi: 10.1186/s12958-020-00573-4.
10
Proteogenomics-Guided Evaluation of RNA-Seq Assembly and Protein Database Construction for Emergent Model Organisms.基于蛋白质基因组学的新兴模式生物 RNA-Seq 组装和蛋白质数据库构建评估。
Proteomics. 2020 May;20(10):e1900261. doi: 10.1002/pmic.201900261. Epub 2020 May 18.

引用本文的文献

1
LncRNA-Mediated Tissue-Specific Plastic Responses to Salinity Changes in Oysters.长链非编码RNA介导的牡蛎对盐度变化的组织特异性可塑性反应
Int J Mol Sci. 2025 May 9;26(10):4523. doi: 10.3390/ijms26104523.
2
CircRNA and lncRNA-encoded peptide in diseases, an update review.环状 RNA 和长链非编码 RNA 编码肽在疾病中的研究进展综述
Mol Cancer. 2024 Sep 30;23(1):214. doi: 10.1186/s12943-024-02131-7.
3
Long noncoding RNA study: Genome-wide approaches.长链非编码RNA研究:全基因组方法。
Genes Dis. 2022 Nov 29;10(6):2491-2510. doi: 10.1016/j.gendis.2022.10.024. eCollection 2023 Nov.
4
Peptidomics Methods Applied to the Study of Flower Development.肽组学方法在花发育研究中的应用。
Methods Mol Biol. 2023;2686:509-536. doi: 10.1007/978-1-0716-3299-4_24.
5
CALINCA-A Novel Pipeline for the Identification of lncRNAs in Podocyte Disease.CALINCA-一种用于鉴定足细胞疾病中 lncRNAs 的新方法。
Cells. 2021 Mar 20;10(3):692. doi: 10.3390/cells10030692.

本文引用的文献

1
The UCSC Genome Browser database: 2019 update.UCSC 基因组浏览器数据库:2019 年更新。
Nucleic Acids Res. 2019 Jan 8;47(D1):D853-D858. doi: 10.1093/nar/gky1095.
2
UniProt: a worldwide hub of protein knowledge.UniProt:蛋白质知识的全球枢纽。
Nucleic Acids Res. 2019 Jan 8;47(D1):D506-D515. doi: 10.1093/nar/gky1049.
3
GENCODE reference annotation for the human and mouse genomes.GENCODE 人类和小鼠基因组参考注释。
Nucleic Acids Res. 2019 Jan 8;47(D1):D766-D773. doi: 10.1093/nar/gky955.
4
The small peptide world in long noncoding RNAs.长非编码 RNA 中的小肽世界。
Brief Bioinform. 2019 Sep 27;20(5):1853-1864. doi: 10.1093/bib/bby055.
5
Towards a complete map of the human long non-coding RNA transcriptome.构建人类长非编码 RNA 转录组完整图谱。
Nat Rev Genet. 2018 Sep;19(9):535-548. doi: 10.1038/s41576-018-0017-y.
6
An update on sORFs.org: a repository of small ORFs identified by ribosome profiling.sORFs.org 更新:核糖体图谱鉴定的小开放阅读框数据库。
Nucleic Acids Res. 2018 Jan 4;46(D1):D497-D502. doi: 10.1093/nar/gkx1130.
7
Mining for Micropeptides.挖掘微肽
Trends Cell Biol. 2017 Sep;27(9):685-696. doi: 10.1016/j.tcb.2017.04.006. Epub 2017 May 18.
8
mTORC1 and muscle regeneration are regulated by the LINC00961-encoded SPAR polypeptide.mTORC1 和肌肉再生受 LINC00961 编码的 SPAR 多肽调节。
Nature. 2017 Jan 12;541(7636):228-232. doi: 10.1038/nature21034. Epub 2016 Dec 26.
9
Improving GENCODE reference gene annotation using a high-stringency proteogenomics workflow.利用高严格性的蛋白质基因组学工作流程改进 GENCODE 参考基因注释。
Nat Commun. 2016 Jun 2;7:11778. doi: 10.1038/ncomms11778.
10
JBrowse: a dynamic web platform for genome visualization and analysis.JBrowse:一个用于基因组可视化和分析的动态网络平台。
Genome Biol. 2016 Apr 12;17:66. doi: 10.1186/s13059-016-0924-1.