• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过整合保守性、二级结构以及高通量测序和芯片数据预测和鉴定秀丽隐杆线虫中的非编码 RNA。

Prediction and characterization of noncoding RNAs in C. elegans by integrating conservation, secondary structure, and high-throughput sequencing and array data.

机构信息

Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520, USA.

出版信息

Genome Res. 2011 Feb;21(2):276-85. doi: 10.1101/gr.110189.110. Epub 2010 Dec 22.

DOI:10.1101/gr.110189.110
PMID:21177971
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3032931/
Abstract

We present an integrative machine learning method, incRNA, for whole-genome identification of noncoding RNAs (ncRNAs). It combines a large amount of expression data, RNA secondary-structure stability, and evolutionary conservation at the protein and nucleic-acid level. Using the incRNA model and data from the modENCODE consortium, we are able to separate known C. elegans ncRNAs from coding sequences and other genomic elements with a high level of accuracy (97% AUC on an independent validation set), and find more than 7000 novel ncRNA candidates, among which more than 1000 are located in the intergenic regions of C. elegans genome. Based on the validation set, we estimate that 91% of the approximately 7000 novel ncRNA candidates are true positives. We then analyze 15 novel ncRNA candidates by RT-PCR, detecting the expression for 14. In addition, we characterize the properties of all the novel ncRNA candidates and find that they have distinct expression patterns across developmental stages and tend to use novel RNA structural families. We also find that they are often targeted by specific transcription factors (∼59% of intergenic novel ncRNA candidates). Overall, our study identifies many new potential ncRNAs in C. elegans and provides a method that can be adapted to other organisms.

摘要

我们提出了一种整合机器学习方法 incRNA,用于全基因组鉴定非编码 RNA(ncRNA)。它结合了大量的表达数据、RNA 二级结构稳定性以及在蛋白质和核酸水平上的进化保守性。利用 incRNA 模型和 modENCODE 联盟的数据,我们能够以高精度(独立验证集上的 AUC 为 97%)将已知的秀丽隐杆线虫 ncRNA 与编码序列和其他基因组元件区分开来,并发现了 7000 多个新的 ncRNA 候选者,其中 1000 多个位于秀丽隐杆线虫基因组的基因间区域。基于验证集,我们估计大约 7000 个新的 ncRNA 候选者中有 91%是真正的阳性。然后,我们通过 RT-PCR 分析了 15 个新的 ncRNA 候选者,检测到了 14 个的表达情况。此外,我们还分析了所有新的 ncRNA 候选者的特性,发现它们在发育阶段具有不同的表达模式,并倾向于使用新的 RNA 结构家族。我们还发现它们经常被特定的转录因子靶向(约 59%的基因间新 ncRNA 候选者)。总的来说,我们的研究在秀丽隐杆线虫中鉴定了许多新的潜在 ncRNA,并提供了一种可适用于其他生物体的方法。

相似文献

1
Prediction and characterization of noncoding RNAs in C. elegans by integrating conservation, secondary structure, and high-throughput sequencing and array data.通过整合保守性、二级结构以及高通量测序和芯片数据预测和鉴定秀丽隐杆线虫中的非编码 RNA。
Genome Res. 2011 Feb;21(2):276-85. doi: 10.1101/gr.110189.110. Epub 2010 Dec 22.
2
Microarray analysis of ncRNA expression patterns in Caenorhabditis elegans after RNAi against snoRNA associated proteins.针对与小核仁RNA相关蛋白进行RNA干扰后,秀丽隐杆线虫中非编码RNA表达模式的微阵列分析。
BMC Genomics. 2008 Jun 11;9:278. doi: 10.1186/1471-2164-9-278.
3
Experimental annotation of the human pathogen Candida albicans coding and noncoding transcribed regions using high-resolution tiling arrays.使用高分辨率平铺阵列对人类病原体白念珠菌编码和非编码转录区域进行实验注释。
Genome Biol. 2010;11(7):R71. doi: 10.1186/gb-2010-11-7-r71. Epub 2010 Jul 9.
4
Identification of intermediate-size non-coding RNAs involved in the UV-induced DNA damage response in C. elegans.鉴定秀丽隐杆线虫中参与 UV 诱导的 DNA 损伤反应的中等大小非编码 RNA。
PLoS One. 2012;7(11):e48066. doi: 10.1371/journal.pone.0048066. Epub 2012 Nov 7.
5
Structure-based whole-genome realignment reveals many novel noncoding RNAs.基于结构的全基因组重排揭示了许多新的非编码 RNA。
Genome Res. 2013 Jun;23(6):1018-27. doi: 10.1101/gr.137091.111. Epub 2013 Jan 7.
6
Organization of the Caenorhabditis elegans small non-coding transcriptome: genomic features, biogenesis, and expression.秀丽隐杆线虫小非编码转录组的组织:基因组特征、生物发生及表达
Genome Res. 2006 Jan;16(1):20-9. doi: 10.1101/gr.4139206. Epub 2005 Dec 12.
7
Biocomputational prediction of non-coding RNAs in model cyanobacteria.模式蓝藻中非编码RNA的生物计算预测
BMC Genomics. 2009 Mar 23;10:123. doi: 10.1186/1471-2164-10-123.
8
Systematic identification of non-coding RNA 2,2,7-trimethylguanosine cap structures in Caenorhabditis elegans.秀丽隐杆线虫中非编码RNA 2,2,7-三甲基鸟苷帽结构的系统鉴定。
BMC Mol Biol. 2007 Sep 29;8:86. doi: 10.1186/1471-2199-8-86.
9
Prediction of structured non-coding RNAs in the genomes of the nematodes Caenorhabditis elegans and Caenorhabditis briggsae.秀丽隐杆线虫和briggsae秀丽隐杆线虫基因组中结构化非编码RNA的预测。
J Exp Zool B Mol Dev Evol. 2006 Jul 15;306(4):379-92. doi: 10.1002/jez.b.21086.
10
Profiling Caenorhabditis elegans non-coding RNA expression with a combined microarray.利用组合微阵列分析秀丽隐杆线虫非编码RNA表达
Nucleic Acids Res. 2006 May 31;34(10):2976-83. doi: 10.1093/nar/gkl371. Print 2006.

引用本文的文献

1
CGC1, a new reference genome for .CGC1,一个用于……的新参考基因组。 (原文不完整,翻译只能到这里)
Genome Res. 2025 Aug 1;35(8):1902-1918. doi: 10.1101/gr.280274.124.
2
Detecting gene expression in Caenorhabditis elegans.检测秀丽隐杆线虫中的基因表达。
Genetics. 2025 Jan 8;229(1):1-108. doi: 10.1093/genetics/iyae167.
3
CGC1, a new reference genome for .CGC1,一种用于……的新参考基因组。 (原文不完整,翻译可能不太准确,需结合完整内容进一步完善)
bioRxiv. 2024 Dec 6:2024.12.04.626850. doi: 10.1101/2024.12.04.626850.
4
Long Non-coding RNAs: Mechanisms, Experimental, and Computational Approaches in Identification, Characterization, and Their Biomarker Potential in Cancer.长链非编码RNA:癌症中识别、表征及其生物标志物潜力的机制、实验及计算方法
Front Genet. 2021 Jul 1;12:649619. doi: 10.3389/fgene.2021.649619. eCollection 2021.
5
The computational approaches of lncRNA identification based on coding potential: and challenges.基于编码潜能的lncRNA识别计算方法及挑战
Comput Struct Biotechnol J. 2020 Nov 19;18:3666-3677. doi: 10.1016/j.csbj.2020.11.030. eCollection 2020.
6
Translational adaptation to heat stress is mediated by RNA 5-methylcytosine in Caenorhabditis elegans.热应激的翻译适应是由秀丽隐杆线虫中的 RNA 5-甲基胞嘧啶介导的。
EMBO J. 2021 Mar 15;40(6):e105496. doi: 10.15252/embj.2020105496. Epub 2020 Dec 7.
7
Predicting functional long non-coding RNAs validated by low throughput experiments.预测经低通量实验验证的功能性长非编码 RNA。
RNA Biol. 2019 Nov;16(11):1555-1564. doi: 10.1080/15476286.2019.1644590. Epub 2019 Jul 26.
8
Distribution of ncRNAs expression across hypothalamic-pituitary-gonadal axis in Capra hircus.绵羊下丘脑-垂体-性腺轴中非编码 RNA 的表达分布。
BMC Genomics. 2018 May 30;19(1):417. doi: 10.1186/s12864-018-4767-x.
9
A Review on Recent Computational Methods for Predicting Noncoding RNAs.关于预测非编码 RNA 的最新计算方法的综述
Biomed Res Int. 2017;2017:9139504. doi: 10.1155/2017/9139504. Epub 2017 May 3.
10
Recurrently deregulated lncRNAs in hepatocellular carcinoma.肝细胞癌中反复失调的长非编码 RNA。
Nat Commun. 2017 Feb 13;8:14421. doi: 10.1038/ncomms14421.

本文引用的文献

1
Integrative analysis of the Caenorhabditis elegans genome by the modENCODE project.模式生物基因组计划对秀丽隐杆线虫基因组的综合分析。
Science. 2010 Dec 24;330(6012):1775-87. doi: 10.1126/science.1196914. Epub 2010 Dec 22.
2
Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays.RNA-Seq 和 tiling 阵列转录组数据的比较和校准。
BMC Genomics. 2010 Jun 17;11:383. doi: 10.1186/1471-2164-11-383.
3
Genome-wide identification of binding sites defines distinct functions for Caenorhabditis elegans PHA-4/FOXA in development and environmental response.全基因组鉴定结合位点揭示了秀丽隐杆线虫 PHA-4/FOXA 在发育和环境响应中的不同功能。
PLoS Genet. 2010 Feb 19;6(2):e1000848. doi: 10.1371/journal.pgen.1000848.
4
Identification of novel non-coding RNAs using profiles of short sequence reads from next generation sequencing data.利用下一代测序数据的短序列读取谱鉴定新型非编码 RNA。
BMC Genomics. 2010 Feb 1;11:77. doi: 10.1186/1471-2164-11-77.
5
De novo prediction of structured RNAs from genomic sequences.从头预测基因组序列中的结构 RNA。
Trends Biotechnol. 2010 Jan;28(1):9-19. doi: 10.1016/j.tibtech.2009.09.006. Epub 2009 Nov 26.
6
TERRA: telomeric repeat-containing RNA.TERRA:含端粒重复序列的RNA。
EMBO J. 2009 Sep 2;28(17):2503-10. doi: 10.1038/emboj.2009.166. Epub 2009 Jul 23.
7
Unlocking the secrets of the genome.揭开基因组的秘密。
Nature. 2009 Jun 18;459(7249):927-30. doi: 10.1038/459927a.
8
Dynamic expression of small non-coding RNAs, including novel microRNAs and piRNAs/21U-RNAs, during Caenorhabditis elegans development.秀丽隐杆线虫发育过程中包括新型微小RNA和piRNA/21U-RNA在内的小非编码RNA的动态表达。
Genome Biol. 2009;10(5):R54. doi: 10.1186/gb-2009-10-5-r54. Epub 2009 May 21.
9
Infernal 1.0: inference of RNA alignments.Infernal 1.0:RNA比对推断
Bioinformatics. 2009 May 15;25(10):1335-7. doi: 10.1093/bioinformatics/btp157. Epub 2009 Mar 23.
10
Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals.染色质特征揭示了哺乳动物中一千多种高度保守的大型非编码RNA。
Nature. 2009 Mar 12;458(7235):223-7. doi: 10.1038/nature07672. Epub 2009 Feb 1.