Suppr超能文献

人类转录组中的长链非编码RNA图谱

The landscape of long noncoding RNAs in the human transcriptome.

作者信息

Iyer Matthew K, Niknafs Yashar S, Malik Rohit, Singhal Udit, Sahu Anirban, Hosono Yasuyuki, Barrette Terrence R, Prensner John R, Evans Joseph R, Zhao Shuang, Poliakov Anton, Cao Xuhong, Dhanasekaran Saravana M, Wu Yi-Mi, Robinson Dan R, Beer David G, Feng Felix Y, Iyer Hariharan K, Chinnaiyan Arul M

机构信息

1] Michigan Center for Translational Pathology, University of Michigan, Ann Arbor, Michigan, USA. [2] Department of Computational Medicine and Bioinformatics, Ann Arbor, Michigan, USA.

1] Michigan Center for Translational Pathology, University of Michigan, Ann Arbor, Michigan, USA. [2] Department of Cellular and Molecular Biology, University of Michigan, Ann Arbor, Michigan, USA.

出版信息

Nat Genet. 2015 Mar;47(3):199-208. doi: 10.1038/ng.3192. Epub 2015 Jan 19.

Abstract

Long noncoding RNAs (lncRNAs) are emerging as important regulators of tissue physiology and disease processes including cancer. To delineate genome-wide lncRNA expression, we curated 7,256 RNA sequencing (RNA-seq) libraries from tumors, normal tissues and cell lines comprising over 43 Tb of sequence from 25 independent studies. We applied ab initio assembly methodology to this data set, yielding a consensus human transcriptome of 91,013 expressed genes. Over 68% (58,648) of genes were classified as lncRNAs, of which 79% were previously unannotated. About 1% (597) of the lncRNAs harbored ultraconserved elements, and 7% (3,900) overlapped disease-associated SNPs. To prioritize lineage-specific, disease-associated lncRNA expression, we employed non-parametric differential expression testing and nominated 7,942 lineage- or cancer-associated lncRNA genes. The lncRNA landscape characterized here may shed light on normal biology and cancer pathogenesis and may be valuable for future biomarker development.

摘要

长链非编码RNA(lncRNA)正成为包括癌症在内的组织生理学和疾病进程的重要调节因子。为了描绘全基因组lncRNA的表达情况,我们整理了来自肿瘤、正常组织和细胞系的7256个RNA测序(RNA-seq)文库,这些文库包含来自25项独立研究的超过43太字节的序列。我们将从头组装方法应用于该数据集,得到了一个包含91013个表达基因的人类转录组共识。超过68%(58648个)的基因被归类为lncRNA,其中79%以前未被注释。约1%(597个)的lncRNA含有超保守元件,7%(3900个)与疾病相关的单核苷酸多态性(SNP)重叠。为了对特定谱系、与疾病相关的lncRNA表达进行优先级排序,我们采用了非参数差异表达测试,并确定了7942个与谱系或癌症相关的lncRNA基因。这里所描绘的lncRNA图谱可能有助于揭示正常生物学和癌症发病机制,并且可能对未来生物标志物的开发具有重要价值。

相似文献

1
The landscape of long noncoding RNAs in the human transcriptome.人类转录组中的长链非编码RNA图谱
Nat Genet. 2015 Mar;47(3):199-208. doi: 10.1038/ng.3192. Epub 2015 Jan 19.

引用本文的文献

7
Long non-coding RNA : A crucial factor in fibrotic diseases.长链非编码RNA:纤维化疾病中的关键因素。
Mol Ther Nucleic Acids. 2025 Jul 17;36(3):102630. doi: 10.1016/j.omtn.2025.102630. eCollection 2025 Sep 9.

本文引用的文献

1
Genenames.org: the HGNC resources in 2015.Genenames.org:2015年的HGNC资源。
Nucleic Acids Res. 2015 Jan;43(Database issue):D1079-85. doi: 10.1093/nar/gku1071. Epub 2014 Oct 31.
3
A draft map of the human proteome.人类蛋白质组草图。
Nature. 2014 May 29;509(7502):575-81. doi: 10.1038/nature13302.
7
The NHGRI GWAS Catalog, a curated resource of SNP-trait associations.NHGRI GWAS Catalog,一个经过精心策划的 SNP 与特征关联资源。
Nucleic Acids Res. 2014 Jan;42(Database issue):D1001-6. doi: 10.1093/nar/gkt1229. Epub 2013 Dec 6.
8
Pfam: the protein families database.Pfam:蛋白质家族数据库。
Nucleic Acids Res. 2014 Jan;42(Database issue):D222-30. doi: 10.1093/nar/gkt1223. Epub 2013 Nov 27.
9
The UCSC Genome Browser database: 2014 update.UCSC 基因组浏览器数据库:2014 年更新。
Nucleic Acids Res. 2014 Jan;42(Database issue):D764-70. doi: 10.1093/nar/gkt1168. Epub 2013 Nov 21.
10
RefSeq: an update on mammalian reference sequences.RefSeq:哺乳动物参考序列的更新。
Nucleic Acids Res. 2014 Jan;42(Database issue):D756-63. doi: 10.1093/nar/gkt1114. Epub 2013 Nov 19.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验