• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

癌症中功能神秘的基因:利用 TCGA 数据绘制注释的局限性图谱。

Functionally Enigmatic Genes in Cancer: Using TCGA Data to Map the Limitations of Annotations.

机构信息

Johns Hopkins University, Bloomberg School of Public Health, Center for Alternatives to Animal Testing (CAAT), Baltimore, MD, United States.

ToxTrack LLC, Baltimore, USA.

出版信息

Sci Rep. 2020 Mar 5;10(1):4106. doi: 10.1038/s41598-020-60456-x.

DOI:10.1038/s41598-020-60456-x
PMID:32139709
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7057977/
Abstract

Cancer is a comparatively well-studied disease, yet despite decades of intense focus, we demonstrate here using data from The Cancer Genome Atlas that a substantial number of genes implicated in cancer are relatively poorly studied. Those genes will likely be missed by any data analysis pipeline, such as enrichment analysis, that depends exclusively on annotations for understanding biological function. There is no indication that the amount of research - indicated by number of publications - is correlated with any objective metric of gene significance. Moreover, these genes are not missing at random but reflect that our information about genes is gathered in a biased manner: poorly studied genes are more likely to be primate-specific and less likely to have a Mendelian inheritance pattern, and they tend to cluster in some biological processes and not others. While this likely reflects both technological limitations as well as the fact that well-known genes tend to gather more interest from the research community, in the absence of a concerted effort to study genes in an unbiased way, many genes (and biological processes) will remain opaque.

摘要

癌症是一种研究得比较透彻的疾病,但尽管我们已经进行了数十年的深入研究,在这里,我们利用癌症基因组图谱的数据表明,大量与癌症相关的基因仍未得到充分研究。这些基因很可能会被任何完全依赖注释来理解生物功能的数据分析管道(如富集分析)所遗漏。没有迹象表明研究的数量(以出版物数量为指标)与基因重要性的任何客观指标相关。此外,这些基因并不是随机缺失的,而是反映了我们获取基因信息的方式存在偏差:研究较少的基因更有可能是灵长类动物所特有的,并且不太可能具有孟德尔遗传模式,而且它们往往在某些生物过程中聚集,而在其他过程中则不聚集。虽然这可能反映了技术限制以及众所周知的基因往往会从研究界获得更多关注的事实,但如果没有一致的努力以无偏倚的方式研究基因,许多基因(和生物过程)将仍然不为人知。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d6e/7057977/cf32bc857dbb/41598_2020_60456_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d6e/7057977/af78c59e2bdc/41598_2020_60456_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d6e/7057977/e3a705a82033/41598_2020_60456_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d6e/7057977/c6e47365a13f/41598_2020_60456_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d6e/7057977/6ba740dbe4a1/41598_2020_60456_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d6e/7057977/fe9bcfff4b2d/41598_2020_60456_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d6e/7057977/cf32bc857dbb/41598_2020_60456_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d6e/7057977/af78c59e2bdc/41598_2020_60456_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d6e/7057977/e3a705a82033/41598_2020_60456_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d6e/7057977/c6e47365a13f/41598_2020_60456_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d6e/7057977/6ba740dbe4a1/41598_2020_60456_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d6e/7057977/fe9bcfff4b2d/41598_2020_60456_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d6e/7057977/cf32bc857dbb/41598_2020_60456_Fig6_HTML.jpg

相似文献

1
Functionally Enigmatic Genes in Cancer: Using TCGA Data to Map the Limitations of Annotations.癌症中功能神秘的基因:利用 TCGA 数据绘制注释的局限性图谱。
Sci Rep. 2020 Mar 5;10(1):4106. doi: 10.1038/s41598-020-60456-x.
2
3
The Cancer Omics Atlas: an integrative resource for cancer omics annotations.癌症组学图谱:癌症组学注释的综合资源
BMC Med Genomics. 2018 Aug 8;11(1):63. doi: 10.1186/s12920-018-0381-7.
4
Cancer genome sequencing: the challenges ahead.癌症基因组测序:未来的挑战。
Bioessays. 2007 Aug;29(8):783-94. doi: 10.1002/bies.20610.
5
Substantial batch effects in TCGA exome sequences undermine pan-cancer analysis of germline variants.TCGA 外显子序列中大量的批次效应破坏了种系变异的泛癌症分析。
BMC Cancer. 2019 Aug 7;19(1):783. doi: 10.1186/s12885-019-5994-5.
6
OrthoFiller: utilising data from multiple species to improve the completeness of genome annotations.OrthoFiller:利用多个物种的数据提高基因组注释的完整性。
BMC Genomics. 2017 May 18;18(1):390. doi: 10.1186/s12864-017-3771-x.
7
The Cancer Genome Atlas: Creating Lasting Value beyond Its Data.癌症基因组图谱:在其数据之外创造持久价值。
Cell. 2018 Apr 5;173(2):283-285. doi: 10.1016/j.cell.2018.03.042.
8
A comprehensive genomic pan-cancer classification using The Cancer Genome Atlas gene expression data.利用癌症基因组图谱基因表达数据进行的全面基因组泛癌分类。
BMC Genomics. 2017 Jul 3;18(1):508. doi: 10.1186/s12864-017-3906-0.
9
Ontology-Based Prediction and Prioritization of Gene Functional Annotations.基于本体的基因功能注释预测与优先级排序
IEEE/ACM Trans Comput Biol Bioinform. 2016 Mar-Apr;13(2):248-60. doi: 10.1109/TCBB.2015.2459694.
10
Exploring cancer genomic data from the cancer genome atlas project.探索来自癌症基因组图谱计划的癌症基因组数据。
BMB Rep. 2016 Nov;49(11):607-611. doi: 10.5483/bmbrep.2016.49.11.145.

引用本文的文献

1
Pathway Analysis Interpretation in the Multi-Omic Era.多组学时代的通路分析解读
BioTech (Basel). 2025 Jul 29;14(3):58. doi: 10.3390/biotech14030058.
2
Development and experimental validation of dephosphorylation-related biomarkers to assess prognosis and immunotherapeutic response in gliomas.用于评估胶质瘤预后和免疫治疗反应的去磷酸化相关生物标志物的开发与实验验证
Front Immunol. 2025 Jan 3;15:1488894. doi: 10.3389/fimmu.2024.1488894. eCollection 2024.
3
Gene prioritization based on random walks with restarts and absorbing states, to define gene sets regulating drug pharmacodynamics from single-cell analyses.

本文引用的文献

1
Weighted Gene Correlation Network Analysis (WGCNA) Reveals Novel Transcription Factors Associated With Bisphenol A Dose-Response.加权基因共表达网络分析(WGCNA)揭示了与双酚A剂量反应相关的新型转录因子。
Front Genet. 2018 Nov 12;9:508. doi: 10.3389/fgene.2018.00508. eCollection 2018.
2
Massive mining of publicly available RNA-seq data from human and mouse.大规模挖掘人类和小鼠公共可用的 RNA-seq 数据。
Nat Commun. 2018 Apr 10;9(1):1366. doi: 10.1038/s41467-018-03751-6.
3
The long tail of molecular alterations in non-small cell lung cancer: a single-institution experience of next-generation sequencing in clinical molecular diagnostics.
基于带有重启动和吸收状态的随机游走的基因优先级排序,从单细胞分析中定义调节药物药效动力学的基因集。
PLoS One. 2022 Nov 7;17(11):e0268956. doi: 10.1371/journal.pone.0268956. eCollection 2022.
4
Applications for open access normalized synthesis in metastatic prostate cancer trials.转移性前列腺癌试验中开放获取标准化合成的应用。
Front Artif Intell. 2022 Sep 12;5:984836. doi: 10.3389/frai.2022.984836. eCollection 2022.
5
Systematic assessment of pathway databases, based on a diverse collection of user-submitted experiments.基于用户提交的多样化实验,对途径数据库进行系统评估。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac355.
6
A Non-Hazardous Deparaffinization Protocol Enables Quantitative Proteomics of Core Needle Biopsy-Sized Formalin-Fixed and Paraffin-Embedded (FFPE) Tissue Specimens.一种非危害脱蜡方案可实现核心针活检大小福尔马林固定石蜡包埋(FFPE)组织标本的定量蛋白质组学分析。
Int J Mol Sci. 2022 Apr 18;23(8):4443. doi: 10.3390/ijms23084443.
7
Labour classified by cervical dilatation & fetal membrane rupture demonstrates differential impact on RNA-seq data for human myometrium tissues.根据宫颈扩张和胎膜破裂对人类子宫肌组织的 RNA-seq 数据的影响进行分类的劳动力。
PLoS One. 2021 Nov 19;16(11):e0260119. doi: 10.1371/journal.pone.0260119. eCollection 2021.
8
Multi-omics mapping of human papillomavirus integration sites illuminates novel cervical cancer target genes.多组学图谱描绘人类乳头瘤病毒整合位点,揭示新型宫颈癌靶标基因。
Br J Cancer. 2021 Nov;125(10):1408-1419. doi: 10.1038/s41416-021-01545-0. Epub 2021 Sep 15.
9
Identification of pathological transcription in autosomal dominant polycystic kidney disease epithelia.常染色体显性多囊肾病上皮细胞中的病理性转录鉴定。
Sci Rep. 2021 Jul 23;11(1):15139. doi: 10.1038/s41598-021-94442-8.
10
Using Multilayer Heterogeneous Networks to Infer Functions of Phosphorylated Sites.利用多层异质网络推断磷酸化位点的功能。
J Proteome Res. 2021 Jul 2;20(7):3532-3548. doi: 10.1021/acs.jproteome.1c00150. Epub 2021 Jun 24.
非小细胞肺癌中分子改变的长尾:单机构在临床分子诊断中进行下一代测序的经验。
J Clin Pathol. 2018 Sep;71(9):767-773. doi: 10.1136/jclinpath-2018-205032. Epub 2018 Mar 13.
4
Gene annotation bias impedes biomedical research.基因注释偏差阻碍了生物医学研究。
Sci Rep. 2018 Jan 22;8(1):1362. doi: 10.1038/s41598-018-19333-x.
5
TIMER: A Web Server for Comprehensive Analysis of Tumor-Infiltrating Immune Cells.TIMER:用于肿瘤浸润免疫细胞综合分析的网络服务器。
Cancer Res. 2017 Nov 1;77(21):e108-e110. doi: 10.1158/0008-5472.CAN-17-0307.
6
An Expanded View of Complex Traits: From Polygenic to Omnigenic.复杂性状的扩展观点:从多基因到泛基因
Cell. 2017 Jun 15;169(7):1177-1186. doi: 10.1016/j.cell.2017.05.038.
7
The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible.2017年的STRING数据库:质量可控的蛋白质-蛋白质相互作用网络,广泛可用。
Nucleic Acids Res. 2017 Jan 4;45(D1):D362-D368. doi: 10.1093/nar/gkw937. Epub 2016 Oct 18.
8
Mitochondrial Mg(2+) homeostasis decides cellular energy metabolism and vulnerability to stress.线粒体镁离子稳态决定细胞能量代谢及对应激的易感性。
Sci Rep. 2016 Jul 26;6:30027. doi: 10.1038/srep30027.
9
Systematic pan-cancer analysis of tumour purity.肿瘤纯度的系统性泛癌分析。
Nat Commun. 2015 Dec 4;6:8971. doi: 10.1038/ncomms9971.
10
MPTP's pathway of toxicity indicates central role of transcription factor SP1.MPTP的毒性途径表明转录因子SP1的核心作用。
Arch Toxicol. 2015 May;89(5):743-55. doi: 10.1007/s00204-015-1509-6. Epub 2015 Apr 8.