• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

前后对比:遗留 TCGA 基因组数据公共数据库与统一 TCGA 基因组数据公共数据库的数据比较。

Before and After: Comparison of Legacy and Harmonized TCGA Genomic Data Commons' Data.

机构信息

Eli and Edythe L. Broad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge, MA 02142, USA; The University of Texas Southwestern Medical School, Dallas, TX 75390, USA.

Department of Genetics, Lineberger Comprehensive Cancer Center, the University of North Carolin at Chapel Hill, Chapel Hill, NC 27599, USA.

出版信息

Cell Syst. 2019 Jul 24;9(1):24-34.e10. doi: 10.1016/j.cels.2019.06.006.

DOI:10.1016/j.cels.2019.06.006
PMID:31344359
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6707074/
Abstract

We present a systematic analysis of the effects of synchronizing a large-scale, deeply characterized, multi-omic dataset to the current human reference genome, using updated software, pipelines, and annotations. For each of 5 molecular data platforms in The Cancer Genome Atlas (TCGA)-mRNA and miRNA expression, single nucleotide variants, DNA methylation and copy number alterations-comprehensive sample, gene, and probe-level studies were performed, towards quantifying the degree of similarity between the 'legacy' GRCh37 (hg19) TCGA data and its GRCh38 (hg38) version as 'harmonized' by the Genomic Data Commons. We offer gene lists to elucidate differences that remained after controlling for confounders, and strategies to mitigate their impact on biological interpretation. Our results demonstrate that the hg19 and hg38 TCGA datasets are very highly concordant, promote informed use of either legacy or harmonized omics data, and provide a rubric that encourages similar comparisons as new data emerge and reference data evolve.

摘要

我们展示了一种系统的分析方法,该方法将大规模、深度特征化的多组学数据集同步到当前人类参考基因组,使用了更新的软件、管道和注释。对于癌症基因组图谱 (TCGA) 中的 5 种分子数据平台中的每一种 - mRNA 和 miRNA 表达、单核苷酸变体、DNA 甲基化和拷贝数改变 - 都进行了全面的样本、基因和探针水平研究,以量化“传统”GRCh37(hg19)TCGA 数据与其通过基因组数据共享资源“协调”的 GRCh38(hg38)版本之间的相似程度。我们提供了基因列表,以阐明在控制混杂因素后仍然存在的差异,并提供了减轻这些差异对生物学解释影响的策略。我们的结果表明,hg19 和 hg38 TCGA 数据集非常一致,促进了对传统或协调的组学数据的明智使用,并提供了一个准则,鼓励在新数据出现和参考数据演变时进行类似的比较。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a98/6707074/7d5d34019fdf/nihms-1535521-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a98/6707074/e4fb9032c1e7/nihms-1535521-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a98/6707074/c345d1e89b46/nihms-1535521-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a98/6707074/c0f209e282a3/nihms-1535521-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a98/6707074/95a6478012a2/nihms-1535521-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a98/6707074/7d5d34019fdf/nihms-1535521-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a98/6707074/e4fb9032c1e7/nihms-1535521-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a98/6707074/c345d1e89b46/nihms-1535521-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a98/6707074/c0f209e282a3/nihms-1535521-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a98/6707074/95a6478012a2/nihms-1535521-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a98/6707074/7d5d34019fdf/nihms-1535521-f0005.jpg

相似文献

1
Before and After: Comparison of Legacy and Harmonized TCGA Genomic Data Commons' Data.前后对比:遗留 TCGA 基因组数据公共数据库与统一 TCGA 基因组数据公共数据库的数据比较。
Cell Syst. 2019 Jul 24;9(1):24-34.e10. doi: 10.1016/j.cels.2019.06.006.
2
Uniform genomic data analysis in the NCI Genomic Data Commons.在 NCI 基因组数据共享中心进行统一的基因组数据分析。
Nat Commun. 2021 Feb 22;12(1):1226. doi: 10.1038/s41467-021-21254-9.
3
Similarities and differences between variants called with human reference genome HG19 or HG38.与使用人类参考基因组 HG19 或 HG38 调用的变体之间的相似性和差异。
BMC Bioinformatics. 2019 Mar 14;20(Suppl 2):101. doi: 10.1186/s12859-019-2620-0.
4
Omics Pipe: a community-based framework for reproducible multi-omics data analysis.组学管道:一个基于社区的可重复多组学数据分析框架。
Bioinformatics. 2015 Jun 1;31(11):1724-8. doi: 10.1093/bioinformatics/btv061. Epub 2015 Jan 30.
5
New functionalities in the TCGAbiolinks package for the study and integration of cancer data from GDC and GTEx.TCGAbiolinks 包中的新功能,用于研究和整合来自 GDC 和 GTEx 的癌症数据。
PLoS Comput Biol. 2019 Mar 5;15(3):e1006701. doi: 10.1371/journal.pcbi.1006701. eCollection 2019 Mar.
6
Large-scale profiling of microRNAs for The Cancer Genome Atlas.癌症基因组图谱的大规模微小RNA分析
Nucleic Acids Res. 2016 Jan 8;44(1):e3. doi: 10.1093/nar/gkv808. Epub 2015 Aug 13.
7
canEvolve: a web portal for integrative oncogenomics.canEvolve:一个整合肿瘤基因组学的网络门户。
PLoS One. 2013;8(2):e56228. doi: 10.1371/journal.pone.0056228. Epub 2013 Feb 13.
8
The Cancer Omics Atlas: an integrative resource for cancer omics annotations.癌症组学图谱:癌症组学注释的综合资源
BMC Med Genomics. 2018 Aug 8;11(1):63. doi: 10.1186/s12920-018-0381-7.
9
Misannotated Multi-Nucleotide Variants in Public Cancer Genomics Datasets Lead to Inaccurate Mutation Calls with Significant Implications.公共癌症基因组学数据集中标注错误的多核苷酸变体导致不准确的突变调用,具有重要影响。
Cancer Res. 2021 Jan 15;81(2):282-288. doi: 10.1158/0008-5472.CAN-20-2151. Epub 2020 Oct 28.
10
Serum-based six-miRNA signature as a potential marker for EC diagnosis: Comparison with TCGA miRNAseq dataset and identification of miRNA-mRNA target pairs by integrated analysis of TCGA miRNAseq and RNAseq datasets.基于血清的六种miRNA特征作为子宫内膜癌诊断的潜在标志物:与TCGA miRNA测序数据集的比较以及通过整合分析TCGA miRNA测序和RNA测序数据集鉴定miRNA-mRNA靶标对
Asia Pac J Clin Oncol. 2018 Oct;14(5):e289-e301. doi: 10.1111/ajco.12847. Epub 2018 Jan 30.

引用本文的文献

1
Identification and Validation of New Molecular Subtypes within the Early and Late Mild Cognitive Impairment Stages of Alzheimer's Disease.阿尔茨海默病早期和晚期轻度认知障碍阶段新分子亚型的识别与验证
medRxiv. 2025 May 24:2023.04.06.23288268. doi: 10.1101/2023.04.06.23288268.
2
HRProfiler Detects Homologous Recombination Deficiency in Breast and Ovarian Cancers Using Whole-Genome and Whole-Exome Sequencing Data.HRProfiler利用全基因组和全外显子组测序数据检测乳腺癌和卵巢癌中的同源重组缺陷。
Cancer Res. 2025 May 6. doi: 10.1158/0008-5472.CAN-24-2639.
3
Genetic regulation of TERT splicing affects cancer risk by altering cellular longevity and replicative potential.

本文引用的文献

1
SeSAMe: reducing artifactual detection of DNA methylation by Infinium BeadChips in genomic deletions.SeSAMe:减少基因组缺失中 Infinium BeadChips 检测到的 DNA 甲基化假阳性。
Nucleic Acids Res. 2018 Nov 16;46(20):e123. doi: 10.1093/nar/gky691.
2
The Cancer Genome Atlas: Creating Lasting Value beyond Its Data.癌症基因组图谱:在其数据之外创造持久价值。
Cell. 2018 Apr 5;173(2):283-285. doi: 10.1016/j.cell.2018.03.042.
3
lncRNA Epigenetic Landscape Analysis Identifies EPIC1 as an Oncogenic lncRNA that Interacts with MYC and Promotes Cell-Cycle Progression in Cancer.
端粒酶逆转录酶(TERT)剪接的基因调控通过改变细胞寿命和复制潜力影响癌症风险。
Nat Commun. 2025 Feb 16;16(1):1676. doi: 10.1038/s41467-025-56947-y.
4
A comparative analysis of gene expression profiling by statistical and machine learning approaches.通过统计和机器学习方法对基因表达谱进行的比较分析。
Bioinform Adv. 2024 Dec 18;5(1):vbae199. doi: 10.1093/bioadv/vbae199. eCollection 2025.
5
Interpreting Lung Cancer Health Disparity at Transcriptome Level.在转录组水平解读肺癌健康差异
bioRxiv. 2025 Jan 13:2025.01.09.632292. doi: 10.1101/2025.01.09.632292.
6
The role of lnc‑MAPKAPK5‑AS1 in immune cell infiltration in hepatocellular carcinoma: Bioinformatics analysis and validation.长链非编码RNA-MAPKAPK5-反义链1在肝细胞癌免疫细胞浸润中的作用:生物信息学分析与验证
Oncol Lett. 2025 Jan 14;29(3):141. doi: 10.3892/ol.2025.14887. eCollection 2025 Mar.
7
Interpreting Lung Cancer Health Disparity between African American Males and European American Males.解读非裔美国男性与欧美裔美国男性之间的肺癌健康差异。
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2024 Dec;2024:7141-7143. doi: 10.1109/bibm62325.2024.10822014.
8
Genetic regulation of splicing contributes to reduced or elevated cancer risk by altering cellular longevity and replicative potential.剪接的基因调控通过改变细胞寿命和复制潜力,导致癌症风险降低或升高。
medRxiv. 2024 Nov 5:2024.11.04.24316722. doi: 10.1101/2024.11.04.24316722.
9
StableLift: Optimized Germline and Somatic Variant Detection Across Genome Builds.StableLift:跨基因组版本的优化种系和体细胞变异检测
bioRxiv. 2024 Nov 3:2024.10.31.621401. doi: 10.1101/2024.10.31.621401.
10
Biobanks as an Indispensable Tool in the "Era" of Precision Medicine: Key Role in the Management of Complex Diseases, Such as Melanoma.生物样本库作为精准医学“时代”不可或缺的工具:在黑色素瘤等复杂疾病管理中的关键作用。
J Pers Med. 2024 Jul 6;14(7):731. doi: 10.3390/jpm14070731.
lncRNA 表观遗传景观分析鉴定 EPIC1 为致癌 lncRNA,它与 MYC 相互作用并促进癌症中的细胞周期进程。
Cancer Cell. 2018 Apr 9;33(4):706-720.e9. doi: 10.1016/j.ccell.2018.03.006. Epub 2018 Apr 2.
4
Pan-Cancer Analysis of lncRNA Regulation Supports Their Targeting of Cancer Genes in Each Tumor Context.泛癌分析 lncRNA 调控作用支持它们在每种肿瘤背景下靶向癌症基因。
Cell Rep. 2018 Apr 3;23(1):297-312.e12. doi: 10.1016/j.celrep.2018.03.064.
5
DNA methylation loss in late-replicating domains is linked to mitotic cell division.DNA 甲基化在复制晚期的丢失与有丝分裂细胞分裂有关。
Nat Genet. 2018 Apr;50(4):591-602. doi: 10.1038/s41588-018-0073-4. Epub 2018 Apr 2.
6
Scalable Open Science Approach for Mutation Calling of Tumor Exomes Using Multiple Genomic Pipelines.采用多种基因组分析流水线的肿瘤外显子组突变调用的可扩展开放科学方法。
Cell Syst. 2018 Mar 28;6(3):271-281.e7. doi: 10.1016/j.cels.2018.03.002.
7
PAX8 activates a p53-p21-dependent pro-proliferative effect in high grade serous ovarian carcinoma.PAX8 在高级别浆液性卵巢癌中激活依赖 p53-p21 的促增殖作用。
Oncogene. 2018 Apr;37(17):2213-2224. doi: 10.1038/s41388-017-0040-z. Epub 2018 Jan 30.
8
The ISB Cancer Genomics Cloud: A Flexible Cloud-Based Platform for Cancer Genomics Research.国际生物信息学研究所癌症基因组云平台:一个用于癌症基因组学研究的灵活的基于云的平台。
Cancer Res. 2017 Nov 1;77(21):e7-e10. doi: 10.1158/0008-5472.CAN-17-0617.
9
Comprehensive characterization, annotation and innovative use of Infinium DNA methylation BeadChip probes.Infinium DNA甲基化芯片探针的全面表征、注释及创新性应用
Nucleic Acids Res. 2017 Feb 28;45(4):e22. doi: 10.1093/nar/gkw967.
10
MuSE: accounting for tumor heterogeneity using a sample-specific error model improves sensitivity and specificity in mutation calling from sequencing data.MuSE:使用样本特异性误差模型考虑肿瘤异质性可提高从测序数据中检测突变的灵敏度和特异性。
Genome Biol. 2016 Aug 24;17(1):178. doi: 10.1186/s13059-016-1029-6.