• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用PheTK对大规模生物样本库数据进行表型全基因组关联研究分析。

PheWAS analysis on large-scale biobank data with PheTK.

作者信息

Tran Tam C, Schlueter David J, Zeng Chenjie, Mo Huan, Carroll Robert J, Denny Joshua C

机构信息

National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, 20892, United States.

University of Toronto, ON, M5S 1A1, Canada.

出版信息

Bioinformatics. 2024 Dec 26;41(1). doi: 10.1093/bioinformatics/btae719.

DOI:10.1093/bioinformatics/btae719
PMID:39657951
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11709244/
Abstract

SUMMARY

With the rapid growth of genetic data linked to electronic health record (EHR) data in huge cohorts, large-scale phenome-wide association study (PheWAS) have become powerful discovery tools in biomedical research. PheWAS is an analysis method to study phenotype associations utilizing longitudinal EHR data. Previous PheWAS packages were developed mostly with smaller datasets and with earlier PheWAS approaches. PheTK was designed to simplify analysis and efficiently handle biobank-scale data. PheTK uses multithreading and supports a full PheWAS workflow including extraction of data from OMOP databases and Hail matrix tables as well as PheWAS analysis for both phecode version 1.2 and phecodeX. Benchmarking results showed PheTK took 64% less time than the R PheWAS package to complete the same workflow. PheTK can be run locally or on cloud platforms such as the All of Us Researcher Workbench (All of Us) or the UK Biobank (UKB) Research Analysis Platform (RAP).

AVAILABILITY AND IMPLEMENTATION

The PheTK package is freely available on the Python Package Index, on GitHub under GNU General Public License (GPL-3) at https://github.com/nhgritctran/PheTK, and on Zenodo, DOI 10.5281/zenodo.14217954, at https://doi.org/10.5281/zenodo.14217954. PheTK is implemented in Python and platform independent.

摘要

摘要

随着与大型队列电子健康记录(EHR)数据相关的基因数据迅速增长,大规模表型组全关联研究(PheWAS)已成为生物医学研究中强大的发现工具。PheWAS是一种利用纵向EHR数据研究表型关联的分析方法。以前的PheWAS软件包大多是使用较小的数据集和早期的PheWAS方法开发的。PheTK旨在简化分析并有效处理生物样本库规模的数据。PheTK使用多线程,并支持完整的PheWAS工作流程,包括从OMOP数据库和Hail矩阵表中提取数据,以及对phecode版本1.2和phecodeX进行PheWAS分析。基准测试结果表明,在完成相同工作流程时,PheTK比R PheWAS软件包所需时间少64%。PheTK可以在本地运行,也可以在云平台上运行,如“我们所有人”研究工作台(“我们所有人”)或英国生物样本库(UKB)研究分析平台(RAP)。

可用性和实现方式

PheTK软件包可在Python软件包索引、GitHub上根据GNU通用公共许可证(GPL-3)免费获取,网址为https://github.com/nhgritctran/PheTK,也可在Zenodo上获取,DOI为10.5281/zenodo.14217954,网址为https://doi.org/10.5281/zenodo.14217954。PheTK用Python实现,与平台无关。

相似文献

1
PheWAS analysis on large-scale biobank data with PheTK.使用PheTK对大规模生物样本库数据进行表型全基因组关联研究分析。
Bioinformatics. 2024 Dec 26;41(1). doi: 10.1093/bioinformatics/btae719.
2
PheWAS analysis on large-scale biobank data with PheTK.使用PheTK对大规模生物样本库数据进行全表型组关联研究分析。
medRxiv. 2024 Feb 13:2024.02.12.24302720. doi: 10.1101/2024.02.12.24302720.
3
R PheWAS: data analysis and plotting tools for phenome-wide association studies in the R environment.R全表型组关联研究分析与绘图工具:用于R环境下全表型组关联研究的数据分析与绘图工具。
Bioinformatics. 2014 Aug 15;30(16):2375-6. doi: 10.1093/bioinformatics/btu197. Epub 2014 Apr 14.
4
Phenome-wide association study (PheWAS) of colorectal cancer risk SNP effects on health outcomes in UK Biobank.全表型关联研究(PheWAS)分析结直肠癌风险 SNP 对英国生物银行健康结局的影响。
Br J Cancer. 2022 Mar;126(5):822-830. doi: 10.1038/s41416-021-01655-9. Epub 2021 Dec 15.
5
BIGwas: Single-command quality control and association testing for multi-cohort and biobank-scale GWAS/PheWAS data.BIGwas:用于多队列和生物库规模 GWAS/PheWAS 数据的单命令质量控制和关联测试。
Gigascience. 2021 Jun 29;10(6). doi: 10.1093/gigascience/giab047.
6
PheWAS-ME: a web-app for interactive exploration of multimorbidity patterns in PheWAS.PheWAS-ME:一个用于交互式探索 PheWAS 中多种疾病模式的网络应用程序。
Bioinformatics. 2021 Jul 19;37(12):1778-1780. doi: 10.1093/bioinformatics/btaa870.
7
Next-generation phenotyping: introducing phecodeX for enhanced discovery research in medical phenomics.下一代表型分析:引入 phecodeX 以增强医学表型组学中的发现研究。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad655.
8
PheMIME: an interactive web app and knowledge base for phenome-wide, multi-institutional multimorbidity analysis.PheMIME:一个交互式网络应用程序和知识库,用于进行表型广泛的、多机构的多病种分析。
J Am Med Inform Assoc. 2024 Nov 1;31(11):2440-2446. doi: 10.1093/jamia/ocae182.
9
Abdominal CT metrics in 17,646 patients reveal associations between myopenia, myosteatosis, and medical phenotypes: a phenome-wide association study.17646 例患者的腹部 CT 指标揭示了肌肉减少症、肌肉脂肪变性与医学表型之间的关联:一项表型全基因组关联研究。
EBioMedicine. 2024 May;103:105116. doi: 10.1016/j.ebiom.2024.105116. Epub 2024 Apr 17.
10
Clinical and genetic contributions to medical comorbidity in bipolar disorder: a study using electronic health records-linked biobank data.双相情感障碍中医疗共病的临床和遗传贡献:一项使用电子健康记录关联生物样本库数据的研究。
Mol Psychiatry. 2024 Sep;29(9):2701-2713. doi: 10.1038/s41380-024-02530-8. Epub 2024 Mar 28.

引用本文的文献

1
Neurobeachin (NBEA) is a novel gene associated with GLP-1 receptor agonist associated weight loss.神经海滩蛋白(NBEA)是一种与胰高血糖素样肽-1受体激动剂相关体重减轻有关的新基因。
Diabetes Obes Metab. 2025 Oct;27(10):5632-5642. doi: 10.1111/dom.16612. Epub 2025 Jul 18.
2
Phenome-wide association of APOE alleles in the All of Us Research Program.“我们所有人”研究计划中APOE等位基因的全表型组关联研究
EBioMedicine. 2025 May 30;117:105768. doi: 10.1016/j.ebiom.2025.105768.
3
The effect of type 2 diabetes genetic predisposition on non-cardiovascular comorbidities.2型糖尿病遗传易感性对非心血管合并症的影响。
medRxiv. 2025 May 7:2025.05.05.25326966. doi: 10.1101/2025.05.05.25326966.
4
Metabolic reaction fluxes as amplifiers and buffers of risk alleles for coronary artery disease.代谢反应通量作为冠状动脉疾病风险等位基因的放大器和缓冲器。
Mol Syst Biol. 2025 Apr 2. doi: 10.1038/s44320-025-00097-2.
5
Phenome-Wide Association of Alleles in the Research Program.研究项目中基因座的全表型组关联研究
medRxiv. 2024 Sep 4:2024.09.04.24313010. doi: 10.1101/2024.09.04.24313010.

本文引用的文献

1
Data-driven science and diversity in the All of Us Research Program.“我们所有人”研究项目中的数据驱动型科学与多样性
Sci Transl Med. 2023 Dec 13;15(726):eade9214. doi: 10.1126/scitranslmed.ade9214.
2
World's biggest set of human genome sequences opens to scientists.世界上最大的人类基因组序列集向科学家开放。
Nature. 2023 Dec;624(7990):16-17. doi: 10.1038/d41586-023-03763-3.
3
Next-generation phenotyping: introducing phecodeX for enhanced discovery research in medical phenomics.下一代表型分析:引入 phecodeX 以增强医学表型组学中的发现研究。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad655.
4
Systematic replication of smoking disease associations using survey responses and EHR data in the All of Us Research Program.在 All of Us 研究计划中,使用调查回复和电子健康记录数据对吸烟疾病关联进行系统复制。
J Am Med Inform Assoc. 2023 Dec 22;31(1):139-153. doi: 10.1093/jamia/ocad205.
5
Genome-Wide Association Study of Intracranial Artery Stenosis Followed by Phenome-Wide Association Study.颅内动脉狭窄的全基因组关联研究及表型全基因组关联研究。
Transl Stroke Res. 2023 Jun;14(3):322-333. doi: 10.1007/s12975-022-01049-w. Epub 2022 Jun 14.
6
Phenome-Wide Association Studies.全表型组关联研究
JAMA. 2022 Jan 4;327(1):75-76. doi: 10.1001/jama.2021.20356.
7
pyPheWAS: A Phenome-Disease Association Tool for Electronic Medical Record Analysis.pyPheWAS:用于电子病历分析的表型-疾病关联工具。
Neuroinformatics. 2022 Apr;20(2):483-505. doi: 10.1007/s12021-021-09553-4. Epub 2022 Jan 3.
8
Large-Scale Phenome-Wide Association Study of Variants Demonstrates Protection Against Ischemic Stroke.大规模表型全基因组关联研究表明,某些变异可预防缺血性脑卒中。
Circ Genom Precis Med. 2018 Jul;11(7):e002162. doi: 10.1161/CIRCGEN.118.002162.
9
Software Application Profile: PHESANT: a tool for performing automated phenome scans in UK Biobank.软件应用简介:PHESANT:一种用于在英国生物银行中进行自动表型组扫描的工具。
Int J Epidemiol. 2018 Feb;47(1):29-35. doi: 10.1093/ije/dyx204. Epub 2017 Oct 5.
10
A phenome-wide association study of a lipoprotein-associated phospholipase A2 loss-of-function variant in 90 000 Chinese adults.一项针对9万名中国成年人中脂蛋白相关磷脂酶A2功能丧失变异体的全表型组关联研究。
Int J Epidemiol. 2016 Oct;45(5):1588-1599. doi: 10.1093/ije/dyw087. Epub 2016 Jun 14.