Scheinfeldt Laura B, Brangan Andrew, Kusic Dara M, Kumar Sudhir, Gharani Neda
Coriell Institute for Medical Research, Camden, NJ 08003, USA.
Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA 19122, USA.
J Pers Med. 2021 Feb 16;11(2):131. doi: 10.3390/jpm11020131.
Pharmacogenomics holds the promise of personalized drug efficacy optimization and drug toxicity minimization. Much of the research conducted to date, however, suffers from an ascertainment bias towards European participants. Here, we leverage publicly available, whole genome sequencing data collected from global populations, evolutionary characteristics, and annotated protein features to construct a new in silico machine learning pharmacogenetic identification method called XGB-PGX. When applied to pharmacogenetic data, XGB-PGX outperformed all existing prediction methods and identified over 2000 new pharmacogenetic variants. While there are modest pharmacogenetic allele frequency distribution differences across global population samples, the most striking distinction is between the relatively rare putatively neutral pharmacogene variants and the relatively common established and newly predicted functional pharamacogenetic variants. Our findings therefore support a focus on individual patient pharmacogenetic testing rather than on clinical presumptions about patient race, ethnicity, or ancestral geographic residence. We further encourage more attention be given to the impact of common variation on drug response and propose a new 'common treatment, common variant' perspective for pharmacogenetic prediction that is distinct from the types of variation that underlie complex and Mendelian disease. XGB-PGX has identified many new pharmacovariants that are present across all global communities; however, communities that have been underrepresented in genomic research are likely to benefit the most from XGB-PGX's in silico predictions.
药物基因组学有望实现个性化药物疗效优化和药物毒性最小化。然而,迄今为止进行的许多研究都存在对欧洲参与者的确定偏倚。在这里,我们利用从全球人群收集的公开可用的全基因组测序数据、进化特征和注释的蛋白质特征,构建了一种新的计算机机器学习药物遗传学识别方法,称为XGB-PGX。当应用于药物遗传学数据时,XGB-PGX的表现优于所有现有的预测方法,并识别出2000多个新的药物遗传学变异。虽然全球人群样本中的药物遗传等位基因频率分布存在适度差异,但最显著的区别在于相对罕见的假定中性药物基因变异与相对常见的已确定和新预测的功能性药物遗传学变异之间。因此,我们的研究结果支持关注个体患者的药物遗传学检测,而不是基于患者种族、民族或祖籍地理居住地的临床推测。我们进一步鼓励更多地关注常见变异对药物反应的影响,并提出一种新的“常见治疗,常见变异”药物遗传学预测观点,这与复杂疾病和孟德尔疾病背后的变异类型不同。XGB-PGX已经识别出许多在所有全球群体中都存在的新药物变异;然而,在基因组研究中代表性不足的群体可能从XGB-PGX的计算机预测中受益最大。