Wu Jiaxin, Gan Mingxin, Jiang Rui
Department of Automation, Tsinghua University, Beijing, China.
Int J Comput Biol Drug Des. 2011;4(4):316-31. doi: 10.1504/IJCBDD.2011.044446. Epub 2011 Dec 24.
Recent advancements of the next-generation sequencing technology have enabled the direct sequencing of rare genetic variants in both case and control individuals. Although there have been a few statistical methods for uncovering potential associations between multiple rare variants and human inherited diseases, most of these methods require computational approaches to filter out non-functional variants for the purpose of maximising the statistical power. To tackle this problem, we formulate the detection of genetic variants that are associated with a specific type of disease from the perspective of one-class novelty learning. We focus on a typical type of genetic variants called Single Amino Acid Polymorphisms (SAAPs), and we take advantages of a feature selection mechanism and two one-class learning methods to prioritise candidate SAAPs. Systematic validation demonstrates that the proposed model is effective in recovering disease-associated SAAPs.
新一代测序技术的最新进展使得能够对病例个体和对照个体中的罕见遗传变异进行直接测序。尽管已经有一些统计方法用于揭示多个罕见变异与人类遗传疾病之间的潜在关联,但这些方法大多需要计算方法来过滤掉无功能的变异,以最大化统计功效。为了解决这个问题,我们从一类新颖性学习的角度来制定与特定类型疾病相关的遗传变异的检测方法。我们专注于一种典型的遗传变异类型,即单氨基酸多态性(SAAPs),并利用特征选择机制和两种一类学习方法对候选SAAPs进行优先级排序。系统验证表明,所提出的模型在恢复与疾病相关的SAAPs方面是有效的。