Zhong Hua, Song Mingzhou
IEEE/ACM Trans Comput Biol Bioinform. 2018 Feb 26. doi: 10.1109/TCBB.2018.2809743.
Directional association measured by functional dependency can answer important questions on relationships between variables, for example, in discovery of molecular interactions in biological systems. However, when one has no prior information about the functional form of a directional association, there is not a widely established statistical procedure to detect such an association. To address this issue, here we introduce an exact functional test for directional association by examining the strength of functional dependency. It is effective in promoting functional patterns by reducing statistical power on non-functional patterns. We designed an algorithm to carry out the test using a fast branch-and-bound strategy, which achieved a substantial speedup over brute-force enumeration. On data from an epidemiological study of liver cancer, the test identified the hepatitis status of a subject as the most influential risk factor among others for the cancer phenotype. On human lung cancer transcriptome data, the test selected 1049 transcription start sites of putative noncoding RNAs directionally associated with lung cancers, stronger than 95% of 589 curated cancer genes. These predictions include non-monotonic interaction patterns, to which other routine tests were insensitive. Complementing symmetric (non-directional) association methods such as Fisher's exact test, the exact functional test is a unique exact statistical test for evaluating evidence for causal relationships.
通过功能依赖性测量的定向关联可以回答有关变量之间关系的重要问题,例如,在生物系统中发现分子相互作用时。然而,当人们对定向关联的功能形式没有先验信息时,没有广泛确立的统计程序来检测这种关联。为了解决这个问题,我们在此通过检查功能依赖性的强度引入一种用于定向关联的精确功能测试。它通过降低对非功能模式的统计功效来有效地促进功能模式。我们设计了一种算法,使用快速分支定界策略来进行测试,该策略比暴力枚举实现了大幅加速。在一项肝癌流行病学研究的数据上,该测试确定受试者的肝炎状态是癌症表型的其他风险因素中最具影响力的因素。在人类肺癌转录组数据上,该测试选择了1049个与肺癌定向相关的假定非编码RNA的转录起始位点,比589个经策划的癌症基因中的95%更强。这些预测包括其他常规测试不敏感的非单调相互作用模式。作为对诸如费舍尔精确检验等对称(非定向)关联方法的补充,精确功能测试是一种用于评估因果关系证据的独特精确统计测试。