Am J Psychiatry. 2014 Mar;171(3):350-9. doi: 10.1176/appi.ajp.2013.11121766.
The Combined Analysis of Psychiatric Studies (CAPS) project conducted extensive review and regularization across studies of all schizophrenia linkage data available as of 2011 from the National Institute of Mental Health-funded Center for Collaborative Genomic Studies on Mental Disorders, also known as the Human Genetics Initiative (HGI). The authors reanalyzed the data using statistical methods tailored to accumulation of evidence across multiple, potentially highly heterogeneous, sets of data.
Data were subdivided based on contributing study, major population group, and presence or absence within families of schizophrenia with a substantial affective component. The posterior probability of linkage (PPL) statistical framework was used to sequentially update linkage evidence across these data subsets (omnibus results).
While some loci previously implicated using the HGI data were also identified in the present omnibus analysis (2q36.1, 15q23), others were not. Several loci were found that had not previously been reported in the HGI samples but are supported by independent linkage or association studies (3q28, 12q23.1, 11p11.2, Xq26.1). Not surprisingly, differences were seen across population groups. Of particular interest are signals on 11p15.3, 11p11.2, and Xq26.1, for which data from families with a substantial affective component support linkage while data from the remaining families provide evidence against linkage. All three of these loci overlap with loci reported in independent studies of bipolar disorder or mixed bipolar-schizophrenia samples.
Public data repositories provide the opportunity to leverage large multisite data sets for studying complex disorders. Analysis with a statistical method specifically designed for such data enables us to extract new information from an existing data resource.
精神病学研究综合分析(CAPS)项目对截至 2011 年,由美国国立精神卫生研究所资助的合作基因组研究精神疾病中心(亦称人类遗传学倡议)所拥有的所有精神分裂症连锁数据进行了广泛的研究和规范化。作者使用专门针对多组数据的证据积累的统计方法重新分析了这些数据,这些数据可能具有高度异质性。
根据贡献研究、主要人群群体以及是否存在具有大量情感成分的精神分裂症家族,将数据进行细分。使用连锁后验概率(PPL)统计框架来依次更新这些数据子集(整体结果)的连锁证据。
虽然先前使用 HGI 数据确定的一些基因座也在本研究中被确定(2q36.1,15q23),但也有其他基因座没有被确定。还发现了一些以前在 HGI 样本中未报告但得到独立连锁或关联研究支持的基因座(3q28,12q23.1,11p11.2,Xq26.1)。不同人群群体之间存在差异并不奇怪。特别值得关注的是 11p15.3、11p11.2 和 Xq26.1 上的信号,这些信号来自具有大量情感成分的家族的数据支持连锁,而其余家族的数据则提供了不支持连锁的证据。这三个基因座都与独立的双相情感障碍或混合双相-精神分裂症样本研究中报告的基因座重叠。
公共数据存储库为研究复杂疾病提供了利用大型多站点数据集的机会。使用专门为此类数据设计的统计方法进行分析,使我们能够从现有数据资源中提取新信息。