• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因组学数据中因果分离的新方法。

New methods for separating causes from effects in genomics data.

机构信息

Center for Health Informatics and Bioinformatics, New York University Langone Medical Center, New York, NY 10016, USA.

出版信息

BMC Genomics. 2012;13 Suppl 8(Suppl 8):S22. doi: 10.1186/1471-2164-13-S8-S22. Epub 2012 Dec 17.

DOI:10.1186/1471-2164-13-S8-S22
PMID:23282373
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3535696/
Abstract

BACKGROUND

The discovery of molecular pathways is a challenging problem and its solution relies on the identification of causal molecular interactions in genomics data. Causal molecular interactions can be discovered using randomized experiments; however such experiments are often costly, infeasible, or unethical. Fortunately, algorithms that infer causal interactions from observational data have been in development for decades, predominantly in the quantitative sciences, and many of them have recently been applied to genomics data. While these algorithms can infer unoriented causal interactions between involved molecular variables (i.e., without specifying which one is the cause and which one is the effect), causally orienting all inferred molecular interactions was assumed to be an unsolvable problem until recently. In this work, we use transcription factor-target gene regulatory interactions in three different organisms to evaluate a new family of methods that, given observational data for just two causally related variables, can determine which one is the cause and which one is the effect.

RESULTS

We have found that a particular family of causal orientation methods (IGCI Gaussian) is often able to accurately infer directionality of causal interactions, and that these methods usually outperform other causal orientation techniques. We also introduced a novel ensemble technique for causal orientation that combines decisions of individual causal orientation methods. The ensemble method was found to be more accurate than any best individual causal orientation method in the tested data.

CONCLUSIONS

This work represents a first step towards establishing context for practical use of causal orientation methods in the genomics domain. We have found that some causal orientation methodologies yield accurate predictions of causal orientation in genomics data, and we have improved on this capability with a novel ensemble method. Our results suggest that these methods have the potential to facilitate reconstruction of molecular pathways by minimizing the number of required randomized experiments to find causal directionality and by avoiding experiments that are infeasible and/or unethical.

摘要

背景

发现分子途径是一个具有挑战性的问题,其解决方案依赖于在基因组学数据中识别因果分子相互作用。可以使用随机实验来发现因果分子相互作用;但是,这样的实验通常成本高昂、不可行或不道德。幸运的是,从观察数据中推断因果相互作用的算法已经开发了几十年,主要在定量科学中,其中许多最近已应用于基因组学数据。虽然这些算法可以推断涉及分子变量之间的无向因果相互作用(即,不指定哪个是原因,哪个是结果),但直到最近,因果定向所有推断出的分子相互作用都被认为是一个无法解决的问题。在这项工作中,我们使用三个不同生物体中的转录因子-靶基因调控相互作用来评估一组新的方法,这些方法给定仅两个因果相关变量的观察数据,就可以确定哪个是原因,哪个是结果。

结果

我们发现,一组特定的因果定向方法(IGCI 高斯)通常能够准确推断因果相互作用的方向,并且这些方法通常优于其他因果定向技术。我们还引入了一种用于因果定向的新集成技术,该技术将单个因果定向方法的决策进行组合。在测试数据中,集合方法比任何最佳的单个因果定向方法都更准确。

结论

这项工作代表了在基因组学领域建立因果定向方法实际应用背景的第一步。我们发现,一些因果定向方法在基因组学数据中能够准确预测因果定向,并且我们通过一种新颖的集成方法提高了这一能力。我们的结果表明,这些方法有可能通过减少找到因果方向所需的随机实验数量,并避免不可行和/或不道德的实验,从而促进分子途径的重建。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/6c7ead032f11/1471-2164-13-S8-S22-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/3b29398b2239/1471-2164-13-S8-S22-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/66635ef80a0f/1471-2164-13-S8-S22-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/5d6adb360af7/1471-2164-13-S8-S22-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/26406415e739/1471-2164-13-S8-S22-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/61e6124e8885/1471-2164-13-S8-S22-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/b3e126e1d43d/1471-2164-13-S8-S22-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/66ca8be4cf6f/1471-2164-13-S8-S22-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/3c128ee3a1bd/1471-2164-13-S8-S22-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/6c7ead032f11/1471-2164-13-S8-S22-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/3b29398b2239/1471-2164-13-S8-S22-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/66635ef80a0f/1471-2164-13-S8-S22-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/5d6adb360af7/1471-2164-13-S8-S22-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/26406415e739/1471-2164-13-S8-S22-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/61e6124e8885/1471-2164-13-S8-S22-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/b3e126e1d43d/1471-2164-13-S8-S22-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/66ca8be4cf6f/1471-2164-13-S8-S22-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/3c128ee3a1bd/1471-2164-13-S8-S22-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df2e/3535696/6c7ead032f11/1471-2164-13-S8-S22-9.jpg

相似文献

1
New methods for separating causes from effects in genomics data.基因组学数据中因果分离的新方法。
BMC Genomics. 2012;13 Suppl 8(Suppl 8):S22. doi: 10.1186/1471-2164-13-S8-S22. Epub 2012 Dec 17.
2
An Evaluation of Active Learning Causal Discovery Methods for Reverse-Engineering Local Causal Pathways of Gene Regulation.用于逆向工程基因调控局部因果通路的主动学习因果发现方法评估
Sci Rep. 2016 Mar 4;6:22558. doi: 10.1038/srep22558.
3
Comparative analysis of module-based versus direct methods for reverse-engineering transcriptional regulatory networks.基于模块的方法与直接方法在转录调控网络逆向工程中的比较分析
BMC Syst Biol. 2009 May 7;3:49. doi: 10.1186/1752-0509-3-49.
4
Advancing the prediction accuracy of protein-protein interactions by utilizing evolutionary information from position-specific scoring matrix and ensemble classifier.利用来自位置特异性得分矩阵的进化信息和集成分类器提高蛋白质-蛋白质相互作用的预测准确性。
J Theor Biol. 2017 Apr 7;418:105-110. doi: 10.1016/j.jtbi.2017.01.003. Epub 2017 Jan 11.
5
A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions.用于重建基因-基因相互作用的数据合并与荟萃分析方法的比较评估。
BMC Bioinformatics. 2016 Jun 6;17 Suppl 5(Suppl 5):194. doi: 10.1186/s12859-016-1038-1.
6
CSN: unsupervised approach for inferring biological networks based on the genome alone.CSN:一种基于基因组的无监督方法,用于推断生物网络。
BMC Bioinformatics. 2020 May 15;21(1):190. doi: 10.1186/s12859-020-3479-9.
7
Reconstructing genome-wide regulatory network of E. coli using transcriptome data and predicted transcription factor activities.利用转录组数据和预测的转录因子活性重建大肠杆菌的全基因组调控网络。
BMC Bioinformatics. 2011 Jun 13;12:233. doi: 10.1186/1471-2105-12-233.
8
Towards the identification of essential genes using targeted genome sequencing and comparative analysis.利用靶向基因组测序和比较分析鉴定必需基因
BMC Genomics. 2006 Oct 19;7:265. doi: 10.1186/1471-2164-7-265.
9
Overrepresentation of interactions between homologous proteins in interactomes.相互作用组中同源蛋白质间相互作用的过度呈现。
FEBS Lett. 2007 Jan 9;581(1):52-6. doi: 10.1016/j.febslet.2006.11.076. Epub 2006 Dec 8.
10
Phylo_dCor: distance correlation as a novel metric for phylogenetic profiling.系统发育dCor:作为系统发育谱分析新指标的距离相关性
BMC Bioinformatics. 2017 Sep 5;18(1):396. doi: 10.1186/s12859-017-1815-5.

引用本文的文献

1
Analysis of cause-effect inference by comparing regression errors.通过比较回归误差进行因果推断分析。
PeerJ Comput Sci. 2019 Jan 21;5:e169. doi: 10.7717/peerj-cs.169. eCollection 2019.
2
Bayesian Estimation of Causal Direction in Acyclic Structural Equation Models with Individual-specific Confounder Variables and Non-Gaussian Distributions.具有个体特定混杂变量和非高斯分布的无环结构方程模型中因果方向的贝叶斯估计
J Mach Learn Res. 2014 Aug;15:2629-2652.
3
Markov Boundary Discovery with Ridge Regularized Linear Models.基于岭正则化线性模型的马尔可夫边界发现

本文引用的文献

1
[Economic matching theory in renal transplantation. Sveriges Riksbank Prize in Economic Sciences in Memory of Alfred Nobel 2012].[肾移植中的经济匹配理论。2012年诺贝尔经济学纪念奖(瑞典银行经济学奖)]
Lakartidningen. 2012 Dec;109(49-50):2276-7.
2
Assessing quality and completeness of human transcriptional regulatory pathways on a genome-wide scale.在全基因组范围内评估人类转录调控途径的质量和完整性。
Biol Direct. 2011 Feb 28;6:15. doi: 10.1186/1745-6150-6-15.
3
A comprehensive assessment of methods for de-novo reverse-engineering of genome-scale regulatory networks.
J Causal Inference. 2016 Mar;4(1):31-48. doi: 10.1515/jci-2015-0011. Epub 2015 Nov 3.
4
A Complex Systems Approach to Causal Discovery in Psychiatry.一种用于精神医学因果发现的复杂系统方法。
PLoS One. 2016 Mar 30;11(3):e0151174. doi: 10.1371/journal.pone.0151174. eCollection 2016.
5
An Evaluation of Active Learning Causal Discovery Methods for Reverse-Engineering Local Causal Pathways of Gene Regulation.用于逆向工程基因调控局部因果通路的主动学习因果发现方法评估
Sci Rep. 2016 Mar 4;6:22558. doi: 10.1038/srep22558.
6
Plasma levels of interleukin-1 receptor antagonist (IL1Ra) predict radiographic progression of symptomatic knee osteoarthritis.血浆白细胞介素-1受体拮抗剂(IL1Ra)水平可预测有症状的膝关节骨关节炎的影像学进展。
Osteoarthritis Cartilage. 2015 Nov;23(11):1915-24. doi: 10.1016/j.joca.2015.08.006.
7
Genomics in 2012: challenges and opportunities in the next generation sequencing era.2012 年的基因组学:下一代测序时代的挑战与机遇。
BMC Genomics. 2012;13 Suppl 8(Suppl 8):S1. doi: 10.1186/1471-2164-13-S8-S1. Epub 2012 Dec 17.
基于从头开始反向工程的基因组规模调控网络方法的综合评估。
Genomics. 2011 Jan;97(1):7-18. doi: 10.1016/j.ygeno.2010.10.003. Epub 2010 Oct 14.
4
The Notch/Hes1 pathway sustains NF-κB activation through CYLD repression in T cell leukemia.Notch/Hes1 通路通过抑制 CYLD 来维持 T 细胞白血病中的 NF-κB 激活。
Cancer Cell. 2010 Sep 14;18(3):268-81. doi: 10.1016/j.ccr.2010.08.006.
5
Variation in transcription factor binding among humans.人类转录因子结合的变异性。
Science. 2010 Apr 9;328(5975):232-5. doi: 10.1126/science.1183621. Epub 2010 Mar 18.
6
Lessons from the DREAM2 Challenges.来自DREAM2挑战赛的经验教训。
Ann N Y Acad Sci. 2009 Mar;1158:159-95. doi: 10.1111/j.1749-6632.2009.04497.x.
7
ChIP-on-chip significance analysis reveals large-scale binding and regulation by human transcription factor oncogenes.芯片上的染色质免疫沉淀技术意义分析揭示了人类转录因子致癌基因的大规模结合与调控。
Proc Natl Acad Sci U S A. 2009 Jan 6;106(1):244-9. doi: 10.1073/pnas.0806445106. Epub 2008 Dec 31.
8
An international standardization programme towards the application of gene expression profiling in routine leukaemia diagnostics: the Microarray Innovations in LEukemia study prephase.白血病常规诊断中基因表达谱应用的国际标准化计划:白血病微阵列创新研究前期阶段。
Br J Haematol. 2008 Sep;142(5):802-7. doi: 10.1111/j.1365-2141.2008.07261.x.
9
Transcription factors bind thousands of active and inactive regions in the Drosophila blastoderm.转录因子与果蝇胚盘内数千个活跃和非活跃区域结合。
PLoS Biol. 2008 Feb;6(2):e27. doi: 10.1371/journal.pbio.0060027.
10
RegulonDB (version 6.0): gene regulation model of Escherichia coli K-12 beyond transcription, active (experimental) annotated promoters and Textpresso navigation.RegulonDB(版本6.0):大肠杆菌K-12超越转录的基因调控模型、活跃(实验性)注释启动子及Textpresso导航
Nucleic Acids Res. 2008 Jan;36(Database issue):D120-4. doi: 10.1093/nar/gkm994. Epub 2007 Dec 23.