• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 MAP 的连锁缺失值填补。

Missing value imputation for epistatic MAPs.

机构信息

School of Computer Science and Informatics, University College Dublin, Dublin, Ireland.

出版信息

BMC Bioinformatics. 2010 Apr 20;11:197. doi: 10.1186/1471-2105-11-197.

DOI:10.1186/1471-2105-11-197
PMID:20406472
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2873538/
Abstract

BACKGROUND

Epistatic miniarray profiling (E-MAPs) is a high-throughput approach capable of quantifying aggravating or alleviating genetic interactions between gene pairs. The datasets resulting from E-MAP experiments typically take the form of a symmetric pairwise matrix of interaction scores. These datasets have a significant number of missing values - up to 35% - that can reduce the effectiveness of some data analysis techniques and prevent the use of others. An effective method for imputing interactions would therefore increase the types of possible analysis, as well as increase the potential to identify novel functional interactions between gene pairs. Several methods have been developed to handle missing values in microarray data, but it is unclear how applicable these methods are to E-MAP data because of their pairwise nature and the significantly larger number of missing values. Here we evaluate four alternative imputation strategies, three local (Nearest neighbor-based) and one global (PCA-based), that have been modified to work with symmetric pairwise data.

RESULTS

We identify different categories for the missing data based on their underlying cause, and show that values from the largest category can be imputed effectively. We compare local and global imputation approaches across a variety of distinct E-MAP datasets, showing that both are competitive and preferable to filling in with zeros. In addition we show that these methods are effective in an E-MAP from a different species, suggesting that pairwise imputation techniques will be increasingly useful as analogous epistasis mapping techniques are developed in different species. We show that strongly alleviating interactions are significantly more difficult to predict than strongly aggravating interactions. Finally we show that imputed interactions, generated using nearest neighbor methods, are enriched for annotations in the same manner as measured interactions. Therefore our method potentially expands the number of mapped epistatic interactions. In addition we make implementations of our algorithms available for use by other researchers.

CONCLUSIONS

We address the problem of missing value imputation for E-MAPs, and suggest the use of symmetric nearest neighbor based approaches as they offer consistently accurate imputations across multiple datasets in a tractable manner.

摘要

背景

上位微型数组分析(E-MAPs)是一种高通量方法,能够量化基因对之间的加重或缓解遗传相互作用。E-MAP 实验产生的数据集通常采用交互作用评分的对称成对矩阵形式。这些数据集有大量的缺失值 - 高达 35% - 这可能会降低一些数据分析技术的有效性,并阻止使用其他技术。因此,有效的交互作用插补方法将增加可能的分析类型,并增加识别基因对之间新的功能相互作用的潜力。已经开发了几种处理微阵列数据中缺失值的方法,但由于其成对性质和缺失值数量显著增加,尚不清楚这些方法在 E-MAP 数据中的适用性。在这里,我们评估了四种替代插补策略,三种局部(基于最近邻)和一种全局(基于 PCA),这些方法已被修改为适用于对称成对数据。

结果

我们根据缺失值的潜在原因将其分为不同类别,并表明最大类别的值可以有效插补。我们比较了局部和全局插补方法在各种不同的 E-MAP 数据集上的表现,表明这两种方法都具有竞争力,并且优于用零填充。此外,我们还表明,这些方法在来自不同物种的 E-MAP 中是有效的,这表明随着在不同物种中开发类似的上位映射技术,成对插补技术将越来越有用。我们表明,强烈缓解的相互作用比强烈加重的相互作用更难预测。最后,我们表明,使用最近邻方法生成的插补相互作用与测量相互作用一样,在相同的方式下富集了注释。因此,我们的方法有可能扩展映射的上位相互作用的数量。此外,我们还提供了我们的算法的实现,供其他研究人员使用。

结论

我们解决了 E-MAPs 中缺失值插补的问题,并建议使用对称最近邻方法,因为它们以可管理的方式在多个数据集上提供一致准确的插补。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/9b9877dfc29c/1471-2105-11-197-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/55149e1f9453/1471-2105-11-197-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/d92899c91fe1/1471-2105-11-197-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/946e64ccdc94/1471-2105-11-197-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/a001236f2d7d/1471-2105-11-197-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/6da982e1ee74/1471-2105-11-197-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/0e8bde030b93/1471-2105-11-197-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/84e8194e5568/1471-2105-11-197-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/056811bd5b64/1471-2105-11-197-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/9b9877dfc29c/1471-2105-11-197-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/55149e1f9453/1471-2105-11-197-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/d92899c91fe1/1471-2105-11-197-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/946e64ccdc94/1471-2105-11-197-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/a001236f2d7d/1471-2105-11-197-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/6da982e1ee74/1471-2105-11-197-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/0e8bde030b93/1471-2105-11-197-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/84e8194e5568/1471-2105-11-197-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/056811bd5b64/1471-2105-11-197-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67f0/2873538/9b9877dfc29c/1471-2105-11-197-9.jpg

相似文献

1
Missing value imputation for epistatic MAPs.基于 MAP 的连锁缺失值填补。
BMC Bioinformatics. 2010 Apr 20;11:197. doi: 10.1186/1471-2105-11-197.
2
Imputing and predicting quantitative genetic interactions in epistatic MAPs.推断和预测上位性多效性图谱中的数量遗传相互作用。
Methods Mol Biol. 2011;781:353-61. doi: 10.1007/978-1-61779-276-2_17.
3
Towards better accuracy for missing value estimation of epistatic miniarray profiling data by a novel ensemble approach.通过一种新的集成方法提高基于微阵列的上位性数据缺失值估计的准确性。
Genomics. 2011 May;97(5):257-64. doi: 10.1016/j.ygeno.2011.03.001. Epub 2011 Mar 21.
4
Integrative missing value estimation for microarray data.微阵列数据的综合缺失值估计
BMC Bioinformatics. 2006 Oct 12;7:449. doi: 10.1186/1471-2105-7-449.
5
Missing value imputation for gene expression data by tailored nearest neighbors.通过定制最近邻算法对基因表达数据进行缺失值插补
Stat Appl Genet Mol Biol. 2017 Apr 25;16(2):95-106. doi: 10.1515/sagmb-2015-0098.
6
Data Imputation in Epistatic MAPs by Network-Guided Matrix Completion.通过网络引导的矩阵补全在上位性全基因组关联图谱中的数据插补
J Comput Biol. 2015 Jun;22(6):595-608. doi: 10.1089/cmb.2014.0158. Epub 2015 Feb 6.
7
Missing value imputation for microarray gene expression data using histone acetylation information.利用组蛋白乙酰化信息对微阵列基因表达数据进行缺失值插补
BMC Bioinformatics. 2008 May 29;9:252. doi: 10.1186/1471-2105-9-252.
8
A hybrid imputation approach for microarray missing value estimation.一种用于微阵列缺失值估计的混合插补方法。
BMC Genomics. 2015;16 Suppl 9(Suppl 9):S1. doi: 10.1186/1471-2164-16-S9-S1. Epub 2015 Aug 17.
9
Missing value imputation for microarray data: a comprehensive comparison study and a web tool.微阵列数据的缺失值插补:一项综合比较研究及网络工具
BMC Syst Biol. 2013;7 Suppl 6(Suppl 6):S12. doi: 10.1186/1752-0509-7-S6-S12. Epub 2013 Dec 13.
10
Prediction of regulatory gene pairs using dynamic time warping and gene ontology.使用动态时间规整和基因本体论预测调控基因对。
Int J Data Min Bioinform. 2014;10(2):121-45. doi: 10.1504/ijdmb.2014.064010.

引用本文的文献

1
A comprehensive survey on computational learning methods for analysis of gene expression data.关于用于基因表达数据分析的计算学习方法的全面综述。
Front Mol Biosci. 2022 Nov 7;9:907150. doi: 10.3389/fmolb.2022.907150. eCollection 2022.
2
Prediction of Genetic Interactions Using Machine Learning and Network Properties.利用机器学习和网络特性预测遗传相互作用。
Front Bioeng Biotechnol. 2015 Oct 26;3:172. doi: 10.3389/fbioe.2015.00172. eCollection 2015.
3
Data Imputation in Epistatic MAPs by Network-Guided Matrix Completion.通过网络引导的矩阵补全在上位性全基因组关联图谱中的数据插补

本文引用的文献

1
DRYGIN: a database of quantitative genetic interaction networks in yeast.DRYGIN:酵母中定量遗传互作网络数据库。
Nucleic Acids Res. 2010 Jan;38(Database issue):D502-7. doi: 10.1093/nar/gkp820. Epub 2009 Oct 30.
2
Functional organization of the S. cerevisiae phosphorylation network.酿酒酵母磷酸化网络的功能组织
Cell. 2009 Mar 6;136(5):952-63. doi: 10.1016/j.cell.2008.12.039.
3
High-throughput, quantitative analyses of genetic interactions in E. coli.大肠杆菌中基因相互作用的高通量定量分析。
J Comput Biol. 2015 Jun;22(6):595-608. doi: 10.1089/cmb.2014.0158. Epub 2015 Feb 6.
4
On protocols and measures for the validation of supervised methods for the inference of biological networks.关于生物网络推断监督方法验证的协议与措施
Front Genet. 2013 Dec 3;4:262. doi: 10.3389/fgene.2013.00262.
5
High-resolution network biology: connecting sequence with function.高分辨率网络生物学:连接序列与功能。
Nat Rev Genet. 2013 Dec;14(12):865-79. doi: 10.1038/nrg3574. Epub 2013 Nov 7.
6
Searching for synergies: matrix algebraic approaches for efficient pair screening.寻找协同作用:用于高效对筛选的矩阵代数方法。
PLoS One. 2013 Jul 25;8(7):e68598. doi: 10.1371/journal.pone.0068598. Print 2013.
7
Imputing and predicting quantitative genetic interactions in epistatic MAPs.推断和预测上位性多效性图谱中的数量遗传相互作用。
Methods Mol Biol. 2011;781:353-61. doi: 10.1007/978-1-61779-276-2_17.
8
Improved functional overview of protein complexes using inferred epistatic relationships.利用推断的上位关系改进蛋白质复合物的功能概述。
BMC Syst Biol. 2011 May 23;5:80. doi: 10.1186/1752-0509-5-80.
9
Quantitative maps of genetic interactions in yeast - comparative evaluation and integrative analysis.酵母中基因相互作用的定量图谱——比较评估与综合分析
BMC Syst Biol. 2011 Mar 24;5:45. doi: 10.1186/1752-0509-5-45.
Nat Methods. 2008 Sep;5(9):781-7. doi: 10.1038/nmeth.1240.
4
Up-to-date catalogues of yeast protein complexes.最新的酵母蛋白质复合物目录。
Nucleic Acids Res. 2009 Feb;37(3):825-31. doi: 10.1093/nar/gkn1005. Epub 2008 Dec 18.
5
A genetic interaction map of RNA-processing factors reveals links between Sem1/Dss1-containing complexes and mRNA export and splicing.RNA加工因子的遗传相互作用图谱揭示了含Sem1/Dss1的复合物与mRNA输出及剪接之间的联系。
Mol Cell. 2008 Dec 5;32(5):735-46. doi: 10.1016/j.molcel.2008.11.012.
6
Finding friends and enemies in an enemies-only network: a graph diffusion kernel for predicting novel genetic interactions and co-complex membership from yeast genetic interactions.在仅含敌人的网络中寻找朋友和敌人:一种用于从酵母遗传相互作用预测新型遗传相互作用和共复合体成员关系的图扩散核
Genome Res. 2008 Dec;18(12):1991-2004. doi: 10.1101/gr.077693.108. Epub 2008 Oct 2.
7
Predicting quantitative genetic interactions by means of sequential matrix approximation.通过序列矩阵近似预测数量遗传相互作用。
PLoS One. 2008 Sep 26;3(9):e3284. doi: 10.1371/journal.pone.0003284.
8
Conservation and rewiring of functional modules revealed by an epistasis map in fission yeast.通过裂殖酵母上位性图谱揭示功能模块的保守性与重连
Science. 2008 Oct 17;322(5900):405-10. doi: 10.1126/science.1162609. Epub 2008 Sep 25.
9
Optimal stepwise experimental design for pairwise functional interaction studies.用于成对功能相互作用研究的最优逐步实验设计。
Bioinformatics. 2008 Dec 1;24(23):2733-9. doi: 10.1093/bioinformatics/btn472. Epub 2008 Sep 18.
10
Local coherence in genetic interaction patterns reveals prevalent functional versatility.基因相互作用模式中的局部连贯性揭示了普遍存在的功能多样性。
Bioinformatics. 2008 Oct 15;24(20):2376-83. doi: 10.1093/bioinformatics/btn440. Epub 2008 Aug 20.