• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于违反具有多个解释变量的化学“组学”数据降维统计假设的影响的见解。

Insights into the Effects of Violating Statistical Assumptions for Dimensionality Reduction for Chemical "-omics" Data with Multiple Explanatory Variables.

作者信息

Brown Amber O, Green Peter J, Frankham Greta J, Stuart Barbara H, Ueland Maiken

机构信息

Australian Museum Research Institute, Australian Museum, Sydney 2001, NSW, Australia.

Centre for Forensic Science, University of Technology Sydney, Ultimo 2007, NSW, Australia.

出版信息

ACS Omega. 2023 Jun 9;8(24):22042-22054. doi: 10.1021/acsomega.3c01613. eCollection 2023 Jun 20.

DOI:10.1021/acsomega.3c01613
PMID:37360494
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10286096/
Abstract

Biological volatilome analysis is inherently complex due to the considerable number of compounds (i.e., dimensions) and differences in peak areas by orders of magnitude, between and within compounds found within datasets. Traditional volatilome analysis relies on dimensionality reduction techniques which aid in the selection of compounds that are considered relevant to respective research questions prior to further analysis. Currently, compounds of interest are identified using either supervised or unsupervised statistical methods which assume the data residuals are normally distributed and exhibit linearity. However, biological data often violate the statistical assumptions of these models related to normality and the presence of multiple explanatory variables which are innate to biological samples. In an attempt to address deviations from normality, volatilome data can be log transformed. However, whether the effects of each assessed variable are additive or multiplicative should be considered prior to transformation, as this will impact the effect of each variable on the data. If assumptions of normality and variable effects are not investigated prior to dimensionality reduction, ineffective or erroneous compound dimensionality reduction can impact downstream analyses. It is the aim of this manuscript to assess the impact of single and multivariable statistical models with and without the log transformation to volatilome dimensionality reduction prior to any supervised or unsupervised classification analysis. As a proof of concept, Shingleback lizard () volatilomes were collected across their species distribution and from captivity and were assessed. Shingleback volatilomes are suspected to be influenced by multiple explanatory variables related to habitat (Bioregion), sex, parasite presence, total body volume, and captive status. This work determined that the exclusion of relevant multiple explanatory variables from analysis overestimates the effect of Bioregion and the identification of significant compounds. The log transformation increased the number of compounds that were identified as significant, as did analyses that assumed that residuals were normally distributed. Among the methods considered in this work, the most conservative form of dimensionality reduction was achieved through analyzing untransformed data using Monte Carlo tests with multiple explanatory variables.

摘要

由于数据集中发现的化合物数量众多(即维度)以及化合物之间和内部峰面积在数量级上的差异,生物挥发物组分析本质上很复杂。传统的挥发物组分析依赖于降维技术,这些技术有助于在进一步分析之前选择与各自研究问题相关的化合物。目前,使用有监督或无监督统计方法来识别感兴趣的化合物,这些方法假定数据残差呈正态分布且具有线性关系。然而,生物数据常常违反这些模型与正态性以及生物样本固有的多个解释变量相关的统计假设。为了应对偏离正态性的情况,可以对挥发物组数据进行对数转换。然而,在转换之前应考虑每个评估变量的影响是相加的还是相乘的,因为这将影响每个变量对数据的作用。如果在降维之前不研究正态性和变量影响的假设,无效或错误的化合物降维可能会影响下游分析。本手稿的目的是评估在进行任何有监督或无监督分类分析之前,单变量和多变量统计模型(有无对数转换)对挥发物组降维的影响。作为概念验证,收集了细纹蓝舌石龙子()在其物种分布范围内以及圈养环境中的挥发物组并进行了评估。细纹蓝舌石龙子的挥发物组被怀疑受到与栖息地(生物区域)、性别、寄生虫存在、总体积和圈养状态相关的多个解释变量的影响。这项工作确定,在分析中排除相关的多个解释变量会高估生物区域的影响和显著化合物的识别。对数转换增加了被确定为显著的化合物数量,假设残差呈正态分布的分析也是如此。在这项工作中考虑的方法中,最保守的降维形式是通过使用具有多个解释变量的蒙特卡罗检验分析未转换的数据来实现的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b8a/10286096/bf9e0dc39f81/ao3c01613_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b8a/10286096/f8bbeb6ebe6d/ao3c01613_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b8a/10286096/395b68fdd523/ao3c01613_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b8a/10286096/ddfe0a28c4b3/ao3c01613_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b8a/10286096/106ee03a4f57/ao3c01613_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b8a/10286096/580b56fc3784/ao3c01613_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b8a/10286096/bf9e0dc39f81/ao3c01613_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b8a/10286096/f8bbeb6ebe6d/ao3c01613_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b8a/10286096/395b68fdd523/ao3c01613_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b8a/10286096/ddfe0a28c4b3/ao3c01613_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b8a/10286096/106ee03a4f57/ao3c01613_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b8a/10286096/580b56fc3784/ao3c01613_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b8a/10286096/bf9e0dc39f81/ao3c01613_0007.jpg

相似文献

1
Insights into the Effects of Violating Statistical Assumptions for Dimensionality Reduction for Chemical "-omics" Data with Multiple Explanatory Variables.关于违反具有多个解释变量的化学“组学”数据降维统计假设的影响的见解。
ACS Omega. 2023 Jun 9;8(24):22042-22054. doi: 10.1021/acsomega.3c01613. eCollection 2023 Jun 20.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
4
Correction to "Insights into the Effects of Violating Statistical Assumptions for Dimensionality Reduction for Chemical "-omics" Data with Multiple Explanatory Variables".对《关于违反多解释变量化学“组学”数据降维统计假设的影响的见解》的勘误
ACS Omega. 2024 Mar 19;9(13):15724. doi: 10.1021/acsomega.4c01126. eCollection 2024 Apr 2.
5
A comparison of methods to handle skew distributed cost variables in the analysis of the resource consumption in schizophrenia treatment.精神分裂症治疗资源消耗分析中处理偏态分布成本变量方法的比较。
J Ment Health Policy Econ. 2002 Mar;5(1):21-31.
6
A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection.基于模型的多因素降维中参数和非参数检验用于检测上位性的稳健性研究。
BioData Min. 2013 Apr 25;6(1):9. doi: 10.1186/1756-0381-6-9.
7
Response to letter to the editor from Dr Rahman Shiri: The challenging topic of suicide across occupational groups.回复拉赫曼·希里博士的来信:职业群体中的自杀这一具有挑战性的话题。
Scand J Work Environ Health. 2018 Jan 1;44(1):108-110. doi: 10.5271/sjweh.3698. Epub 2017 Dec 8.
8
Erratum: Eyestalk Ablation to Increase Ovarian Maturation in Mud Crabs.勘误:切除眼柄以增加泥蟹的卵巢成熟度。
J Vis Exp. 2023 May 26(195). doi: 10.3791/6561.
9
A semiparametric Bayesian approach for structural equation models.一种用于结构方程模型的半参数贝叶斯方法。
Biom J. 2010 Jun;52(3):314-32. doi: 10.1002/bimj.200900135.
10
Compositional Data Analysis of Microbiome and Any-Omics Datasets: A Validation of the Additive Logratio Transformation.微生物组与任意组学数据集的成分数据分析:加法对数比变换的验证
Front Microbiol. 2021 Oct 11;12:727398. doi: 10.3389/fmicb.2021.727398. eCollection 2021.

本文引用的文献

1
30 Anniversary of comprehensive two-dimensional gas chromatography: Latest advances.全二维气相色谱30周年:最新进展
Anal Sci Adv. 2021 Jan 21;2(3-4):213-224. doi: 10.1002/ansa.202000142. eCollection 2021 Apr.
2
Recent Advances in GC×GC and Chemometrics to Address Emerging Challenges in Nontargeted Analysis.气相色谱×气相色谱联用技术与化学计量学在应对非靶向分析新挑战方面的最新进展
Anal Chem. 2023 Jan 10;95(1):264-286. doi: 10.1021/acs.analchem.2c04235.
3
VOCs profile can discriminate biological age.挥发性有机化合物(VOCs)谱可以区分生物年龄。
Aging (Albany NY). 2021 Apr 11;13(7):9156-9157. doi: 10.18632/aging.202959.
4
Urinary Volatilomics Unveils a Candidate Biomarker Panel for Noninvasive Detection of Clear Cell Renal Cell Carcinoma.尿挥发性组学揭示了用于无创检测透明细胞肾细胞癌的候选生物标志物组合。
J Proteome Res. 2021 Jun 4;20(6):3068-3077. doi: 10.1021/acs.jproteome.0c00936. Epub 2021 Apr 2.
5
Validating Differential Volatilome Profiles in Parkinson's Disease.验证帕金森病中的差异挥发组图谱
ACS Cent Sci. 2021 Feb 24;7(2):300-306. doi: 10.1021/acscentsci.0c01028. Epub 2021 Feb 12.
6
Coral endosymbionts (Symbiodiniaceae) emit species-specific volatilomes that shift when exposed to thermal stress.珊瑚共生藻(虫黄藻科)会释放具有物种特异性的挥发物组,而这些挥发物组在受到热胁迫时会发生变化。
Sci Rep. 2019 Nov 22;9(1):17395. doi: 10.1038/s41598-019-53552-0.
7
Knowing Me, Knowing You: Anal Gland Secretion of European Badgers (Meles meles) Codes for Individuality, Sex and Social Group Membership.知己知彼:欧洲獾的肛门腺分泌物蕴含个体识别信息、性别信息和社会群体信息。
J Chem Ecol. 2019 Oct;45(10):823-837. doi: 10.1007/s10886-019-01113-0. Epub 2019 Nov 7.
8
SPME-GC×GC-TOF MS fingerprint of virally-infected cell culture: Sample preparation optimization and data processing evaluation.SPME-GC×GC-TOF MS 指纹图谱分析病毒感染细胞培养物:样品制备优化和数据处理评估。
Anal Chim Acta. 2018 Oct 16;1027:158-167. doi: 10.1016/j.aca.2018.03.037. Epub 2018 Mar 30.
9
Individual human scent as a forensic identifier using mantrailing.利用警犬追踪将个体人类气味作为法医鉴定标识。
Forensic Sci Int. 2018 Jan;282:111-121. doi: 10.1016/j.forsciint.2017.11.021. Epub 2017 Nov 21.
10
Early biotic stress detection in tomato (Solanum lycopersicum) by BVOC emissions.通过挥发性有机化合物排放对番茄(Solanum lycopersicum)早期生物胁迫进行检测。
Phytochemistry. 2017 Dec;144:180-188. doi: 10.1016/j.phytochem.2017.09.006. Epub 2017 Sep 22.