• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

描述性高通量生物学研究中临界点的识别方法。

Simple method for cutoff point identification in descriptive high-throughput biological studies.

机构信息

Department of Environmental Health Sciences, School of Public Health and Health Sciences, University of Massachusetts, 686 North Pleasant Street Amherst, Amherst, MA, 01003, USA.

出版信息

BMC Genomics. 2022 Mar 14;23(1):204. doi: 10.1186/s12864-022-08427-6.

DOI:10.1186/s12864-022-08427-6
PMID:35287573
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8922865/
Abstract

BACKGROUND

Rapid development of high-throughput omics technologies generates an increasing interest in algorithms for cutoff point identification. Existing cutoff methods and tools identify cutoff points based on an association of continuous variables with another variable, such as phenotype, disease state, or treatment group. These approaches are not applicable for descriptive studies in which continuous variables are reported without known association with any biologically meaningful variables.

RESULTS

The most common shape of the ranked distribution of continuous variables in high-throughput descriptive studies corresponds to a biphasic curve, where the first phase includes a big number of variables with values slowly growing with rank and the second phase includes a smaller number of variables rapidly growing with rank. This study describes an easy algorithm to identify the boundary between these phases to be used as a cutoff point.

DISCUSSION

The major assumption of that approach is that a small number of variables with high values dominate the biological system and determine its major processes and functions. This approach was tested on three different datasets: human genes and their expression values in the human cerebral cortex, mammalian genes and their values of sensitivity to chemical exposures, and human proteins and their expression values in the human heart. In every case, the described cutoff identification method produced shortlists of variables (genes, proteins) highly relevant for dominant functions/pathways of the analyzed biological systems.

CONCLUSIONS

The described method for cutoff identification may be used to prioritize variables in descriptive omics studies for a focused functional analysis, in situations where other methods of dichotomization of data are inaccessible.

摘要

背景

高通量组学技术的快速发展引发了人们对用于确定截止点的算法的浓厚兴趣。现有的截止点方法和工具是基于连续变量与另一个变量(如表型、疾病状态或治疗组)的关联来确定截止点。这些方法不适用于描述性研究,因为在这些研究中,连续变量是在与任何有意义的生物学变量没有已知关联的情况下报告的。

结果

在高通量描述性研究中,连续变量的排序分布最常见的形状是双相曲线,其中第一相包括大量随着等级缓慢增长的变量,第二相包括数量较少的变量,随着等级迅速增长。本研究描述了一种简单的算法,可以识别这些相之间的边界作为截止点。

讨论

该方法的主要假设是,少数具有高值的变量主导着生物系统,并决定其主要的过程和功能。该方法在三个不同的数据集上进行了测试:人类基因及其在人类大脑皮层中的表达值、哺乳动物基因及其对化学暴露的敏感性值,以及人类蛋白质及其在人类心脏中的表达值。在每种情况下,所描述的截止识别方法都产生了变量(基因、蛋白质)的短名单,这些变量与分析生物系统的主要功能/途径高度相关。

结论

所描述的截止识别方法可用于在无法使用其他数据二值化方法的情况下,对描述性组学研究中的变量进行优先排序,以进行重点功能分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aa4/8922865/6742754cd279/12864_2022_8427_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aa4/8922865/6742754cd279/12864_2022_8427_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4aa4/8922865/6742754cd279/12864_2022_8427_Fig1_HTML.jpg

相似文献

1
Simple method for cutoff point identification in descriptive high-throughput biological studies.描述性高通量生物学研究中临界点的识别方法。
BMC Genomics. 2022 Mar 14;23(1):204. doi: 10.1186/s12864-022-08427-6.
2
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
3
Evaluate Cutpoints: Adaptable continuous data distribution system for determining survival in Kaplan-Meier estimator.评估切点:适用于 Kaplan-Meier 估计器的可适应连续数据分布系统,用于确定生存情况。
Comput Methods Programs Biomed. 2019 Aug;177:133-139. doi: 10.1016/j.cmpb.2019.05.023. Epub 2019 May 23.
4
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
5
Engineering Aspects of Olfaction嗅觉的工程学方面
6
Using iterative cluster merging with improved gap statistics to perform online phenotype discovery in the context of high-throughput RNAi screens.在高通量RNA干扰筛选的背景下,使用带有改进间隙统计量的迭代聚类合并来进行在线表型发现。
BMC Bioinformatics. 2008 Jun 5;9:264. doi: 10.1186/1471-2105-9-264.
7
petal: Co-expression network modelling in R.花瓣:R语言中的共表达网络建模
BMC Syst Biol. 2016 Aug 1;10 Suppl 2(Suppl 2):51. doi: 10.1186/s12918-016-0298-8.
8
Determining the cutoff based on a continuous variable to define two populations with application to vaccines.基于连续变量确定截断值以定义两个群体并应用于疫苗。
J Biopharm Stat. 2013 May;23(3):662-80. doi: 10.1080/10543406.2012.756502.
9
A random forest algorithm-based approach to capture latent decision variables and their cutoff values.基于随机森林算法的方法来捕捉潜在的决策变量及其截断值。
J Biomed Inform. 2020 Oct;110:103548. doi: 10.1016/j.jbi.2020.103548. Epub 2020 Aug 28.
10
Enhancement of Plant Productivity in the Post-Genomics Era.后基因组时代植物生产力的提高
Curr Genomics. 2016 Aug;17(4):295-6. doi: 10.2174/138920291704160607182507.

引用本文的文献

1
Underexplored Molecular Mechanisms of Toxicity.毒性的未充分探索的分子机制
J Xenobiot. 2024 Jul 18;14(3):939-949. doi: 10.3390/jox14030052.
2
Prognostic role of TEAD4 in TNBC: in-silico inhibition of the TEAD4-YAP interaction by flufenamic acid analogs.TEAD4在三阴性乳腺癌中的预后作用:氟芬那酸类似物对TEAD4-YAP相互作用的计算机模拟抑制
In Silico Pharmacol. 2024 Jul 17;12(2):64. doi: 10.1007/s40203-024-00239-8. eCollection 2024.
3
Sequential disruption of SPLASH-identified vRNA-vRNA interactions challenges their role in influenza A virus genome packaging.

本文引用的文献

1
Unbiased approach for the identification of molecular mechanisms sensitive to chemical exposures.针对化学暴露敏感的分子机制的无偏鉴定方法。
Chemosphere. 2021 Jan;262:128362. doi: 10.1016/j.chemosphere.2020.128362. Epub 2020 Sep 18.
2
The GTEx Consortium atlas of genetic regulatory effects across human tissues.GTEx 联盟人类组织遗传调控效应图谱
Science. 2020 Sep 11;369(6509):1318-1330. doi: 10.1126/science.aaz1776.
3
Evaluate Cutpoints: Adaptable continuous data distribution system for determining survival in Kaplan-Meier estimator.
顺序破坏 SPLASH 鉴定的 vRNA-vRNA 相互作用挑战了它们在甲型流感病毒基因组包装中的作用。
Nucleic Acids Res. 2023 Jul 7;51(12):6479-6494. doi: 10.1093/nar/gkad442.
4
Towards Whole Health Toxicology: In-Silico Prediction of Diseases Sensitive to Multi-Chemical Exposures.迈向全健康毒理学:对多化学物质暴露敏感疾病的计算机模拟预测
Toxics. 2022 Dec 8;10(12):764. doi: 10.3390/toxics10120764.
5
Chemical Exposures Affect Innate Immune Response to SARS-CoV-2.化学暴露会影响对 SARS-CoV-2 的先天免疫反应。
Int J Mol Sci. 2021 Nov 19;22(22):12474. doi: 10.3390/ijms222212474.
评估切点:适用于 Kaplan-Meier 估计器的可适应连续数据分布系统,用于确定生存情况。
Comput Methods Programs Biomed. 2019 Aug;177:133-139. doi: 10.1016/j.cmpb.2019.05.023. Epub 2019 May 23.
4
Metascape provides a biologist-oriented resource for the analysis of systems-level datasets.Metascape 为系统水平数据集的分析提供了面向生物学家的资源。
Nat Commun. 2019 Apr 3;10(1):1523. doi: 10.1038/s41467-019-09234-6.
5
EmptyDrops: distinguishing cells from empty droplets in droplet-based single-cell RNA sequencing data.EmptyDrops:用于区分基于液滴的单细胞 RNA 测序数据中的细胞和空液滴。
Genome Biol. 2019 Mar 22;20(1):63. doi: 10.1186/s13059-019-1662-y.
6
Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets.利用纳升液滴对单个细胞进行高度并行的全基因组表达谱分析。
Cell. 2015 May 21;161(5):1202-1214. doi: 10.1016/j.cell.2015.05.002.
7
Gateways to the FANTOM5 promoter level mammalian expression atlas.通向FANTOM5启动子水平哺乳动物表达图谱的途径。
Genome Biol. 2015 Jan 5;16(1):22. doi: 10.1186/s13059-014-0560-6.
8
Proteomics. Tissue-based map of the human proteome.蛋白质组学。人类蛋白质组组织图谱。
Science. 2015 Jan 23;347(6220):1260419. doi: 10.1126/science.1260419.
9
A draft map of the human proteome.人类蛋白质组草图。
Nature. 2014 May 29;509(7502):575-81. doi: 10.1038/nature13302.
10
Cutoff Finder: a comprehensive and straightforward Web application enabling rapid biomarker cutoff optimization.Cutoff Finder:一个全面而直接的网络应用程序,可实现快速生物标志物截止值优化。
PLoS One. 2012;7(12):e51862. doi: 10.1371/journal.pone.0051862. Epub 2012 Dec 14.