• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

特征列表综合分析的等价方法。

An equivalence approach to the integrative analysis of feature lists.

机构信息

Genetics, Microbiology and Statistics Department, Universitat de Barcelona, Avinguda Diagonal, 648, Barcelona, 08028, Spain.

出版信息

BMC Bioinformatics. 2019 Aug 27;20(1):441. doi: 10.1186/s12859-019-3008-x.

DOI:10.1186/s12859-019-3008-x
PMID:31455218
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6712676/
Abstract

BACKGROUND

Although a few comparison methods based on the biological meaning of gene lists have been developed, the goProfiles approach is one of the few that are being used for that purpose. It consists of projecting lists of genes into predefined levels of the Gene Ontology, in such a way that a multinomial model can be used for estimation and testing. Of particular interest is the fact that it may be used for proving equivalence (in the sense of "enough similarity") between two lists, instead of proving differences between them, which seems conceptually better suited to the end goal of establishing similarity among gene lists. An equivalence method has been derived that uses a distance-based approach and the confidence interval inclusion principle. Equivalence is declared if the upper limit of a one-sided confidence interval for the distance between two profiles is below a pre-established equivalence limit.

RESULTS

In this work, this method is extended to establish the equivalence of any number of gene lists. Additionally, an algorithm to obtain the smallest equivalence limit that would allow equivalence between two or more lists to be declared is presented. This algorithm is at the base of an iterative method of graphic visualization to represent the most to least equivalent gene lists. These methods deal adequately with the problem of adjusting for multiple testing. The applicability of these techniques is illustrated in two typical situations: (i) a collection of cancer-related gene lists, suggesting which of them are more reasonable to combine -as claimed by the authors- and (ii) a collection of pathogenesis-based transcript sets, showing which of these are more closely related. The methods developed are available in the goProfiles Bioconductor package.

CONCLUSIONS

The method provides a simple yet powerful and statistically well-grounded way to classify a set of genes or other feature lists by establishing their equivalence at a given equivalence threshold. The classification results can be viewed using standard visualization methods. This may be applied to a variety of problems, from deciding whether a series of datasets generating the lists can be combined to the simplification of groups of lists.

摘要

背景

虽然已经开发了一些基于基因列表生物学意义的比较方法,但 goProfiles 方法是少数用于此目的的方法之一。它由将基因列表投影到预先定义的基因本体论水平组成,以便可以使用多项式模型进行估计和测试。特别有趣的是,它可以用于证明两个列表之间的等效性(在“足够相似”的意义上),而不是证明它们之间的差异,这似乎在概念上更适合于建立基因列表之间相似性的最终目标。已经得出了一种使用基于距离的方法和置信区间包含原理的等效性方法。如果两个配置文件之间距离的单侧置信区间上限低于预先设定的等效极限,则声明等效性。

结果

在这项工作中,该方法扩展到建立任意数量的基因列表的等效性。此外,还提出了一种算法来获得允许声明两个或更多列表之间等效性的最小等效极限。该算法是一种图形可视化迭代方法的基础,用于表示最等效和最不等效的基因列表。这些方法适当地解决了多重测试调整的问题。这些技术的适用性在两种典型情况下得到了说明:(i)一组与癌症相关的基因列表,建议其中哪些更合理-正如作者所声称的那样-和(ii)一组基于发病机制的转录组,显示哪些更密切相关。开发的方法可在 goProfiles Bioconductor 包中使用。

结论

该方法提供了一种简单但强大且具有统计学依据的方法,通过在给定的等效阈值下建立它们的等效性来对一组基因或其他特征列表进行分类。分类结果可以使用标准可视化方法查看。这可以应用于各种问题,从决定是否可以组合生成列表的一系列数据集到简化列表组。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630f/6712676/2a733408a37f/12859_2019_3008_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630f/6712676/d28189b5b85f/12859_2019_3008_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630f/6712676/7eb2858ad506/12859_2019_3008_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630f/6712676/a896bd4f3695/12859_2019_3008_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630f/6712676/7260524c8026/12859_2019_3008_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630f/6712676/2a733408a37f/12859_2019_3008_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630f/6712676/d28189b5b85f/12859_2019_3008_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630f/6712676/7eb2858ad506/12859_2019_3008_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630f/6712676/a896bd4f3695/12859_2019_3008_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630f/6712676/7260524c8026/12859_2019_3008_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/630f/6712676/2a733408a37f/12859_2019_3008_Fig5_HTML.jpg

相似文献

1
An equivalence approach to the integrative analysis of feature lists.特征列表综合分析的等价方法。
BMC Bioinformatics. 2019 Aug 27;20(1):441. doi: 10.1186/s12859-019-3008-x.
2
An equivalence test between features lists, based on the Sorensen-Dice index and the joint frequencies of GO term enrichment.基于 Sorensen-Dice 指数和 GO 术语富集的共同频率对特征列表进行等效性检验。
BMC Bioinformatics. 2022 May 31;23(1):207. doi: 10.1186/s12859-022-04739-2.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Comparison of lists of genes based on functional profiles.基于功能谱的基因列表比较。
BMC Bioinformatics. 2011 Oct 16;12:401. doi: 10.1186/1471-2105-12-401.
5
Multiple testing to establish superiority/equivalence of a new treatment compared with kappa standard treatments.进行多次检验以确定一种新疗法相对于κ种标准疗法的优越性/等效性。
Stat Med. 1997 Nov 15;16(21):2489-506. doi: 10.1002/(sici)1097-0258(19971115)16:21<2489::aid-sim684>3.0.co;2-d.
6
GOexpress: an R/Bioconductor package for the identification and visualisation of robust gene ontology signatures through supervised learning of gene expression data.GOexpress:一个用于通过对基因表达数据进行监督学习来识别和可视化稳健基因本体特征的R/Bioconductor软件包。
BMC Bioinformatics. 2016 Mar 11;17:126. doi: 10.1186/s12859-016-0971-3.
7
Extending pathways based on gene lists using InterPro domain signatures.使用InterPro结构域签名基于基因列表扩展通路。
BMC Bioinformatics. 2008 Jan 4;9:3. doi: 10.1186/1471-2105-9-3.
8
Elements of significance testing with equivalence problems.
Methods Inf Med. 1991 Aug;30(3):194-8.
9
Mulcom: a multiple comparison statistical test for microarray data in Bioconductor.Mulcom:Bioconductor 中用于微阵列数据的多重比较统计检验。
BMC Bioinformatics. 2011 Sep 28;12:382. doi: 10.1186/1471-2105-12-382.
10
Application of genetic algorithms and constructive neural networks for the analysis of microarray cancer data.遗传算法和构造神经网络在微阵列癌症数据分析中的应用。
Theor Biol Med Model. 2014 May 7;11 Suppl 1(Suppl 1):S7. doi: 10.1186/1742-4682-11-S1-S7.

引用本文的文献

1
Network pharmacology and AI in cancer research uncovering biomarkers and therapeutic targets for RALGDS mutations.癌症研究中的网络药理学与人工智能:揭示RALGDS突变的生物标志物和治疗靶点
Sci Rep. 2025 Mar 29;15(1):10938. doi: 10.1038/s41598-025-91568-x.
2
An equivalence test between features lists, based on the Sorensen-Dice index and the joint frequencies of GO term enrichment.基于 Sorensen-Dice 指数和 GO 术语富集的共同频率对特征列表进行等效性检验。
BMC Bioinformatics. 2022 May 31;23(1):207. doi: 10.1186/s12859-022-04739-2.

本文引用的文献

1
PANTHER version 11: expanded annotation data from Gene Ontology and Reactome pathways, and data analysis tool enhancements.PANTHER 版本 11:来自基因本体论和 Reactome 通路的注释数据扩展,以及数据分析工具增强。
Nucleic Acids Res. 2017 Jan 4;45(D1):D183-D189. doi: 10.1093/nar/gkw1138. Epub 2016 Nov 29.
2
Semantic Similarity in the Gene Ontology.基因本体论中的语义相似性。
Methods Mol Biol. 2017;1446:161-173. doi: 10.1007/978-1-4939-3743-1_12.
3
Primer on the Gene Ontology.基因本体论入门
Methods Mol Biol. 2017;1446:25-37. doi: 10.1007/978-1-4939-3743-1_3.
4
VennPainter: A Tool for the Comparison and Identification of Candidate Genes Based on Venn Diagrams.VennPainter:一种基于维恩图比较和鉴定候选基因的工具。
PLoS One. 2016 Apr 27;11(4):e0154315. doi: 10.1371/journal.pone.0154315. eCollection 2016.
5
A description of the Molecular Signatures Database (MSigDB) Web site.分子特征数据库(MSigDB)网站的描述。
Methods Mol Biol. 2014;1150:153-60. doi: 10.1007/978-1-4939-0512-6_9.
6
CORaL: comparison of ranked lists for analysis of gene expression data.CORaL:用于基因表达数据分析的排名列表比较
J Comput Biol. 2013 Jun;20(6):433-43. doi: 10.1089/cmb.2013.0017. Epub 2013 May 15.
7
clusterProfiler: an R package for comparing biological themes among gene clusters.clusterProfiler:一个用于比较基因簇间生物学主题的 R 包。
OMICS. 2012 May;16(5):284-7. doi: 10.1089/omi.2011.0118. Epub 2012 Mar 28.
8
Ten years of pathway analysis: current approaches and outstanding challenges.十年的通路分析:当前方法和突出挑战。
PLoS Comput Biol. 2012;8(2):e1002375. doi: 10.1371/journal.pcbi.1002375. Epub 2012 Feb 23.
9
Safe harbours for the integration of new DNA in the human genome.人类基因组中新 DNA 整合的安全港。
Nat Rev Cancer. 2011 Dec 1;12(1):51-8. doi: 10.1038/nrc3179.
10
Comparison of lists of genes based on functional profiles.基于功能谱的基因列表比较。
BMC Bioinformatics. 2011 Oct 16;12:401. doi: 10.1186/1471-2105-12-401.