• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将分子特征计算为双目标函数的最优解:方法及其在肿瘤基因组学预测中的应用

Computing molecular signatures as optima of a bi-objective function: method and application to prediction in oncogenomics.

作者信息

Gardeux Vincent, Chelouah Rachid, Wanderley Maria F Barbosa, Siarry Patrick, Braga Antônio P, Reyal Fabien, Rouzier Roman, Pusztai Lajos, Natowicz René

机构信息

EISTI engineering school, Department of Computer Science, Cergy, France. ; LISSI laboratory, University of Paris-Est, Créteil, France.

EISTI engineering school, Department of Computer Science, Cergy, France.

出版信息

Cancer Inform. 2015 Apr 19;14:33-45. doi: 10.4137/CIN.S21111. eCollection 2015.

DOI:10.4137/CIN.S21111
PMID:25983540
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4426938/
Abstract

BACKGROUND

Filter feature selection methods compute molecular signatures by selecting subsets of genes in the ranking of a valuation function. The motivations of the valuation functions choice are almost always clearly stated, but those for selecting the genes according to their ranking are hardly ever explicit.

METHOD

We addressed the computation of molecular signatures by searching the optima of a bi-objective function whose solution space was the set of all possible molecular signatures, ie, the set of subsets of genes. The two objectives were the size of the signature-to be minimized-and the interclass distance induced by the signature-to be maximized-.

RESULTS

We showed that: 1) the convex combination of the two objectives had exactly n optimal non empty signatures where n was the number of genes, 2) the n optimal signatures were nested, and 3) the optimal signature of size k was the subset of k top ranked genes that contributed the most to the interclass distance. We applied our feature selection method on five public datasets in oncology, and assessed the prediction performances of the optimal signatures as input to the diagonal linear discriminant analysis (DLDA) classifier. They were at the same level or better than the best-reported ones. The predictions were robust, and the signatures were almost always significantly smaller. We studied in more details the performances of our predictive modeling on two breast cancer datasets to predict the response to a preoperative chemotherapy: the performances were higher than the previously reported ones, the signatures were three times smaller (11 versus 30 gene signatures), and the genes member of the signature were known to be involved in the response to chemotherapy.

CONCLUSIONS

Defining molecular signatures as the optima of a bi-objective function that combined the signature size and the interclass distance was well founded and efficient for prediction in oncogenomics. The complexity of the computation was very low because the optimal signatures were the sets of genes in the ranking of their valuation. Software can be freely downloaded from http://gardeux-vincent.eu/DeltaRanking.php.

摘要

背景

过滤特征选择方法通过在评估函数的排名中选择基因子集来计算分子特征。评估函数选择的动机几乎总是明确阐述的,但根据基因排名来选择基因的动机却几乎从未明确说明。

方法

我们通过搜索一个双目标函数的最优解来解决分子特征的计算问题,该双目标函数的解空间是所有可能的分子特征集,即基因子集的集合。这两个目标分别是特征的大小(要最小化)和由特征诱导的类间距离(要最大化)。

结果

我们表明:1)两个目标的凸组合恰好有n个最优非空特征,其中n是基因的数量;2)这n个最优特征是嵌套的;3)大小为k的最优特征是对类间距离贡献最大的k个排名靠前的基因的子集。我们将我们的特征选择方法应用于肿瘤学的五个公共数据集,并评估了最优特征作为对角线性判别分析(DLDA)分类器输入的预测性能。它们与报告的最佳性能处于同一水平或更好。预测是稳健的,并且特征几乎总是明显更小。我们更详细地研究了我们在两个乳腺癌数据集上进行预测建模以预测术前化疗反应的性能:性能高于先前报道的性能,特征小三倍(11个基因特征对30个基因特征),并且已知特征中的基因成员参与化疗反应。

结论

将分子特征定义为结合特征大小和类间距离的双目标函数的最优解,在肿瘤基因组学预测中是有充分依据且有效的。计算复杂度非常低,因为最优特征是其评估排名中的基因集。软件可从http://gardeux - vincent.eu/DeltaRanking.php免费下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d768/4426938/59a61b2ad754/cin-14-2015-033f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d768/4426938/426267b31196/cin-14-2015-033f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d768/4426938/7721f6d83b7b/cin-14-2015-033f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d768/4426938/59a61b2ad754/cin-14-2015-033f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d768/4426938/426267b31196/cin-14-2015-033f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d768/4426938/7721f6d83b7b/cin-14-2015-033f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d768/4426938/59a61b2ad754/cin-14-2015-033f3.jpg

相似文献

1
Computing molecular signatures as optima of a bi-objective function: method and application to prediction in oncogenomics.将分子特征计算为双目标函数的最优解:方法及其在肿瘤基因组学预测中的应用
Cancer Inform. 2015 Apr 19;14:33-45. doi: 10.4137/CIN.S21111. eCollection 2015.
2
Prediction of early breast cancer metastasis from DNA microarray data using high-dimensional cox regression models.使用高维Cox回归模型从DNA微阵列数据预测早期乳腺癌转移
Cancer Inform. 2015 May 5;14(Suppl 2):129-38. doi: 10.4137/CIN.S17284. eCollection 2015.
3
Genes and functions from breast cancer signatures.乳腺癌标志物的基因和功能。
BMC Cancer. 2018 Apr 27;18(1):473. doi: 10.1186/s12885-018-4388-4.
4
Evaluation of public cancer datasets and signatures identifies TP53 mutant signatures with robust prognostic and predictive value.对公共癌症数据集和特征的评估确定了具有强大预后和预测价值的TP53突变特征。
BMC Cancer. 2015 Mar 26;15:179. doi: 10.1186/s12885-015-1102-7.
5
Identification of gene signatures from RNA-seq data using Pareto-optimal cluster algorithm.使用帕累托最优聚类算法从RNA测序数据中识别基因特征。
BMC Syst Biol. 2018 Dec 21;12(Suppl 8):126. doi: 10.1186/s12918-018-0650-2.
6
Hybrid method for prediction of metastasis in breast cancer patients using gene expression signals.利用基因表达信号预测乳腺癌患者转移的混合方法。
J Med Signals Sens. 2013 Apr;3(2):79-86.
7
Robust selection of cancer survival signatures from high-throughput genomic data using two-fold subsampling.利用双重抽样从高通量基因组数据中稳健选择癌症生存特征
PLoS One. 2014 Oct 8;9(10):e108818. doi: 10.1371/journal.pone.0108818. eCollection 2014.
8
biosigner: A New Method for the Discovery of Significant Molecular Signatures from Omics Data.生物标志物:一种从组学数据中发现显著分子特征的新方法。
Front Mol Biosci. 2016 Jun 21;3:26. doi: 10.3389/fmolb.2016.00026. eCollection 2016.
9
Predictors of primary breast cancers responsiveness to preoperative epirubicin/cyclophosphamide-based chemotherapy: translation of microarray data into clinically useful predictive signatures.原发性乳腺癌对术前表柔比星/环磷酰胺化疗反应的预测指标:将微阵列数据转化为临床有用的预测特征
J Transl Med. 2005 Aug 9;3:32. doi: 10.1186/1479-5876-3-32.
10
Construction and optimization of gene expression signatures for prediction of survival in two-arm clinical trials.构建和优化基因表达谱,用于预测双臂临床试验中的生存情况。
BMC Bioinformatics. 2020 Jul 25;21(1):333. doi: 10.1186/s12859-020-03655-7.

引用本文的文献

1
Framework for feature selection of predicting the diagnosis and prognosis of necrotizing enterocolitis.预测坏死性小肠结肠炎的诊断和预后的特征选择框架。
PLoS One. 2022 Aug 19;17(8):e0273383. doi: 10.1371/journal.pone.0273383. eCollection 2022.
2
RIFS: a randomly restarted incremental feature selection algorithm.RIFS:一种随机重启的增量特征选择算法。
Sci Rep. 2017 Oct 12;7(1):13013. doi: 10.1038/s41598-017-13259-6.

本文引用的文献

1
Interrogating differences in expression of targeted gene sets to predict breast cancer outcome.分析靶向基因集表达的差异预测乳腺癌的预后。
BMC Cancer. 2013 Jul 2;13:326. doi: 10.1186/1471-2407-13-326.
2
The influence of feature selection methods on accuracy, stability and interpretability of molecular signatures.特征选择方法对分子特征准确性、稳定性和可解释性的影响。
PLoS One. 2011;6(12):e28210. doi: 10.1371/journal.pone.0028210. Epub 2011 Dec 21.
3
Lack of sufficiently strong informative features limits the potential of gene expression analysis as predictive tool for many clinical classification problems.
缺乏足够有力的信息特征限制了基因表达分析作为预测工具在许多临床分类问题中的潜力。
BMC Bioinformatics. 2011 Dec 1;12:463. doi: 10.1186/1471-2105-12-463.
4
Estrogen receptor (ESR1) mRNA expression and benefit from tamoxifen in the treatment and prevention of estrogen receptor-positive breast cancer.雌激素受体(ESR1)mRNA 表达与他莫昔芬在治疗和预防雌激素受体阳性乳腺癌中的获益。
J Clin Oncol. 2011 Nov 1;29(31):4160-7. doi: 10.1200/JCO.2010.32.9615. Epub 2011 Sep 26.
5
The tumour suppressor SOX11 is associated with improved survival among high grade epithelial ovarian cancers and is regulated by reversible promoter methylation.抑癌基因 SOX11 与高级别上皮性卵巢癌患者的生存改善相关,且受到可逆转的启动子甲基化调控。
BMC Cancer. 2011 Sep 24;11:405. doi: 10.1186/1471-2407-11-405.
6
ESR1 is co-expressed with closely adjacent uncharacterised genes spanning a breast cancer susceptibility locus at 6q25.1.ESR1 与紧邻的未被描述的基因共同表达,这些基因跨越了位于 6q25.1 的乳腺癌易感性位点。
PLoS Genet. 2011 Apr;7(4):e1001382. doi: 10.1371/journal.pgen.1001382. Epub 2011 Apr 28.
7
Tumor suppressor RARRES1 interacts with cytoplasmic carboxypeptidase AGBL2 to regulate the α-tubulin tyrosination cycle.抑癌基因 RARRES1 与细胞质羧肽酶 AGBL2 相互作用,调节α-微管蛋白的酪氨酸化循环。
Cancer Res. 2011 Feb 15;71(4):1219-28. doi: 10.1158/0008-5472.CAN-10-2294. Epub 2011 Feb 8.
8
Evaluation of a 30-gene paclitaxel, fluorouracil, doxorubicin, and cyclophosphamide chemotherapy response predictor in a multicenter randomized trial in breast cancer.多中心随机临床试验中紫杉醇、氟尿嘧啶、多柔比星和环磷酰胺 30 基因化疗反应预测因子的评价。
Clin Cancer Res. 2010 Nov 1;16(21):5351-61. doi: 10.1158/1078-0432.CCR-10-1265. Epub 2010 Sep 9.
9
The pro-metastatic protein anterior gradient-2 predicts poor prognosis in tamoxifen-treated breast cancers.促转移蛋白前梯度-2 预测他莫昔芬治疗的乳腺癌预后不良。
Oncogene. 2010 Aug 26;29(34):4838-47. doi: 10.1038/onc.2010.228. Epub 2010 Jun 7.
10
Advances in metaheuristics for gene selection and classification of microarray data.元启发式算法在基因选择和微阵列数据分析分类中的应用进展。
Brief Bioinform. 2010 Jan;11(1):127-41. doi: 10.1093/bib/bbp035. Epub 2009 Sep 29.