• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

顶级评分 'N' 算法:一种从少量生物分子中进行广义相对表达分类的方法。

The top-scoring 'N' algorithm: a generalized relative expression classification method from small numbers of biomolecules.

机构信息

Institute for Systems Biology, Seattle, WA 98109, USA.

出版信息

BMC Bioinformatics. 2012 Sep 11;13:227. doi: 10.1186/1471-2105-13-227.

DOI:10.1186/1471-2105-13-227
PMID:22966958
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3663421/
Abstract

BACKGROUND

Relative expression algorithms such as the top-scoring pair (TSP) and the top-scoring triplet (TST) have several strengths that distinguish them from other classification methods, including resistance to overfitting, invariance to most data normalization methods, and biological interpretability. The top-scoring 'N' (TSN) algorithm is a generalized form of other relative expression algorithms which uses generic permutations and a dynamic classifier size to control both the permutation and combination space available for classification.

RESULTS

TSN was tested on nine cancer datasets, showing statistically significant differences in classification accuracy between different classifier sizes (choices of N). TSN also performed competitively against a wide variety of different classification methods, including artificial neural networks, classification trees, discriminant analysis, k-Nearest neighbor, naïve Bayes, and support vector machines, when tested on the Microarray Quality Control II datasets. Furthermore, TSN exhibits low levels of overfitting on training data compared to other methods, giving confidence that results obtained during cross validation will be more generally applicable to external validation sets.

CONCLUSIONS

TSN preserves the strengths of other relative expression algorithms while allowing a much larger permutation and combination space to be explored, potentially improving classification accuracies when fewer numbers of measured features are available.

摘要

背景

相对表达算法,如最高分对(TSP)和最高分三重(TST),具有一些区别于其他分类方法的优势,包括对过拟合的抗性、对大多数数据归一化方法的不变性以及生物学可解释性。最高分 'N'(TSN)算法是其他相对表达算法的广义形式,它使用通用排列和动态分类器大小来控制分类可用的排列和组合空间。

结果

TSN 在九个癌症数据集上进行了测试,显示不同分类器大小(N 的选择)之间的分类准确性存在统计学显著差异。当在 Microarray Quality Control II 数据集上测试时,TSN 也与各种不同的分类方法(包括人工神经网络、分类树、判别分析、k-最近邻、朴素贝叶斯和支持向量机)进行了竞争,表现出色。此外,与其他方法相比,TSN 在训练数据上的过拟合程度较低,这使人们有信心在交叉验证中获得的结果将更普遍适用于外部验证集。

结论

TSN 保留了其他相对表达算法的优势,同时允许探索更大的排列和组合空间,当可用的测量特征数量较少时,可能会提高分类准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51c7/3663421/0d881c20c99e/1471-2105-13-227-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51c7/3663421/b97beb6f4f2d/1471-2105-13-227-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51c7/3663421/3d08f36ab223/1471-2105-13-227-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51c7/3663421/ad748a3d0298/1471-2105-13-227-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51c7/3663421/72e675972686/1471-2105-13-227-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51c7/3663421/5423055107c4/1471-2105-13-227-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51c7/3663421/0d881c20c99e/1471-2105-13-227-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51c7/3663421/b97beb6f4f2d/1471-2105-13-227-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51c7/3663421/3d08f36ab223/1471-2105-13-227-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51c7/3663421/ad748a3d0298/1471-2105-13-227-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51c7/3663421/72e675972686/1471-2105-13-227-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51c7/3663421/5423055107c4/1471-2105-13-227-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51c7/3663421/0d881c20c99e/1471-2105-13-227-6.jpg

相似文献

1
The top-scoring 'N' algorithm: a generalized relative expression classification method from small numbers of biomolecules.顶级评分 'N' 算法:一种从少量生物分子中进行广义相对表达分类的方法。
BMC Bioinformatics. 2012 Sep 11;13:227. doi: 10.1186/1471-2105-13-227.
2
Top scoring pairs for feature selection in machine learning and applications to cancer outcome prediction.机器学习中特征选择的最佳评分对及其在癌症预后预测中的应用。
BMC Bioinformatics. 2011 Sep 23;12:375. doi: 10.1186/1471-2105-12-375.
3
TSG: a new algorithm for binary and multi-class cancer classification and informative genes selection.TSG:一种用于二分类和多分类癌症分类及信息基因选择的新算法。
BMC Med Genomics. 2013;6 Suppl 1(Suppl 1):S3. doi: 10.1186/1755-8794-6-S1-S3. Epub 2013 Jan 23.
4
Microarray-based cancer prediction using single genes.基于微阵列的单基因癌症预测。
BMC Bioinformatics. 2011 Oct 7;12:391. doi: 10.1186/1471-2105-12-391.
5
Feature selection and tumor classification for microarray data using relaxed Lasso and generalized multi-class support vector machine.使用松弛 Lasso 和广义多类支持向量机进行微阵列数据分析的特征选择和肿瘤分类。
J Theor Biol. 2019 Feb 21;463:77-91. doi: 10.1016/j.jtbi.2018.12.010. Epub 2018 Dec 8.
6
A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis.用于微阵列基因表达癌症诊断的多类别分类方法的综合评估。
Bioinformatics. 2005 Mar 1;21(5):631-43. doi: 10.1093/bioinformatics/bti033. Epub 2004 Sep 16.
7
MLSeq: Machine learning interface for RNA-sequencing data.MLSeq:用于 RNA-seq 数据的机器学习接口。
Comput Methods Programs Biomed. 2019 Jul;175:223-231. doi: 10.1016/j.cmpb.2019.04.007. Epub 2019 Apr 29.
8
Direct Kernel Perceptron (DKP): ultra-fast kernel ELM-based classification with non-iterative closed-form weight calculation.直接核感知机(DKP):基于超快速核极限学习机的分类方法,具有非迭代的闭式权重计算。
Neural Netw. 2014 Feb;50:60-71. doi: 10.1016/j.neunet.2013.11.002. Epub 2013 Nov 14.
9
A generalized covariate-adjusted top-scoring pair algorithm with applications to diabetic kidney disease stage classification in the Chronic Renal Insufficiency Cohort (CRIC) Study.一种广义协变量调整的最优配对算法及其在慢性肾功能不全队列研究(CRIC)中糖尿病肾病分期分类中的应用。
BMC Bioinformatics. 2023 Feb 20;24(1):57. doi: 10.1186/s12859-023-05171-w.
10
Application of supervised machine learning algorithms in the classification of sagittal gait patterns of cerebral palsy children with spastic diplegia.监督机器学习算法在痉挛性双瘫脑瘫儿童矢状面步态模式分类中的应用。
Comput Biol Med. 2019 Mar;106:33-39. doi: 10.1016/j.compbiomed.2019.01.009. Epub 2019 Jan 16.

引用本文的文献

1
Robust Cluster Prediction Across Data Types Validates Association of Sex and Therapy Response in GBM.跨数据类型的稳健聚类预测验证了胶质母细胞瘤中性别与治疗反应的关联。
Cancers (Basel). 2025 Jan 28;17(3):445. doi: 10.3390/cancers17030445.
2
Ensemble methods of rank-based trees for single sample classification with gene expression profiles.基于排名的树的集成方法,用于具有基因表达谱的单个样本分类。
J Transl Med. 2024 Feb 7;22(1):140. doi: 10.1186/s12967-024-04940-2.
3
Dynamic Database Design of Sports Quality Based on Genetic Data Algorithm and Artificial Intelligence.

本文引用的文献

1
Accelerating Advanced MRI Reconstructions on GPUs.在图形处理器上加速高级磁共振成像重建
J Parallel Distrib Comput. 2008 Oct;68(10):1307-1318. doi: 10.1016/j.jpdc.2008.05.013.
2
Graphics processing unit implementations of relative expression analysis algorithms enable dramatic computational speedup.图形处理单元实现的相对表达分析算法可实现显著的计算加速。
Bioinformatics. 2011 Mar 15;27(6):872-3. doi: 10.1093/bioinformatics/btr033. Epub 2011 Jan 20.
3
The MicroArray Quality Control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models.
基于遗传数据算法和人工智能的体育素质动态数据库设计。
Comput Intell Neurosci. 2022 Sep 16;2022:7473109. doi: 10.1155/2022/7473109. eCollection 2022.
4
A scalable and unbiased discordance metric with H.具有 H 的可扩展且无偏的不和谐度量。
Biostatistics. 2023 Dec 15;25(1):188-202. doi: 10.1093/biostatistics/kxac035.
5
Data analysis methods for defining biomarkers from omics data.用于从组学数据中定义生物标志物的数据分析方法。
Anal Bioanal Chem. 2022 Jan;414(1):235-250. doi: 10.1007/s00216-021-03813-7. Epub 2021 Dec 24.
6
SINC: a scale-invariant deep-neural-network classifier for bulk and single-cell RNA-seq data.SINC:一种用于批量和单细胞 RNA-seq 数据的尺度不变深度神经网络分类器。
Bioinformatics. 2020 Mar 1;36(6):1779-1784. doi: 10.1093/bioinformatics/btz801.
7
Transcriptome profiles in peripheral white blood cells at the time of artificial insemination discriminate beef heifers with different fertility potential.在人工授精时,外周血白细胞转录组谱可区分具有不同生育潜力的肉牛后备牛。
BMC Genomics. 2018 Feb 9;19(1):129. doi: 10.1186/s12864-018-4505-4.
8
Informative gene selection and the direct classification of tumors based on relative simplicity.基于相对简易性的信息性基因选择与肿瘤的直接分类
BMC Bioinformatics. 2016 Jan 20;17:44. doi: 10.1186/s12859-016-0893-0.
9
Informative gene selection and direct classification of tumor based on Chi-square test of pairwise gene interactions.基于成对基因相互作用的卡方检验进行肿瘤的信息基因选择与直接分类。
Biomed Res Int. 2014;2014:589290. doi: 10.1155/2014/589290. Epub 2014 Jul 23.
10
Evolutionary approach for relative gene expression algorithms.相对基因表达算法的进化方法。
ScientificWorldJournal. 2014 Mar 23;2014:593503. doi: 10.1155/2014/593503. eCollection 2014.
《基因芯片质量控制(MAQC)-II 研究:基于基因芯片的预测模型的开发和验证的常见实践》。
Nat Biotechnol. 2010 Aug;28(8):827-38. doi: 10.1038/nbt.1665. Epub 2010 Jul 30.
4
GPU-accelerated molecular modeling coming of age.GPU 加速分子建模走向成熟。
J Mol Graph Model. 2010 Sep;29(2):116-25. doi: 10.1016/j.jmgm.2010.06.010. Epub 2010 Jul 8.
5
Identifying tightly regulated and variably expressed networks by Differential Rank Conservation (DIRAC).通过差异秩守恒(DIRAC)识别严格调控和可变表达的网络。
PLoS Comput Biol. 2010 May 27;6(5):e1000792. doi: 10.1371/journal.pcbi.1000792.
6
Relative expression analysis for molecular cancer diagnosis and prognosis.用于分子癌症诊断和预后的相对表达分析。
Technol Cancer Res Treat. 2010 Apr;9(2):149-59. doi: 10.1177/153303461000900204.
7
The ordering of expression among a few genes can provide simple cancer biomarkers and signal BRCA1 mutations.少数基因表达的顺序可以提供简单的癌症生物标志物,并提示 BRCA1 突变。
BMC Bioinformatics. 2009 Aug 20;10:256. doi: 10.1186/1471-2105-10-256.
8
High-throughput sequence alignment using Graphics Processing Units.使用图形处理单元进行高通量序列比对。
BMC Bioinformatics. 2007 Dec 10;8:474. doi: 10.1186/1471-2105-8-474.
9
Highly accurate two-gene classifier for differentiating gastrointestinal stromal tumors and leiomyosarcomas.用于鉴别胃肠道间质瘤和平滑肌肉瘤的高准确性双基因分类器。
Proc Natl Acad Sci U S A. 2007 Feb 27;104(9):3414-9. doi: 10.1073/pnas.0611373104. Epub 2007 Feb 21.
10
Classifying gene expression profiles from pairwise mRNA comparisons.通过成对的mRNA比较对基因表达谱进行分类。
Stat Appl Genet Mol Biol. 2004;3:Article19. doi: 10.2202/1544-6115.1071. Epub 2004 Aug 30.