• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重新分类为监督聚类。

Reclassification as supervised clustering.

作者信息

Sierra A, Corbacho F

机构信息

Escuela Técnica Superior de Informática, Universidad Autónoma de Madrid, Spain.

出版信息

Neural Comput. 2000 Nov;12(11):2537-46. doi: 10.1162/089976600300014836.

DOI:10.1162/089976600300014836
PMID:11110126
Abstract

In some branches of science, such as molecular biology, classes may be defined but not completely trusted. Sometimes posterior analysis proves them to be partially incorrect. Despite its relevance, this phenomenon has not received much attention within the neural computation community. We define reclassification as the task of redefining some given classes by maximum likelihood learning in a model that contains both supervised and unsupervised information. This approach leads to supervised clustering with an additional complexity penalizing term on the number of new classes. As a proof of concept, a simple reclassification algorithm is designed and applied to a data set of gene sequences. To test the performance of the algorithm, two of the original classes are merged. The algorithm is capable of unraveling the original three-class hidden structure, in contrast to the unsupervised version (K-means); moreover, it predicts the subdivision of one of the original classes into two different ones.

摘要

在一些科学分支中,如分子生物学,类别可以被定义,但不能完全被信赖。有时事后分析证明它们部分是不正确的。尽管这种现象具有相关性,但在神经计算领域却没有得到太多关注。我们将重新分类定义为在一个包含监督和无监督信息的模型中,通过最大似然学习重新定义一些给定类别的任务。这种方法导致了有监督的聚类,并且对新类别的数量有一个额外的复杂度惩罚项。作为概念验证,设计了一种简单的重新分类算法并将其应用于基因序列数据集。为了测试该算法的性能,将两个原始类别合并。与无监督版本(K均值)相比,该算法能够揭示原始的三类隐藏结构;此外,它还能预测将原始类别之一细分为两个不同的类别。

相似文献

1
Reclassification as supervised clustering.重新分类为监督聚类。
Neural Comput. 2000 Nov;12(11):2537-46. doi: 10.1162/089976600300014836.
2
Systematic learning of gene functional classes from DNA array expression data by using multilayer perceptrons.利用多层感知器从DNA阵列表达数据中系统学习基因功能类别。
Genome Res. 2002 Nov;12(11):1703-15. doi: 10.1101/gr.192502.
3
Data classification with radial basis function networks based on a novel kernel density estimation algorithm.基于一种新型核密度估计算法的径向基函数网络数据分类
IEEE Trans Neural Netw. 2005 Jan;16(1):225-36. doi: 10.1109/TNN.2004.836229.
4
Detection of compositional constraints in nucleic acid sequences using neural networks.利用神经网络检测核酸序列中的组成限制。
Comput Appl Biosci. 1995 Feb;11(1):29-37. doi: 10.1093/bioinformatics/11.1.29.
5
Best harmony, unified RPCL and automated model selection for unsupervised and supervised learning on Gaussian mixtures, three-layer nets and ME-RBF-SVM models.高斯混合模型、三层网络和ME-RBF-SVM模型上无监督和监督学习的最佳协调、统一RPCL与自动模型选择。
Int J Neural Syst. 2001 Feb;11(1):43-69. doi: 10.1142/S0129065701000497.
6
Identification of coding regions in genomic DNA sequences: an application of dynamic programming and neural networks.基因组DNA序列中编码区域的识别:动态规划和神经网络的应用
Nucleic Acids Res. 1993 Feb 11;21(3):607-13. doi: 10.1093/nar/21.3.607.
7
Supervised neural network modeling: an empirical investigation into learning from imbalanced data with labeling errors.监督神经网络建模:关于从不平衡数据和标注错误中学习的实证研究。
IEEE Trans Neural Netw. 2010 May;21(5):813-30. doi: 10.1109/TNN.2010.2042730. Epub 2010 Mar 15.
8
Parametric embedding for class visualization.
Neural Comput. 2007 Sep;19(9):2536-56. doi: 10.1162/neco.2007.19.9.2536.
9
Online semi-supervised growing neural gas.在线半监督生长神经气模型。
Int J Neural Syst. 2012 Oct;22(5):1250023. doi: 10.1142/S0129065712500232. Epub 2012 Sep 19.
10
Review of MR image segmentation techniques using pattern recognition.基于模式识别的磁共振图像分割技术综述。
Med Phys. 1993 Jul-Aug;20(4):1033-48. doi: 10.1118/1.597000.

引用本文的文献

1
A robust approach based on Weibull distribution for clustering gene expression data.一种基于威布尔分布的用于聚类基因表达数据的稳健方法。
Algorithms Mol Biol. 2011 May 31;6(1):14. doi: 10.1186/1748-7188-6-14.