• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种金融数据的综合聚类检测、优化和解释方法。

An Integrated Cluster Detection, Optimization, and Interpretation Approach for Financial Data.

出版信息

IEEE Trans Cybern. 2022 Dec;52(12):13848-13861. doi: 10.1109/TCYB.2021.3109066. Epub 2022 Nov 18.

DOI:10.1109/TCYB.2021.3109066
PMID:34550896
Abstract

In many financial applications, such as fraud detection, reject inference, and credit evaluation, detecting clusters automatically is critical because it helps to understand the subpatterns of the data that can be used to infer user's behaviors and identify potential risks. Due to the complexity of human behaviors and changing social environments, the distributions of financial data are usually complex and it is challenging to find clusters and give reasonable interpretations. The goal of this study is to develop an integrated approach to detect clusters in financial data, and optimize the scope of the clusters such that the clusters can be easily interpreted. Specifically, we first proposed a new cluster quality evaluation criterion, which is free from large-scale computation and can guide base clustering algorithms such as k -Means to detect hyperellipsoidal clusters adaptively. Then, we designed a new solver for a revised support vector data description model, which efficiently refines the centroids and scopes of the detected clusters to make the clusters tighter such that the data in the clusters share greater similarities, and thus, the clusters can be easily interpreted with eigenvectors. Using ten financial datasets, the experiments showed that the proposed algorithm can efficiently find reasonable number of clusters. The proposed approach is suitable for large-scale financial datasets whose features are meaningful, and also applicable to financial mining tasks, such as data distribution interpretation and anomaly detection.

摘要

在许多金融应用中,如欺诈检测、拒绝推断和信用评估,自动检测聚类至关重要,因为它有助于理解数据的子模式,可用于推断用户的行为和识别潜在风险。由于人类行为的复杂性和不断变化的社会环境,金融数据的分布通常很复杂,很难找到聚类并给出合理的解释。本研究的目的是开发一种综合方法来检测金融数据中的聚类,并优化聚类的范围,以便聚类可以很容易地进行解释。具体来说,我们首先提出了一种新的聚类质量评估标准,该标准无需大规模计算,可以指导 k-Means 等基础聚类算法自适应地检测超椭球聚类。然后,我们设计了一个新的求解器来修正支持向量数据描述模型,该模型可以有效地细化检测到的聚类的质心和范围,使聚类更加紧密,从而使聚类中的数据具有更大的相似性,因此可以使用特征向量更容易地解释聚类。使用十个金融数据集的实验表明,所提出的算法可以有效地找到合理数量的聚类。所提出的方法适用于特征有意义的大规模金融数据集,也适用于金融挖掘任务,如数据分布解释和异常检测。

相似文献

1
An Integrated Cluster Detection, Optimization, and Interpretation Approach for Financial Data.一种金融数据的综合聚类检测、优化和解释方法。
IEEE Trans Cybern. 2022 Dec;52(12):13848-13861. doi: 10.1109/TCYB.2021.3109066. Epub 2022 Nov 18.
2
A multiple kernel density clustering algorithm for incomplete datasets in bioinformatics.一种用于生物信息学中不完整数据集的多核密度聚类算法。
BMC Syst Biol. 2018 Nov 22;12(Suppl 6):111. doi: 10.1186/s12918-018-0630-6.
3
PFClust: a novel parameter free clustering algorithm.PFClust:一种新颖的无参数聚类算法。
BMC Bioinformatics. 2013 Jul 3;14:213. doi: 10.1186/1471-2105-14-213.
4
Identification of Enterprise Financial Risk Based on Clustering Algorithm.基于聚类算法的企业财务风险识别。
Comput Intell Neurosci. 2022 Apr 12;2022:1086945. doi: 10.1155/2022/1086945. eCollection 2022.
5
Digital Industry Financial Risk Early Warning System Based on Improved K-Means Clustering Algorithm.基于改进 K-Means 聚类算法的数字产业金融风险预警系统。
Comput Intell Neurosci. 2022 May 28;2022:6797185. doi: 10.1155/2022/6797185. eCollection 2022.
6
Simultaneous gene clustering and subset selection for sample classification via MDL.通过最小描述长度实现用于样本分类的同步基因聚类和子集选择
Bioinformatics. 2003 Jun 12;19(9):1100-9. doi: 10.1093/bioinformatics/btg039.
7
Weighted graph cuts without eigenvectors a multilevel approach.无需特征向量的加权图割:一种多级方法。
IEEE Trans Pattern Anal Mach Intell. 2007 Nov;29(11):1944-57. doi: 10.1109/TPAMI.2007.1115.
8
Evolutionary Multiobjective Clustering and Its Applications to Patient Stratification.进化多目标聚类及其在患者分层中的应用。
IEEE Trans Cybern. 2019 May;49(5):1680-1693. doi: 10.1109/TCYB.2018.2817480. Epub 2018 Apr 2.
9
An effective density-based clustering and dynamic maintenance framework for evolving medical data streams.用于演化型医疗数据流的有效基于密度的聚类和动态维护框架。
Int J Med Inform. 2019 Jun;126:176-186. doi: 10.1016/j.ijmedinf.2019.03.016. Epub 2019 Mar 28.
10
DICON: interactive visual analysis of multidimensional clusters.DICON:多维聚类的交互式可视化分析。
IEEE Trans Vis Comput Graph. 2011 Dec;17(12):2581-90. doi: 10.1109/TVCG.2011.188.

引用本文的文献

1
EasyGraph: A multifunctional, cross-platform, and effective library for interdisciplinary network analysis.EasyGraph:一个用于跨学科网络分析的多功能、跨平台且高效的库。
Patterns (N Y). 2023 Sep 5;4(10):100839. doi: 10.1016/j.patter.2023.100839. eCollection 2023 Oct 13.
2
Financial literacy, behavioral traits, and ePayment adoption and usage in Japan.日本的金融素养、行为特征与电子支付的采用和使用情况。
Financ Innov. 2023;9(1):101. doi: 10.1186/s40854-023-00504-3. Epub 2023 Jun 11.
3
The aggregate and sectoral time-varying market efficiency during crisis periods in Turkey: a comparative analysis with COVID-19 outbreak and the global financial crisis.
土耳其危机时期的总体和部门随时间变化的市场效率:与新冠疫情爆发及全球金融危机的比较分析
Financ Innov. 2023;9(1):80. doi: 10.1186/s40854-023-00484-4. Epub 2023 May 1.
4
A multidimensional review of the cash management problem.现金管理问题的多维度审视。
Financ Innov. 2023;9(1):67. doi: 10.1186/s40854-023-00473-7. Epub 2023 Mar 15.
5
Is a correlation-based investment strategy beneficial for long-term international portfolio investors?基于相关性的投资策略对长期国际投资组合投资者有益吗?
Financ Innov. 2023;9(1):64. doi: 10.1186/s40854-023-00471-9. Epub 2023 Mar 9.
6
Understanding the adoption context of China's digital currency electronic payment.了解中国数字货币电子支付的应用背景。
Financ Innov. 2023;9(1):63. doi: 10.1186/s40854-023-00467-5. Epub 2023 Mar 2.
7
Stock profiling using time-frequency-varying systematic risk measure.使用时变系统风险度量的股票特征分析。
Financ Innov. 2023;9(1):52. doi: 10.1186/s40854-023-00457-7. Epub 2023 Feb 14.
8
Management disclosure of risk factors and COVID-19.管理层对风险因素及新冠疫情的披露。
Financ Innov. 2023;9(1):53. doi: 10.1186/s40854-023-00459-5. Epub 2023 Feb 17.
9
Did weekly economic index and volatility index impact US food sales during the first year of the pandemic?在疫情的第一年,每周经济指数和波动率指数对美国食品销售有影响吗?
Financ Innov. 2023;9(1):57. doi: 10.1186/s40854-023-00460-y. Epub 2023 Feb 10.
10
Empirical evidence on the ownership and liquidity of real estate tokens.关于房地产代币所有权和流动性的实证证据。
Financ Innov. 2023;9(1):45. doi: 10.1186/s40854-022-00427-5. Epub 2023 Jan 19.