• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过提取可解释的形状特征对多元时间观测进行早期分类。

Early classification of multivariate temporal observations by extraction of interpretable shapelets.

机构信息

Center for Data Analytics and Biomedical Informatics, Temple University, Philadelphia, USA.

出版信息

BMC Bioinformatics. 2012 Aug 8;13:195. doi: 10.1186/1471-2105-13-195.

DOI:10.1186/1471-2105-13-195
PMID:22873729
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3475011/
Abstract

BACKGROUND

Early classification of time series is beneficial for biomedical informatics problems such including, but not limited to, disease change detection. Early classification can be of tremendous help by identifying the onset of a disease before it has time to fully take hold. In addition, extracting patterns from the original time series helps domain experts to gain insights into the classification results. This problem has been studied recently using time series segments called shapelets. In this paper, we present a method, which we call Multivariate Shapelets Detection (MSD), that allows for early and patient-specific classification of multivariate time series. The method extracts time series patterns, called multivariate shapelets, from all dimensions of the time series that distinctly manifest the target class locally. The time series were classified by searching for the earliest closest patterns.

RESULTS

The proposed early classification method for multivariate time series has been evaluated on eight gene expression datasets from viral infection and drug response studies in humans. In our experiments, the MSD method outperformed the baseline methods, achieving highly accurate classification by using as little as 40%-64% of the time series. The obtained results provide evidence that using conventional classification methods on short time series is not as accurate as using the proposed methods specialized for early classification.

CONCLUSION

For the early classification task, we proposed a method called Multivariate Shapelets Detection (MSD), which extracts patterns from all dimensions of the time series. We showed that the MSD method can classify the time series early by using as little as 40%-64% of the time series' length.

摘要

背景

时间序列的早期分类有益于生物医学信息学问题,包括但不限于疾病变化检测。早期分类可以通过在疾病完全发作之前识别其发作来提供巨大帮助。此外,从原始时间序列中提取模式可以帮助领域专家深入了解分类结果。最近,人们使用称为形状特征的时间序列片段来研究这个问题。在本文中,我们提出了一种称为多变量形状特征检测(Multivariate Shapelets Detection,MSD)的方法,该方法允许对多变量时间序列进行早期和患者特异性分类。该方法从时间序列的所有维度中提取时间序列模式,称为多变量形状特征,这些模式在局部明显表现出目标类。通过搜索最早的最接近的模式来对时间序列进行分类。

结果

我们提出的用于多变量时间序列的早期分类方法已经在来自人类病毒感染和药物反应研究的八个基因表达数据集上进行了评估。在我们的实验中,MSD 方法优于基线方法,通过使用时间序列的 40%-64%的长度即可实现高度准确的分类。所得到的结果表明,在短时间序列上使用常规分类方法不如使用专门针对早期分类的建议方法准确。

结论

对于早期分类任务,我们提出了一种称为多变量形状特征检测(Multivariate Shapelets Detection,MSD)的方法,该方法从时间序列的所有维度中提取模式。我们表明,MSD 方法可以通过使用时间序列长度的 40%-64%来尽早对时间序列进行分类。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/233a/3475011/71f35dc3b78e/1471-2105-13-195-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/233a/3475011/3e22310094e7/1471-2105-13-195-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/233a/3475011/b798a760b3fb/1471-2105-13-195-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/233a/3475011/42a562fff117/1471-2105-13-195-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/233a/3475011/fb19da2f09cc/1471-2105-13-195-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/233a/3475011/b38ce2d90263/1471-2105-13-195-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/233a/3475011/d7b98eca89f0/1471-2105-13-195-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/233a/3475011/71f35dc3b78e/1471-2105-13-195-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/233a/3475011/3e22310094e7/1471-2105-13-195-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/233a/3475011/b798a760b3fb/1471-2105-13-195-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/233a/3475011/42a562fff117/1471-2105-13-195-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/233a/3475011/fb19da2f09cc/1471-2105-13-195-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/233a/3475011/b38ce2d90263/1471-2105-13-195-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/233a/3475011/d7b98eca89f0/1471-2105-13-195-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/233a/3475011/71f35dc3b78e/1471-2105-13-195-7.jpg

相似文献

1
Early classification of multivariate temporal observations by extraction of interpretable shapelets.通过提取可解释的形状特征对多元时间观测进行早期分类。
BMC Bioinformatics. 2012 Aug 8;13:195. doi: 10.1186/1471-2105-13-195.
2
W-TSS: A Wavelet-Based Algorithm for Discovering Time Series Shapelets.W-TSS:一种基于小波的时间序列模式发现算法。
Sensors (Basel). 2021 Aug 28;21(17):5801. doi: 10.3390/s21175801.
3
GENDIS: Genetic Discovery of Shapelets.GENDIS:形态子的遗传发现。
Sensors (Basel). 2021 Feb 4;21(4):1059. doi: 10.3390/s21041059.
4
LTSpAUC: Learning Time-Series Shapelets for Partial AUC Maximization.LTSpAUC:用于部分 AUC 最大化的时间序列形心学习。
Big Data. 2020 Oct;8(5):391-411. doi: 10.1089/big.2020.0069.
5
Memory Shapelet Learning for Early Classification of Streaming Time Series.用于流时间序列早期分类的记忆形状子学习
IEEE Trans Cybern. 2024 May;54(5):2757-2770. doi: 10.1109/TCYB.2023.3337550. Epub 2024 Apr 16.
6
Patient-specific early classification of multivariate observations.多变量观察结果的患者特异性早期分类。
Int J Data Min Bioinform. 2015;11(4):392-411. doi: 10.1504/ijdmb.2015.067955.
7
Multiview Unsupervised Shapelet Learning for Multivariate Time Series Clustering.多视图无监督形状子学习在多元时间序列聚类中的应用。
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4981-4996. doi: 10.1109/TPAMI.2022.3198411. Epub 2023 Mar 7.
8
Salient Subsequence Learning for Time Series Clustering.序列显著子序列学习在时间序列聚类中的应用
IEEE Trans Pattern Anal Mach Intell. 2019 Sep;41(9):2193-2207. doi: 10.1109/TPAMI.2018.2847699. Epub 2018 Jun 15.
9
The great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances.多元时间序列分类大比拼:对近期算法进展的综述与实验评估
Data Min Knowl Discov. 2021;35(2):401-449. doi: 10.1007/s10618-020-00727-3. Epub 2020 Dec 18.
10
Association mapping in biomedical time series via statistically significant shapelet mining.通过具有统计学意义的形态挖掘进行生物医学时间序列的关联映射。
Bioinformatics. 2018 Jul 1;34(13):i438-i446. doi: 10.1093/bioinformatics/bty246.

引用本文的文献

1
Advancing Precision Oncology Through Modeling of Longitudinal and Multimodal Data.通过纵向和多模态数据建模推进精准肿瘤学
ArXiv. 2025 Apr 29:arXiv:2502.07836v2.
2
To what extent naringenin binding and membrane depolarization shape mitoBK channel gating-A machine learning approach.柚皮素结合和膜去极化在多大程度上影响线粒体 BK 通道的门控:一种机器学习方法。
PLoS Comput Biol. 2022 Jul 20;18(7):e1010315. doi: 10.1371/journal.pcbi.1010315. eCollection 2022 Jul.
3
Technology investigation on time series classification and prediction.时间序列分类与预测的技术研究

本文引用的文献

1
Bootstrap techniques for error estimation.引导技术的误差估计。
IEEE Trans Pattern Anal Mach Intell. 1987 May;9(5):628-33. doi: 10.1109/tpami.1987.4767957.
2
Gene expression signatures diagnose influenza and other symptomatic respiratory viral infections in humans.基因表达特征可诊断人类流感及其他有症状的呼吸道病毒感染。
Cell Host Microbe. 2009 Sep 17;6(3):207-17. doi: 10.1016/j.chom.2009.07.006. Epub 2009 Aug 6.
3
Constrained mixture estimation for analysis and robust classification of clinical time series.用于临床时间序列分析和稳健分类的约束混合估计
PeerJ Comput Sci. 2022 May 18;8:e982. doi: 10.7717/peerj-cs.982. eCollection 2022.
4
An Optimized HCC Recurrence Prediction Using APO Algorithm Multiple Time Series Clinical Liver Cancer Dataset.基于 APO 算法的多时间序列临床肝癌数据集的 HCC 复发预测优化。
J Med Syst. 2019 May 22;43(7):193. doi: 10.1007/s10916-019-1265-x.
5
Association mapping in biomedical time series via statistically significant shapelet mining.通过具有统计学意义的形态挖掘进行生物医学时间序列的关联映射。
Bioinformatics. 2018 Jul 1;34(13):i438-i446. doi: 10.1093/bioinformatics/bty246.
6
Structured feature selection using coordinate descent optimization.使用坐标下降优化的结构化特征选择
BMC Bioinformatics. 2016 Apr 8;17:158. doi: 10.1186/s12859-016-0954-4.
Bioinformatics. 2009 Jun 15;25(12):i6-14. doi: 10.1093/bioinformatics/btp222.
4
Alignment and classification of time series gene expression in clinical studies.临床研究中时间序列基因表达的比对与分类
Bioinformatics. 2008 Jul 1;24(13):i147-55. doi: 10.1093/bioinformatics/btn152.
5
Transcription-based prediction of response to IFNbeta using supervised computational methods.使用监督计算方法基于转录预测对干扰素β的反应
PLoS Biol. 2005 Jan;3(1):e2. doi: 10.1371/journal.pbio.0030002. Epub 2004 Dec 28.