• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有隐私保护的多源选择迁移学习

Multi-Source Selection Transfer Learning with Privacy-Preserving.

作者信息

Wu Weifei

机构信息

Beijing Institute of Remote Sensing Equipment, Beijing, 100084 People's Republic of China.

出版信息

Neural Process Lett. 2022;54(6):4921-4950. doi: 10.1007/s11063-022-10841-6. Epub 2022 May 7.

DOI:10.1007/s11063-022-10841-6
PMID:35573261
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9077647/
Abstract

Transfer learning has ability to create learning task of weakly labeled or unlabeled target domain by using knowledge of source domain to help, which can effectively improve the performance of target learning task. At present, the increased awareness of privacy protection restricts access to data sources and poses new challenges to the development of transfer learning. However, the research on privacy protection in transfer learning is very rare. The existing work mainly uses differential privacy technology and does not consider the distribution difference between data sources, or does not consider the conditional probability distribution of data, which causes negative transfer to harm the effect of algorithm. Therefore, this paper proposes multi-source selection transfer learning algorithm with privacy-preserving MultiSTLP, which is used in scenarios where target domain contains unlabeled data sets with only a small amount of group probability information and multiple source domains with a large number of labeled data sets. Group probability means that the class label of each sample in target data set is unknown, but the probability of each class in a given data group is available, and multiple source domains indicate that there are more than two source domains. The number of data set contains more than two data sets of source domain and one data set of target domain. The algorithm adapts to the marginal probability distribution and conditional probability distribution differences between domains, and can protect the privacy of target data and improve classification accuracy by fusing the idea of multi-source transfer learning and group probability into support vector machine. At the same time, it can select the representative dataset in source domains to improve efficiency relied on speeding up the training process of algorithm. Experimental results on several real datasets show the effectiveness of MultiSTLP, and it also has some advantages compared with the state-of-the-art transfer learning algorithm.

摘要

迁移学习能够利用源域的知识来帮助创建弱标记或未标记目标域的学习任务,从而有效提高目标学习任务的性能。目前,隐私保护意识的增强限制了对数据源的访问,给迁移学习的发展带来了新的挑战。然而,关于迁移学习中隐私保护的研究非常少见。现有的工作主要使用差分隐私技术,没有考虑数据源之间的分布差异,或者没有考虑数据的条件概率分布,这会导致负迁移损害算法的效果。因此,本文提出了具有隐私保护的多源选择迁移学习算法MultiSTLP,该算法用于目标域包含只有少量组概率信息的未标记数据集以及多个带有大量标记数据集的源域的场景。组概率是指目标数据集中每个样本的类别标签未知,但给定数据组中每个类别的概率是已知的,多个源域表示有两个以上的源域。数据集的数量包含两个以上的源域数据集和一个目标域数据集。该算法适应域之间的边际概率分布和条件概率分布差异,并且通过将多源迁移学习和组概率的思想融合到支持向量机中,可以保护目标数据的隐私并提高分类准确率。同时,它可以在源域中选择有代表性的数据集,依靠加快算法的训练过程来提高效率。在几个真实数据集上的实验结果表明了MultiSTLP的有效性,并且与当前最先进的迁移学习算法相比也具有一些优势。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a846/9077647/21f44b904d38/11063_2022_10841_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a846/9077647/52657e7b944e/11063_2022_10841_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a846/9077647/7220cbd94e97/11063_2022_10841_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a846/9077647/38bd545cff1a/11063_2022_10841_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a846/9077647/5340635f33c8/11063_2022_10841_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a846/9077647/b884a145872b/11063_2022_10841_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a846/9077647/21f44b904d38/11063_2022_10841_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a846/9077647/52657e7b944e/11063_2022_10841_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a846/9077647/7220cbd94e97/11063_2022_10841_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a846/9077647/38bd545cff1a/11063_2022_10841_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a846/9077647/5340635f33c8/11063_2022_10841_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a846/9077647/b884a145872b/11063_2022_10841_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a846/9077647/21f44b904d38/11063_2022_10841_Fig6_HTML.jpg

相似文献

1
Multi-Source Selection Transfer Learning with Privacy-Preserving.具有隐私保护的多源选择迁移学习
Neural Process Lett. 2022;54(6):4921-4950. doi: 10.1007/s11063-022-10841-6. Epub 2022 May 7.
2
Multi-source fast transfer learning algorithm based on support vector machine.基于支持向量机的多源快速迁移学习算法
Appl Intell (Dordr). 2021;51(11):8451-8465. doi: 10.1007/s10489-021-02194-9. Epub 2021 Apr 6.
3
Multi-Source Deep Transfer Neural Network Algorithm.多源深度迁移神经网络算法。
Sensors (Basel). 2019 Sep 16;19(18):3992. doi: 10.3390/s19183992.
4
Unsupervised multi-source domain adaptation with no observable source data.无可见源数据的无监督多源域自适应。
PLoS One. 2021 Jul 9;16(7):e0253415. doi: 10.1371/journal.pone.0253415. eCollection 2021.
5
Multi-Source Decentralized Transfer for Privacy-Preserving BCIs.多源去中心化迁移的隐私保护脑机接口。
IEEE Trans Neural Syst Rehabil Eng. 2022;30:2710-2720. doi: 10.1109/TNSRE.2022.3207494. Epub 2022 Sep 26.
6
Source-Free Domain Adaptation (SFDA) for Privacy-Preserving Seizure Subtype Classification.源自由领域自适应 (SFDA) 用于保护隐私的癫痫发作亚型分类。
IEEE Trans Neural Syst Rehabil Eng. 2023;31:2315-2325. doi: 10.1109/TNSRE.2023.3274563. Epub 2023 May 19.
7
A transfer learning model with multi-source domains for biomedical event trigger extraction.一种用于生物医学事件触发词提取的多源域迁移学习模型。
BMC Genomics. 2021 Jan 7;22(1):31. doi: 10.1186/s12864-020-07315-1.
8
Sample Complexity Bounds for Differentially Private Learning.差分隐私学习的样本复杂度界
JMLR Workshop Conf Proc. 2011;2011:155-186.
9
Demonstrating the successful application of synthetic learning in spine surgery for training multi-center models with increased patient privacy.展示了合成学习在脊柱外科中的成功应用,该方法用于训练具有更高患者隐私保护的多中心模型。
Sci Rep. 2023 Aug 1;13(1):12481. doi: 10.1038/s41598-023-39458-y.
10
Incomplete Multisource Transfer Learning.不完全多源迁移学习。
IEEE Trans Neural Netw Learn Syst. 2018 Feb;29(2):310-323. doi: 10.1109/TNNLS.2016.2618765. Epub 2016 Nov 14.

本文引用的文献

1
Multi-source fast transfer learning algorithm based on support vector machine.基于支持向量机的多源快速迁移学习算法
Appl Intell (Dordr). 2021;51(11):8451-8465. doi: 10.1007/s10489-021-02194-9. Epub 2021 Apr 6.
2
Unsupervised multi-sense language models for natural language processing tasks.用于自然语言处理任务的无监督多义语言模型。
Neural Netw. 2021 Oct;142:397-409. doi: 10.1016/j.neunet.2021.05.023. Epub 2021 May 25.
3
Transfer learning for establishment of recognition of COVID-19 on CT imaging using small-sized training datasets.
使用小型训练数据集进行迁移学习以建立基于CT成像的新型冠状病毒肺炎识别模型
Knowl Based Syst. 2021 Apr 22;218:106849. doi: 10.1016/j.knosys.2021.106849. Epub 2021 Feb 6.
4
Multi-Source Deep Transfer Neural Network Algorithm.多源深度迁移神经网络算法。
Sensors (Basel). 2019 Sep 16;19(18):3992. doi: 10.3390/s19183992.
5
Feature Selection with Annealing for Computer Vision and Big Data Learning.基于退火的计算机视觉和大数据学习特征选择。
IEEE Trans Pattern Anal Mach Intell. 2017 Feb;39(2):272-286. doi: 10.1109/TPAMI.2016.2544315. Epub 2016 Mar 21.
6
Machine learning: Trends, perspectives, and prospects.机器学习:趋势、观点和展望。
Science. 2015 Jul 17;349(6245):255-60. doi: 10.1126/science.aaa8415.