• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HySec-Flow:基于SGX的大数据分析框架实现隐私保护的基因组计算

HySec-Flow: Privacy-Preserving Genomic Computing with SGX-based Big-Data Analytics Framework.

作者信息

Widanage Chathura, Liu Weijie, Li Jiayu, Chen Hongbo, Wang XiaoFeng, Tang Haixu, Fox Judy

机构信息

Indiana University.

University of Virginia.

出版信息

IEEE Int Conf Cloud Comput. 2021 Sep;2021:733-743. doi: 10.1109/CLOUD53861.2021.00098. Epub 2021 Nov 13.

DOI:10.1109/CLOUD53861.2021.00098
PMID:35662807
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9165173/
Abstract

Trusted execution environments (TEE) such as Intel's Software Guard Extension (SGX) have been widely studied to boost security and privacy protection for the computation of sensitive data such as human genomics. However, a performance hurdle is often generated by SGX, especially from the small enclave memory. In this paper, we propose a new Hybrid Secured Flow framework (called "HySec-Flow") for large-scale genomic data analysis using SGX platforms. Here, the data-intensive computing tasks can be partitioned into independent subtasks to be deployed into distinct secured and non-secured containers, therefore allowing for parallel execution while alleviating the limited size of Page Cache (EPC) memory in each enclave. We illustrate our contributions using a workflow supporting indexing, alignment, dispatching, and merging the execution of SGX- enabled containers. We provide details regarding the architecture of the trusted and untrusted components and the underlying Scorn and Graphene support as generic shielding execution frameworks to port legacy code. We thoroughly evaluate the performance of our privacy-preserving reads mapping algorithm using real human genome sequencing data. The results demonstrate that the performance is enhanced by partitioning the time-consuming genomic computation into subtasks compared to the conventional execution of the data-intensive reads mapping algorithm in an enclave. The proposed HySec-Flow framework is made available as an open-source and adapted to the data-parallel computation of other large-scale genomic tasks requiring security and scalable computational resources.

摘要

诸如英特尔软件防护扩展(SGX)之类的可信执行环境(TEE)已得到广泛研究,以增强对人类基因组学等敏感数据计算的安全性和隐私保护。然而,SGX常常会带来性能障碍,尤其是来自小的飞地内存。在本文中,我们提出了一种新的混合安全流框架(称为“HySec-Flow”),用于使用SGX平台进行大规模基因组数据分析。在这里,数据密集型计算任务可以被划分为独立的子任务,以部署到不同的安全和非安全容器中,从而允许并行执行,同时缓解每个飞地中页面缓存(EPC)内存有限的问题。我们使用一个支持索引、比对、调度和合并启用SGX的容器执行的工作流程来说明我们的贡献。我们提供了关于可信和不可信组件的架构以及底层Scorn和Graphene支持的详细信息,作为移植遗留代码的通用屏蔽执行框架。我们使用真实的人类基因组测序数据全面评估了我们的隐私保护读段比对算法的性能。结果表明,与在飞地中传统执行数据密集型读段比对算法相比,将耗时的基因组计算划分为子任务可提高性能。所提出的HySec-Flow框架作为开源框架提供,并适用于其他需要安全和可扩展计算资源的大规模基因组任务的数据并行计算。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/b26aebba0f97/nihms-1804039-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/a685eeabd696/nihms-1804039-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/d63d0177cd5f/nihms-1804039-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/ba8a38d3e826/nihms-1804039-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/c54dae51c8a7/nihms-1804039-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/21b9bd7d8b8b/nihms-1804039-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/3d221bbec034/nihms-1804039-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/8ad42d5ba302/nihms-1804039-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/59e5d7fb6caf/nihms-1804039-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/050793f55aca/nihms-1804039-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/b26aebba0f97/nihms-1804039-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/a685eeabd696/nihms-1804039-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/d63d0177cd5f/nihms-1804039-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/ba8a38d3e826/nihms-1804039-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/c54dae51c8a7/nihms-1804039-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/21b9bd7d8b8b/nihms-1804039-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/3d221bbec034/nihms-1804039-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/8ad42d5ba302/nihms-1804039-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/59e5d7fb6caf/nihms-1804039-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/050793f55aca/nihms-1804039-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0178/9165173/b26aebba0f97/nihms-1804039-f0012.jpg

相似文献

1
HySec-Flow: Privacy-Preserving Genomic Computing with SGX-based Big-Data Analytics Framework.HySec-Flow:基于SGX的大数据分析框架实现隐私保护的基因组计算
IEEE Int Conf Cloud Comput. 2021 Sep;2021:733-743. doi: 10.1109/CLOUD53861.2021.00098. Epub 2021 Nov 13.
2
Practical and Efficient in-Enclave Verification of Privacy Compliance.实用且高效的飞地隐私合规性验证
Proc (Int Conf Dependable Syst Netw). 2021 Jun;2021:413-425. doi: 10.1109/dsn48987.2021.00052. Epub 2021 Aug 6.
3
Privacy-preserving genotype imputation in a trusted execution environment.在可信执行环境中进行隐私保护的基因型推断。
Cell Syst. 2021 Oct 20;12(10):983-993.e7. doi: 10.1016/j.cels.2021.08.001. Epub 2021 Aug 26.
4
Sketching algorithms for genomic data analysis and querying in a secure enclave.在安全飞地中进行基因组数据分析和查询的草图算法。
Nat Methods. 2020 Mar;17(3):295-301. doi: 10.1038/s41592-020-0761-8. Epub 2020 Mar 4.
5
PPPCT: Privacy-Preserving framework for Parallel Clustering Transcriptomics data.PPPCT:用于平行聚类转录组学数据的隐私保护框架。
Comput Biol Med. 2024 May;173:108351. doi: 10.1016/j.compbiomed.2024.108351. Epub 2024 Mar 21.
6
PREMIX: PRivacy-preserving EstiMation of Individual admiXture.预混:个体混合比例的隐私保护估计
AMIA Annu Symp Proc. 2017 Feb 10;2016:1747-1755. eCollection 2016.
7
PRESAGE: PRivacy-preserving gEnetic testing via SoftwAre Guard Extension.PRESAGE:通过软件防护扩展实现隐私保护的基因检测
BMC Med Genomics. 2017 Jul 26;10(Suppl 2):48. doi: 10.1186/s12920-017-0281-2.
8
Trust Beyond Border: Lightweight, Verifiable User Isolation for Protecting In-Enclave Services.超越边界的信任:用于保护飞地内服务的轻量级、可验证的用户隔离
IEEE Trans Dependable Secure Comput. 2023 Jan-Feb;20(1):522-538. doi: 10.1109/tdsc.2021.3138427. Epub 2021 Dec 28.
9
Leaky Cauldron on the Dark Land: Understanding Memory Side-Channel Hazards in SGX.黑暗之地的漏锅:理解SGX中的内存侧信道风险
Conf Comput Commun Secur. 2017 Oct-Nov;2017:2421-2434. doi: 10.1145/3133956.3134038.
10
eTPM: A Trusted Cloud Platform Enclave TPM Scheme Based on Intel SGX Technology.eTPM:一种基于 Intel SGX 技术的可信云平台 Enclave TPM 方案。
Sensors (Basel). 2018 Nov 6;18(11):3807. doi: 10.3390/s18113807.

引用本文的文献

1
Learning-augmented sketching offers improved performance for privacy preserving and secure GWAS.学习增强的草图绘制为隐私保护和安全的全基因组关联研究提供了更好的性能。
iScience. 2025 Feb 13;28(3):112011. doi: 10.1016/j.isci.2025.112011. eCollection 2025 Mar 21.
2
Learning-Augmented Sketching Offers Improved Performance for Privacy Preserving and Secure GWAS.学习增强的草图绘制为隐私保护和安全的全基因组关联研究提供了更好的性能。
bioRxiv. 2024 Sep 24:2024.09.19.613975. doi: 10.1101/2024.09.19.613975.
3
Privacy-Enhancing Technologies in Biomedical Data Science.

本文引用的文献

1
Sketching algorithms for genomic data analysis and querying in a secure enclave.在安全飞地中进行基因组数据分析和查询的草图算法。
Nat Methods. 2020 Mar;17(3):295-301. doi: 10.1038/s41592-020-0761-8. Epub 2020 Mar 4.
2
Benefits and limitations of genome-wide association studies.全基因组关联研究的优势和局限性。
Nat Rev Genet. 2019 Aug;20(8):467-484. doi: 10.1038/s41576-019-0127-1.
3
Leaky Cauldron on the Dark Land: Understanding Memory Side-Channel Hazards in SGX.黑暗之地的漏锅:理解SGX中的内存侧信道风险
生物医学数据科学中的隐私增强技术。
Annu Rev Biomed Data Sci. 2024 Aug;7(1):317-343. doi: 10.1146/annurev-biodatasci-120423-120107.
4
Genomic privacy preservation in genome-wide association studies: taxonomy, limitations, challenges, and vision.全基因组关联研究中的基因组隐私保护:分类法、局限性、挑战和展望。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae356.
5
The evolving privacy and security concerns for genomic data analysis and sharing as observed from the iDASH competition.从 iDASH 竞赛中观察到的基因组数据分析和共享的不断发展的隐私和安全问题。
J Am Med Inform Assoc. 2022 Nov 14;29(12):2182-2190. doi: 10.1093/jamia/ocac165.
Conf Comput Commun Secur. 2017 Oct-Nov;2017:2421-2434. doi: 10.1145/3133956.3134038.
4
Secure top most significant genome variants search: iDASH 2017 competition.安全搜索最重要的基因组变异:iDASH 2017 竞赛。
BMC Med Genomics. 2018 Oct 11;11(Suppl 4):82. doi: 10.1186/s12920-018-0399-x.
5
PRESAGE: PRivacy-preserving gEnetic testing via SoftwAre Guard Extension.PRESAGE:通过软件防护扩展实现隐私保护的基因检测
BMC Med Genomics. 2017 Jul 26;10(Suppl 2):48. doi: 10.1186/s12920-017-0281-2.
6
PRINCESS: Privacy-protecting Rare disease International Network Collaboration via Encryption through Software guard extensionS.公主:通过软件保护扩展进行加密的隐私保护罕见病国际网络协作。
Bioinformatics. 2017 Mar 15;33(6):871-878. doi: 10.1093/bioinformatics/btw758.
7
Global gene expression analysis using RNA-seq uncovered a new role for SR1/CAMTA3 transcription factor in salt stress.使用RNA测序进行的全基因组基因表达分析揭示了SR1/CAMTA3转录因子在盐胁迫中的新作用。
Sci Rep. 2016 Jun 2;6:27021. doi: 10.1038/srep27021.
8
DIDA: Distributed Indexing Dispatched Alignment.DIDA:分布式索引调度对齐
PLoS One. 2015 Apr 29;10(4):e0126409. doi: 10.1371/journal.pone.0126409. eCollection 2015.
9
Identifying personal genomes by surname inference.姓氏推断识别个人基因组。
Science. 2013 Jan 18;339(6117):321-4. doi: 10.1126/science.1229566.
10
Fast gapped-read alignment with Bowtie 2.快速缺口读对准与 Bowtie 2。
Nat Methods. 2012 Mar 4;9(4):357-9. doi: 10.1038/nmeth.1923.