• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PriLive:用于下一代测序的隐私保护实时过滤。

PriLive: privacy-preserving real-time filtering for next-generation sequencing.

机构信息

Bioinformatics Division (MF 1), Department for Methods Development and Research Infrastructure.

Centre for Biological Threats and Special Pathogens: Highly Pathogenic Viruses (ZBS 1).

出版信息

Bioinformatics. 2018 Jul 15;34(14):2376-2383. doi: 10.1093/bioinformatics/bty128.

DOI:10.1093/bioinformatics/bty128
PMID:29522157
Abstract

MOTIVATION

In next-generation sequencing, re-identification of individuals and other privacy-breaching strategies can be applied even for anonymized data. This also holds true for applications in which human DNA is acquired as a by-product, e.g. for viral or metagenomic samples from a human host. Conventional data protection strategies including cryptography and post-hoc filtering are only appropriate for the final and processed sequencing data. This can result in an insufficient level of data protection and a considerable time delay in the further analysis workflow.

RESULTS

We present PriLive, a novel tool for the automated removal of sensitive data while the sequencing machine is running. Thereby, human sequence information can be detected and removed before being completely produced. This facilitates the compliance with strict data protection regulations. The unique characteristic to cause almost no time delay for further analyses is also a clear benefit for applications other than data protection. Especially if the sequencing data are dominated by known background signals, PriLive considerably accelerates consequent analyses by having only fractions of input data. Besides these conceptual advantages, PriLive achieves filtering results at least as accurate as conventional post-hoc filtering tools.

AVAILABILITY AND IMPLEMENTATION

PriLive is open-source software available at https://gitlab.com/rki_bioinformatics/PriLive.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

在下一代测序中,即使对于已匿名的数据,也可以应用重新识别个人和其他侵犯隐私的策略。这对于以人类 DNA 为副产品获取的应用程序也同样适用,例如来自人类宿主的病毒或宏基因组样本。传统的数据保护策略,包括加密和事后过滤,仅适用于最终和处理后的测序数据。这可能导致数据保护水平不足,并且在进一步的分析工作流程中会有相当长的延迟。

结果

我们提出了 PriLive,这是一种在测序机运行时自动删除敏感数据的新工具。因此,可以在完全生成之前检测和删除人类序列信息。这有助于符合严格的数据保护法规。对于除数据保护以外的其他应用程序来说,几乎不会对进一步的分析造成时间延迟,这也是一个明显的优势。特别是如果测序数据主要由已知的背景信号主导,那么 PriLive 通过仅使用输入数据的一部分,就可以大大加速后续的分析。除了这些概念上的优势外,PriLive 的过滤结果至少与传统的事后过滤工具一样准确。

可用性和实现

PriLive 是一个开源软件,可在 https://gitlab.com/rki_bioinformatics/PriLive 上获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
PriLive: privacy-preserving real-time filtering for next-generation sequencing.PriLive:用于下一代测序的隐私保护实时过滤。
Bioinformatics. 2018 Jul 15;34(14):2376-2383. doi: 10.1093/bioinformatics/bty128.
2
HiLive: real-time mapping of illumina reads while sequencing.HiLive:测序时对Illumina reads进行实时映射
Bioinformatics. 2017 Mar 15;33(6):917-319. doi: 10.1093/bioinformatics/btw659.
3
FaStore: a space-saving solution for raw sequencing data.FaStore:一种节省存储空间的原始测序数据解决方案。
Bioinformatics. 2018 Aug 15;34(16):2748-2756. doi: 10.1093/bioinformatics/bty205.
4
ANAQUIN: a software toolkit for the analysis of spike-in controls for next generation sequencing.ANAQUIN:用于下一代测序中掺入对照分析的软件工具包。
Bioinformatics. 2017 Jun 1;33(11):1723-1724. doi: 10.1093/bioinformatics/btx038.
5
E2FM: an encrypted and compressed full-text index for collections of genomic sequences.E2FM:用于基因组序列集合的加密和压缩全文索引。
Bioinformatics. 2017 Sep 15;33(18):2808-2817. doi: 10.1093/bioinformatics/btx313.
6
PAIPline: pathogen identification in metagenomic and clinical next generation sequencing samples.PAIPline:宏基因组和临床下一代测序样本中的病原体鉴定。
Bioinformatics. 2018 Sep 1;34(17):i715-i721. doi: 10.1093/bioinformatics/bty595.
7
ViraPipe: scalable parallel pipeline for viral metagenome analysis from next generation sequencing reads.ViraPipe:用于从下一代测序读取中进行病毒宏基因组分析的可扩展并行管道。
Bioinformatics. 2018 Mar 15;34(6):928-935. doi: 10.1093/bioinformatics/btx702.
8
LiveKraken--real-time metagenomic classification of illumina data.LiveKraken--实时宏基因组 illumina 数据分析分类。
Bioinformatics. 2018 Nov 1;34(21):3750-3752. doi: 10.1093/bioinformatics/bty433.
9
A space and time-efficient index for the compacted colored de Bruijn graph.一种用于压缩彩色 de Bruijn 图的空间和时间高效索引。
Bioinformatics. 2018 Jul 1;34(13):i169-i177. doi: 10.1093/bioinformatics/bty292.
10
Simulating the dynamics of targeted capture sequencing with CapSim.使用 CapSim 模拟靶向捕获测序的动力学。
Bioinformatics. 2018 Mar 1;34(5):873-874. doi: 10.1093/bioinformatics/btx691.

引用本文的文献

1
Evaluation of WGS performance for bacterial pathogen characterization with the Illumina technology optimized for time-critical situations.评估 Illumina 技术在时间紧迫情况下进行细菌病原体特征描述的 WGS 性能。
Microb Genom. 2021 Nov;7(11). doi: 10.1099/mgen.0.000699.