• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

NGS-物流:跨多个地点的 NGS 序列变体的联合分析。

NGS-Logistics: federated analysis of NGS sequence variants across multiple locations.

机构信息

KU Leuven, Department of Electrical Engineering (ESAT), STADIUS Center for Dynamical Systems, Signal Processing and Data Analytics, Kasteelpark Arenberg 10, Box 2446, 3001 Leuven, Belgium ; iMinds Medical IT Department, Kasteelpark Arenberg 10, Box 2446, 3001 Leuven, Belgium.

KU Leuven, Center of Human Genetics Gasthuisberg, O&N I Herestraat 49, Box 602, 3000 Leuven, Belgium.

出版信息

Genome Med. 2014 Sep 17;6(9):71. doi: 10.1186/s13073-014-0071-9. eCollection 2014.

DOI:10.1186/s13073-014-0071-9
PMID:25328540
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4198698/
Abstract

As many personal genomes are being sequenced, collaborative analysis of those genomes has become essential. However, analysis of personal genomic data raises important privacy and confidentiality issues. We propose a methodology for federated analysis of sequence variants from personal genomes. Specific base-pair positions and/or regions are queried for samples to which the user has access but also for the whole population. The statistics results do not breach data confidentiality but allow further exploration of the data; researchers can negotiate access to relevant samples through pseudonymous identifiers. This approach minimizes the impact on data confidentiality while enabling powerful data analysis by gaining access to important rare samples. Our methodology is implemented in an open source tool called NGS-Logistics, freely available at https://ngsl.esat.kuleuven.be.

摘要

随着越来越多的个人基因组被测序,对这些基因组进行协作分析已变得至关重要。然而,个人基因组数据的分析引发了重要的隐私和保密性问题。我们提出了一种用于对个人基因组序列变体进行联合分析的方法。针对用户可访问的样本以及整个群体,对特定的碱基对位置和/或区域进行查询。统计结果不会泄露数据机密性,但允许对数据进行进一步探索;研究人员可以通过匿名标识符协商访问相关样本。这种方法最大限度地减少了对数据保密性的影响,同时通过访问重要的稀有样本,实现了强大的数据分析。我们的方法在一个名为 NGS-Logistics 的开源工具中实现,可在 https://ngsl.esat.kuleuven.be 免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a522/4198698/c4a4eceab50c/13073_2014_71_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a522/4198698/3c1e92eaeb00/13073_2014_71_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a522/4198698/4523815e78ba/13073_2014_71_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a522/4198698/0649b9f2d3a6/13073_2014_71_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a522/4198698/788ce27b5a30/13073_2014_71_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a522/4198698/0cb3a4031400/13073_2014_71_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a522/4198698/c4a4eceab50c/13073_2014_71_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a522/4198698/3c1e92eaeb00/13073_2014_71_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a522/4198698/4523815e78ba/13073_2014_71_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a522/4198698/0649b9f2d3a6/13073_2014_71_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a522/4198698/788ce27b5a30/13073_2014_71_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a522/4198698/0cb3a4031400/13073_2014_71_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a522/4198698/c4a4eceab50c/13073_2014_71_Fig6_HTML.jpg

相似文献

1
NGS-Logistics: federated analysis of NGS sequence variants across multiple locations.NGS-物流:跨多个地点的 NGS 序列变体的联合分析。
Genome Med. 2014 Sep 17;6(9):71. doi: 10.1186/s13073-014-0071-9. eCollection 2014.
2
Huvariome: a web server resource of whole genome next-generation sequencing allelic frequencies to aid in pathological candidate gene selection.Huvariome:一个用于辅助病理候选基因选择的全基因组下一代测序等位基因频率的网络服务器资源。
J Clin Bioinforma. 2012 Nov 19;2(1):19. doi: 10.1186/2043-9113-2-19.
3
A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses.一种用于对小RNA病毒的完整基因组进行测序和分析的强大且经济高效的方法。
Virol J. 2017 Apr 7;14(1):72. doi: 10.1186/s12985-017-0741-5.
4
Balancing Accuracy and Privacy in Federated Queries of Clinical Data Repositories: Algorithm Development and Validation.联邦查询临床数据存储库中的准确性和隐私平衡:算法的开发和验证。
J Med Internet Res. 2020 Nov 3;22(11):e18735. doi: 10.2196/18735.
5
Multi-site fMRI analysis using privacy-preserving federated learning and domain adaptation: ABIDE results.使用隐私保护联邦学习和域适应的多站点功能磁共振成像分析:ABIDE研究结果
Med Image Anal. 2020 Oct;65:101765. doi: 10.1016/j.media.2020.101765. Epub 2020 Jul 2.
6
GenomeChronicler: The Personal Genome Project UK Genomic Report Generator Pipeline.基因组编年史家:英国个人基因组计划基因组报告生成管道。
Front Genet. 2020 Sep 24;11:518644. doi: 10.3389/fgene.2020.518644. eCollection 2020.
7
NGS-QCbox and Raspberry for Parallel, Automated and Rapid Quality Control Analysis of Large-Scale Next Generation Sequencing (Illumina) Data.用于大规模新一代测序(Illumina)数据并行、自动化和快速质量控制分析的NGS-QCbox与树莓派
PLoS One. 2015 Oct 13;10(10):e0139868. doi: 10.1371/journal.pone.0139868. eCollection 2015.
8
TIARA: a database for accurate analysis of multiple personal genomes based on cross-technology.TIARA:一个基于跨技术对多个人类基因组进行精确分析的数据库。
Nucleic Acids Res. 2011 Jan;39(Database issue):D883-8. doi: 10.1093/nar/gkq1101. Epub 2010 Nov 4.
9
VCF-Server: A web-based visualization tool for high-throughput variant data mining and management.VCF-Server:一个基于网络的高通量变异数据挖掘和管理的可视化工具。
Mol Genet Genomic Med. 2019 Jul;7(7):e00641. doi: 10.1002/mgg3.641. Epub 2019 May 24.
10
MetaSeq: privacy preserving meta-analysis of sequencing-based association studies.MetaSeq:基于测序的关联研究的隐私保护元分析
Pac Symp Biocomput. 2013:356-67.

引用本文的文献

1
Leveraging real-world data to investigate multiple sclerosis disease behavior, prognosis, and treatment.利用真实世界数据研究多发性硬化症的疾病行为、预后和治疗。
Mult Scler. 2020 Jan;26(1):23-37. doi: 10.1177/1352458519892555. Epub 2019 Nov 28.
2
Pathogenic variants in CDC45 on the remaining allele in patients with a chromosome 22q11.2 deletion result in a novel autosomal recessive condition.患者染色体 22q11.2 缺失的另一条等位基因中存在 CDC45 的致病性变异,导致一种新的常染色体隐性疾病。
Genet Med. 2020 Feb;22(2):326-335. doi: 10.1038/s41436-019-0645-4. Epub 2019 Sep 2.
3
Real-World Evidence Gathering in Oncology: The Need for a Biomedical Big Data Insight-Providing Federated Network.

本文引用的文献

1
NCBI's Database of Genotypes and Phenotypes: dbGaP.NCBI 的基因型和表型数据库:dbGaP。
Nucleic Acids Res. 2014 Jan;42(Database issue):D975-9. doi: 10.1093/nar/gkt1211. Epub 2013 Dec 1.
2
The Genome of the Netherlands: design, and project goals.荷兰基因组计划:设计与项目目标。
Eur J Hum Genet. 2014 Feb;22(2):221-7. doi: 10.1038/ejhg.2013.118. Epub 2013 May 29.
3
An integrated map of genetic variation from 1,092 human genomes.1092 个人类基因组遗传变异的综合图谱。
肿瘤学中的真实世界证据收集:对提供生物医学大数据洞察的联邦网络的需求。
Front Med (Lausanne). 2019 Mar 8;6:43. doi: 10.3389/fmed.2019.00043. eCollection 2019.
4
Towards practical privacy-preserving genome-wide association study.迈向实用的隐私保护全基因组关联研究。
BMC Bioinformatics. 2018 Dec 20;19(1):537. doi: 10.1186/s12859-018-2541-3.
5
Novel CASK mutations in cases with syndromic microcephaly.综合征性小头畸形病例中的新型 CASK 突变。
Hum Mutat. 2018 Jul;39(7):993-1001. doi: 10.1002/humu.23536. Epub 2018 May 11.
6
Novel Compound Heterozygous Mutations Cause Severe Fetal Microcephaly and Centriolar Lengthening.新型复合杂合突变导致严重胎儿小头畸形和中心粒延长。
Mol Syndromol. 2017 Nov;8(6):282-293. doi: 10.1159/000479666. Epub 2017 Sep 27.
7
Tracing the origin of disseminated tumor cells in breast cancer using single-cell sequencing.利用单细胞测序追踪乳腺癌中播散肿瘤细胞的起源
Genome Biol. 2016 Dec 9;17(1):250. doi: 10.1186/s13059-016-1109-7.
8
Identification of Intellectual Disability Genes in Female Patients with a Skewed X-Inactivation Pattern.具有偏态X染色体失活模式的女性患者中智力残疾基因的鉴定。
Hum Mutat. 2016 Aug;37(8):804-11. doi: 10.1002/humu.23012. Epub 2016 May 25.
9
MutAid: Sanger and NGS Based Integrated Pipeline for Mutation Identification, Validation and Annotation in Human Molecular Genetics.MutAid:基于桑格测序法和新一代测序技术的综合流程,用于人类分子遗传学中的突变鉴定、验证及注释
PLoS One. 2016 Feb 3;11(2):e0147697. doi: 10.1371/journal.pone.0147697. eCollection 2016.
10
ClinQC: a tool for quality control and cleaning of Sanger and NGS data in clinical research.ClinQC:临床研究中用于Sanger测序和二代测序(NGS)数据质量控制与清理的工具
BMC Bioinformatics. 2016 Feb 2;17:56. doi: 10.1186/s12859-016-0915-y.
Nature. 2012 Nov 1;491(7422):56-65. doi: 10.1038/nature11632.
4
Heterozygous missense mutations in SMARCA2 cause Nicolaides-Baraitser syndrome.SMARCA2 中的杂合错义突变导致 Nicolaides-Baraitser 综合征。
Nat Genet. 2012 Feb 26;44(4):445-9, S1. doi: 10.1038/ng.1105.
5
The variant call format and VCFtools.变异调用格式和 VCFtools。
Bioinformatics. 2011 Aug 1;27(15):2156-8. doi: 10.1093/bioinformatics/btr330. Epub 2011 Jun 7.
6
On the future of genomic data.论基因组数据的未来。
Science. 2011 Feb 11;331(6018):728-9. doi: 10.1126/science.1197891.
7
Human genome: Genomes by the thousand.人类基因组:数以千计的基因组。
Nature. 2010 Oct 28;467(7319):1026-7. doi: 10.1038/4671026a.
8
The UCSC Genome Browser database: update 2011.加州大学圣克鲁兹分校基因组浏览器数据库:2011年更新
Nucleic Acids Res. 2011 Jan;39(Database issue):D876-82. doi: 10.1093/nar/gkq963. Epub 2010 Oct 18.
9
The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.基因组分析工具包:一种用于分析下一代 DNA 测序数据的 MapReduce 框架。
Genome Res. 2010 Sep;20(9):1297-303. doi: 10.1101/gr.107524.110. Epub 2010 Jul 19.
10
The Sequence Alignment/Map format and SAMtools.序列比对/映射格式和 SAMtools。
Bioinformatics. 2009 Aug 15;25(16):2078-9. doi: 10.1093/bioinformatics/btp352. Epub 2009 Jun 8.