• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SeeCiTe:一种利用三联体数据评估单核苷酸多态性阵列中拷贝数变异检测结果的方法。

SeeCiTe: a method to assess CNV calls from SNP arrays using trio data.

作者信息

Lavrichenko Ksenia, Helgeland Øyvind, Njølstad Pål R, Jonassen Inge, Johansson Stefan

机构信息

Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway.

Department of Clinical Science, University of Bergen, Bergen, Norway.

出版信息

Bioinformatics. 2021 Jul 27;37(13):1876-1883. doi: 10.1093/bioinformatics/btab028.

DOI:10.1093/bioinformatics/btab028
PMID:33459766
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8317106/
Abstract

MOTIVATION

Single nucleotide polymorphism (SNP) genotyping arrays remain an attractive platform for assaying copy number variants (CNVs) in large population-wide cohorts. However, current tools for calling CNVs are still prone to extensive false positive calls when applied to biobank scale arrays. Moreover, there is a lack of methods exploiting cohorts with trios available (e.g. nuclear family) to assist in quality control and downstream analyses following the calling.

RESULTS

We developed SeeCiTe (Seeing CNVs in Trios), a novel CNV-quality control tool that postprocesses output from current CNV-calling tools exploiting child-parent trio data to classify calls in quality categories and provide a set of visualizations for each putative CNV call in the offspring. We apply it to the Norwegian Mother, Father and Child Cohort Study (MoBa) and show that SeeCiTe improves the specificity and sensitivity compared to the common empiric filtering strategies. To our knowledge, it is the first tool that utilizes probe-level CNV data in trios (and singletons) to systematically highlight potential artifacts and visualize signal intensities in a streamlined fashion suitable for biobank scale studies.

AVAILABILITY AND IMPLEMENTATION

The software is implemented in R with the source code freely available at https://github.com/aksenia/SeeCiTe.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

单核苷酸多态性(SNP)基因分型阵列仍然是在大规模人群队列中检测拷贝数变异(CNV)的一个有吸引力的平台。然而,当前用于检测CNV的工具在应用于生物样本库规模的阵列时,仍然容易产生大量的假阳性结果。此外,缺乏利用包含三人组(如核心家庭)的队列来协助呼叫后的质量控制和下游分析的方法。

结果

我们开发了SeeCiTe(在三人组中检测CNV),这是一种新颖的CNV质量控制工具,它对当前CNV呼叫工具的输出进行后处理,利用子-父三人组数据将呼叫分类到质量类别中,并为后代中的每个假定CNV呼叫提供一组可视化。我们将其应用于挪威母婴队列研究(MoBa),结果表明与常见的经验性过滤策略相比,SeeCiTe提高了特异性和敏感性。据我们所知,它是第一个利用三人组(和单例)中的探针级CNV数据来系统地突出潜在伪影并以适合生物样本库规模研究的简化方式可视化信号强度的工具。

可用性和实现方式

该软件用R实现,源代码可在https://github.com/aksenia/SeeCiTe上免费获取。

补充信息

补充数据可在《生物信息学》在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4976/8317106/88d5303b7b8f/btab028f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4976/8317106/f14a711d2dc3/btab028f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4976/8317106/ee0a6e653ca3/btab028f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4976/8317106/5e684716948d/btab028f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4976/8317106/88d5303b7b8f/btab028f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4976/8317106/f14a711d2dc3/btab028f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4976/8317106/ee0a6e653ca3/btab028f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4976/8317106/5e684716948d/btab028f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4976/8317106/88d5303b7b8f/btab028f4.jpg

相似文献

1
SeeCiTe: a method to assess CNV calls from SNP arrays using trio data.SeeCiTe:一种利用三联体数据评估单核苷酸多态性阵列中拷贝数变异检测结果的方法。
Bioinformatics. 2021 Jul 27;37(13):1876-1883. doi: 10.1093/bioinformatics/btab028.
2
Fast detection of de novo copy number variants from SNP arrays for case-parent trios.基于 SNP 芯片的先证者-父母三体型检测新发拷贝数变异的快速方法。
BMC Bioinformatics. 2012 Dec 12;13:330. doi: 10.1186/1471-2105-13-330.
3
Joint detection of copy number variations in parent-offspring trios.亲子三联体中拷贝数变异的联合检测。
Bioinformatics. 2016 Apr 15;32(8):1130-7. doi: 10.1093/bioinformatics/btv707. Epub 2015 Dec 7.
4
New quality measure for SNP array based CNV detection.基于 SNP 芯片的 CNV 检测的新质量度量。
Bioinformatics. 2016 Nov 1;32(21):3298-3305. doi: 10.1093/bioinformatics/btw477. Epub 2016 Jul 10.
5
Rare CNVs in Suicide Attempt include Schizophrenia-Associated Loci and Neurodevelopmental Genes: A Pilot Genome-Wide and Family-Based Study.自杀未遂中的罕见拷贝数变异包括精神分裂症相关基因座和神经发育基因:一项全基因组和基于家系的初步研究。
PLoS One. 2016 Dec 28;11(12):e0168531. doi: 10.1371/journal.pone.0168531. eCollection 2016.
6
Using family data as a verification standard to evaluate copy number variation calling strategies for genetic association studies.利用家系数据作为验证标准,评估遗传关联研究中拷贝数变异 calling 策略。
Genet Epidemiol. 2012 Apr;36(3):253-62. doi: 10.1002/gepi.21618.
7
Identification of Copy Number Variants from SNP Arrays Using PennCNV.使用PennCNV从SNP阵列中鉴定拷贝数变异
Methods Mol Biol. 2018;1833:1-28. doi: 10.1007/978-1-4939-8666-8_1.
8
CLAMMS: a scalable algorithm for calling common and rare copy number variants from exome sequencing data.CLAMMS:一种用于从外显子组测序数据中检测常见和罕见拷贝数变异的可扩展算法。
Bioinformatics. 2016 Jan 1;32(1):133-5. doi: 10.1093/bioinformatics/btv547. Epub 2015 Sep 17.
9
Accurate and Effective Detection of Recurrent Copy Number Variants in Large SNP Genotype Datasets.准确有效地检测大型 SNP 基因型数据中的复发性拷贝数变异。
Curr Protoc. 2022 Dec;2(12):e621. doi: 10.1002/cpz1.621.
10
CNVfilteR: an R/Bioconductor package to identify false positives produced by germline NGS CNV detection tools.CNVfilteR:一个用于识别由种系 NGS CNV 检测工具产生的假阳性的 R/Bioconductor 包。
Bioinformatics. 2021 Nov 18;37(22):4227-4229. doi: 10.1093/bioinformatics/btab356.

引用本文的文献

1
MarkerMatch: A Proximity-Based Probe-Matching Algorithm for Joint Analysis of Copy-Number Variants from Different Genotyping Arrays.MarkerMatch:一种基于邻近性的探针匹配算法,用于联合分析来自不同基因分型阵列的拷贝数变异
bioRxiv. 2025 Jul 4:2025.06.30.662249. doi: 10.1101/2025.06.30.662249.
2
Rare copy number variant analysis in case-control studies using snp array data: a scalable and automated data analysis pipeline.基于 SNP 芯片数据的病例对照研究中的罕见拷贝数变异分析:一种可扩展和自动化的数据分析流程。
BMC Bioinformatics. 2024 Nov 15;25(1):357. doi: 10.1186/s12859-024-05979-0.
3
Genomic analysis of the rare British Lop pig and identification of distinctive genomic markers.
英国稀有长白猪的基因组分析及独特基因组标记的鉴定。
PLoS One. 2022 Aug 12;17(8):e0271053. doi: 10.1371/journal.pone.0271053. eCollection 2022.
4
Fully exploiting SNP arrays: a systematic review on the tools to extract underlying genomic structure.充分利用 SNP 阵列:提取潜在基因组结构的工具的系统评价。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac043.
5
Comprehensive characterization of copy number variation (CNV) called from array, long- and short-read data.综合分析来自于芯片、长读和短读测序数据的拷贝数变异(CNV)。
BMC Genomics. 2021 Nov 17;22(1):826. doi: 10.1186/s12864-021-08082-3.