• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

StrandScript:评估 Illumina 基因分型阵列设计和链校正。

StrandScript: evaluation of Illumina genotyping array design and strand correction.

机构信息

Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville TN, USA 37232.

Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville TN, USA, 37232.

出版信息

Bioinformatics. 2017 Aug 1;33(15):2399-2401. doi: 10.1093/bioinformatics/btx186.

DOI:10.1093/bioinformatics/btx186
PMID:28402386
Abstract

SUMMARY

After the introduction of high-throughput sequencing, genotyping arrays continue to be a viable source for conducting large-scale genetic studies. Currently, Illumina is one of the largest genotyping array manufacturers. One technical issue that has always plagued the post-processing of Illumina genotyping array data is the strand definition. Against convention, Illumina uses their own definition of strand, which is inconsistent with the standard reference forward and reverse definition. This issue has been a major obstacle in the consistency of reporting, meta-analysis and correct interpretation of phenotype association results. To date, the strand issue has not been adequately addressed, prompting us to develop StrandScript, a tool that can convert all genotyping data generated from Illumina genotyping arrays to the reference forward strand. StrandScript works independently of the Illumina array version and is future proof for newer Illumina array designs. Furthermore, StrandScript can examine an Illumina genotyping array manifest file and can detect all problematic SNPs, including SNPs with wrong RS ID and SNPs with mismatched probe sequences. Here, we introduce StrandScript's design and development, and demonstrate its effectiveness using real genotyping data.

AVAILABILITY AND IMPLEMENTATION

https://github.com/seasky002002/Strandscript.

CONTACT

yan.guo.1@vanderbilt.edu.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

摘要

高通量测序技术问世后,基因分型芯片仍然是进行大规模遗传研究的一种可行资源。目前,Illumina 是最大的基因分型芯片制造商之一。Illumina 基因分型芯片数据后处理一直存在一个技术问题,即链定义。与常规做法相反,Illumina 使用自己定义的链,与标准参考正向和反向定义不一致。这个问题一直是报告一致性、荟萃分析和正确解释表型关联结果的主要障碍。迄今为止,这个链的问题尚未得到充分解决,促使我们开发了 StrandScript,这是一种可以将所有来自 Illumina 基因分型芯片的基因分型数据转换为参考正向链的工具。StrandScript 独立于 Illumina 芯片版本工作,并且为较新的 Illumina 芯片设计提供了未来保障。此外,StrandScript 可以检查 Illumina 基因分型芯片清单文件,并可以检测所有有问题的 SNP,包括 RS ID 错误的 SNP 和探针序列不匹配的 SNP。在这里,我们介绍了 StrandScript 的设计和开发,并使用真实的基因分型数据展示了它的有效性。

可用性和实现

https://github.com/seasky002002/Strandscript。

联系方式

yan.guo.1@vanderbilt.edu。

补充信息

补充数据可在 Bioinformatics 在线获取。

相似文献

1
StrandScript: evaluation of Illumina genotyping array design and strand correction.StrandScript:评估 Illumina 基因分型阵列设计和链校正。
Bioinformatics. 2017 Aug 1;33(15):2399-2401. doi: 10.1093/bioinformatics/btx186.
2
Strategies for processing and quality control of Illumina genotyping arrays.Illumina 基因分型芯片的处理和质量控制策略。
Brief Bioinform. 2018 Sep 28;19(5):765-775. doi: 10.1093/bib/bbx012.
3
Illumina human exome genotyping array clustering and quality control.Illumina人类外显子基因分型阵列聚类与质量控制
Nat Protoc. 2014 Nov;9(11):2643-62. doi: 10.1038/nprot.2014.174. Epub 2014 Oct 16.
4
Simple, rapid and accurate genotyping-by-sequencing from aligned whole genomes with ArrayMaker.使用ArrayMaker从比对的全基因组中进行简单、快速且准确的测序基因分型。
Bioinformatics. 2015 Feb 15;31(4):599-601. doi: 10.1093/bioinformatics/btu691. Epub 2014 Oct 21.
5
Preprocessing, normalization and integration of the Illumina HumanMethylationEPIC array with minfi.Illumina HumanMethylationEPIC阵列与minfi的预处理、标准化和整合
Bioinformatics. 2017 Feb 15;33(4):558-560. doi: 10.1093/bioinformatics/btw691.
6
Transcriptomic SNP discovery for custom genotyping arrays: impacts of sequence data, SNP calling method and genotyping technology on the probability of validation success.用于定制基因分型阵列的转录组SNP发现:序列数据、SNP检测方法和基因分型技术对验证成功概率的影响。
BMC Res Notes. 2016 Aug 26;9(1):418. doi: 10.1186/s13104-016-2209-x.
7
Design of a 9K illumina BeadChip for polar bears (Ursus maritimus) from RAD and transcriptome sequencing.基于 RAD 和转录组测序设计的 9K 安捷伦 BeadChip 芯片用于北极熊(Ursus maritimus)研究。
Mol Ecol Resour. 2015 May;15(3):587-600. doi: 10.1111/1755-0998.12327. Epub 2014 Sep 15.
8
argyle: An R Package for Analysis of Illumina Genotyping Arrays.阿盖尔:一个用于分析Illumina基因分型阵列的R软件包。
G3 (Bethesda). 2015 Dec 18;6(2):281-6. doi: 10.1534/g3.115.023739.
9
Developing a 670k genotyping array to tag ~2M SNPs across 24 horse breeds.开发一个670k基因分型阵列,以标记24个马品种中的约200万个单核苷酸多态性(SNP)。
BMC Genomics. 2017 Jul 27;18(1):565. doi: 10.1186/s12864-017-3943-8.
10
The COPILOT Raw Illumina Genotyping QC Protocol.COPILOT原始Illumina基因分型质量控制方案。
Curr Protoc. 2022 Apr;2(4):e373. doi: 10.1002/cpz1.373.

引用本文的文献

1
Longitudinal single cell atlas identifies complex temporal relationship between type I interferon response and COVID-19 severity.纵向单细胞图谱鉴定 I 型干扰素反应与 COVID-19 严重程度之间的复杂时间关系。
Nat Commun. 2024 Jan 18;15(1):567. doi: 10.1038/s41467-023-44524-0.
2
A Pipeline for Phasing and Genotype Imputation on Mixed Human Data (Parents-Offspring Trios and Unrelated Subjects) by Reviewing Current Methods and Software.通过回顾当前方法和软件构建的用于混合人类数据(父母-子女三联体和无关个体)的定相和基因型填充流程
Life (Basel). 2022 Dec 5;12(12):2030. doi: 10.3390/life12122030.
3
GTQC: Automated Genotyping Array Quality Control and Report.
GTQC:自动化基因分型阵列质量控制与报告
J Genomics. 2022 Feb 14;10:39-44. doi: 10.7150/jgen.69860. eCollection 2022.
4
Fully exploiting SNP arrays: a systematic review on the tools to extract underlying genomic structure.充分利用 SNP 阵列:提取潜在基因组结构的工具的系统评价。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac043.
5
Association Between rs12037447, rs146732504, rs151078858, rs55723436, and rs6094136 Polymorphisms and Kawasaki Disease in the Population of Polish Children.波兰儿童群体中rs12037447、rs146732504、rs151078858、rs55723436和rs6094136多态性与川崎病的关联
Front Pediatr. 2021 Feb 22;9:624798. doi: 10.3389/fped.2021.624798. eCollection 2021.
6
Y-Chromosome Genetic Analysis of Modern Polish Population.现代波兰人群的Y染色体遗传分析。
Front Genet. 2020 Oct 23;11:567309. doi: 10.3389/fgene.2020.567309. eCollection 2020.
7
Alternative Applications of Genotyping Array Data Using Multivariant Methods.采用多变量方法的基因分型阵列数据的其他应用。
Trends Genet. 2020 Nov;36(11):857-867. doi: 10.1016/j.tig.2020.07.006. Epub 2020 Aug 6.
8
Phenome-wide association analysis suggests the APOL1 linked disease spectrum primarily drives kidney-specific pathways.全表型组关联分析表明,载脂蛋白L1(APOL1)相关疾病谱主要驱动肾脏特异性通路。
Kidney Int. 2020 May;97(5):1032-1041. doi: 10.1016/j.kint.2020.01.027. Epub 2020 Feb 17.