• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

STRScan:全基因组测序数据中短串联重复序列的靶向分析

STRScan: targeted profiling of short tandem repeats in whole-genome sequencing data.

作者信息

Tang Haixu, Nzabarushimana Etienne

机构信息

School of Informatics and Computing, Indiana University, 150 S. Woodlawn Avenue, Bloomington, 47405, IN, USA.

出版信息

BMC Bioinformatics. 2017 Oct 3;18(Suppl 11):398. doi: 10.1186/s12859-017-1800-z.

DOI:10.1186/s12859-017-1800-z
PMID:28984185
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5629557/
Abstract

BACKGROUND

Short tandem repeats (STRs) are found in many prokaryotic and eukaryotic genomes, and are commonly used as genetic markers, in particular for identity and parental testing in DNA forensics. The unstable expansion of some STRs was associated with various genetic disorders (e.g., the Huntington disease), and thus was used in genetic testing for screening individuals at high risk. Traditional STR analyses were based on the PCR amplification of STR loci followed by gel electrophoresis. With the availability of massive whole genome sequencing data, it becomes practical to mine STR profiles in silico from genome sequences. Software tools such as lobSTR and STR-FM have been developed to address these demands, which are, however, built upon whole genome reads mapping tools, and thus may not be sensitive enough.

RESULTS

In this paper, we present a standalone software tool STRScan that uses a greedy algorithm for targeted STR profiling in next-generation sequencing (NGS) data. STRScan was tested on the whole genome sequencing data from Venter genome sequencing and 1000 Genomes Project. The results showed that STRScan can profile 20% more STRs in the target set that are missed by lobSTR.

CONCLUSION

STRScan is particularly useful for the NGS-based targeted STR profiling, e.g., in genetic and human identity testing. STRScan is available as open-source software at http://darwin.informatics.indiana.edu/str/ .

摘要

背景

短串联重复序列(STRs)存在于许多原核生物和真核生物基因组中,常用于作为遗传标记,尤其是在DNA法医鉴定中的身份识别和亲权测试。一些STRs的不稳定扩增与多种遗传疾病(如亨廷顿舞蹈症)相关,因此被用于遗传检测以筛查高危个体。传统的STR分析基于STR位点的PCR扩增,随后进行凝胶电泳。随着大量全基因组测序数据的可得性,从基因组序列中进行STR图谱的计算机挖掘变得可行。诸如lobSTR和STR-FM等软件工具已被开发以满足这些需求,然而,它们是基于全基因组读段映射工具构建的,因此可能不够灵敏。

结果

在本文中,我们展示了一个独立的软件工具STRScan,它使用贪心算法在下一代测序(NGS)数据中进行靶向STR图谱分析。STRScan在来自文特尔基因组测序和千人基因组计划的全基因组测序数据上进行了测试。结果表明,STRScan能够在目标集合中识别出比lobSTR多20%的STRs。

结论

STRScan对于基于NGS的靶向STR图谱分析特别有用,例如在遗传和人类身份测试中。STRScan作为开源软件可在http://darwin.informatics.indiana.edu/str/获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd86/5629557/dc8cedd38752/12859_2017_1800_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd86/5629557/dc8cedd38752/12859_2017_1800_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd86/5629557/dc8cedd38752/12859_2017_1800_Fig1_HTML.jpg

相似文献

1
STRScan: targeted profiling of short tandem repeats in whole-genome sequencing data.STRScan:全基因组测序数据中短串联重复序列的靶向分析
BMC Bioinformatics. 2017 Oct 3;18(Suppl 11):398. doi: 10.1186/s12859-017-1800-z.
2
lobSTR: A short tandem repeat profiler for personal genomes.lobSTR:个人基因组的短串联重复序列分析工具。
Genome Res. 2012 Jun;22(6):1154-62. doi: 10.1101/gr.135780.111. Epub 2012 Apr 20.
3
Profiling short tandem repeats from short reads.从短读段中分析短串联重复序列。
Methods Mol Biol. 2013;1038:113-35. doi: 10.1007/978-1-62703-514-9_7.
4
Profiling of Short-Tandem-Repeat Disease Alleles in 12,632 Human Whole Genomes.12632个人类全基因组中短串联重复疾病等位基因的分析
Am J Hum Genet. 2017 Nov 2;101(5):700-715. doi: 10.1016/j.ajhg.2017.09.013.
5
Search for More Effective Microsatellite Markers for Forensics With Next-Generation Sequencing.利用下一代测序技术寻找更有效的法医微卫星标记物。
IEEE Trans Nanobioscience. 2017 Jul;16(5):375-381. doi: 10.1109/TNB.2017.2712795. Epub 2017 Jun 7.
6
Genome-wide sequencing as a first-tier screening test for short tandem repeat expansions.全基因组测序作为短串联重复扩展的一线筛查试验。
Genome Med. 2021 Aug 9;13(1):126. doi: 10.1186/s13073-021-00932-9.
7
STRsearch: a new pipeline for targeted profiling of short tandem repeats in massively parallel sequencing data.STRsearch:一种用于大规模平行测序数据中短串联重复序列靶向分析的新管道。
Hereditas. 2020 Mar 16;157(1):8. doi: 10.1186/s41065-020-00120-6.
8
Developmental validation of the MiSeq FGx Forensic Genomics System for Targeted Next Generation Sequencing in Forensic DNA Casework and Database Laboratories.MiSeq FGx法医基因组学系统在法医DNA案件工作和数据库实验室中用于靶向新一代测序的发育验证。
Forensic Sci Int Genet. 2017 May;28:52-70. doi: 10.1016/j.fsigen.2017.01.011. Epub 2017 Jan 27.
9
A unified STR profiling system across multiple species with whole genome sequencing data.具有全基因组测序数据的跨多种物种的统一 STR 分析系统。
BMC Bioinformatics. 2019 Dec 20;20(Suppl 24):671. doi: 10.1186/s12859-019-3246-y.
10
Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.从全基因组测序数据中准确分型短串联重复序列及其应用。
Genome Res. 2015 May;25(5):736-49. doi: 10.1101/gr.185892.114. Epub 2015 Mar 30.

引用本文的文献

1
A genome-wide approach for the discovery of novel repeat expansion disorders in the Undiagnosed Diseases Network cohort.一种用于在未确诊疾病网络队列中发现新型重复序列扩张疾病的全基因组方法。
Genet Med. 2025 May 22;27(8):101462. doi: 10.1016/j.gim.2025.101462.
2
STRsensor: a computationally efficient method for STR allele-typing from massively parallel sequencing data.STR传感器:一种从大规模平行测序数据中进行STR等位基因分型的计算高效方法。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae637.
3
BigFiRSt: A Software Program Using Big Data Technique for Mining Simple Sequence Repeats From Large-Scale Sequencing Data.

本文引用的文献

1
Abundant contribution of short tandem repeats to gene expression variation in humans.短串联重复序列对人类基因表达变异的巨大贡献。
Nat Genet. 2016 Jan;48(1):22-9. doi: 10.1038/ng.3461. Epub 2015 Dec 7.
2
Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.从全基因组测序数据中准确分型短串联重复序列及其应用。
Genome Res. 2015 May;25(5):736-49. doi: 10.1101/gr.185892.114. Epub 2015 Mar 30.
3
MIPSTR: a method for multiplex genotyping of germline and somatic STR variation across many individuals.
BigFiRSt:一种使用大数据技术从大规模测序数据中挖掘简单序列重复序列的软件程序。
Front Big Data. 2022 Jan 18;4:727216. doi: 10.3389/fdata.2021.727216. eCollection 2021.
4
An Introductory Overview of Open-Source and Commercial Software Options for the Analysis of Forensic Sequencing Data.开源和商业软件选项在法医测序数据分析中的应用概述
Genes (Basel). 2021 Oct 29;12(11):1739. doi: 10.3390/genes12111739.
5
Large scale in silico characterization of repeat expansion variation in human genomes.大规模的人类基因组中重复扩展变异的计算机模拟分析。
Sci Data. 2020 Sep 8;7(1):294. doi: 10.1038/s41597-020-00633-9.
6
The Potential of HTS Approaches for Accurate Genotyping in Grapevine ( L.).高通量筛选方法在葡萄(L.)准确基因型鉴定中的潜力。
Genes (Basel). 2020 Aug 10;11(8):917. doi: 10.3390/genes11080917.
7
Accuracy of short tandem repeats genotyping tools in whole exome sequencing data.短串联重复序列基因分型工具在全外显子组测序数据中的准确性。
F1000Res. 2020 Mar 23;9:200. doi: 10.12688/f1000research.22639.1. eCollection 2020.
8
MultiplexSSR: A pipeline for developing multiplex SSR-PCR assays from resequencing data.多重简单序列重复(MultiplexSSR):一种从重测序数据开发多重简单序列重复聚合酶链反应(SSR-PCR)检测方法的流程。
Ecol Evol. 2020 Mar 4;10(6):3055-3067. doi: 10.1002/ece3.6121. eCollection 2020 Mar.
9
STRsearch: a new pipeline for targeted profiling of short tandem repeats in massively parallel sequencing data.STRsearch:一种用于大规模平行测序数据中短串联重复序列靶向分析的新管道。
Hereditas. 2020 Mar 16;157(1):8. doi: 10.1186/s41065-020-00120-6.
10
Mining and Development of Novel SSR Markers Using Next Generation Sequencing (NGS) Data in Plants.利用新一代测序(NGS)数据在植物中挖掘和开发新型 SSR 标记。
Molecules. 2018 Feb 13;23(2):399. doi: 10.3390/molecules23020399.
MIPSTR:一种用于对多个个体的种系和体细胞STR变异进行多重基因分型的方法。
Genome Res. 2015 May;25(5):750-61. doi: 10.1101/gr.182212.114. Epub 2015 Feb 6.
4
The landscape of human STR variation.人类短串联重复序列(STR)变异图谱。
Genome Res. 2014 Nov;24(11):1894-904. doi: 10.1101/gr.177774.114. Epub 2014 Aug 18.
5
Short tandem repeat typing on the 454 platform: strategies and considerations for targeted sequencing of common forensic markers.454 平台上的短串联重复序列分型:常见法医标记物靶向测序的策略和考虑因素。
Forensic Sci Int Genet. 2014 Sep;12:107-19. doi: 10.1016/j.fsigen.2014.04.010. Epub 2014 May 4.
6
Large-scale analysis of tandem repeat variability in the human genome.人类基因组中串联重复序列变异的大规模分析。
Nucleic Acids Res. 2014 May;42(9):5728-41. doi: 10.1093/nar/gku212. Epub 2014 Mar 20.
7
Identifying personal genomes by surname inference.姓氏推断识别个人基因组。
Science. 2013 Jan 18;339(6117):321-4. doi: 10.1126/science.1229566.
8
lobSTR: A short tandem repeat profiler for personal genomes.lobSTR:个人基因组的短串联重复序列分析工具。
Genome Res. 2012 Jun;22(6):1154-62. doi: 10.1101/gr.135780.111. Epub 2012 Apr 20.
9
A map of human genome variation from population-scale sequencing.人类基因组变异的图谱来自于基于人群的测序。
Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.
10
Short tandem repeat typing technologies used in human identity testing.用于人类身份鉴定的短串联重复序列分型技术。
Biotechniques. 2007 Oct;43(4):ii-v. doi: 10.2144/000112582.