• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Discretized Gaussian mixture for genotyping of microsatellite loci containing homopolymer runs.含重复序列微卫星位点基因分型的离散高斯混合模型
Bioinformatics. 2014 Mar 1;30(5):652-9. doi: 10.1093/bioinformatics/btt595. Epub 2013 Oct 17.
2
megasat: automated inference of microsatellite genotypes from sequence data.Megasat:从序列数据自动推断微卫星基因型
Mol Ecol Resour. 2017 Mar;17(2):247-256. doi: 10.1111/1755-0998.12561. Epub 2016 Jul 19.
3
Seq2Sat and SatAnalyzer toolkit: Towards comprehensive microsatellite genotyping from sequencing data.Seq2Sat 和 SatAnalyzer 工具包:从测序数据中实现全面的微卫星基因分型。
Mol Ecol Resour. 2024 Apr;24(3):e13929. doi: 10.1111/1755-0998.13929. Epub 2024 Jan 30.
4
ReviSTER: an automated pipeline to revise misaligned reads to simple tandem repeats.ReviSTER:一种自动流水线,用于修正未对齐的读取到简单串联重复序列。
Bioinformatics. 2013 Jul 15;29(14):1734-41. doi: 10.1093/bioinformatics/btt277. Epub 2013 May 15.
5
Use of the LUS in sequence allele designations to facilitate probabilistic genotyping of NGS-based STR typing results.在序列等位基因命名中使用 LUS 以促进基于 NGS 的 STR 分型结果的概率基因分型。
Forensic Sci Int Genet. 2018 May;34:197-205. doi: 10.1016/j.fsigen.2018.02.016. Epub 2018 Feb 21.
6
Profiling of Short-Tandem-Repeat Disease Alleles in 12,632 Human Whole Genomes.12632个人类全基因组中短串联重复疾病等位基因的分析
Am J Hum Genet. 2017 Nov 2;101(5):700-715. doi: 10.1016/j.ajhg.2017.09.013.
7
MicNeSs: genotyping microsatellite loci from a collection of (NGS) reads.MicNeSs:从一组(二代测序)读段中对微卫星位点进行基因分型。
Mol Ecol Resour. 2016 Mar;16(2):524-33. doi: 10.1111/1755-0998.12467. Epub 2015 Oct 14.
8
Resolving complex tandem repeats with long reads.用长读解决复杂串联重复序列
Bioinformatics. 2014 Dec 15;30(24):3491-8. doi: 10.1093/bioinformatics/btu437. Epub 2014 Jul 15.
9
Dante: genotyping of known complex and expanded short tandem repeats.对已知的复杂和扩展短串联重复序列进行基因分型。
Bioinformatics. 2019 Apr 15;35(8):1310-1317. doi: 10.1093/bioinformatics/bty791.
10
Long microsatellite alleles in Drosophila melanogaster have a downward mutation bias and short persistence times, which cause their genome-wide underrepresentation.黑腹果蝇中的长微卫星等位基因具有向下的突变偏向性和较短的持续时间,这导致它们在全基因组中的代表性不足。
Genetics. 2000 Jul;155(3):1213-20. doi: 10.1093/genetics/155.3.1213.

引用本文的文献

1
A detailed analysis of second and third-generation sequencing approaches for accurate length determination of short tandem repeats and homopolymers.用于精确测定短串联重复序列和同聚物长度的第二代和第三代测序方法的详细分析。
Nucleic Acids Res. 2025 Feb 27;53(5). doi: 10.1093/nar/gkaf131.
2
Molecular Surveillance of Malaria Using the PF AmpliSeq Custom Assay for Parasites from Dried Blood Spot DNA Isolates from Peru.利用PF AmpliSeq定制检测法对秘鲁干血斑DNA分离株中的疟原虫进行疟疾分子监测。
Bio Protoc. 2023 Mar 5;13(5):e4621. doi: 10.21769/BioProtoc.4621.
3
Assessment of Microsatellite Instability from Next-Generation Sequencing Data.基于下一代测序数据的微卫星不稳定性评估。
Adv Exp Med Biol. 2022;1361:75-100. doi: 10.1007/978-3-030-91836-1_5.
4
Abundance of ethnically biased microsatellites in human gene regions.人类基因区域中存在大量种族偏见的微卫星。
PLoS One. 2019 Dec 12;14(12):e0225216. doi: 10.1371/journal.pone.0225216. eCollection 2019.
5
Low temperature isothermal amplification of microsatellites drastically reduces stutter artifact formation and improves microsatellite instability detection in cancer.低温等温扩增微卫星可大大减少卫星不稳定现象形成的伪迹,并提高癌症中微卫星不稳定性的检测。
Nucleic Acids Res. 2019 Dec 2;47(21):e141. doi: 10.1093/nar/gkz811.
6
Germline microsatellite genotypes differentiate children with medulloblastoma.胚系微卫星基因型可区分髓母细胞瘤患儿。
Neuro Oncol. 2020 Jan 11;22(1):152-162. doi: 10.1093/neuonc/noz179.
7
Detecting Expansions of Tandem Repeats in Cohorts Sequenced with Short-Read Sequencing Data.检测短读测序数据序列化队列中的串联重复扩展。
Am J Hum Genet. 2018 Dec 6;103(6):858-873. doi: 10.1016/j.ajhg.2018.10.015. Epub 2018 Nov 29.
8
CAGm: a repository of germline microsatellite variations in the 1000 genomes project.CAGm:1000 基因组计划中的种系微卫星变异知识库。
Nucleic Acids Res. 2019 Jan 8;47(D1):D39-D45. doi: 10.1093/nar/gky969.
9
Whole-exome sequencing reveals microsatellite DNA markers for response to dofetilide initiation in patients with persistent atrial fibrillation: A pilot study.全外显子组测序揭示持续性心房颤动患者起始使用多非利特治疗反应的微卫星DNA标记:一项试点研究。
Clin Cardiol. 2018 Jun;41(6):849-854. doi: 10.1002/clc.22969. Epub 2018 Jun 11.
10
as a Risk and Mortality Marker for Breast Cancer in African American Women.作为非裔美国女性乳腺癌的风险和死亡率标志物。
Cancer Inform. 2017 Dec 13;16:1176935117746644. doi: 10.1177/1176935117746644. eCollection 2017.

本文引用的文献

1
Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles.利用知情误差模型从高通量重测序数据中准确获取人类微卫星基因型。
Nucleic Acids Res. 2013 Jan 7;41(1):e32. doi: 10.1093/nar/gks981. Epub 2012 Oct 22.
2
lobSTR: A short tandem repeat profiler for personal genomes.lobSTR:个人基因组的短串联重复序列分析工具。
Genome Res. 2012 Jun;22(6):1154-62. doi: 10.1101/gr.135780.111. Epub 2012 Apr 20.
3
pIRS: Profile-based Illumina pair-end reads simulator.pIRS:基于谱的 Illumina 双端读取模拟器。
Bioinformatics. 2012 Jun 1;28(11):1533-5. doi: 10.1093/bioinformatics/bts187. Epub 2012 Apr 15.
4
The Drosophila melanogaster Genetic Reference Panel.黑腹果蝇遗传参考面板。
Nature. 2012 Feb 8;482(7384):173-8. doi: 10.1038/nature10811.
5
Evaluation of genomic high-throughput sequencing data generated on Illumina HiSeq and genome analyzer systems.Illumina HiSeq 和基因组分析仪系统生成的基因组高通量测序数据评估。
Genome Biol. 2011 Nov 8;12(11):R112. doi: 10.1186/gb-2011-12-11-r112.
6
Evaluation of microsatellite variation in the 1000 Genomes Project pilot studies is indicative of the quality and utility of the raw data and alignments.对 1000 基因组计划试点研究中微卫星变异的评估表明了原始数据和比对的质量和实用性。
Genomics. 2011 Apr;97(4):193-9. doi: 10.1016/j.ygeno.2011.01.001. Epub 2011 Jan 9.
7
Dindel: accurate indel calls from short-read data.Dindel:从短读数据中进行精确的插入缺失突变(Indel)调用。
Genome Res. 2011 Jun;21(6):961-73. doi: 10.1101/gr.112326.110. Epub 2010 Oct 27.
8
The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.基因组分析工具包:一种用于分析下一代 DNA 测序数据的 MapReduce 框架。
Genome Res. 2010 Sep;20(9):1297-303. doi: 10.1101/gr.107524.110. Epub 2010 Jul 19.
9
The Sequence Alignment/Map format and SAMtools.序列比对/映射格式和 SAMtools。
Bioinformatics. 2009 Aug 15;25(16):2078-9. doi: 10.1093/bioinformatics/btp352. Epub 2009 Jun 8.
10
Fast and accurate short read alignment with Burrows-Wheeler transform.使用Burrows-Wheeler变换进行快速准确的短读比对。
Bioinformatics. 2009 Jul 15;25(14):1754-60. doi: 10.1093/bioinformatics/btp324. Epub 2009 May 18.

含重复序列微卫星位点基因分型的离散高斯混合模型

Discretized Gaussian mixture for genotyping of microsatellite loci containing homopolymer runs.

机构信息

Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, VA 24061 and Office of Biostatistics Research, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA.

出版信息

Bioinformatics. 2014 Mar 1;30(5):652-9. doi: 10.1093/bioinformatics/btt595. Epub 2013 Oct 17.

DOI:10.1093/bioinformatics/btt595
PMID:24135263
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3933874/
Abstract

MOTIVATION

Inferring lengths of inherited microsatellite alleles with single base pair resolution from short sequence reads is challenging due to several sources of noise caused by the repetitive nature of microsatellites and the technologies used to generate raw sequence data.

RESULTS

We have developed a program, GenoTan, using a discretized Gaussian mixture model combined with a rules-based approach to identify inherited variation of microsatellite loci from short sequence reads without paired-end information. It effectively distinguishes length variants from noise including insertion/deletion errors in homopolymer runs by addressing the bidirectional aspect of insertion and deletion errors in sequence reads. Here we first introduce a homopolymer decomposition method which estimates error bias toward insertion or deletion in homopolymer sequence runs. Combining these approaches, GenoTan was able to genotype 94.9% of microsatellite loci accurately from simulated data with 40x sequence coverage quickly while the other programs showed <90% correct calls for the same data and required 5∼30× more computational time than GenoTan. It also showed the highest true-positive rate for real data using mixed sequence data of two Drosophila inbred lines, which was a novel validation approach for genotyping.

AVAILABILITY

GenoTan is open-source software available at http://genotan.sourceforge.net.

摘要

动机

由于微卫星的重复性质和用于生成原始序列数据的技术所导致的多种噪声源,从短序列读取推断具有单碱基分辨率的遗传微卫星等位基因的长度具有挑战性。

结果

我们使用离散高斯混合模型结合基于规则的方法开发了一个程序 GenoTan,用于从没有配对末端信息的短序列读取中识别微卫星基因座的遗传变异。它通过解决序列读取中插入/缺失错误的双向方面,有效地将长度变体与噪声(包括同源多聚体序列中的插入/缺失错误)区分开来。在这里,我们首先介绍了一种同源多聚体分解方法,该方法估计了同源多聚体序列运行中插入或缺失的错误偏向。结合这些方法,GenoTan 能够从模拟数据中以 40x 序列覆盖率快速准确地对 94.9%的微卫星基因座进行基因分型,而其他程序对相同数据的正确调用率<90%,并且比 GenoTan 多花费 5∼30 倍的计算时间。它还使用两种果蝇近交系的混合序列数据显示了真实数据的最高真阳性率,这是一种用于基因分型的新验证方法。

可用性

GenoTan 是一个开源软件,可在 http://genotan.sourceforge.net 上获得。