• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PSR:多态性简单序列重复检索

PSR: polymorphic SSR retrieval.

作者信息

Cantarella Concita, D'Agostino Nunzio

机构信息

Consiglio per la ricerca in agricoltura e l'analisi dell'economia agraria - Centro di ricerca per l'orticoltura, Via Cavalleggeri 25, 84098, Pontecagnano Faiano, Italy.

出版信息

BMC Res Notes. 2015 Oct 1;8:525. doi: 10.1186/s13104-015-1474-4.

DOI:10.1186/s13104-015-1474-4
PMID:26428628
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4591729/
Abstract

BACKGROUND

With the advent of high-throughput sequencing technologies large-scale identification of microsatellites became affordable and was especially directed to non-model species. By contrast, few efforts have been published toward the automatic identification of polymorphic microsatellites by exploiting sequence redundancy. Few tools for genotyping microsatellite repeats have been implemented so far that are able to manage huge amount of sequence data and handle the SAM/BAM file format. Most of them have been developed for and tested on human or model organisms with high quality reference genomes.

RESULTS

In this note we describe polymorphic SSR retrieval (PSR), a read counter and simple sequence repeat (SSR) length polymorphism detection tool. It is written in Perl and was developed to identify length polymorphisms in perfect microsatellites exploiting next generation sequencing (NGS) data. PSR has been developed bearing in mind plant non-model species for which de novo transcriptome assembly is generally the first sequence resource available to be used for SSR-mining. PSR is divided into two modules: the read-counting module (PSR_read_retrieval) identifies all the reads that cover the full-length of perfect microsatellites; the comparative module (PSR_poly_finder) detects both heterozygous and homozygous alleles at each microsatellite locus across all genotypes under investigation. Two threshold values to call a length polymorphism and reduce the number of false positives can be defined by the user: the minimum number of reads overlapping the repetitive stretch and the minimum read depth. The first parameter determines if the microsatellite-containing sequence must be processed or not, while the second one is decisive for the identification of minor alleles. PSR was tested on two different case studies. The first study aims at the identification of polymorphic SSRs in a set of de novo assembled transcripts defined by RNA-sequencing of two different plant genotypes. The second research activity aims to investigate sequence variations within a collection of newly sequenced chloroplast genomes. In both the cases PSR results are in agreement with those obtained by capillary gel separation.

CONCLUSION

PSR has been specifically developed from the need to automate the gene-based and genome-wide identification of polymorphic microsatellites from NGS data. It overcomes the limits related to the existing and time-consuming efforts based on tools developed in the pre-NGS era.

摘要

背景

随着高通量测序技术的出现,大规模鉴定微卫星变得可行,尤其适用于非模式物种。相比之下,利用序列冗余自动鉴定多态性微卫星的研究较少。目前,很少有能够处理大量序列数据并处理SAM/BAM文件格式的微卫星重复基因分型工具。它们中的大多数是针对具有高质量参考基因组的人类或模式生物开发并进行测试的。

结果

在本报告中,我们描述了多态性SSR检索工具(PSR),这是一种读取计数器和简单序列重复(SSR)长度多态性检测工具。它用Perl编写,旨在利用下一代测序(NGS)数据识别完美微卫星中的长度多态性。PSR的开发考虑到了植物非模式物种,对于这些物种而言,从头转录组组装通常是可用于SSR挖掘的首个序列资源。PSR分为两个模块:读取计数模块(PSR_read_retrieval)识别覆盖完美微卫星全长的所有读取;比较模块(PSR_poly_finder)检测所有被研究基因型中每个微卫星位点的杂合和纯合等位基因。用户可以定义两个阈值来判定长度多态性并减少假阳性数量:与重复片段重叠的最小读取数和最小读取深度。第一个参数决定是否必须处理包含微卫星的序列,而第二个参数对于次要等位基因的识别起决定性作用。PSR在两个不同的案例研究中进行了测试。第一个研究旨在通过对两种不同植物基因型进行RNA测序来鉴定一组从头组装转录本中的多态性SSR。第二项研究活动旨在研究一组新测序的叶绿体基因组中的序列变异。在这两种情况下,PSR的结果都与通过毛细管凝胶分离获得的结果一致。

结论

PSR是根据从NGS数据中自动进行基于基因和全基因组的多态性微卫星鉴定的需求而专门开发的。它克服了与基于NGS时代之前开发的工具的现有且耗时的工作相关的限制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4591729/ea70f8655dfb/13104_2015_1474_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4591729/c8930043ffce/13104_2015_1474_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4591729/ea70f8655dfb/13104_2015_1474_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4591729/c8930043ffce/13104_2015_1474_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4591729/ea70f8655dfb/13104_2015_1474_Fig2_HTML.jpg

相似文献

1
PSR: polymorphic SSR retrieval.PSR:多态性简单序列重复检索
BMC Res Notes. 2015 Oct 1;8:525. doi: 10.1186/s13104-015-1474-4.
2
Using next-generation sequencing approaches to isolate simple sequence repeat (SSR) loci in the plant sciences.利用下一代测序方法在植物科学中分离简单重复序列(SSR)位点。
Am J Bot. 2012 Feb;99(2):193-208. doi: 10.3732/ajb.1100394. Epub 2011 Dec 20.
3
Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.).黄瓜基因组中简单重复序列的全基因组特征分析。
BMC Genomics. 2010 Oct 15;11:569. doi: 10.1186/1471-2164-11-569.
4
Large-scale identification of polymorphic microsatellites using an in silico approach.利用计算机模拟方法大规模鉴定多态性微卫星。
BMC Bioinformatics. 2008 Sep 15;9:374. doi: 10.1186/1471-2105-9-374.
5
SATIN: a micro and mini satellite mining tool of total genome and coding regions with analysis of perfect repeats polymorphism in coding regions.SATIN:一种微小型卫星全基因组和编码区挖掘工具,可分析编码区完全重复多态性。
BMC Bioinformatics. 2024 Jun 18;25(1):217. doi: 10.1186/s12859-024-05842-2.
6
A Novel Software and Method for the Efficient Development of Polymorphic SSR Loci Based on Transcriptome Data.一种基于转录组数据高效开发多态性 SSR 位点的新软件和方法。
Genes (Basel). 2019 Nov 11;10(11):917. doi: 10.3390/genes10110917.
7
Development and validation of genic-SSR markers in sesame by RNA-seq.基于 RNA-seq 的芝麻基因 SSR 标记的开发与验证。
BMC Genomics. 2012 Jul 16;13:316. doi: 10.1186/1471-2164-13-316.
8
Development of EST-SSR markers for genetic diversity analysis in coconut (Cocos nucifera L.).椰属(Cocos nucifera L.)遗传多样性分析的 EST-SSR 标记开发。
Mol Biol Rep. 2020 Dec;47(12):9385-9397. doi: 10.1007/s11033-020-05981-8. Epub 2020 Nov 19.
9
Mining and Development of Novel SSR Markers Using Next Generation Sequencing (NGS) Data in Plants.利用新一代测序(NGS)数据在植物中挖掘和开发新型 SSR 标记。
Molecules. 2018 Feb 13;23(2):399. doi: 10.3390/molecules23020399.
10
De novo assembly of the pepper transcriptome (Capsicum annuum): a benchmark for in silico discovery of SNPs, SSRs and candidate genes.从头组装辣椒转录组(Capsicum annuum):用于 SNP、SSR 和候选基因在计算机上发现的基准。
BMC Genomics. 2012 Oct 30;13:571. doi: 10.1186/1471-2164-13-571.

引用本文的文献

1
Pipeline for developing polymorphic microsatellites in species without reference genomes.在没有参考基因组的物种中开发多态微卫星的流程。
3 Biotech. 2022 Oct;12(10):248. doi: 10.1007/s13205-022-03313-0. Epub 2022 Aug 26.
2
Genome-wide identification of simple sequence repeats and development of polymorphic SSR markers in swamp eel (Monopterus albus).基于全基因组序列鉴定的中国沼虾(Monopterus albus)微卫星标记开发及多态性分析
Sci Prog. 2021 Jul-Sep;104(3):368504211035597. doi: 10.1177/00368504211035597.
3
The Potential of HTS Approaches for Accurate Genotyping in Grapevine ( L.).

本文引用的文献

1
Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.从全基因组测序数据中准确分型短串联重复序列及其应用。
Genome Res. 2015 May;25(5):736-49. doi: 10.1101/gr.185892.114. Epub 2015 Mar 30.
2
Exploiting transcriptome data for the development and characterization of gene-based SSR markers related to cold tolerance in oil palm (Elaeis guineensis).利用转录组数据开发和鉴定与油棕(Elaeis guineensis)耐寒性相关的基因SSR标记。
BMC Plant Biol. 2014 Dec 19;14:384. doi: 10.1186/s12870-014-0384-2.
3
Allelic divergence and cultivar-specific SSR alleles revealed by capillary electrophoresis using fluorescence-labeled SSR markers in sugarcane.
高通量筛选方法在葡萄(L.)准确基因型鉴定中的潜力。
Genes (Basel). 2020 Aug 10;11(8):917. doi: 10.3390/genes11080917.
4
MultiplexSSR: A pipeline for developing multiplex SSR-PCR assays from resequencing data.多重简单序列重复(MultiplexSSR):一种从重测序数据开发多重简单序列重复聚合酶链反应(SSR-PCR)检测方法的流程。
Ecol Evol. 2020 Mar 4;10(6):3055-3067. doi: 10.1002/ece3.6121. eCollection 2020 Mar.
5
Development of a Genomic Resource and Identification of Nucleotide Diversity of Yellow Perch by RAD Sequencing.通过RAD测序开发基因组资源并鉴定黄鲈的核苷酸多样性
Front Genet. 2019 Oct 14;10:992. doi: 10.3389/fgene.2019.00992. eCollection 2019.
6
Draft genome of a high value tropical timber tree, Teak (Tectona grandis L. f): insights into SSR diversity, phylogeny and conservation.柚木(Tectona grandis L. f)基因组草图:SSR 多样性、系统发育和保护的研究
DNA Res. 2018 Aug 1;25(4):409-419. doi: 10.1093/dnares/dsy013.
7
Mining and Development of Novel SSR Markers Using Next Generation Sequencing (NGS) Data in Plants.利用新一代测序(NGS)数据在植物中挖掘和开发新型 SSR 标记。
Molecules. 2018 Feb 13;23(2):399. doi: 10.3390/molecules23020399.
8
MonoSeq Variant Caller Reveals Novel Mononucleotide Run Indel Mutations in Tumors with Defective DNA Mismatch Repair.单序列变异检测工具揭示了DNA错配修复缺陷肿瘤中的新型单核苷酸重复插入缺失突变。
Hum Mutat. 2016 Oct;37(10):1004-12. doi: 10.1002/humu.23036. Epub 2016 Aug 8.
利用荧光标记的SSR标记通过毛细管电泳揭示甘蔗的等位基因差异和品种特异性SSR等位基因。
Genome. 2014 Jun;57(6):363-72. doi: 10.1139/gen-2014-0072. Epub 2014 Aug 26.
4
Global transcriptome sequencing using the Illumina platform and the development of EST-SSR markers in autotetraploid alfalfa.利用Illumina平台进行全球转录组测序以及同源四倍体苜蓿中EST-SSR标记的开发。
PLoS One. 2013 Dec 12;8(12):e83549. doi: 10.1371/journal.pone.0083549. eCollection 2013.
5
Genomic analysis of the native European Solanum species, S. dulcamara.欧洲原生茄属植物,Solanum dulcamara 的基因组分析。
BMC Genomics. 2013 May 28;14:356. doi: 10.1186/1471-2164-14-356.
6
Accurate human microsatellite genotypes from high-throughput resequencing data using informed error profiles.利用知情误差模型从高通量重测序数据中准确获取人类微卫星基因型。
Nucleic Acids Res. 2013 Jan 7;41(1):e32. doi: 10.1093/nar/gks981. Epub 2012 Oct 22.
7
HighSSR: high-throughput SSR characterization and locus development from next-gen sequencing data.HighSSR:从下一代测序数据中进行高通量 SSR 特征分析和基因座开发。
Bioinformatics. 2012 Nov 1;28(21):2797-803. doi: 10.1093/bioinformatics/bts524. Epub 2012 Sep 6.
8
Development and validation of genic-SSR markers in sesame by RNA-seq.基于 RNA-seq 的芝麻基因 SSR 标记的开发与验证。
BMC Genomics. 2012 Jul 16;13:316. doi: 10.1186/1471-2164-13-316.
9
In silico polymorphism analysis for the development of simple sequence repeat and transposon markers and construction of linkage map in cultivated peanut.栽培花生中简单重复序列和转座子标记的开发及连锁图谱构建的基于计算机的多态性分析
BMC Plant Biol. 2012 Jun 6;12:80. doi: 10.1186/1471-2229-12-80.
10
lobSTR: A short tandem repeat profiler for personal genomes.lobSTR:个人基因组的短串联重复序列分析工具。
Genome Res. 2012 Jun;22(6):1154-62. doi: 10.1101/gr.135780.111. Epub 2012 Apr 20.