• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

-QR分类器:一种基于模式的植物物种识别方法。

-QR classifier: a patterns based approach for plant species identification.

作者信息

More Ravi Prabhakar, Mane Rupali Chandrashekhar, Purohit Hemant J

机构信息

Environmental Genomics Division, CSIR-National Environmental Engineering Research Institute, Nagpur, 440020 Maharashtra India ; Present Institute: Division of Molecular Entomology, ICAR- National Bureau of Agricultural Insect Resources (NBAIR), Hebbal, Bengaluru, 560024 Karnataka India.

MDS Bio-Analytics, Nagpur, 440020 Maharashtra India.

出版信息

BioData Min. 2016 Dec 9;9:39. doi: 10.1186/s13040-016-0120-6. eCollection 2016.

DOI:10.1186/s13040-016-0120-6
PMID:27990177
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5148893/
Abstract

BACKGROUND

DNA barcoding is widely used and most efficient approach that facilitates rapid and accurate identification of plant species based on the short standardized segment of the genome. The nucleotide sequences of maturaseK () and ribulose-1, 5-bisphosphate carboxylase () marker loci are commonly used in plant species identification. Here, we present a new and highly efficient approach for identifying a unique set of discriminating nucleotide patterns to generate a signature (i.e. regular expression) for plant species identification.

METHODS

In order to generate molecular signatures, we used and loci datasets, which encompass 125 plant species in 52 genera reported by the CBOL plant working group. Initially, we performed Multiple Sequence Alignment (MSA) of all species followed by Position Specific Scoring Matrix (PSSM) for both loci to achieve a percentage of discrimination among species. Further, we detected Discriminating Patterns (DP) at genus and species level using PSSM for the dataset. Combining DP and consecutive pattern distances, we generated molecular signatures for each species. Finally, we performed a comparative assessment of these signatures with the existing methods including BLASTn, Support Vector Machines (SVM), Jrip-RIPPER, J48 (C4.5 algorithm), and the Naïve Bayes (NB) methods against NCBI-GenBank dataset.

RESULTS

Due to the higher discrimination success obtained with the as compared to the , we selected gene for signature generation. We generated signatures for 60 species based on identified discriminating patterns at genus and species level. Our comparative assessment results suggest that a total of 46 out of 60 species could be correctly identified using generated signatures, followed by BLASTn (34 species), SVM (18 species), C4.5 (7 species), NB (4 species) and RIPPER (3 species) methods As a final outcome of this study, we converted signatures into QR codes and developed a software -QR Classifier (http://www.neeri.res.in/matk_classifier/index.htm), which search signatures in the query gene sequences and predict corresponding plant species.

CONCLUSIONS

This novel approach of employing pattern-based signatures opens new avenues for the classification of species. In addition to existing methods, we believe that -QR Classifier would be a valuable tool for molecular taxonomists enabling precise identification of plant species.

摘要

背景

DNA条形码技术是一种广泛应用且高效的方法,它基于基因组的短标准化片段,有助于快速准确地鉴定植物物种。成熟酶K(matK)和核酮糖-1,5-二磷酸羧化酶(rbcL)标记基因座的核苷酸序列常用于植物物种鉴定。在此,我们提出了一种全新且高效的方法,用于识别一组独特的区分性核苷酸模式,以生成用于植物物种鉴定的特征码(即正则表达式)。

方法

为了生成分子特征码,我们使用了matK和rbcL基因座数据集,这些数据集涵盖了CBOL植物工作组报告的52个属中的125种植物。最初,我们对所有物种进行了多序列比对(MSA),随后对两个基因座进行了位置特异性评分矩阵(PSSM)分析,以实现物种间的区分率。此外,我们使用PSSM对matK数据集在属和种水平上检测区分模式(DP)。结合DP和连续模式距离,我们为每个物种生成了分子特征码。最后,我们将这些特征码与包括BLASTn、支持向量机(SVM)、Jrip - RIPPER、J48(C4.5算法)和朴素贝叶斯(NB)方法在内的现有方法针对NCBI - GenBank matK数据集进行了比较评估。

结果

由于matK相比rbcL获得了更高的区分成功率,我们选择matK基因进行特征码生成。我们基于在属和种水平上确定的区分模式为60个物种生成了特征码。我们的比较评估结果表明,使用生成的特征码能够正确鉴定60个物种中的46个,其次是BLASTn(34个物种)方法、SVM(18个物种)方法、C4.5(7个物种)方法、NB(4个物种)方法和RIPPER(3个物种)方法。作为本研究的最终成果,我们将特征码转换为二维码,并开发了一款软件 - QR分类器(http://www.neeri.res.in/matk_classifier/index.htm),该软件可在查询的matK基因序列中搜索特征码并预测相应的植物物种。

结论

这种采用基于模式的特征码的新方法为物种分类开辟了新途径。除了现有方法外,我们认为QR分类器将成为分子分类学家精确鉴定植物物种的有价值工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2ab/5148893/4ec50e60c78b/13040_2016_120_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2ab/5148893/a11bbf169f65/13040_2016_120_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2ab/5148893/3a51b8eb275a/13040_2016_120_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2ab/5148893/01184535e52f/13040_2016_120_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2ab/5148893/f397f61d9dac/13040_2016_120_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2ab/5148893/4ec50e60c78b/13040_2016_120_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2ab/5148893/a11bbf169f65/13040_2016_120_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2ab/5148893/3a51b8eb275a/13040_2016_120_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2ab/5148893/01184535e52f/13040_2016_120_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2ab/5148893/f397f61d9dac/13040_2016_120_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2ab/5148893/4ec50e60c78b/13040_2016_120_Fig5_HTML.jpg

相似文献

1
-QR classifier: a patterns based approach for plant species identification.-QR分类器:一种基于模式的植物物种识别方法。
BioData Min. 2016 Dec 9;9:39. doi: 10.1186/s13040-016-0120-6. eCollection 2016.
2
Evaluation of Arabian Vascular Plant Barcodes (rbcL and matK): Precision of Unsupervised and Supervised Learning Methods towards Accurate Identification.阿拉伯维管植物条形码(rbcL和matK)评估:无监督和监督学习方法对准确识别的精度
Plants (Basel). 2021 Dec 13;10(12):2741. doi: 10.3390/plants10122741.
3
The Identification of Discriminating Patterns from 16S rRNA Gene to Generate Signature for Bacillus Genus.从16S rRNA基因中识别区分模式以生成芽孢杆菌属的特征标记
J Comput Biol. 2016 Aug;23(8):651-61. doi: 10.1089/cmb.2016.0002. Epub 2016 Apr 22.
4
[Establishment of DNA Genetic Marker Identification System for Plant Evidence].[植物物证DNA遗传标记鉴定体系的建立]
Fa Yi Xue Za Zhi. 2018 Apr;34(2):138-141. doi: 10.3969/j.issn.1004-5619.2018.02.006. Epub 2018 Apr 25.
5
Development of DNA barcodes for selected species by using and DNA markers.利用[具体DNA标记]为选定物种开发DNA条形码。 (你原文中“using and DNA markers”表述有误,这里推测应该是“using [specific DNA markers]”之类的,你可检查下原文并修正后再让我翻译更准确)
Saudi J Biol Sci. 2020 Dec;27(12):3735-3742. doi: 10.1016/j.sjbs.2020.08.020. Epub 2020 Aug 19.
6
Identification of species in the angiosperm family Apiaceae using DNA barcodes.利用DNA条形码鉴定被子植物伞形科中的物种。
Mol Ecol Resour. 2014 Nov;14(6):1231-8. doi: 10.1111/1755-0998.12262. Epub 2014 May 14.
7
Analyzing multi-locus plant barcoding datasets with a composition vector method based on adjustable weighted distance.基于可调加权距离的组合向量方法分析多基因座植物条形码数据集。
PLoS One. 2012;7(7):e42154. doi: 10.1371/journal.pone.0042154. Epub 2012 Jul 27.
8
Analyses of Burial Codon Bias Among the Species of Dipterocarpaceae Through Molecular and Phylogenetic Data.通过分子和系统发育数据对龙脑香科物种间埋藏密码子偏好性的分析
Evol Bioinform Online. 2019 Mar 26;15:1176934319834888. doi: 10.1177/1176934319834888. eCollection 2019.
9
The first initiative of DNA barcoding of ornamental plants from Egypt and potential applications in horticulture industry.埃及观赏植物DNA条形码的首个倡议及其在园艺产业中的潜在应用。
PLoS One. 2017 Feb 15;12(2):e0172170. doi: 10.1371/journal.pone.0172170. eCollection 2017.
10
Comparison of matK and rbcL DNA barcodes for genetic classification of jewel orchid accessions in Vietnam.越南宝石兰种质资源遗传分类中matK和rbcL DNA条形码的比较
J Genet Eng Biotechnol. 2021 Jun 21;19(1):93. doi: 10.1186/s43141-021-00188-1.

引用本文的文献

1
Quantification of adulteration in traded ayurvedic raw drugs employing machine learning approaches with DNA barcode database.利用机器学习方法和DNA条形码数据库对交易中的阿育吠陀生药掺假情况进行量化分析。
3 Biotech. 2021 Nov;11(11):463. doi: 10.1007/s13205-021-03001-5. Epub 2021 Oct 18.
2
Machine learning approaches outperform distance- and tree-based methods for DNA barcoding of Pterocarpus wood.机器学习方法在基于 DNA 条形码的紫檀木鉴定中优于距离和基于树的方法。
Planta. 2019 May;249(5):1617-1625. doi: 10.1007/s00425-019-03116-3. Epub 2019 Mar 1.

本文引用的文献

1
The Identification of Discriminating Patterns from 16S rRNA Gene to Generate Signature for Bacillus Genus.从16S rRNA基因中识别区分模式以生成芽孢杆菌属的特征标记
J Comput Biol. 2016 Aug;23(8):651-61. doi: 10.1089/cmb.2016.0002. Epub 2016 Apr 22.
2
LAF: Logic Alignment Free and its application to bacterial genomes classification.LAF:无逻辑比对及其在细菌基因组分类中的应用。
BioData Min. 2015 Dec 8;8:39. doi: 10.1186/s13040-015-0073-1. eCollection 2015.
3
An algorithm of discovering signatures from DNA databases on a computer cluster.
一种在计算机集群上从DNA数据库中发现特征序列的算法。
BMC Bioinformatics. 2014 Oct 5;15(1):339. doi: 10.1186/1471-2105-15-339.
4
DNA barcode authentication of wood samples of threatened and commercial timber trees within the tropical dry evergreen forest of India.印度热带干燥常绿森林中濒危及商用木材树种木材样本的DNA条形码鉴定
PLoS One. 2014 Sep 26;9(9):e107669. doi: 10.1371/journal.pone.0107669. eCollection 2014.
5
Classification of selectively constrained DNA elements using feature vectors and rule-based classifiers.使用特征向量和基于规则的分类器对选择性受限DNA元件进行分类。
Genomics. 2014 Aug;104(2):79-86. doi: 10.1016/j.ygeno.2014.07.004. Epub 2014 Jul 22.
6
Supervised DNA Barcodes species classification: analysis, comparisons and results.有监督 DNA 条形码物种分类:分析、比较和结果。
BioData Min. 2014 Apr 11;7(1):4. doi: 10.1186/1756-0381-7-4.
7
Plant DNA barcoding: from gene to genome.植物 DNA 条形码:从基因到基因组。
Biol Rev Camb Philos Soc. 2015 Feb;90(1):157-66. doi: 10.1111/brv.12104. Epub 2014 Mar 26.
8
A reliable DNA barcode reference library for the identification of the North European shelf fish fauna.用于鉴定北欧大陆架鱼类区系的可靠 DNA 条码参考文库。
Mol Ecol Resour. 2014 Sep;14(5):1060-71. doi: 10.1111/1755-0998.12238. Epub 2014 Mar 12.
9
Prospects for discriminating Zingiberaceae species in India using DNA barcodes.利用 DNA 条形码鉴别印度姜科植物种类的前景。
J Integr Plant Biol. 2014 Aug;56(8):760-73. doi: 10.1111/jipb.12189. Epub 2014 Apr 7.
10
DNA barcoding of medicinal plant material for identification.药用植物材料的 DNA 条形码鉴定。
Curr Opin Biotechnol. 2014 Feb;25:103-10. doi: 10.1016/j.copbio.2013.09.010. Epub 2013 Nov 16.