• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

4SpecID:法医应用参考 DNA 文库审核和注释系统。

4SpecID: Reference DNA Libraries Auditing and Annotation System for Forensic Applications.

机构信息

ALGORITMI, Campus de Gualtar, University of Minho, Rua da Universidade, 4710-057 Braga, Portugal.

Instituto de Investigação e Inovação em Saúde (i3S), Universidade do Porto, Rua Alfredo Allen 208, 4200-135 Porto, Portugal.

出版信息

Genes (Basel). 2021 Jan 2;12(1):61. doi: 10.3390/genes12010061.

DOI:10.3390/genes12010061
PMID:33401773
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7824288/
Abstract

Forensic genetics is a fast-growing field that frequently requires DNA-based taxonomy, namely, when evidence are parts of specimens, often highly processed in food, potions, or ointments. Reference DNA-sequences libraries, such as BOLD or GenBank, are imperative tools for taxonomic assignment, particularly when morphology is inadequate for classification. The auditing and curation of these datasets require reliable mechanisms, preferably with automated data preprocessing. Software tools were developed to grade these datasets considering as primary criterion the number of records, which is not compliant with forensic standards, where the priority is validation from independent sources. Moreover, 4SpecID is an efficient software tool developed to audit and annotate reference libraries, specifically designed for forensic applications. Its intuitive user-friendly interface virtually accesses any database and includes specific data mining functions tuned for the widespread BOLD repositories. The built tool was evaluated in laptop MacBook and a dual-Xeon server with a large BOLD dataset ( 36,115 records), and the best execution time to grade the dataset on the laptop was 0.28 s. Datasets of and families were used to evaluate the quality of the tool and the relevance of independent sources validation.

摘要

法医遗传学是一个快速发展的领域,经常需要基于 DNA 的分类学,即在证据是标本的一部分时,通常是经过高度加工的食品、药水或药膏。BOLD 或 GenBank 等参考 DNA 序列库是分类学分配的必要工具,特别是在形态学不足以进行分类时。这些数据集的审核和管理需要可靠的机制,最好具有自动化的数据预处理。开发了软件工具来对这些数据集进行评分,主要标准是记录的数量,这不符合法医标准,法医标准优先考虑来自独立来源的验证。此外,4SpecID 是一种高效的软件工具,用于审核和注释参考库,专门为法医应用而设计。它直观的用户友好界面可以访问任何数据库,并包括针对广泛的 BOLD 存储库进行了优化的数据挖掘功能。在带有大型 BOLD 数据集(36115 条记录)的笔记本 MacBook 和双 Xeon 服务器上对构建的工具进行了评估,在笔记本电脑上对数据集进行评分的最佳执行时间为 0.28 秒。使用 和 家族的数据集来评估工具的质量和独立来源验证的相关性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/82fa/7824288/309dc708a4af/genes-12-00061-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/82fa/7824288/4f401781c29a/genes-12-00061-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/82fa/7824288/45e16a0ae00b/genes-12-00061-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/82fa/7824288/00c4c2de9bbb/genes-12-00061-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/82fa/7824288/bc52e78835e3/genes-12-00061-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/82fa/7824288/8c9570b1016e/genes-12-00061-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/82fa/7824288/d10201569588/genes-12-00061-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/82fa/7824288/dcad1cd077c0/genes-12-00061-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/82fa/7824288/309dc708a4af/genes-12-00061-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/82fa/7824288/4f401781c29a/genes-12-00061-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/82fa/7824288/45e16a0ae00b/genes-12-00061-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/82fa/7824288/00c4c2de9bbb/genes-12-00061-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/82fa/7824288/bc52e78835e3/genes-12-00061-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/82fa/7824288/8c9570b1016e/genes-12-00061-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/82fa/7824288/d10201569588/genes-12-00061-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/82fa/7824288/dcad1cd077c0/genes-12-00061-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/82fa/7824288/309dc708a4af/genes-12-00061-g008.jpg

相似文献

1
4SpecID: Reference DNA Libraries Auditing and Annotation System for Forensic Applications.4SpecID:法医应用参考 DNA 文库审核和注释系统。
Genes (Basel). 2021 Jan 2;12(1):61. doi: 10.3390/genes12010061.
2
BAGS: An automated Barcode, Audit & Grade System for DNA barcode reference libraries.BAGS:一个用于 DNA 条码参考文库的自动条码、审计和分级系统。
Mol Ecol Resour. 2021 Feb;21(2):573-583. doi: 10.1111/1755-0998.13262. Epub 2020 Oct 28.
3
Management of DNA reference libraries for barcoding and metabarcoding studies with the R package refdb.使用R包refdb对用于条形码和宏条形码研究的DNA参考文库进行管理。
Mol Ecol Resour. 2023 Feb;23(2):511-518. doi: 10.1111/1755-0998.13723. Epub 2022 Oct 28.
4
Guidelines for the Analysis of DNA Barcoding/Metabarcoding Sequencing Data and Interpretation of Publicly Available Databases.DNA 条形码/代谢组条形码测序数据分析指南和公共可用数据库的解释。
Methods Mol Biol. 2024;2744:391-402. doi: 10.1007/978-1-0716-3581-0_25.
5
VIP Barcoding: composition vector-based software for rapid species identification based on DNA barcoding.VIP条形码:基于组成向量的软件,用于基于DNA条形码的快速物种鉴定。
Mol Ecol Resour. 2014 Jul;14(4):871-81. doi: 10.1111/1755-0998.12235. Epub 2014 Mar 7.
6
EMBL2checklists: A Python package to facilitate the user-friendly submission of plant and fungal DNA barcoding sequences to ENA.EMBL2checklists:一个方便用户向 ENA 提交植物和真菌 DNA 条形码序列的 Python 包。
PLoS One. 2019 Jan 10;14(1):e0210347. doi: 10.1371/journal.pone.0210347. eCollection 2019.
7
crabs-A software program to generate curated reference databases for metabarcoding sequencing data.Crabs——一个用于为元条形码测序数据生成经过整理的参考数据库的软件程序。
Mol Ecol Resour. 2023 Apr;23(3):725-738. doi: 10.1111/1755-0998.13741. Epub 2022 Dec 11.
8
BOLD and GenBank revisited - Do identification errors arise in the lab or in the sequence libraries?BOLD 和 GenBank 再探——鉴定错误是发生在实验室还是序列文库中?
PLoS One. 2020 Apr 16;15(4):e0231814. doi: 10.1371/journal.pone.0231814. eCollection 2020.
9
DNA barcoding in the Southeast Pacific marine realm: Low coverage and geographic representation despite high diversity.东南太平洋海域的 DNA 条形码:尽管多样性高,但覆盖范围和地理代表性低。
PLoS One. 2020 Dec 28;15(12):e0244323. doi: 10.1371/journal.pone.0244323. eCollection 2020.
10
Taxonomic identification accuracy from BOLD and GenBank databases using over a thousand insect DNA barcodes from Colombia.利用来自哥伦比亚的超过一千个昆虫 DNA 条形码对 BOLD 和 GenBank 数据库进行分类鉴定准确性研究。
PLoS One. 2023 Apr 24;18(4):e0277379. doi: 10.1371/journal.pone.0277379. eCollection 2023.

引用本文的文献

1
Benchmarking and Validation of a Bioinformatics Workflow for Meat Species Identification Using 16S rDNA Metabarcoding.使用16S rDNA宏条形码技术进行肉类物种鉴定的生物信息学工作流程的基准测试与验证
Foods. 2023 Feb 24;12(5):968. doi: 10.3390/foods12050968.

本文引用的文献

1
Phylogenomics and species delimitation for effective conservation of manta and devil rays.系统基因组学与物种界定在有效保护蝠鲼和魟鱼中的应用。
Mol Ecol. 2020 Dec;29(24):4783-4796. doi: 10.1111/mec.15683. Epub 2020 Nov 9.
2
New insights into the genetic diversity of the stone crayfish: taxonomic and conservation implications.对石蟹遗传多样性的新认识:分类学和保护意义。
BMC Evol Biol. 2020 Nov 6;20(1):146. doi: 10.1186/s12862-020-01709-1.
3
DNA Barcoding unveils cryptic lineages of Hoplias malabaricus from Northeastern Brazil.
DNA 条形码揭示了来自巴西东北部的马拉巴丽脂鲤的隐存谱系。
Braz J Biol. 2021 Oct-Dec;81(4):917-927. doi: 10.1590/1519-6984.231598. Epub 2021 Nov 30.
4
BAGS: An automated Barcode, Audit & Grade System for DNA barcode reference libraries.BAGS:一个用于 DNA 条码参考文库的自动条码、审计和分级系统。
Mol Ecol Resour. 2021 Feb;21(2):573-583. doi: 10.1111/1755-0998.13262. Epub 2020 Oct 28.
5
Species assignment in forensics and the challenge of hybrids.法庭科学中的物种鉴定与杂种的挑战。
Forensic Sci Int Genet. 2020 Sep;48:102333. doi: 10.1016/j.fsigen.2020.102333. Epub 2020 Jun 17.
6
DNA barcode reference libraries for the monitoring of aquatic biota in Europe: Gap-analysis and recommendations for future work.用于监测欧洲水生物种的 DNA 条码参考图书馆:差距分析和未来工作建议。
Sci Total Environ. 2019 Aug 15;678:499-524. doi: 10.1016/j.scitotenv.2019.04.247. Epub 2019 Apr 27.
7
The UNITE database for molecular identification of fungi: handling dark taxa and parallel taxonomic classifications.UNITE 数据库用于真菌的分子鉴定:处理暗类群和并行的分类学分类。
Nucleic Acids Res. 2019 Jan 8;47(D1):D259-D264. doi: 10.1093/nar/gky1022.
8
GenBank.基因银行
Nucleic Acids Res. 2017 Jan 4;45(D1):D37-D42. doi: 10.1093/nar/gkw1070. Epub 2016 Nov 28.
9
Assembling and auditing a comprehensive DNA barcode reference library for European marine fishes.构建和审核欧洲海洋鱼类的综合DNA条形码参考文库。
J Fish Biol. 2016 Dec;89(6):2741-2754. doi: 10.1111/jfb.13169. Epub 2016 Oct 14.
10
R-Syst::diatom: an open-access and curated barcode database for diatoms and freshwater monitoring.R-Syst::硅藻:一个用于硅藻和淡水监测的开放获取且经过整理的条形码数据库。
Database (Oxford). 2016 Mar 17;2016. doi: 10.1093/database/baw016. Print 2016.