• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种选择有效鉴别性种子用于寡核苷酸设计的实证研究。

An empirical study of choosing efficient discriminative seeds for oligonucleotide design.

机构信息

Department of Computer Engineering, Kyungpook National University, Daegu 702-701, South Korea.

出版信息

BMC Genomics. 2009 Dec 3;10 Suppl 3(Suppl 3):S3. doi: 10.1186/1471-2164-10-S3-S3.

DOI:10.1186/1471-2164-10-S3-S3
PMID:19958494
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2788383/
Abstract

BACKGROUND

Oligonucleotide design is known as a time-consuming work in bioinformatics. In order to accelerate and be efficient the oligonucleotide design process, one of widely used approach is the prescreening unreliable regions using a hashing (or seeding) algorithm. Since the seeding algorithm is originally proposed to increase sensitivity for local alignment, the specificity should be considered as well as the sensitivity for the oligonucleotide design problem. However, a measure of evaluating the seeds regarding how adequate and efficient they are in the oligo design is not yet proposed. Here, we propose novel measures of evaluating the seeding algorithms based on the discriminability and the efficiency.

RESULTS

To evaluate the proposed measures, we examine five seeding algorithms in oligonucleotide design. We carried out a series of experiments to compare the seeding algorithms. As the result, the spaced seed is recorded as the most efficient discriminative seed for oligo design. The performance of transition-constrained seed is slightly lower than the spaced seed. Because BLAT seeding algorithm and Vector seeding algorithm give poor scores in specificity and efficiency, we conclude that these algorithms are not adequate to design oligos. Consequently, we recommend spaced seeds or transition-constrained seeds with 15 approximately 18 weight in order to design oligos with the length of 50 mer. The empirical experiments in real biological data reveal that the recommended seeds show consequently good performance. We also propose a software package which enables the users to get the adequate seeds under their own experimental conditions.

CONCLUSION

Our study is valuable to the two points. One is that our study can be applied to the oligo design programs in order to improve the performance by suggesting the experiment-specific seeds. The other is that our study is useful to improve the performance of the mapping assembly in the field of Next-Generation Sequencing. Our proposed measures are originally designed to be used for oligo design but we expect that our study will be helpful to the other genomic tasks.

摘要

背景

寡核苷酸设计在生物信息学中是一项耗时的工作。为了加速和提高寡核苷酸设计的效率,一种广泛使用的方法是使用哈希(或播种)算法预先筛选不可靠的区域。由于播种算法最初是为了提高局部比对的灵敏度而提出的,因此在寡核苷酸设计问题中,不仅要考虑灵敏度,还要考虑特异性。然而,目前还没有提出一种衡量种子在寡核苷酸设计中充分性和效率的方法。在这里,我们提出了基于可区分性和效率的新的种子算法评估方法。

结果

为了评估所提出的方法,我们在寡核苷酸设计中检验了五种播种算法。我们进行了一系列实验来比较播种算法。结果表明,间隔种子是最有效的用于寡核苷酸设计的区分性种子。转换约束种子的性能略低于间隔种子。由于 BLAT 播种算法和 Vector 播种算法在特异性和效率方面得分较低,我们得出结论,这些算法不适合设计寡核苷酸。因此,我们建议使用长度约为 50 个碱基的间隔种子或转换约束种子,权重为 15 到 18。在真实生物数据的实证实验中,推荐的种子表现出了良好的性能。我们还提出了一个软件包,使用户能够在自己的实验条件下获得合适的种子。

结论

我们的研究有两个重要价值。一是我们的研究可以应用于寡核苷酸设计程序,通过建议特定于实验的种子来提高性能。另一个是我们的研究有助于提高下一代测序领域的映射组装的性能。我们提出的方法最初是为寡核苷酸设计而设计的,但我们希望我们的研究将对其他基因组任务有所帮助。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0312/2788383/1925becfd924/1471-2164-10-S3-S3-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0312/2788383/2e69b65f1d5f/1471-2164-10-S3-S3-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0312/2788383/cd8d5c8ef81a/1471-2164-10-S3-S3-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0312/2788383/6c154e2cd95d/1471-2164-10-S3-S3-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0312/2788383/1925becfd924/1471-2164-10-S3-S3-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0312/2788383/2e69b65f1d5f/1471-2164-10-S3-S3-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0312/2788383/cd8d5c8ef81a/1471-2164-10-S3-S3-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0312/2788383/6c154e2cd95d/1471-2164-10-S3-S3-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0312/2788383/1925becfd924/1471-2164-10-S3-S3-4.jpg

相似文献

1
An empirical study of choosing efficient discriminative seeds for oligonucleotide design.一种选择有效鉴别性种子用于寡核苷酸设计的实证研究。
BMC Genomics. 2009 Dec 3;10 Suppl 3(Suppl 3):S3. doi: 10.1186/1471-2164-10-S3-S3.
2
Seeds for effective oligonucleotide design.有效寡核苷酸设计的种子。
BMC Genomics. 2011 Jun 1;12:280. doi: 10.1186/1471-2164-12-280.
3
Efficient computation of spaced seed hashing with block indexing.基于块索引的高效间距种子哈希计算。
BMC Bioinformatics. 2018 Nov 30;19(Suppl 15):441. doi: 10.1186/s12859-018-2415-8.
4
Iterative Spaced Seed Hashing: Closing the Gap Between Spaced Seed Hashing and -mer Hashing.迭代间隔种子哈希:缩小间隔种子哈希与k-mer哈希之间的差距。
J Comput Biol. 2020 Feb;27(2):223-233. doi: 10.1089/cmb.2019.0298. Epub 2019 Dec 4.
5
Mismatch-tolerant, alignment-free sequence classification using multiple spaced seeds and multiindex Bloom filters.使用多间隔种子和多索引布隆过滤器进行容错、无比对的序列分类。
Proc Natl Acad Sci U S A. 2020 Jul 21;117(29):16961-16968. doi: 10.1073/pnas.1903436117. Epub 2020 Jul 8.
6
Choosing the best heuristic for seeded alignment of DNA sequences.选择用于DNA序列种子比对的最佳启发式算法。
BMC Bioinformatics. 2006 Mar 13;7:133. doi: 10.1186/1471-2105-7-133.
7
PerFSeeB: designing long high-weight single spaced seeds for full sensitivity alignment with a given number of mismatches.PerFSeeB:设计长的高权重单间隔种子,以在给定数量的错配下实现全灵敏度比对。
BMC Bioinformatics. 2023 Oct 24;24(1):396. doi: 10.1186/s12859-023-05517-4.
8
ntHash2: recursive spaced seed hashing for nucleotide sequences.ntHash2:核苷酸序列的递归间隔种子哈希。
Bioinformatics. 2022 Oct 14;38(20):4812-4813. doi: 10.1093/bioinformatics/btac564.
9
FSH: fast spaced seed hashing exploiting adjacent hashes.FSH:利用相邻哈希的快速间隔种子哈希
Algorithms Mol Biol. 2018 Mar 22;13:8. doi: 10.1186/s13015-018-0125-4. eCollection 2018.
10
Efficient computation of spaced seeds.间隔种子的高效计算。
BMC Res Notes. 2012 Feb 28;5:123. doi: 10.1186/1756-0500-5-123.

引用本文的文献

1
Best hits of 11110110111: model-free selection and parameter-free sensitivity calculation of spaced seeds.11110110111的最佳命中结果:间隔种子的无模型选择和无参数敏感性计算
Algorithms Mol Biol. 2017 Feb 14;12:1. doi: 10.1186/s13015-017-0092-1. eCollection 2017.
2
Seeds for effective oligonucleotide design.有效寡核苷酸设计的种子。
BMC Genomics. 2011 Jun 1;12:280. doi: 10.1186/1471-2164-12-280.
3
Extending Asia Pacific bioinformatics into new realms in the "-omics" era.将亚太生物信息学拓展到“组学”时代的新领域。

本文引用的文献

1
Genome assembly reborn: recent computational challenges.基因组组装重生:近期的计算挑战
Brief Bioinform. 2009 Jul;10(4):354-66. doi: 10.1093/bib/bbp026. Epub 2009 May 29.
2
ZOOM! Zillions of oligos mapped.嗖!数百万个寡核苷酸被定位。
Bioinformatics. 2008 Nov 1;24(21):2431-7. doi: 10.1093/bioinformatics/btn416. Epub 2008 Aug 6.
3
The impact of next-generation sequencing technology on genetics.下一代测序技术对遗传学的影响。
BMC Genomics. 2009 Dec 3;10 Suppl 3(Suppl 3):S1. doi: 10.1186/1471-2164-10-S3-S1.
Trends Genet. 2008 Mar;24(3):133-41. doi: 10.1016/j.tig.2007.12.007. Epub 2008 Feb 11.
4
A fast and flexible approach to oligonucleotide probe design for genomes and gene families.一种针对基因组和基因家族的快速且灵活的寡核苷酸探针设计方法。
Bioinformatics. 2007 May 15;23(10):1195-202. doi: 10.1093/bioinformatics/btm114. Epub 2007 Mar 28.
5
OligoSpawn: a software tool for the design of overgo probes from large unigene datasets.OligoSpawn:一种用于从大型单基因数据集设计重叠探针的软件工具。
BMC Bioinformatics. 2006 Jan 9;7:7. doi: 10.1186/1471-2105-7-7.
6
Design of long oligonucleotide probes for functional gene detection in a microbial community.用于微生物群落中功能基因检测的长寡核苷酸探针设计
Bioinformatics. 2005 Nov 15;21(22):4092-100. doi: 10.1093/bioinformatics/bti673. Epub 2005 Sep 13.
7
Empirical establishment of oligonucleotide probe design criteria.寡核苷酸探针设计标准的经验性确立。
Appl Environ Microbiol. 2005 Jul;71(7):3753-60. doi: 10.1128/AEM.71.7.3753-3760.2005.
8
DINAMelt web server for nucleic acid melting prediction.用于核酸熔解预测的DINAMelt网络服务器。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W577-81. doi: 10.1093/nar/gki591.
9
YASS: enhancing the sensitivity of DNA similarity search.YASS:提高DNA相似性搜索的灵敏度。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W540-3. doi: 10.1093/nar/gki478.
10
Design of oligonucleotides for microarrays and perspectives for design of multi-transcriptome arrays.用于微阵列的寡核苷酸设计及多转录组阵列设计展望
Nucleic Acids Res. 2003 Jul 1;31(13):3491-6. doi: 10.1093/nar/gkg622.