• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HELIOS:光学中的高速序列比对。

HELIOS: High-speed sequence alignment in optics.

机构信息

Department of Computer Engineering, Sharif University of Technology, Tehran, Iran.

出版信息

PLoS Comput Biol. 2022 Nov 21;18(11):e1010665. doi: 10.1371/journal.pcbi.1010665. eCollection 2022 Nov.

DOI:10.1371/journal.pcbi.1010665
PMID:36409684
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9678324/
Abstract

In response to the imperfections of current sequence alignment methods, originated from the inherent serialism within their corresponding electrical systems, a few optical approaches for biological data comparison have been proposed recently. However, due to their low performance, raised from their inefficient coding scheme, this paper presents a novel all-optical high-throughput method for aligning DNA, RNA, and protein sequences, named HELIOS. The HELIOS method employs highly sophisticated operations to locate character matches, single or multiple mutations, and single or multiple indels within various biological sequences. On the other hand, the HELIOS optical architecture exploits high-speed processing and operational parallelism in optics, by adopting wavelength and polarization of optical beams. For evaluation, the functionality and accuracy of the HELIOS method are approved through behavioral and optical simulation studies, while its complexity and performance are estimated through analytical computation. The accuracy evaluations indicate that the HELIOS method achieves a precise pairwise alignment of two sequences, highly similar to those of Smith-Waterman, Needleman-Wunsch, BLAST, MUSCLE, ClustalW, ClustalΩ, T-Coffee, Kalign, and MAFFT. According to our performance evaluations, the HELIOS optical architecture outperforms all alternative electrical and optical algorithms in terms of processing time and memory requirement, relying on its highly sophisticated method and optical architecture. Moreover, the employed compact coding scheme highly escalates the number of input characters, and hence, it offers reduced time and space complexities, compared to the electrical and optical alternatives. It makes the HELIOS method and optical architecture highly applicable for biomedical applications.

摘要

针对当前序列比对方法的不完善之处,源于其相应电子系统中的固有序列化,最近提出了几种用于生物数据比较的光学方法。然而,由于其低效的编码方案导致性能低下,本文提出了一种新颖的全光学高通量方法,用于对齐 DNA、RNA 和蛋白质序列,名为 HELIOS。HELIOS 方法采用高度复杂的操作来定位各种生物序列中的字符匹配、单个或多个突变以及单个或多个插入缺失。另一方面,HELIOS 光学架构利用光学中的高速处理和操作并行性,通过采用光束的波长和偏振。为了进行评估,通过行为和光学模拟研究验证了 HELIOS 方法的功能和准确性,同时通过分析计算估计了其复杂性和性能。准确性评估表明,HELIOS 方法实现了两个序列的精确两两比对,高度类似于 Smith-Waterman、Needleman-Wunsch、BLAST、MUSCLE、ClustalW、ClustalΩ、T-Coffee、Kalign 和 MAFFT 的比对。根据我们的性能评估,HELIOS 光学架构在处理时间和内存需求方面优于所有替代的电气和光学算法,这依赖于其高度复杂的方法和光学架构。此外,所采用的紧凑编码方案极大地增加了输入字符的数量,因此与电气和光学替代方案相比,它提供了减少的时间和空间复杂度。这使得 HELIOS 方法和光学架构非常适用于生物医学应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f083/9678324/9aeeceaab279/pcbi.1010665.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f083/9678324/1da1650bb9b9/pcbi.1010665.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f083/9678324/414756ac8d82/pcbi.1010665.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f083/9678324/fd9dc4d69e1c/pcbi.1010665.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f083/9678324/f2bd9823569f/pcbi.1010665.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f083/9678324/89732069492f/pcbi.1010665.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f083/9678324/55fe637e3815/pcbi.1010665.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f083/9678324/60d5f3c85f3f/pcbi.1010665.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f083/9678324/9aeeceaab279/pcbi.1010665.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f083/9678324/1da1650bb9b9/pcbi.1010665.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f083/9678324/414756ac8d82/pcbi.1010665.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f083/9678324/fd9dc4d69e1c/pcbi.1010665.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f083/9678324/f2bd9823569f/pcbi.1010665.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f083/9678324/89732069492f/pcbi.1010665.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f083/9678324/55fe637e3815/pcbi.1010665.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f083/9678324/60d5f3c85f3f/pcbi.1010665.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f083/9678324/9aeeceaab279/pcbi.1010665.g008.jpg

相似文献

1
HELIOS: High-speed sequence alignment in optics.HELIOS:光学中的高速序列比对。
PLoS Comput Biol. 2022 Nov 21;18(11):e1010665. doi: 10.1371/journal.pcbi.1010665. eCollection 2022 Nov.
2
OptCAM: An ultra-fast all-optical architecture for DNA variant discovery.OptCAM:用于 DNA 变体发现的超快速全光架构。
J Biophotonics. 2020 Jan;13(1):e201900227. doi: 10.1002/jbio.201900227. Epub 2019 Aug 29.
3
Parallel algorithms for large-scale biological sequence alignment on Xeon-Phi based clusters.基于至强融核集群的大规模生物序列比对并行算法
BMC Bioinformatics. 2016 Jul 19;17 Suppl 9(Suppl 9):267. doi: 10.1186/s12859-016-1128-0.
4
Optimization and Performance Analysis of CAT Method for DNA Sequence Similarity Searching and Alignment.CAT 方法在 DNA 序列相似性搜索和比对中的优化与性能分析。
Genes (Basel). 2024 Mar 7;15(3):341. doi: 10.3390/genes15030341.
5
Protein alignment algorithms with an efficient backtracking routine on multiple GPUs.在多个 GPU 上使用高效回溯例程的蛋白质比对算法。
BMC Bioinformatics. 2011 May 20;12:181. doi: 10.1186/1471-2105-12-181.
6
Grammar-based distance in progressive multiple sequence alignment.渐进多序列比对中基于语法的距离
BMC Bioinformatics. 2008 Jul 10;9:306. doi: 10.1186/1471-2105-9-306.
7
MUSCLE: multiple sequence alignment with high accuracy and high throughput.MUSCLE:具有高精度和高吞吐量的多序列比对。
Nucleic Acids Res. 2004 Mar 19;32(5):1792-7. doi: 10.1093/nar/gkh340. Print 2004.
8
Pairwise alignment for very long nucleic acid sequences.非常长的核酸序列的两两比对。
Biochem Biophys Res Commun. 2018 Jul 20;502(3):313-317. doi: 10.1016/j.bbrc.2018.05.134. Epub 2018 May 29.
9
Improvement in the accuracy of multiple sequence alignment program MAFFT.多重序列比对程序MAFFT准确性的提高。
Genome Inform. 2005;16(1):22-33.
10
[Estimation of the quality of global alignment of amino acid sequences based on evolution criterion].基于进化标准的氨基酸序列全局比对质量评估
Biofizika. 2013 Mar-Apr;58(2):197-202.

引用本文的文献

1
PC-mer: An Ultra-fast memory-efficient tool for metagenomics profiling and classification.PC-mer:一种用于宏基因组学分析和分类的超快速、内存高效的工具。
PLoS One. 2024 Aug 1;19(8):e0307279. doi: 10.1371/journal.pone.0307279. eCollection 2024.

本文引用的文献

1
The Influence of Optical Alignment Error on Compression Coding Superresolution Imaging.光学对准误差对压缩编码超分辨率成像的影响
Sensors (Basel). 2022 Apr 1;22(7):2717. doi: 10.3390/s22072717.
2
Optical pattern generator for efficient bio-data encoding in a photonic sequence comparison architecture.用于在光子序列比较架构中高效生物数据编码的光学模式发生器。
PLoS One. 2021 Jan 15;16(1):e0245095. doi: 10.1371/journal.pone.0245095. eCollection 2021.
3
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and coronavirus disease-2019 (COVID-19): The epidemic and the challenges.
严重急性呼吸综合征冠状病毒 2(SARS-CoV-2)和 2019 年冠状病毒病(COVID-19):疫情和挑战。
Int J Antimicrob Agents. 2020 Mar;55(3):105924. doi: 10.1016/j.ijantimicag.2020.105924. Epub 2020 Feb 17.
4
Measuring Similarity among Protein Sequences Using a New Descriptor.使用新描述符衡量蛋白质序列之间的相似性。
Biomed Res Int. 2019 Nov 22;2019:2796971. doi: 10.1155/2019/2796971. eCollection 2019.
5
Kalign 3: multiple sequence alignment of large data sets.Kalign 3:大型数据集的多序列比对
Bioinformatics. 2019 Oct 26;36(6):1928-9. doi: 10.1093/bioinformatics/btz795.
6
OptCAM: An ultra-fast all-optical architecture for DNA variant discovery.OptCAM:用于 DNA 变体发现的超快速全光架构。
J Biophotonics. 2020 Jan;13(1):e201900227. doi: 10.1002/jbio.201900227. Epub 2019 Aug 29.
7
Topology-independent and global protein structure alignment through an FFT-based algorithm.基于 FFT 的算法实现拓扑无关和全局的蛋白质结构比对。
Bioinformatics. 2020 Jan 15;36(2):478-486. doi: 10.1093/bioinformatics/btz609.
8
Magic-BLAST, an accurate RNA-seq aligner for long and short reads.Magic-BLAST,一款适用于长读长和短读长的精确 RNA-seq 比对工具。
BMC Bioinformatics. 2019 Jul 25;20(1):405. doi: 10.1186/s12859-019-2996-x.
9
MAFFT-DASH: integrated protein sequence and structural alignment.MAFFT-DASH:集成蛋白质序列和结构比对。
Nucleic Acids Res. 2019 Jul 2;47(W1):W5-W10. doi: 10.1093/nar/gkz342.
10
Physical Origin of Early Failure for Contaminated Optics.受污染光学元件早期失效的物理根源
Sci Rep. 2019 Jan 24;9(1):635. doi: 10.1038/s41598-018-37337-5.