• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
An average-case efficient two-stage algorithm for enumerating all longest common substrings of minimum length between genome pairs.一种用于枚举基因组对之间所有最短长度最长公共子串的平均情况高效两阶段算法。
Proc (IEEE Int Conf Healthc Inform). 2024 Jun;2024:93-102. doi: 10.1109/ichi61247.2024.00020. Epub 2024 Aug 22.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
4
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
5
Sexual Harassment and Prevention Training性骚扰与预防培训
6
Short-Term Memory Impairment短期记忆障碍
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
8
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
9
Impact of residual disease as a prognostic factor for survival in women with advanced epithelial ovarian cancer after primary surgery.原发性手术后晚期上皮性卵巢癌患者残留病灶对生存预后的影响。
Cochrane Database Syst Rev. 2022 Sep 26;9(9):CD015048. doi: 10.1002/14651858.CD015048.pub2.
10
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.

本文引用的文献

1
Fast and exact quantification of motif occurrences in biological sequences.快速准确地定量生物序列中的基序出现次数。
BMC Bioinformatics. 2021 Sep 18;22(1):445. doi: 10.1186/s12859-021-04355-6.
2
MUMmer4: A fast and versatile genome alignment system.MUMmer4:一种快速且通用的基因组比对系统。
PLoS Comput Biol. 2018 Jan 26;14(1):e1005944. doi: 10.1371/journal.pcbi.1005944. eCollection 2018 Jan.
3
Occurrence probability of structured motifs in random sequences.
J Comput Biol. 2002;9(6):761-73. doi: 10.1089/10665270260518254.

一种用于枚举基因组对之间所有最短长度最长公共子串的平均情况高效两阶段算法。

An average-case efficient two-stage algorithm for enumerating all longest common substrings of minimum length between genome pairs.

作者信息

Prosperi Mattia, Marini Simone, Boucher Christina

机构信息

dept. of Epidemiology, University of Florida, Gainesville, FL (USA).

dept. of CISE, University of Florida, Gainesville, FL (USA).

出版信息

Proc (IEEE Int Conf Healthc Inform). 2024 Jun;2024:93-102. doi: 10.1109/ichi61247.2024.00020. Epub 2024 Aug 22.

DOI:10.1109/ichi61247.2024.00020
PMID:39308639
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11412151/
Abstract

A problem extension of the longest common substring (LCS) between two texts is the enumeration of all LCSs given a minimum length (ALCS- ), along with their positions in each text. In bioinformatics, an efficient solution to the ALCS- for very long texts -genomes or metagenomes- can provide useful insights to discover genetic signatures responsible for biological mechanisms. The ALCS- problem has two additional requirements compared to the LCS problem: one is the minimum length , and the other is that all common strings longer than must be reported. We present an efficient, two-stage ALCS- algorithm exploiting the spectrum of text substrings of length ( -mers). Our approach yields a worst-case time complexity loglinear in the number of -mers for the first stage, and an average-case loglinear in the number of common -mers for the second stage (several orders of magnitudes smaller than the total -mer spectrum). The space complexity is linear in the first phase (disk-based), and on average linear in the second phase (disk- and memory-based). Tests performed on genomes for different organisms (including viruses, bacteria and animal chromosomes) show that run times are consistent with our theoretical estimates; further, comparisons with MUMmer4 show an asymptotic advantage with divergent genomes.

摘要

两个文本之间最长公共子串(LCS)的一个问题扩展是,在给定最小长度(ALCS - )的情况下,枚举所有LCS及其在每个文本中的位置。在生物信息学中,针对非常长的文本(基因组或宏基因组)的ALCS - 的有效解决方案,可以为发现负责生物机制的遗传特征提供有用的见解。与LCS问题相比,ALCS - 问题有两个额外的要求:一个是最小长度 ,另一个是必须报告所有长度大于 的公共字符串。我们提出了一种高效的两阶段ALCS - 算法,该算法利用长度为 ( - 聚体)的文本子串频谱。我们的方法在第一阶段产生的最坏情况时间复杂度与 - 聚体数量成对数线性关系,在第二阶段与公共 - 聚体数量成平均情况对数线性关系(比总 - 聚体频谱小几个数量级)。空间复杂度在第一阶段是线性的(基于磁盘),在第二阶段平均是线性的(基于磁盘和内存)。对不同生物体(包括病毒、细菌和动物染色体)的基因组进行的测试表明,运行时间与我们的理论估计一致;此外,与MUMmer4的比较表明,在分歧基因组方面具有渐近优势。