• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对于具有高序列变异的靶点,最佳探针长度会有所不同:对用于重测序高变基因的探针文库设计的启示。

Optimal probe length varies for targets with high sequence variation: implications for probe library design for resequencing highly variable genes.

作者信息

Haslam Niall J, Whiteford Nava E, Weber Gerald, Prügel-Bennett Adam, Essex Jonathan W, Neylon Cameron

机构信息

School of Chemistry, University of Southampton, Southhampton, United Kingdom.

出版信息

PLoS One. 2008 Jun 18;3(6):e2500. doi: 10.1371/journal.pone.0002500.

DOI:10.1371/journal.pone.0002500
PMID:18563203
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2430613/
Abstract

BACKGROUND

Sequencing by hybridisation is an effective method for obtaining large amounts of DNA sequence information at low cost. The efficiency of SBH depends on the design of the probe library to provide the maximum information for minimum cost. Long probes provide a higher probability of non-repeated sequences but lead to an increase in the number of probes required whereas short probes may not provide unique sequence information due to repeated sequences. We have investigated the effect of probe length, use of reference sequences, and thermal filtering on the design of probe libraries for several highly variable target DNA sequences.

RESULTS

We designed overlapping probe libraries for a range of highly variable drug target genes based on known sequence information and develop a formal terminology to describe probe library design. We find that for some targets these libraries can provide good coverage of a previously unseen target whereas for others the coverage is less than 30%. The optimal probe length varies from as short at 12 nt to as large as 19 nt and depends on the sequence, its variability, and the stringency of thermal filtering. It cannot be determined from inspection of an example gene sequence.

CONCLUSIONS

Optimal probe length and the optimal number of reference sequences used to design a probe library are highly target specific for highly variable sequencing targets. The optimum design cannot be determined simply by inspection of input sequences or of alignments but only by detailed analysis of the each specific target. For highly variable sequences, shorter probes can in some cases provide better information than longer probes. Probe library design would benefit from a general purpose tool for analysing these issues. The formal terminology developed here and the analysis approaches it is used to describe will contribute to the development of such tools.

摘要

背景

杂交测序是一种以低成本获取大量DNA序列信息的有效方法。杂交测序(SBH)的效率取决于探针文库的设计,以便用最低成本提供最大信息。长探针提供非重复序列的概率更高,但会导致所需探针数量增加,而短探针由于存在重复序列可能无法提供唯一的序列信息。我们研究了探针长度、参考序列的使用以及热过滤对几个高度可变的目标DNA序列的探针文库设计的影响。

结果

我们基于已知序列信息为一系列高度可变的药物靶基因设计了重叠探针文库,并开发了一套正式术语来描述探针文库设计。我们发现,对于某些靶标,这些文库可以很好地覆盖以前未见过的靶标,而对于其他靶标,覆盖率则低于30%。最佳探针长度从短至12个核苷酸到长达19个核苷酸不等,这取决于序列、其变异性以及热过滤的严格程度。无法通过检查示例基因序列来确定。

结论

用于设计探针文库的最佳探针长度和最佳参考序列数量对于高度可变的测序靶标具有高度的靶标特异性。最佳设计不能简单地通过检查输入序列或比对来确定,而只能通过对每个特定靶标的详细分析来确定。对于高度可变的序列,在某些情况下,较短的探针可以比较长的探针提供更好的信息。探针文库设计将受益于一种用于分析这些问题的通用工具。这里开发的正式术语及其用于描述的分析方法将有助于此类工具的开发。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9116/2430613/8688db1bda6a/pone.0002500.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9116/2430613/045a2468b879/pone.0002500.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9116/2430613/04bf9ba1895c/pone.0002500.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9116/2430613/3f6df6114dda/pone.0002500.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9116/2430613/da9d748a55b8/pone.0002500.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9116/2430613/8688db1bda6a/pone.0002500.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9116/2430613/045a2468b879/pone.0002500.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9116/2430613/04bf9ba1895c/pone.0002500.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9116/2430613/3f6df6114dda/pone.0002500.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9116/2430613/da9d748a55b8/pone.0002500.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9116/2430613/8688db1bda6a/pone.0002500.g005.jpg

相似文献

1
Optimal probe length varies for targets with high sequence variation: implications for probe library design for resequencing highly variable genes.对于具有高序列变异的靶点,最佳探针长度会有所不同:对用于重测序高变基因的探针文库设计的启示。
PLoS One. 2008 Jun 18;3(6):e2500. doi: 10.1371/journal.pone.0002500.
2
A Universal Probe Set for Targeted Sequencing of 353 Nuclear Genes from Any Flowering Plant Designed Using k-Medoids Clustering.基于 k-中值聚类设计的用于靶向测序任何开花植物中 353 个核基因的通用探针集。
Syst Biol. 2019 Jul 1;68(4):594-606. doi: 10.1093/sysbio/syy086.
3
Optimal reconstruction of a sequence from its probes.根据其探测值对序列进行最优重建。
J Comput Biol. 1999 Fall-Winter;6(3-4):361-8. doi: 10.1089/106652799318328.
4
Quantitative assessment of LASSO probe assembly and long-read multiplexed cloning.LASSO 探针组装和长读长片段多重克隆的定量评估。
BMC Biotechnol. 2019 Jul 24;19(1):50. doi: 10.1186/s12896-019-0547-1.
5
Selective and flexible depletion of problematic sequences from RNA-seq libraries at the cDNA stage.在cDNA阶段从RNA测序文库中选择性且灵活地去除有问题的序列。
BMC Genomics. 2014 May 26;15(1):401. doi: 10.1186/1471-2164-15-401.
6
Rational design of HIV-1 fluorescent hydrolysis probes considering phylogenetic variation and probe performance.考虑到系统发育变异和探针性能,对 HIV-1 荧光水解探针进行合理设计。
J Virol Methods. 2010 May;165(2):151-60. doi: 10.1016/j.jviromet.2010.01.012. Epub 2010 Jan 29.
7
Single nucleotide polymorphism genotyping by two colour melting curve analysis using the MGB Eclipse Probe System in challenging sequence environment.在具有挑战性的序列环境中,使用MGB Eclipse探针系统通过双色熔解曲线分析进行单核苷酸多态性基因分型。
Hum Genomics. 2004 Mar;1(3):209-17. doi: 10.1186/1479-7364-1-3-209.
8
Assembly of Long-Adapter Single-Strand Oligonucleotide (LASSO) Probes for Massively Parallel Capture of Kilobase Size DNA Targets.长接头单链寡核苷酸 (LASSO) 探针的组装,用于大规模平行捕获千碱基大小的 DNA 靶标。
Curr Protoc. 2021 Nov;1(11):e278. doi: 10.1002/cpz1.278.
9
Detection of short repeated genomic sequences on metaphase chromosomes using padlock probes and target primed rolling circle DNA synthesis.使用锁式探针和靶标引发滚环DNA合成技术检测中期染色体上的短重复基因组序列。
BMC Mol Biol. 2007 Nov 13;8:103. doi: 10.1186/1471-2199-8-103.
10
Rapid and highly-specific generation of targeted DNA sequencing libraries enabled by linking capture probes with universal primers.通过将捕获探针与通用引物连接,实现了靶向 DNA 测序文库的快速和高度特异性生成。
PLoS One. 2018 Dec 5;13(12):e0208283. doi: 10.1371/journal.pone.0208283. eCollection 2018.

引用本文的文献

1
CRISPR-based point-of-care diagnostics incorporating Cas9, Cas12, and Cas13 enzymes advanced for SARS-CoV-2 detection.基于 CRISPR 的即时诊断,整合 Cas9、Cas12 和 Cas13 酶,用于 SARS-CoV-2 检测的进展。
J Biochem Mol Toxicol. 2022 Aug;36(8):e23113. doi: 10.1002/jbt.23113. Epub 2022 Jun 1.

本文引用的文献

1
The resequencing imperative.重测序的必要性。
Nat Genet. 2007 Apr;39(4):439-40. doi: 10.1038/ng0407-439.
2
Identifying influenza viruses with resequencing microarrays.利用重测序微阵列鉴定流感病毒。
Emerg Infect Dis. 2006 Apr;12(4):638-46. doi: 10.3201/eid1204.051441.
3
Design of microarray probes for virus identification and detection of emerging viruses at the genus level.用于病毒鉴定及在属水平检测新兴病毒的微阵列探针设计。
BMC Bioinformatics. 2006 Apr 28;7:232. doi: 10.1186/1471-2105-7-232.
4
Evolutionary dynamics of HIV-1 and the control of AIDS.人类免疫缺陷病毒1型的进化动力学与艾滋病的控制
Curr Top Microbiol Immunol. 2006;299:171-92. doi: 10.1007/3-540-26397-7_6.
5
Gene sequencing. The race for the $1000 genome.基因测序。千元基因组竞赛。
Science. 2006 Mar 17;311(5767):1544-6. doi: 10.1126/science.311.5767.1544.
6
An analysis of the feasibility of short read sequencing.短读长测序的可行性分析
Nucleic Acids Res. 2005 Nov 7;33(19):e171. doi: 10.1093/nar/gni170.
7
Emerging drug targets for antiretroviral therapy.抗逆转录病毒疗法的新兴药物靶点
Drugs. 2005;65(13):1747-66. doi: 10.2165/00003495-200565130-00002.
8
Thermodynamic properties of DNA sequences: characteristic values for the human genome.DNA序列的热力学性质:人类基因组的特征值。
Bioinformatics. 2005 Aug 15;21(16):3333-9. doi: 10.1093/bioinformatics/bti530. Epub 2005 Jun 9.
9
Advances in sequencing technology.测序技术的进展。
Mutat Res. 2005 Jun 3;573(1-2):13-40. doi: 10.1016/j.mrfmmm.2005.01.004.
10
Applications of DNA tiling arrays for whole-genome analysis.DNA 平铺阵列在全基因组分析中的应用。
Genomics. 2005 Jan;85(1):1-15. doi: 10.1016/j.ygeno.2004.10.005.