• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多因素降维分析确定了促进遗传多态性的特定核苷酸模式。

Multifactor dimensionality reduction analysis identifies specific nucleotide patterns promoting genetic polymorphisms.

机构信息

Department of Pharmacology and Toxicology, Dartmouth Medical School, Hanover, NH, USA.

Computational Genetics Laboratory, Department of Genetics, Norris-Cotton Cancer Center, Dartmouth Medical School, Lebanon, NH, USA.

出版信息

BioData Min. 2009 Mar 30;2(1):2. doi: 10.1186/1756-0381-2-2.

DOI:10.1186/1756-0381-2-2
PMID:19331672
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2669078/
Abstract

BACKGROUND

The fidelity of DNA replication serves as the nidus for both genetic evolution and genomic instability fostering disease. Single nucleotide polymorphisms (SNPs) constitute greater than 80% of the genetic variation between individuals. A new theory regarding DNA replication fidelity has emerged in which selectivity is governed by base-pair geometry through interactions between the selected nucleotide, the complementary strand, and the polymerase active site. We hypothesize that specific nucleotide combinations in the flanking regions of SNP fragments are associated with mutation.

RESULTS

We modeled the relationship between DNA sequence and observed polymorphisms using the novel multifactor dimensionality reduction (MDR) approach. MDR was originally developed to detect synergistic interactions between multiple SNPs that are predictive of disease susceptibility. We initially assembled data from the Broad Institute as a pilot test for the hypothesis that flanking region patterns associate with mutagenesis (n = 2194). We then confirmed and expanded our inquiry with human SNPs within coding regions and their flanking sequences collected from the National Center for Biotechnology Information (NCBI) database (n = 29967) and a control set of sequences (coding region) not associated with SNP sites randomly selected from the NCBI database (n = 29967). We discovered seven flanking region pattern associations in the Broad dataset which reached a minimum significance level of p </= 0.05. Significant models (p << 0.001) were detected for each SNP type examined in the larger NCBI dataset. Importantly, the flanking region models were elongated or truncated depending on the nucleotide change. Additionally, nucleotide distributions differed significantly at motif sites relative to the type of variation observed. The MDR approach effectively discerned specific sites within the flanking regions of observed SNPs and their respective identities, supporting the collective contribution of these sites to SNP genesis.

CONCLUSION

The present study represents the first use of this computational methodology for modeling nonlinear patterns in molecular genetics. MDR was able to identify distinct nucleotide patterning around sites of mutations dependent upon the observed nucleotide change. We discovered one flanking region set that included five nucleotides clustered around a specific type of SNP site. Based on the strongly associated patterns identified in this study, it may become possible to scan genomic databases for such clustering of nucleotides in order to predict likely sites of future SNPs, and even the type of polymorphism most likely to occur.

摘要

背景

DNA 复制的保真度是遗传进化和促进疾病发生的基因组不稳定性的核心。单核苷酸多态性(SNP)构成了个体之间超过 80%的遗传变异。一种新的 DNA 复制保真度理论已经出现,该理论认为选择性受碱基对几何形状的控制,通过所选核苷酸、互补链和聚合酶活性位点之间的相互作用。我们假设 SNP 片段侧翼区域的特定核苷酸组合与突变有关。

结果

我们使用新颖的多因素维度缩减(MDR)方法对 DNA 序列与观察到的多态性之间的关系进行建模。MDR 最初是为了检测与疾病易感性相关的多个 SNP 之间的协同作用而开发的。我们首先组装了 Broad 研究所的数据作为假设的试点测试,即侧翼区域模式与诱变相关(n=2194)。然后,我们使用来自 National Center for Biotechnology Information (NCBI) 数据库的人类 SNP 及其侧翼序列(n=29967)和从 NCBI 数据库随机选择的与 SNP 位点无关的序列(n=29967)扩展并确认了我们的研究。我们在 Broad 数据集发现了七个侧翼区域模式关联,达到了 p </= 0.05 的最小显著性水平。在更大的 NCBI 数据集检查的每种 SNP 类型中都检测到了显著模型(p << 0.001)。重要的是,侧翼区域模型根据核苷酸变化而延长或缩短。此外,与观察到的变异类型相比, motif 位点的核苷酸分布有显著差异。MDR 方法能够有效地辨别观察到的 SNP 及其各自身份的侧翼区域内的特定位点,支持这些位点对 SNP 生成的集体贡献。

结论

本研究代表了该计算方法在分子遗传学中用于建模非线性模式的首次应用。MDR 能够识别依赖于观察到的核苷酸变化的突变部位周围的独特核苷酸模式。我们发现了一组侧翼区域,其中包含五个核苷酸,聚集在特定类型的 SNP 位点周围。基于本研究中发现的强烈关联模式,可能有可能扫描基因组数据库以寻找此类核苷酸聚类,从而预测未来 SNP 的可能位点,甚至可能发生的多态性类型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6afa/2669078/d0a2f747fab0/1756-0381-2-2-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6afa/2669078/91bb717d4e93/1756-0381-2-2-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6afa/2669078/1b487c340cf8/1756-0381-2-2-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6afa/2669078/fa51206240e1/1756-0381-2-2-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6afa/2669078/1417f81d1688/1756-0381-2-2-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6afa/2669078/d0a2f747fab0/1756-0381-2-2-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6afa/2669078/91bb717d4e93/1756-0381-2-2-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6afa/2669078/1b487c340cf8/1756-0381-2-2-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6afa/2669078/fa51206240e1/1756-0381-2-2-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6afa/2669078/1417f81d1688/1756-0381-2-2-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6afa/2669078/d0a2f747fab0/1756-0381-2-2-5.jpg

相似文献

1
Multifactor dimensionality reduction analysis identifies specific nucleotide patterns promoting genetic polymorphisms.多因素降维分析确定了促进遗传多态性的特定核苷酸模式。
BioData Min. 2009 Mar 30;2(1):2. doi: 10.1186/1756-0381-2-2.
2
Novel polymorphisms in UTR and coding region of inducible heat shock protein 70.1 gene in tropically adapted Indian zebu cattle (Bos indicus) and riverine buffalo (Bubalus bubalis).热带适应性印度瘤牛(Bos indicus)和沼泽型水牛(Bubalus bubalis)中诱导型热休克蛋白 70.1 基因 UTR 和编码区的新型多态性。
Gene. 2013 Sep 25;527(2):606-15. doi: 10.1016/j.gene.2013.05.078. Epub 2013 Jun 19.
3
A comparative study on the unified model based multifactor dimensionality reduction methods for identifying gene-gene interactions associated with the survival phenotype.基于统一模型的多因素降维方法识别与生存表型相关的基因-基因相互作用的比较研究。
BioData Min. 2021 Mar 1;14(1):17. doi: 10.1186/s13040-021-00248-9.
4
Polymorphism of BMP4 gene in Indian goat breeds differing in prolificacy.印度不同繁殖力山羊品种中 BMP4 基因的多态性。
Gene. 2013 Dec 10;532(1):140-5. doi: 10.1016/j.gene.2013.08.086. Epub 2013 Sep 5.
5
Evaluation of potential novel variations and their interactions related to bipolar disorders: analysis of genome-wide association study data.双相情感障碍相关潜在新变异及其相互作用的评估:全基因组关联研究数据分析
Neuropsychiatr Dis Treat. 2016 Nov 24;12:2997-3004. doi: 10.2147/NDT.S112558. eCollection 2016.
6
Enabling personal genomics with an explicit test of epistasis.通过明确的上位性检验实现个人基因组学。
Pac Symp Biocomput. 2010:327-36. doi: 10.1142/9789814295291_0035.
7
Breast cancer-associated high-order SNP-SNP interaction of CXCL12/CXCR4-related genes by an improved multifactor dimensionality reduction (MDR-ER).通过改进的多因素降维法(MDR-ER)分析CXCL12/CXCR4相关基因与乳腺癌相关的高阶单核苷酸多态性-单核苷酸多态性相互作用
Oncol Rep. 2016 Sep;36(3):1739-47. doi: 10.3892/or.2016.4956. Epub 2016 Jul 21.
8
Genetic Influences in Traumatic Brain Injury创伤性脑损伤中的遗传影响
9
SNP@Promoter: a database of human SNPs (single nucleotide polymorphisms) within the putative promoter regions.SNP@启动子:一个关于假定启动子区域内人类单核苷酸多态性(SNP)的数据库。
BMC Bioinformatics. 2008;9 Suppl 1(Suppl 1):S2. doi: 10.1186/1471-2105-9-S1-S2.
10
Seven novel single nucleotide polymorphisms identified within river buffalo (Bubalus bubalis) lactoferrin gene.在河水牛(水牛属)乳铁蛋白基因中鉴定出七个新的单核苷酸多态性。
Trop Anim Health Prod. 2010 Jun;42(5):1021-6. doi: 10.1007/s11250-009-9516-3. Epub 2009 Dec 10.

本文引用的文献

1
Integrating biogeographic and genetic approaches to investigate the history of bioluminescent colour alleles in the Jamaican click beetle, Pyrophorus plagiophthalamus.整合生物地理学和遗传学方法以探究牙买加叩甲(Pyrophorus plagiophthalamus)生物发光颜色等位基因的历史。
Mol Ecol. 2006 Apr;15(5):1393-404. doi: 10.1111/j.1365-294X.2005.02793.x.
2
A flexible computational framework for detecting, characterizing, and interpreting statistical patterns of epistasis in genetic studies of human disease susceptibility.一种灵活的计算框架,用于在人类疾病易感性的遗传研究中检测、表征和解释上位性的统计模式。
J Theor Biol. 2006 Jul 21;241(2):252-61. doi: 10.1016/j.jtbi.2005.11.036. Epub 2006 Feb 2.
3
A novel method to identify gene-gene effects in nuclear families: the MDR-PDT.
一种在核心家庭中识别基因-基因效应的新方法:多因素降维-惩罚判别分析。
Genet Epidemiol. 2006 Feb;30(2):111-23. doi: 10.1002/gepi.20128.
4
Sequence context analysis in the mouse genome: single nucleotide polymorphisms and CpG island sequences.小鼠基因组中的序列上下文分析:单核苷酸多态性和CpG岛序列。
Genomics. 2006 Jan;87(1):68-74. doi: 10.1016/j.ygeno.2005.09.012. Epub 2005 Nov 28.
5
To slip or skip, visualizing frameshift mutation dynamics for error-prone DNA polymerases.为了滑动或跳跃,可视化易错DNA聚合酶的移码突变动态。
J Biol Chem. 2004 Oct 29;279(44):45360-8. doi: 10.1074/jbc.M408600200. Epub 2004 Aug 31.
6
Critical role of magnesium ions in DNA polymerase beta's closing and active site assembly.镁离子在DNA聚合酶β的构象闭合及活性位点组装中的关键作用
J Am Chem Soc. 2004 Jul 14;126(27):8441-53. doi: 10.1021/ja049412o.
7
Ideal discrimination of discrete clinical endpoints using multilocus genotypes.利用多位点基因型对离散临床终点进行理想的判别。
In Silico Biol. 2004;4(2):183-94.
8
Molecular structure of nucleic acids; a structure for deoxyribose nucleic acid.核酸的分子结构;脱氧核糖核酸的一种结构。
Nature. 1953 Apr 25;171(4356):737-8. doi: 10.1038/171737a0.
9
Multifactor dimensionality reduction software for detecting gene-gene and gene-environment interactions.用于检测基因-基因和基因-环境相互作用的多因素降维软件。
Bioinformatics. 2003 Feb 12;19(3):376-82. doi: 10.1093/bioinformatics/btf869.
10
Power of multifactor dimensionality reduction for detecting gene-gene interactions in the presence of genotyping error, missing data, phenocopy, and genetic heterogeneity.在存在基因分型错误、数据缺失、表型模拟和遗传异质性的情况下,多因素降维法检测基因-基因相互作用的效能。
Genet Epidemiol. 2003 Feb;24(2):150-7. doi: 10.1002/gepi.10218.