• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Estimating population genetic parameters and comparing model goodness-of-fit using DNA sequences with error.在存在错误的 DNA 序列的情况下估计群体遗传参数和比较模型拟合优度。
Genome Res. 2010 Jan;20(1):101-9. doi: 10.1101/gr.097543.109. Epub 2009 Dec 1.
2
Inferring population mutation rate and sequencing error rate using the SNP frequency spectrum in a sample of DNA sequences.利用DNA序列样本中的单核苷酸多态性(SNP)频率谱推断群体突变率和测序错误率。
Mol Biol Evol. 2009 Jul;26(7):1479-90. doi: 10.1093/molbev/msp059. Epub 2009 Mar 24.
3
A novel adaptive method for the analysis of next-generation sequencing data to detect complex trait associations with rare variants due to gene main effects and interactions.一种用于分析下一代测序数据的新自适应方法,用于检测由于基因主效应和相互作用而导致的复杂性状关联的罕见变异体。
PLoS Genet. 2010 Oct 14;6(10):e1001156. doi: 10.1371/journal.pgen.1001156.
4
Usefulness of single nucleotide polymorphism data for estimating population parameters.单核苷酸多态性数据在估计群体参数方面的实用性。
Genetics. 2000 Sep;156(1):439-47. doi: 10.1093/genetics/156.1.439.
5
A composite-likelihood approach for detecting directional selection from DNA sequence data.一种用于从DNA序列数据中检测定向选择的复合似然方法。
Genetics. 2005 Jul;170(3):1411-21. doi: 10.1534/genetics.104.035097. Epub 2005 May 6.
6
Selecting SNPs informative for African, American Indian and European Ancestry: application to the Family Investigation of Nephropathy and Diabetes (FIND).选择对非洲、美洲印第安人和欧洲血统有信息价值的单核苷酸多态性:应用于肾病与糖尿病家族调查(FIND)。
BMC Genomics. 2016 May 4;17:325. doi: 10.1186/s12864-016-2654-x.
7
Accurate and fast methods to estimate the population mutation rate from error prone sequences.从易错序列中准确快速地估计群体突变率的方法。
BMC Bioinformatics. 2009 Aug 11;10:247. doi: 10.1186/1471-2105-10-247.
8
Population-based resequencing of ANGPTL4 uncovers variations that reduce triglycerides and increase HDL.基于人群的血管生成素样蛋白4重测序揭示了降低甘油三酯并升高高密度脂蛋白的变异。
Nat Genet. 2007 Apr;39(4):513-6. doi: 10.1038/ng1984. Epub 2007 Feb 25.
9
Composite likelihood estimation of demographic parameters.人口统计学参数的复合似然估计
BMC Genet. 2009 Nov 12;10:72. doi: 10.1186/1471-2156-10-72.
10
Estimation of individual admixture: analytical and study design considerations.个体混合比例的估计:分析与研究设计考量
Genet Epidemiol. 2005 May;28(4):289-301. doi: 10.1002/gepi.20064.

引用本文的文献

1
: An R package for the analysis of very low frequency variants in DNA sequences.用于分析DNA序列中极低频率变异的R软件包。
Biodivers Data J. 2023 Jan 26;11:e96480. doi: 10.3897/BDJ.11.e96480. eCollection 2023.
2
Bridgehead effect and multiple introductions shape the global invasion history of a termite.桥头堡效应和多次引入塑造了一种白蚁的全球入侵历史。
Commun Biol. 2021 Feb 12;4(1):196. doi: 10.1038/s42003-021-01725-x.
3
Properties of Samples With Segregating Polymerase Chain Reaction (PCR) Dropout Mutations Within a Species.物种内具有聚合酶链反应(PCR)缺失突变分离的样本特性。
Evol Bioinform Online. 2019 Nov 5;15:1176934319883612. doi: 10.1177/1176934319883612. eCollection 2019.
4
Efficiency of ddRAD target enriched sequencing across spiny rock lobster species (Palinuridae: Jasus).ddRAD 靶向富集测序在刺龙虾物种(短尾派蝦科:真龙虾属)中的效率。
Sci Rep. 2017 Jul 28;7(1):6781. doi: 10.1038/s41598-017-06582-5.
5
Population genetic studies in the genomic sequencing era.基因组测序时代的群体遗传学研究。
Dongwuxue Yanjiu. 2015 Jul 18;36(4):223-32. doi: 10.13918/j.issn.2095-8137.2015.4.223.
6
Exploring population size changes using SNP frequency spectra.利用单核苷酸多态性频率谱探索种群大小变化。
Nat Genet. 2015 May;47(5):555-9. doi: 10.1038/ng.3254. Epub 2015 Apr 6.
7
Characterizing bias in population genetic inferences from low-coverage sequencing data.从低覆盖测序数据中推断群体遗传时的偏差特征分析。
Mol Biol Evol. 2014 Mar;31(3):723-35. doi: 10.1093/molbev/mst229. Epub 2013 Nov 27.
8
Calculation of Tajima's D and other neutrality test statistics from low depth next-generation sequencing data.从低深度下一代测序数据中计算 Tajima's D 和其他中性检验统计量。
BMC Bioinformatics. 2013 Oct 2;14:289. doi: 10.1186/1471-2105-14-289.
9
Theoretical framework of population genetics with somatic mutations taken into account: application to copy number variations in humans.考虑体细胞突变的群体遗传学理论框架:在人类中拷贝数变异的应用。
Heredity (Edinb). 2013 Nov;111(5):364-74. doi: 10.1038/hdy.2013.59. Epub 2013 Aug 28.
10
jPopGen Suite: population genetic analysis of DNA polymorphism from nucleotide sequences with errors.jPopGen套件:对含错误的核苷酸序列中的DNA多态性进行群体遗传学分析。
Methods Ecol Evol. 2012 Aug 1;3(4):624-627. doi: 10.1111/j.2041-210X.2012.00194.x. Epub 2012 Mar 2.

本文引用的文献

1
Accurate and fast methods to estimate the population mutation rate from error prone sequences.从易错序列中准确快速地估计群体突变率的方法。
BMC Bioinformatics. 2009 Aug 11;10:247. doi: 10.1186/1471-2105-10-247.
2
Inferring population mutation rate and sequencing error rate using the SNP frequency spectrum in a sample of DNA sequences.利用DNA序列样本中的单核苷酸多态性(SNP)频率谱推断群体突变率和测序错误率。
Mol Biol Evol. 2009 Jul;26(7):1479-90. doi: 10.1093/molbev/msp059. Epub 2009 Mar 24.
3
Estimation of allele frequencies from high-coverage genome-sequencing projects.从高覆盖度基因组测序项目中估计等位基因频率。
Genetics. 2009 May;182(1):295-301. doi: 10.1534/genetics.109.100479. Epub 2009 Mar 16.
4
Rare loss-of-function mutations in ANGPTL family members contribute to plasma triglyceride levels in humans.血管生成素样蛋白(ANGPTL)家族成员中罕见的功能丧失突变会影响人类血浆甘油三酯水平。
J Clin Invest. 2009 Jan;119(1):70-9. doi: 10.1172/JCI37118. Epub 2008 Dec 15.
5
Accommodating the effect of ancient DNA damage on inferences of demographic histories.适应古代DNA损伤对人口历史推断的影响。
Mol Biol Evol. 2009 Feb;26(2):245-8. doi: 10.1093/molbev/msn256. Epub 2008 Nov 11.
6
Population genetic inference from resequencing data.基于重测序数据的群体遗传推断。
Genetics. 2009 Jan;181(1):187-97. doi: 10.1534/genetics.107.080630. Epub 2008 Nov 3.
7
Next-generation DNA sequencing.下一代DNA测序
Nat Biotechnol. 2008 Oct;26(10):1135-45. doi: 10.1038/nbt1486.
8
Estimation of nucleotide diversity, disequilibrium coefficients, and mutation rates from high-coverage genome-sequencing projects.从高覆盖度基因组测序项目中估算核苷酸多样性、不平衡系数和突变率。
Mol Biol Evol. 2008 Nov;25(11):2409-19. doi: 10.1093/molbev/msn185. Epub 2008 Aug 25.
9
Testing for neutrality in samples with sequencing errors.检测存在测序错误的样本中的中性
Genetics. 2008 Jul;179(3):1409-24. doi: 10.1534/genetics.107.082198. Epub 2008 Jun 18.
10
Population genetic analysis of shotgun assemblies of genomic sequences from multiple individuals.来自多个个体的基因组序列鸟枪法组装的群体遗传分析。
Genome Res. 2008 Jul;18(7):1020-9. doi: 10.1101/gr.074187.107. Epub 2008 Apr 14.

在存在错误的 DNA 序列的情况下估计群体遗传参数和比较模型拟合优度。

Estimating population genetic parameters and comparing model goodness-of-fit using DNA sequences with error.

机构信息

Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, Texas 77030, USA.

出版信息

Genome Res. 2010 Jan;20(1):101-9. doi: 10.1101/gr.097543.109. Epub 2009 Dec 1.

DOI:10.1101/gr.097543.109
PMID:19952140
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2798822/
Abstract

It is known that sequencing error can bias estimation of evolutionary or population genetic parameters. This problem is more prominent in deep resequencing studies because of their large sample size n, and a higher probability of error at each nucleotide site. We propose a new method based on the composite likelihood of the observed SNP configurations to infer population mutation rate theta = 4N(e)micro, population exponential growth rate R, and error rate epsilon, simultaneously. Using simulation, we show the combined effects of the parameters, theta, n, epsilon, and R on the accuracy of parameter estimation. We compared our maximum composite likelihood estimator (MCLE) of theta with other theta estimators that take into account the error. The results show the MCLE performs well when the sample size is large or the error rate is high. Using parametric bootstrap, composite likelihood can also be used as a statistic for testing the model goodness-of-fit of the observed DNA sequences. The MCLE method is applied to sequence data on the ANGPTL4 gene in 1832 African American and 1045 European American individuals.

摘要

据了解,测序错误会影响进化或群体遗传参数的估计。在深度重测序研究中,由于样本量 n 较大,每个核苷酸位点出错的概率更高,因此这个问题更为突出。我们提出了一种新的方法,基于观察到的 SNP 构型的复合似然,同时推断群体突变率 theta = 4N(e)micro、群体指数增长率 R 和错误率 epsilon。通过模拟,我们展示了参数 theta、n、epsilon 和 R 的综合效应,以及它们对参数估计准确性的影响。我们比较了我们的最大复合似然估计器(MCLE)与其他考虑错误的 theta 估计器。结果表明,当样本量较大或错误率较高时,MCLE 表现良好。通过参数 bootstrap,复合似然也可以用作检验观测 DNA 序列模型拟合优度的统计量。MCLE 方法应用于 1832 名非裔美国人和 1045 名欧洲裔美国人的 ANGPTL4 基因序列数据。