• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Numericware i:状态相同矩阵计算器。

Numericware i: Identical by State Matrix Calculator.

作者信息

Kim Bongsong, Beavis William D

机构信息

Department of Agronomy, Iowa State University, Ames, IA, USA.

出版信息

Evol Bioinform Online. 2017 Mar 10;13:1176934316688663. doi: 10.1177/1176934316688663. eCollection 2017.

DOI:10.1177/1176934316688663
PMID:28469375
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5395260/
Abstract

We introduce software, Numericware i, to compute identical by state (IBS) matrix based on genotypic data. Calculating an IBS matrix with a large dataset requires large computer memory and takes lengthy processing time. Numericware i addresses these challenges with 2 algorithmic methods: multithreading and forward chopping. The multithreading allows computational routines to concurrently run on multiple central processing unit (CPU) processors. The forward chopping addresses memory limitation by dividing a dataset into appropriately sized subsets. Numericware i allows calculation of the IBS matrix for a large genotypic dataset using a laptop or a desktop computer. For comparison with different software, we calculated genetic relationship matrices using Numericware i, SPAGeDi, and TASSEL with the same genotypic dataset. Numericware i calculates IBS coefficients between 0 and 2, whereas SPAGeDi and TASSEL produce different ranges of values including negative values. The Pearson correlation coefficient between the matrices from Numericware i and TASSEL was high at .9972, whereas SPAGeDi showed low correlation with Numericware i (.0505) and TASSEL (.0587). With a high-dimensional dataset of 500 entities by 10 000 000 SNPs, Numericware i spent 382 minutes using 19 CPU threads and 64 GB memory by dividing the dataset into 3 pieces, whereas SPAGeDi and TASSEL failed with the same dataset. Numericware i is freely available for Windows and Linux under CC-BY 4.0 license at https://figshare.com/s/f100f33a8857131eb2db.

摘要

我们引入了Numericware i软件,用于根据基因型数据计算状态相同(IBS)矩阵。使用大型数据集计算IBS矩阵需要大量计算机内存,且处理时间较长。Numericware i通过两种算法方法应对这些挑战:多线程和前向分割。多线程允许计算例程在多个中央处理器(CPU)上并发运行。前向分割通过将数据集划分为大小合适的子集来解决内存限制问题。Numericware i允许使用笔记本电脑或台式计算机为大型基因型数据集计算IBS矩阵。为了与不同软件进行比较,我们使用相同的基因型数据集,通过Numericware i、SPAGeDi和TASSEL计算了遗传关系矩阵。Numericware i计算的IBS系数在0到2之间,而SPAGeDi和TASSEL产生的取值范围不同,包括负值。Numericware i和TASSEL生成的矩阵之间的Pearson相关系数很高,为0.9972,而SPAGeDi与Numericware i的相关性较低(0.0505),与TASSEL的相关性也较低(0.0587)。对于一个包含500个实体和1000万个单核苷酸多态性(SNP)的高维数据集,Numericware i通过将数据集分成3块,使用19个CPU线程和64GB内存花费了382分钟,而SPAGeDi和TASSEL在相同数据集上运行失败。Numericware i根据CC-BY 4.0许可在https://figshare.com/s/f100f33a8857131eb2db上可免费用于Windows和Linux系统。

相似文献

1
Numericware i: Identical by State Matrix Calculator.Numericware i:状态相同矩阵计算器。
Evol Bioinform Online. 2017 Mar 10;13:1176934316688663. doi: 10.1177/1176934316688663. eCollection 2017.
2
Numericware N: Numerator Relationship Matrix Calculator.Numericware N:分子关系矩阵计算器。
J Hered. 2016;107(7):686-690. doi: 10.1093/jhered/esw057. Epub 2016 Oct 11.
3
ICGRM: integrative construction of genomic relationship matrix combining multiple genomic regions for big dataset.ICGRM:整合多个基因组区域构建基因组关系矩阵的综合方法,用于大数据集。
BMC Bioinformatics. 2019 Dec 26;20(1):731. doi: 10.1186/s12859-019-3319-y.
4
High performance computing for deformable image registration: towards a new paradigm in adaptive radiotherapy.用于可变形图像配准的高性能计算:迈向自适应放射治疗的新范式。
Med Phys. 2008 Aug;35(8):3546-53. doi: 10.1118/1.2948318.
5
Fully 3D list-mode time-of-flight PET image reconstruction on GPUs using CUDA.基于 CUDA 的 GPU 上完全 3D 列表模式飞行时间 PET 图像重建。
Med Phys. 2011 Dec;38(12):6775-86. doi: 10.1118/1.3661998.
6
NMF-mGPU: non-negative matrix factorization on multi-GPU systems.NMF-mGPU:多GPU系统上的非负矩阵分解
BMC Bioinformatics. 2015 Feb 13;16:43. doi: 10.1186/s12859-015-0485-4.
7
XuvTools: free, fast and reliable stitching of large 3D datasets.XuvTools:免费、快速且可靠地拼接大型3D数据集。
J Microsc. 2009 Jan;233(1):42-60. doi: 10.1111/j.1365-2818.2008.03094.x.
8
Accuracy of genomic selection for a sib-evaluated trait using identity-by-state and identity-by-descent relationships.利用状态一致性和系谱一致性关系对同胞评估性状进行基因组选择的准确性。
Genet Sel Evol. 2015 Feb 25;47(1):9. doi: 10.1186/s12711-014-0084-2.
9
Message passing interface and multithreading hybrid for parallel molecular docking of large databases on petascale high performance computing machines.消息传递接口和多线程混合技术在大规模高性能计算机构架下并行对接大型分子数据库。
J Comput Chem. 2013 Apr 30;34(11):915-27. doi: 10.1002/jcc.23214. Epub 2013 Jan 23.
10
GradientOptimizer: an open-source graphical environment for calculating optimized gradients in reversed-phase liquid chromatography.梯度优化器:一种用于计算反相液相色谱中优化梯度的开源图形环境。
Proteomics. 2014 Jun;14(12):1464-6. doi: 10.1002/pmic.201400036. Epub 2014 May 15.

引用本文的文献

1
Molecular markers and molecular basis of plant type related traits in maize.玉米株型相关性状的分子标记及分子基础
Front Genet. 2024 Nov 1;15:1487700. doi: 10.3389/fgene.2024.1487700. eCollection 2024.
2
Linkage disequilibrium and population structure in a core collection of Brassica napus (L.).甘蓝型油菜核心种质的连锁不平衡与群体结构。
PLoS One. 2022 Mar 1;17(3):e0250310. doi: 10.1371/journal.pone.0250310. eCollection 2022.
3
Genetic diversity analysis of a flax (Linum usitatissimum L.) global collection.亚麻(Linum usitatissimum L.)全球种质资源的遗传多样性分析
BMC Genomics. 2020 Aug 14;21(1):557. doi: 10.1186/s12864-020-06922-2.
4
Classifying accessions into and using logistic regression model with phenotypic data.利用表型数据将种质分类并使用逻辑回归模型。
PeerJ. 2019 Nov 7;7:e7259. doi: 10.7717/peerj.7259. eCollection 2019.

本文引用的文献

1
Numericware N: Numerator Relationship Matrix Calculator.Numericware N:分子关系矩阵计算器。
J Hered. 2016;107(7):686-690. doi: 10.1093/jhered/esw057. Epub 2016 Oct 11.
2
Shrinkage estimation of the genomic relationship matrix can improve genomic estimated breeding values in the training set.基因组关系矩阵的收缩估计可以提高训练集中的基因组估计育种值。
Theor Appl Genet. 2015 Apr;128(4):693-703. doi: 10.1007/s00122-015-2464-6. Epub 2015 Mar 4.
3
Genome-wide association study for grain yield and related traits in an elite spring wheat population grown in temperate irrigated environments.在温带灌溉环境下种植的优质春小麦群体中进行的全基因组关联研究,用于研究粒重和相关性状。
Theor Appl Genet. 2015 Feb;128(2):353-63. doi: 10.1007/s00122-014-2435-3. Epub 2014 Dec 10.
4
Association mapping for important agronomic traits in core collection of rice (Oryza sativa L.) with SSR markers.利用SSR标记对水稻(Oryza sativa L.)核心种质重要农艺性状进行关联分析。
PLoS One. 2014 Oct 31;9(10):e111508. doi: 10.1371/journal.pone.0111508. eCollection 2014.
5
Genome-wide association studies and prediction of 17 traits related to phenology, biomass and cell wall composition in the energy grass Miscanthus sinensis.芒草全基因组关联研究及与物候、生物量和细胞壁组成相关的17个性状预测
New Phytol. 2014 Mar;201(4):1227-1239. doi: 10.1111/nph.12621. Epub 2013 Dec 6.
6
Genome-wide association study identifies multiple susceptibility loci for pulmonary fibrosis.全基因组关联研究鉴定出多个肺纤维化易感性位点。
Nat Genet. 2013 Jun;45(6):613-20. doi: 10.1038/ng.2609. Epub 2013 Apr 14.
7
Shrinkage estimation of the realized relationship matrix.实现关系矩阵的收缩估计。
G3 (Bethesda). 2012 Nov;2(11):1405-13. doi: 10.1534/g3.112.004259. Epub 2012 Nov 1.
8
GAPIT: genome association and prediction integrated tool.GAPIT:基因组关联和预测综合工具。
Bioinformatics. 2012 Sep 15;28(18):2397-9. doi: 10.1093/bioinformatics/bts444. Epub 2012 Jul 13.
9
Common SNPs explain a large proportion of the heritability for human height.常见的单核苷酸多态性解释了人类身高遗传的很大一部分。
Nat Genet. 2010 Jul;42(7):565-9. doi: 10.1038/ng.608. Epub 2010 Jun 20.
10
A relationship matrix including full pedigree and genomic information.一个包含完整谱系和基因组信息的关系矩阵。
J Dairy Sci. 2009 Sep;92(9):4656-63. doi: 10.3168/jds.2009-2061.