用于涉及家族和部落关系的STR-DNA相似度计算的高斯模糊数

Gaussian Fuzzy Number for STR-DNA Similarity Calculation Involving Familial and Tribal Relationships.

作者信息

Anggreainy Maria Susan, Widyanto M Rahmat, Widjaja Belawati H, Soedarsono Nurtami

机构信息

Faculty of Computer Science, Universitas Indonesia, Depok Campus, West Java 16424, Indonesia.

Faculty of Dentistry, Universitas Indonesia, Salemba Campus, Jakarta 10430, Indonesia.

出版信息

Adv Bioinformatics. 2018 Jul 29;2018:8602513. doi: 10.1155/2018/8602513. eCollection 2018.

DOI:10.1155/2018/8602513

PMID:30151007

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6087604/

Abstract

We performed locus similarity calculation by measuring fuzzy intersection between individual locus and reference locus and then performed CODIS STR-DNA similarity calculation. The fuzzy intersection calculation enables a more robust CODIS STR-DNA similarity calculation due to imprecision caused by noise produced by PCR machine. We also proposed shifted convoluted Gaussian fuzzy number (SCGFN) and Gaussian fuzzy number (GFN) to represent each locus value as improvement of triangular fuzzy number (TFN) as used in previous research. Compared to triangular fuzzy number (TFN), GFN is more realistic to represent uncertainty of locus information because the distribution is assumed to be Gaussian. Then, the original Gaussian fuzzy number (GFN) is convoluted with distribution of certain ethnic locus information to produce the new SCGFN which more represents ethnic information compared to original GFN. Experiments were done for the following cases: people with family relationships, people of the same tribe, and certain tribal populations. The statistical test with analysis of variance (ANOVA) shows the difference in similarity between SCGFN, GFN, and TFN with a significant level of 95%. The Tukey method in ANOVA shows that SCGFN yields a higher similarity which means being better than the GFN and TFN methods. The proposed method enables CODIS STR-DNA similarity calculation which is more robust to noise and performed better CODIS similarity calculation involving familial and tribal relationships.

摘要

我们通过测量个体基因座与参考基因座之间的模糊交集来进行基因座相似度计算，然后进行联合DNA索引系统（CODIS）短串联重复序列（STR）-DNA相似度计算。由于聚合酶链式反应（PCR）机器产生的噪声导致的不精确性，模糊交集计算能够实现更稳健的CODIS STR-DNA相似度计算。我们还提出了移位卷积高斯模糊数（SCGFN）和高斯模糊数（GFN），以表示每个基因座值，作为对先前研究中使用的三角模糊数（TFN）的改进。与三角模糊数（TFN）相比，GFN在表示基因座信息的不确定性方面更现实，因为其分布假定为高斯分布。然后，将原始高斯模糊数（GFN）与特定种族基因座信息的分布进行卷积，以产生新的SCGFN，与原始GFN相比，SCGFN更能代表种族信息。针对以下情况进行了实验：有亲属关系的人、同一部落的人以及特定部落群体。采用方差分析（ANOVA）的统计检验显示，SCGFN、GFN和TFN之间的相似度存在差异，显著性水平为95%。方差分析中的Tukey方法表明，SCGFN产生的相似度更高，这意味着它比GFN和TFN方法更好。所提出的方法能够实现对噪声更稳健的CODIS STR-DNA相似度计算，并且在涉及家族和部落关系的CODIS相似度计算中表现更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86b5/6087604/ab03f0a8aefe/ABI2018-8602513.001.jpg

相似文献

Gaussian Fuzzy Number for STR-DNA Similarity Calculation Involving Familial and Tribal Relationships.用于涉及家族和部落关系的STR-DNA相似度计算的高斯模糊数

Adv Bioinformatics. 2018 Jul 29;2018:8602513. doi: 10.1155/2018/8602513. eCollection 2018.

Fuzzy peer groups for reducing mixed gaussian-impulse noise from color images.用于减少彩色图像中混合高斯脉冲噪声的模糊对等组

IEEE Trans Image Process. 2009 Jul;18(7):1452-66. doi: 10.1109/TIP.2009.2019305. Epub 2009 May 12.

ScientificWorldJournal. 2014 Mar 20;2014:215047. doi: 10.1155/2014/215047. eCollection 2014.

An Adaptive Feature Selection Algorithm for Fuzzy Clustering Image Segmentation Based on Embedded Neighbourhood Information Constraints.一种基于嵌入邻域信息约束的模糊聚类图像分割自适应特征选择算法

Sensors (Basel). 2020 Jul 3;20(13):3722. doi: 10.3390/s20133722.

Characterization of medical time series using fuzzy similarity-based fractal dimensions.基于模糊相似性的分形维数对医学时间序列的特征描述

Artif Intell Med. 2003 Feb;27(2):201-22. doi: 10.1016/s0933-3657(02)00114-8.

A Local Neighborhood Robust Fuzzy Clustering Image Segmentation Algorithm Based on an Adaptive Feature Selection Gaussian Mixture Model.一种基于自适应特征选择高斯混合模型的局部邻域鲁棒模糊聚类图像分割算法。

Sensors (Basel). 2020 Apr 22;20(8):2391. doi: 10.3390/s20082391.

Interval-Valued Model Level Fuzzy Aggregation-Based Background Subtraction.基于区间值模型级模糊聚合的背景减除。

IEEE Trans Cybern. 2017 Sep;47(9):2544-2555. doi: 10.1109/TCYB.2016.2585600. Epub 2016 Jul 29.

Comprehensive Eutrophication Assessment Based on Fuzzy Matter Element Model and Monte Carlo-Triangular Fuzzy Numbers Approach.基于模糊物元模型和蒙特卡罗-三角模糊数方法的综合富营养化评价。

Int J Environ Res Public Health. 2019 May 19;16(10):1769. doi: 10.3390/ijerph16101769.

Automated analysis of sequence polymorphism in STR alleles by PCR and direct electrospray ionization mass spectrometry.通过 PCR 和直接电喷雾电离质谱法对 STR 等位基因中的序列多态性进行自动化分析。

Forensic Sci Int Genet. 2012 Sep;6(5):594-606. doi: 10.1016/j.fsigen.2012.02.002. Epub 2012 Mar 8.

An evaluation method of risk grades for prostate cancer using similarity measure of cubic hesitant fuzzy sets.基于立方犹豫模糊集相似度的前列腺癌风险等级评估方法。

J Biomed Inform. 2018 Nov;87:131-137. doi: 10.1016/j.jbi.2018.10.003. Epub 2018 Oct 16.

本文引用的文献

Population genomics of bacterial host adaptation.细菌宿主适应性的群体基因组学。

Nat Rev Genet. 2018 Sep;19(9):549-565. doi: 10.1038/s41576-018-0032-z.

Allele frequency data for 15 autosomal STR loci in eight Indonesian subpopulations.印度尼西亚八个亚群体中15个常染色体STR基因座的等位基因频率数据。

Forensic Sci Int Genet. 2016 Jan;20:45-52. doi: 10.1016/j.fsigen.2015.09.014. Epub 2015 Oct 3.

Genetic variation of 15 autosomal short tandem repeat (STR) loci in the Palestinian population of Gaza Strip.加沙地带巴勒斯坦人群中15个常染色体短串联重复序列（STR）位点的基因变异。

Leg Med (Tokyo). 2009 Jul;11(4):203-4. doi: 10.1016/j.legalmed.2009.02.072. Epub 2009 Apr 11.

Inferring ethnic origin by means of an STR profile.

Forensic Sci Int. 2001 Jun 1;119(1):17-22. doi: 10.1016/s0379-0738(00)00387-x.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于涉及家族和部落关系的STR-DNA相似度计算的高斯模糊数

Gaussian Fuzzy Number for STR-DNA Similarity Calculation Involving Familial and Tribal Relationships.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献