• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

数据依赖性对说话人识别评估的影响。

The Impact of Data Dependence on Speaker Recognition Evaluation.

作者信息

Wu Jin Chu, Martin Alvin F, Greenberg Craig S, Kacker Raghu N

机构信息

National Institute of Standards and Technology, Gaithersburg, MD 20899 USA.

出版信息

IEEE/ACM Trans Audio Speech Lang Process. 2017 Jan;25(1):5-18. doi: 10.1109/TASLP.2016.2614725. Epub 2016 Sep 30.

DOI:10.1109/TASLP.2016.2614725
PMID:28660231
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5484007/
Abstract

The data dependency due to multiple use of the same subjects has impact on the standard error (SE) of the detection cost function (DCF) in speaker recognition evaluation. The DCF is defined as a weighted sum of the probabilities of type I and type II errors at a given threshold. A two-layer data structure is constructed: target scores are grouped into target sets based on the dependency, and likewise for non-target scores. On account of the needed equal probabilities for scores being selected when resampling, target sets must contain the same number of target scores, and so must non-target sets. In addition to the bootstrap method with i.i.d. assumption, the nonparametric two-sample one-layer and two-layer bootstrap methods are carried out based on whether the resampling takes place only on sets, or subsequently on scores within the sets. Due to the stochastic nature of the bootstrap, the distributions of the SEs of the DCF estimated using the three different bootstrap methods are created and compared. After performing hypothesis testing, it is found that data dependency increases not only the SE but also the variation of the SE, and the two-layer bootstrap is more conservative than the one-layer bootstrap. The rationale regarding the different impacts of the three bootstrap methods on the estimated SEs is investigated.

摘要

在说话人识别评估中,由于同一主体的多次使用所导致的数据依赖性会对检测成本函数(DCF)的标准误差(SE)产生影响。DCF被定义为在给定阈值下第一类错误和第二类错误概率的加权和。构建了一种两层数据结构:基于依赖性将目标分数分组为目标集,非目标分数也同样处理。由于重采样时选择分数所需的概率相等,目标集必须包含相同数量的目标分数,非目标集也必须如此。除了具有独立同分布假设的自助法之外,基于重采样是仅在集合上进行还是随后在集合内的分数上进行,还实施了非参数双样本单层和两层自助法。由于自助法的随机性,创建并比较了使用三种不同自助法估计的DCF的SE分布。进行假设检验后发现,数据依赖性不仅会增加SE,还会增加SE的变化,并且两层自助法比单层自助法更保守。研究了三种自助法对估计SE的不同影响的原理。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6330/5484007/4dcbf8ccdbab/nihms859729f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6330/5484007/ce09dcda12e6/nihms859729f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6330/5484007/ffb59ee00482/nihms859729f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6330/5484007/95dd98be14e1/nihms859729f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6330/5484007/4dcbf8ccdbab/nihms859729f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6330/5484007/ce09dcda12e6/nihms859729f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6330/5484007/ffb59ee00482/nihms859729f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6330/5484007/95dd98be14e1/nihms859729f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6330/5484007/4dcbf8ccdbab/nihms859729f4.jpg

相似文献

1
The Impact of Data Dependence on Speaker Recognition Evaluation.数据依赖性对说话人识别评估的影响。
IEEE/ACM Trans Audio Speech Lang Process. 2017 Jan;25(1):5-18. doi: 10.1109/TASLP.2016.2614725. Epub 2016 Sep 30.
2
Monte Carlo studies of bootstrap variability in ROC analysis with data dependency.基于数据相关性的ROC分析中自举法变异性的蒙特卡罗研究。
Commun Stat Simul Comput. 2018;48(2). doi: 10.1080/03610918.2018.1521974.
3
Analysis of small sample size studies using nonparametric bootstrap test with pooled resampling method.使用合并重采样方法的非参数自助检验对小样本量研究进行分析。
Stat Med. 2017 Jun 30;36(14):2187-2205. doi: 10.1002/sim.7263. Epub 2017 Mar 9.
4
Validation of Nonparametric Two-Sample Bootstrap in ROC Analysis on Large Datasets.大型数据集ROC分析中基于非参数双样本自助法的验证
Commun Stat Simul Comput. 2016;45(5):1689-1703. doi: 10.1080/03610918.2015.1065327. Epub 2015 Aug 31.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
Classifier performance prediction for computer-aided diagnosis using a limited dataset.使用有限数据集对计算机辅助诊断的分类器性能进行预测。
Med Phys. 2008 Apr;35(4):1559-70. doi: 10.1118/1.2868757.
7
Two bootstrapping routines for obtaining imprecision estimates for nonparametric parameter distributions in nonlinear mixed effects models.两种自举法在非线性混合效应模型中获取非参数参数分布不精确估计的应用。
J Pharmacokinet Pharmacodyn. 2011 Feb;38(1):63-82. doi: 10.1007/s10928-010-9177-x. Epub 2010 Nov 13.
8
Comparison of bootstrap approaches for estimation of uncertainties of DTI parameters.用于估计扩散张量成像(DTI)参数不确定性的自助法比较
Neuroimage. 2006 Nov 1;33(2):531-41. doi: 10.1016/j.neuroimage.2006.07.001. Epub 2006 Aug 28.
9
Standard errors in covariance structure models: asymptotics versus bootstrap.协方差结构模型中的标准误差:渐近法与自助法
Br J Math Stat Psychol. 2006 Nov;59(Pt 2):397-417. doi: 10.1348/000711005X85896.
10
Comparison of Bayesian and maximum likelihood bootstrap measures of phylogenetic reliability.系统发育可靠性的贝叶斯和最大似然自展法测度的比较
Mol Biol Evol. 2003 Feb;20(2):248-54. doi: 10.1093/molbev/msg042.

引用本文的文献

1
Monte Carlo studies of bootstrap variability in ROC analysis with data dependency.基于数据相关性的ROC分析中自举法变异性的蒙特卡罗研究。
Commun Stat Simul Comput. 2018;48(2). doi: 10.1080/03610918.2018.1521974.
2
A novel measure and significance testing in data analysis of cell image segmentation.细胞图像分割数据分析中的一种新测量方法及显著性检验
BMC Bioinformatics. 2017 Mar 14;18(1):168. doi: 10.1186/s12859-017-1527-x.

本文引用的文献

1
Validation of Nonparametric Two-Sample Bootstrap in ROC Analysis on Large Datasets.大型数据集ROC分析中基于非参数双样本自助法的验证
Commun Stat Simul Comput. 2016;45(5):1689-1703. doi: 10.1080/03610918.2015.1065327. Epub 2015 Aug 31.
2
Measures, Uncertainties, and Significance Test in Operational ROC Analysis.操作特征曲线(ROC)分析中的测量、不确定性与显著性检验
J Res Natl Inst Stand Technol. 2011 Feb 1;116(1):517-37. doi: 10.6028/jres.116.003. Print 2011 Jan-Feb.
3
Performance generalization in biometric authentication using joint user-specific and sample bootstraps.
使用联合用户特定和样本自举法的生物特征认证中的性能泛化
IEEE Trans Pattern Anal Mach Intell. 2007 Mar;29(3):492-8. doi: 10.1109/TPAMI.2007.55.
4
Comparison of three methods for estimating the standard error of the area under the curve in ROC analysis of quantitative data.定量数据ROC分析中三种估计曲线下面积标准误方法的比较。
Acad Radiol. 2002 Nov;9(11):1278-85. doi: 10.1016/s1076-6332(03)80561-5.
5
Comparison of quantitative diagnostic tests: type I error, power, and sample size.定量诊断试验的比较:I型错误、检验效能和样本量。
Stat Med. 1987 Mar;6(2):147-58. doi: 10.1002/sim.4780060207.