• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于评分和特征的作者文本证据似然比估计:方法的实证比较。

Likelihood ratio estimation for authorship text evidence: An empirical comparison of score- and feature-based methods.

机构信息

Speech and Language Laboratory, the Australian National University, Canberra, Australia; Linguistics Program, School of Culture, History and Language, College of Asia and the Pacific, the Australian National University, Building #110, Canberra, ACT 2600, Australia.

出版信息

Forensic Sci Int. 2022 May;334:111268. doi: 10.1016/j.forsciint.2022.111268. Epub 2022 Mar 10.

DOI:10.1016/j.forsciint.2022.111268
PMID:35334288
Abstract

This study compares score- and feature-based methods for estimating forensic likelihood ratios for text evidence. Three feature-based methods built on different Poisson-based models with logistic regression fusion are introduced and evaluated: a one-level Poisson model, a one-level zero-inflated Poisson model and a two-level Poisson-gamma model. These are compared with a score-based method that employs the cosine distance as a score-generating function. The two types of methods are compared using the same data (i.e., documents attributable to 2,157 authors) and the same features set, which is a bag-of-words model using the 400 most frequently occurring words. Their performances are evaluated via the log-likelihood ratio cost (C) and its composites: discrimination (C) and calibration (C) cost. The results show that (1) the feature-based methods outperform the score-based method by a C value of 0.14-0.2 when their best results are compared and (2) a feature selection procedure can further improve performance for the feature-based methods. Some distinctive performance characteristics associated with likelihood ratios produced using the feature-based methods are described, and their implications will be discussed with real forensic casework in mind.

摘要

本研究比较了基于评分和特征的方法,以估计文本证据的法医似然比。介绍并评估了三种基于特征的方法,这些方法基于不同的泊松模型,并结合逻辑回归融合:一级泊松模型、一级零膨胀泊松模型和两级泊松-伽马模型。这些方法与基于评分的方法进行了比较,该方法使用余弦距离作为评分生成函数。这两种方法使用相同的数据(即归因于 2157 位作者的文档)和相同的特征集进行比较,特征集是使用最常出现的 400 个单词的词袋模型。通过对数似然比成本(C)及其组合:区分(C)和校准(C)成本来评估它们的性能。结果表明:(1)当比较最佳结果时,基于特征的方法比基于评分的方法的 C 值高出 0.14-0.2;(2)特征选择过程可以进一步提高基于特征的方法的性能。描述了与使用基于特征的方法生成的似然比相关的一些独特性能特征,并将考虑实际法医案例讨论其含义。

相似文献

1
Likelihood ratio estimation for authorship text evidence: An empirical comparison of score- and feature-based methods.基于评分和特征的作者文本证据似然比估计:方法的实证比较。
Forensic Sci Int. 2022 May;334:111268. doi: 10.1016/j.forsciint.2022.111268. Epub 2022 Mar 10.
2
Score-based likelihood ratios for linguistic text evidence with a bag-of-words model.基于词袋模型的语言文本证据的评分似然比。
Forensic Sci Int. 2021 Oct;327:110980. doi: 10.1016/j.forsciint.2021.110980. Epub 2021 Aug 25.
3
Strength of linguistic text evidence: A fused forensic text comparison system.语言文本证据的强度:一种融合的法医文本比较系统。
Forensic Sci Int. 2017 Sep;278:184-197. doi: 10.1016/j.forsciint.2017.06.040. Epub 2017 Jul 8.
4
Weight of authorship evidence with multiple categories of stylometric features: A multinomial-based discrete model.多类别风格特征下的作者权重证据:基于多项的离散模型。
Sci Justice. 2023 Mar;63(2):181-199. doi: 10.1016/j.scijus.2022.12.007. Epub 2023 Jan 3.
5
An overview of log likelihood ratio cost in forensic science - Where is it used and what values can we expect?法医学中对数似然比代价概述——其应用于何处以及我们可预期何种值?
Forensic Sci Int Synerg. 2024 Apr 17;8:100466. doi: 10.1016/j.fsisyn.2024.100466. eCollection 2024.
6
Calibration of score based likelihood ratio estimation in automated forensic facial image comparison.基于评分的似然比估计在自动法医面部图像比对中的校准。
Forensic Sci Int. 2022 May;334:111239. doi: 10.1016/j.forsciint.2022.111239. Epub 2022 Mar 7.
7
Improved likelihood ratios for face recognition in surveillance video by multimodal feature pairing.通过多模态特征配对提高监控视频中人脸识别的似然比。
Forensic Sci Int Synerg. 2024 Feb 29;8:100458. doi: 10.1016/j.fsisyn.2024.100458. eCollection 2024.
8
Fusing linguistic and acoustic information for automated forensic speaker comparison.融合语言和声学信息进行自动法医说话人比较。
Sci Justice. 2024 Sep;64(5):485-497. doi: 10.1016/j.scijus.2024.07.001. Epub 2024 Jul 9.
9
Zero-inflated Poisson model based likelihood ratio test for drug safety signal detection.基于零膨胀泊松模型的药物安全信号检测似然比检验
Stat Methods Med Res. 2017 Feb;26(1):471-488. doi: 10.1177/0962280214549590. Epub 2016 Jul 11.
10
Likelihood-ratio forensic voice comparison using parametric representations of the formant trajectories of diphthongs.使用双元音共振峰轨迹的参数表示进行似然比法医语音比较。
J Acoust Soc Am. 2009 Apr;125(4):2387-97. doi: 10.1121/1.3081384.

引用本文的文献

1
An overview of log likelihood ratio cost in forensic science - Where is it used and what values can we expect?法医学中对数似然比代价概述——其应用于何处以及我们可预期何种值?
Forensic Sci Int Synerg. 2024 Apr 17;8:100466. doi: 10.1016/j.fsisyn.2024.100466. eCollection 2024.
2
Interpol questioned documents review 2019-2022.国际刑警组织对2019年至2022年文件的审查
Forensic Sci Int Synerg. 2023 Feb 24;6:100300. doi: 10.1016/j.fsisyn.2022.100300. eCollection 2023.