• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用“混合粘贴”命令检测伪造音频文件:一种基于韩语音素特征的深度学习方法

Detecting Forged Audio Files Using "Mixed Paste" Command: A Deep Learning Approach Based on Korean Phonemic Features.

作者信息

Son Yeongmin, Park Jae Wan

机构信息

Department of Digital Media, Soongsil University, Seoul 07027, Republic of Korea.

Global School of Media, Soongsil University, Seoul 07027, Republic of Korea.

出版信息

Sensors (Basel). 2024 Mar 14;24(6):1872. doi: 10.3390/s24061872.

DOI:10.3390/s24061872
PMID:38544136
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10975275/
Abstract

The ubiquity of smartphones today enables the widespread utilization of voice recording for diverse purposes. Consequently, the submission of voice recordings as digital evidence in legal proceedings has notably increased, alongside a rise in allegations of recording file forgery. This trend highlights the growing significance of audio file authentication. This study aims to develop a deep learning methodology capable of identifying forged files, particularly those altered using "Mixed Paste" commands, a technique not previously addressed. The proposed deep learning framework is a composite model, integrating a convolutional neural network and a long short-term memory model. It is designed based on the extraction of features from spectrograms and sequences of Korean consonant types. The training of this model utilizes an authentic dataset of forged audio recordings created on an iPhone, modified via "Mixed Paste", and encoded. This hybrid model demonstrates a high accuracy rate of 97.5%. To validate the model's efficacy, tests were conducted using various manipulated audio files. The findings reveal that the model's effectiveness is not contingent on the smartphone model or the audio editing software employed. We anticipate that this research will advance the field of audio forensics through a novel hybrid model approach.

摘要

如今智能手机的普及使得语音记录被广泛用于各种目的。因此,在法律程序中作为数字证据提交语音记录的情况显著增加,同时录音文件伪造指控也有所上升。这一趋势凸显了音频文件认证日益重要。本研究旨在开发一种深度学习方法,能够识别伪造文件,特别是那些使用“混合粘贴”命令更改的文件,这是一种此前未涉及的技术。所提出的深度学习框架是一个复合模型,集成了卷积神经网络和长短期记忆模型。它基于从声谱图和韩语音素类型序列中提取特征进行设计。该模型的训练使用了在iPhone上创建的伪造音频记录的真实数据集,通过“混合粘贴”进行修改并编码。这种混合模型展示了97.5%的高准确率。为验证模型的有效性,使用各种经过处理的音频文件进行了测试。结果表明,该模型的有效性并不取决于智能手机型号或所使用的音频编辑软件。我们预计这项研究将通过一种新颖的混合模型方法推动音频取证领域的发展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0af/10975275/1a5510beb209/sensors-24-01872-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0af/10975275/408d5b54ef58/sensors-24-01872-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0af/10975275/3910f7018c36/sensors-24-01872-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0af/10975275/777036b52f2a/sensors-24-01872-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0af/10975275/3e229cb646e9/sensors-24-01872-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0af/10975275/154353b64f9b/sensors-24-01872-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0af/10975275/cfc37b0968a5/sensors-24-01872-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0af/10975275/ac3235c5bb1a/sensors-24-01872-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0af/10975275/1a5510beb209/sensors-24-01872-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0af/10975275/408d5b54ef58/sensors-24-01872-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0af/10975275/3910f7018c36/sensors-24-01872-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0af/10975275/777036b52f2a/sensors-24-01872-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0af/10975275/3e229cb646e9/sensors-24-01872-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0af/10975275/154353b64f9b/sensors-24-01872-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0af/10975275/cfc37b0968a5/sensors-24-01872-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0af/10975275/ac3235c5bb1a/sensors-24-01872-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0af/10975275/1a5510beb209/sensors-24-01872-g008.jpg

相似文献

1
Detecting Forged Audio Files Using "Mixed Paste" Command: A Deep Learning Approach Based on Korean Phonemic Features.使用“混合粘贴”命令检测伪造音频文件:一种基于韩语音素特征的深度学习方法
Sensors (Basel). 2024 Mar 14;24(6):1872. doi: 10.3390/s24061872.
2
Advanced forensic procedure for the authentication of audio recordings generated by Voice Memos application of iOS14.用于验证 iOS14 版语音备忘录应用程序生成的录音的高级法医程序
J Forensic Sci. 2022 Jul;67(4):1534-1549. doi: 10.1111/1556-4029.15016. Epub 2022 Mar 1.
3
A method of forensic authentication of audio recordings generated using the Voice Memos application in the iPhone.一种使用 iPhone 中的语音备忘录应用程序生成的音频录音的法医认证方法。
Forensic Sci Int. 2021 Mar;320:110702. doi: 10.1016/j.forsciint.2021.110702. Epub 2021 Jan 23.
4
Forensic authentication method for audio recordings generated by Voice Recorder application on Samsung Galaxy Watch4 series.用于三星 Galaxy Watch4 系列上的 Voice Recorder 应用程序生成的录音的法医认证方法。
J Forensic Sci. 2023 Jan;68(1):139-153. doi: 10.1111/1556-4029.15158. Epub 2022 Oct 22.
5
Prediction of Sleep Stages Via Deep Learning Using Smartphone Audio Recordings in Home Environments: Model Development and Validation.基于智能手机音频记录的深度学习在家庭环境中预测睡眠阶段:模型开发与验证。
J Med Internet Res. 2023 Jun 1;25:e46216. doi: 10.2196/46216.
6
Authenticity examination of compressed audio recordings using detection of multiple compression and encoders' identification.利用多重压缩检测和编码器识别对压缩音频记录进行真实性检验。
Forensic Sci Int. 2014 May;238:33-46. doi: 10.1016/j.forsciint.2014.02.008. Epub 2014 Feb 18.
7
A Meta-Learning Approach for Few-Shot Face Forgery Segmentation and Classification.元学习方法在少样本人脸伪造分割与分类中的应用。
Sensors (Basel). 2023 Mar 31;23(7):3647. doi: 10.3390/s23073647.
8
HornBase: An audio dataset of car horns in different scenarios and positions.HornBase:一个包含不同场景和位置汽车喇叭声的音频数据集。
Data Brief. 2024 Jul 14;55:110678. doi: 10.1016/j.dib.2024.110678. eCollection 2024 Aug.
9
Detection of Audio Tampering Based on Electric Network Frequency Signal.基于电网频率信号的音频篡改检测
Sensors (Basel). 2023 Aug 8;23(16):7029. doi: 10.3390/s23167029.
10
Neural noiseprint transfer: a generic noiseprint-based counter forensics framework.神经噪声指纹转移:一种基于噪声指纹的通用反取证框架。
PeerJ Comput Sci. 2023 Apr 27;9:e1359. doi: 10.7717/peerj-cs.1359. eCollection 2023.

引用本文的文献

1
Data trace as the scientific foundation for trusted metrological data: a review for future metrology direction.数据溯源作为可信计量数据的科学基础:对未来计量方向的综述
PeerJ Comput Sci. 2025 Aug 14;11:e3106. doi: 10.7717/peerj-cs.3106. eCollection 2025.

本文引用的文献

1
Comparative Analysis of CNN and RNN for Voice Pathology Detection.卷积神经网络(CNN)和循环神经网络(RNN)在语音病理学检测中的比较分析。
Biomed Res Int. 2021 Apr 14;2021:6635964. doi: 10.1155/2021/6635964. eCollection 2021.
2
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.