• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

挖掘软件见解:揭示低评分软件应用中频繁出现的问题。

Mining software insights: uncovering the frequently occurring issues in low-rating software applications.

作者信息

Khan Nek Dil, Khan Javed Ali, Li Jianqiang, Ullah Tahir, Zhao Qing

机构信息

Faculty of Information Technology, Beijing University of Technology, Beijing, China.

Department of Computer Science, School of Physics, Engineering and Computer Science, University of Hertfordshire, Hatfield, United Kingdom.

出版信息

PeerJ Comput Sci. 2024 Jul 10;10:e2115. doi: 10.7717/peerj-cs.2115. eCollection 2024.

DOI:10.7717/peerj-cs.2115
PMID:39145243
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11323132/
Abstract

In today's digital world, app stores have become an essential part of software distribution, providing customers with a wide range of applications and opportunities for software developers to showcase their work. This study elaborates on the importance of end-user feedback for software evolution. However, in the literature, more emphasis has been given to high-rating & popular software apps while ignoring comparatively low-rating apps. Therefore, the proposed approach focuses on end-user reviews collected from 64 low-rated apps representing 14 categories in the Amazon App Store. We critically analyze feedback from low-rating apps and developed a grounded theory to identify various concepts important for software evolution and improving its quality including user interface (UI) and user experience (UX), functionality and features, compatibility and device-specific, performance and stability, customer support and responsiveness and security and privacy issues. Then, using a grounded theory and content analysis approach, a novel research dataset is curated to evaluate the performance of baseline machine learning (ML), and state-of-the-art deep learning (DL) algorithms in automatically classifying end-user feedback into frequently occurring issues. Various natural language processing and feature engineering techniques are utilized for improving and optimizing the performance of ML and DL classifiers. Also, an experimental study comparing various ML and DL algorithms, including multinomial naive Bayes (MNB), logistic regression (LR), random forest (RF), multi-layer perception (MLP), k-nearest neighbors (KNN), AdaBoost, Voting, convolutional neural network (CNN), long short-term memory (LSTM), bidirectional long short term memory (BiLSTM), gated recurrent unit (GRU), bidirectional gated recurrent unit (BiGRU), and recurrent neural network (RNN) classifiers, achieved satisfactory results in classifying end-user feedback to commonly occurring issues. Whereas, MLP, RF, BiGRU, GRU, CNN, LSTM, and Classifiers achieved average accuracies of 94%, 94%, 92%, 91%, 90%, 89%, and 89%, respectively. We employed the SHAP approach to identify the critical features associated with each issue type to enhance the explainability of the classifiers. This research sheds light on areas needing improvement in low-rated apps and opens up new avenues for developers to improve software quality based on user feedback.

摘要

在当今的数字世界中,应用商店已成为软件分发的重要组成部分,为客户提供了广泛的应用程序,并为软件开发人员提供了展示其作品的机会。本研究阐述了终端用户反馈对软件演进的重要性。然而,在文献中,更多的重点放在了高评分和流行的软件应用程序上,而忽略了评分相对较低的应用程序。因此,所提出的方法侧重于从亚马逊应用商店中代表14个类别的64个低评分应用程序收集的终端用户评论。我们对低评分应用程序的反馈进行了批判性分析,并开发了一种扎根理论,以识别对软件演进和提高其质量重要的各种概念,包括用户界面(UI)和用户体验(UX)、功能和特性、兼容性和特定设备、性能和稳定性、客户支持和响应能力以及安全和隐私问题。然后,使用扎根理论和内容分析方法,精心策划了一个新颖的研究数据集,以评估基线机器学习(ML)和最先进的深度学习(DL)算法在将终端用户反馈自动分类为常见问题方面的性能。利用各种自然语言处理和特征工程技术来提高和优化ML和DL分类器的性能。此外,一项比较各种ML和DL算法的实验研究,包括多项式朴素贝叶斯(MNB)、逻辑回归(LR)、随机森林(RF)、多层感知器(MLP)、k近邻(KNN)、AdaBoost、投票、卷积神经网络(CNN)、长短期记忆(LSTM)、双向长短期记忆(BiLSTM)、门控循环单元(GRU)、双向门控循环单元(BiGRU)和递归神经网络(RNN)分类器。在将终端用户反馈分类为常见问题方面取得了令人满意的结果。而MLP、RF、BiGRU、GRU、CNN、LSTM和分类器的平均准确率分别达到了94%、94%、92%、91%、90%、89%和89%。我们采用SHAP方法来识别与每种问题类型相关的关键特征,以增强分类器的可解释性。这项研究揭示了低评分应用程序中需要改进的领域,并为开发人员根据用户反馈提高软件质量开辟了新途径。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571d/11323132/0747f22a7d67/peerj-cs-10-2115-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571d/11323132/7f2a58484f3e/peerj-cs-10-2115-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571d/11323132/a4003618da40/peerj-cs-10-2115-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571d/11323132/6f174cd3e00c/peerj-cs-10-2115-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571d/11323132/5b74b69a5bf0/peerj-cs-10-2115-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571d/11323132/58a15225aaa1/peerj-cs-10-2115-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571d/11323132/2807990715c1/peerj-cs-10-2115-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571d/11323132/f441bf2b7884/peerj-cs-10-2115-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571d/11323132/0747f22a7d67/peerj-cs-10-2115-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571d/11323132/7f2a58484f3e/peerj-cs-10-2115-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571d/11323132/a4003618da40/peerj-cs-10-2115-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571d/11323132/6f174cd3e00c/peerj-cs-10-2115-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571d/11323132/5b74b69a5bf0/peerj-cs-10-2115-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571d/11323132/58a15225aaa1/peerj-cs-10-2115-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571d/11323132/2807990715c1/peerj-cs-10-2115-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571d/11323132/f441bf2b7884/peerj-cs-10-2115-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/571d/11323132/0747f22a7d67/peerj-cs-10-2115-g008.jpg

相似文献

1
Mining software insights: uncovering the frequently occurring issues in low-rating software applications.挖掘软件见解:揭示低评分软件应用中频繁出现的问题。
PeerJ Comput Sci. 2024 Jul 10;10:e2115. doi: 10.7717/peerj-cs.2115. eCollection 2024.
2
Time series forecasting of new cases and new deaths rate for COVID-19 using deep learning methods.使用深度学习方法对COVID-19的新增病例和新增死亡率进行时间序列预测。
Results Phys. 2021 Aug;27:104495. doi: 10.1016/j.rinp.2021.104495. Epub 2021 Jun 26.
3
Algorithms for automated diagnosis of cardiovascular diseases based on ECG data: A comprehensive systematic review.基于心电图数据的心血管疾病自动诊断算法:一项全面的系统综述。
Heliyon. 2023 Feb 10;9(2):e13601. doi: 10.1016/j.heliyon.2023.e13601. eCollection 2023 Feb.
4
Contradiction in text review and apps rating: prediction using textual features and transfer learning.文本评论与应用评分中的矛盾:使用文本特征和迁移学习进行预测
PeerJ Comput Sci. 2024 Jan 3;10:e1722. doi: 10.7717/peerj-cs.1722. eCollection 2024.
5
Listening to Stakeholders Involved in Speech-Language Therapy for Children With Communication Disorders: Content Analysis of Apple App Store Reviews.倾听参与沟通障碍儿童言语治疗的利益相关者的意见:苹果应用商店评论的内容分析
JMIR Pediatr Parent. 2022 Jan 21;5(1):e28661. doi: 10.2196/28661.
6
Machine learning for mHealth apps quality evaluation: An approach based on user feedback analysis.用于移动健康应用程序质量评估的机器学习:一种基于用户反馈分析的方法。
Softw Qual J. 2023 May 23:1-31. doi: 10.1007/s11219-023-09630-8.
7
User Perspectives of Diet-Tracking Apps: Reviews Content Analysis and Topic Modeling.用户对饮食追踪应用程序的看法:评论内容分析与主题建模
J Med Internet Res. 2021 Apr 22;23(4):e25160. doi: 10.2196/25160.
8
A feature-oriented analysis of developers' descriptions and user reviews of top mHealth applications for diabetes and hypertension.面向特征的分析:开发者对糖尿病和高血压顶级移动医疗应用的描述,以及用户对此的评价。
Int J Med Inform. 2021 Dec;156:104598. doi: 10.1016/j.ijmedinf.2021.104598. Epub 2021 Sep 28.
9
Machine learning and deep learning-based approach to categorize Bengali comments on social networks using fused dataset.基于机器学习和深度学习的方法,使用融合数据集对孟加拉语社交网络评论进行分类。
PLoS One. 2024 Oct 3;19(10):e0308862. doi: 10.1371/journal.pone.0308862. eCollection 2024.
10
ACCU3RATE: A mobile health application rating scale based on user reviews.ACCU3RATE:基于用户评价的移动健康应用评级量表。
PLoS One. 2021 Dec 16;16(12):e0258050. doi: 10.1371/journal.pone.0258050. eCollection 2021.

本文引用的文献

1
Emotion detection from handwriting and drawing samples using an attention-based transformer model.使用基于注意力的变压器模型从手写和绘图样本中进行情绪检测。
PeerJ Comput Sci. 2024 Mar 29;10:e1887. doi: 10.7717/peerj-cs.1887. eCollection 2024.
2
Crowd-based requirements elicitation via pull feedback: method and case studies.基于拉式反馈的群体需求 elicitation:方法与案例研究。(注:“elicitation”直译为“引出、启发”等,这里因语境不明暂保留英文)
Requir Eng. 2022;27(4):429-455. doi: 10.1007/s00766-022-00384-6. Epub 2022 Aug 20.
3
Grounded theory research: A design framework for novice researchers.
扎根理论研究:新手研究者的设计框架。
SAGE Open Med. 2019 Jan 2;7:2050312118822927. doi: 10.1177/2050312118822927. eCollection 2019.
4
Weighted kappa: nominal scale agreement with provision for scaled disagreement or partial credit.加权kappa系数:用于衡量名义尺度上的一致性,并考虑了尺度不一致或部分得分的情况。
Psychol Bull. 1968 Oct;70(4):213-20. doi: 10.1037/h0026256.