Faculty of Electrical Engineering and Computer Science, University of Maribor, Maribor, Slovenia.
Semantika, Maribor, Slovenia.
Sci Prog. 2022 Jan-Mar;105(1):368504211029777. doi: 10.1177/00368504211029777.
Machine Learning is an increasingly important technology dealing with the growing complexity of the digitalised world. Despite the fact, that we live in a 'Big data' world where, almost 'everything' is digitally stored, there are many real-world situations, where researchers are still faced with small data samples. The present bibliometric knowledge synthesis study aims to answer the research question 'What is the small data problem in machine learning and how it is solved?' The analysis a positive trend in the number of research publications and substantial growth of the research community, indicating that the research field is reaching maturity. Most productive countries are China, United States and United Kingdom. Despite notable international cooperation, the regional concentration of research literature production in economically more developed countries was observed. Thematic analysis identified four research themes. The themes are concerned with to dimension reduction in complex big data analysis, data augmentation techniques in deep learning, data mining and statistical learning on small datasets.
机器学习是一项日益重要的技术,用于应对数字化世界日益复杂的挑战。尽管我们生活在一个“大数据”的世界中,几乎“所有”东西都以数字形式存储,但在许多现实情况下,研究人员仍然面临着小数据样本的问题。本文献计量知识综合研究旨在回答研究问题“机器学习中的小数据问题是什么,以及如何解决它?”分析表明,研究出版物的数量呈积极趋势,研究社区也在大幅增长,这表明该研究领域正在走向成熟。最具生产力的国家是中国、美国和英国。尽管国际合作显著,但在经济较发达的国家,研究文献的区域集中生产仍然存在。主题分析确定了四个研究主题。这些主题涉及复杂大数据分析中的降维、深度学习中的数据增强技术、小数据集的数据挖掘和统计学习。
J Nurs Scholarsh. 2024-5
Int J Environ Res Public Health. 2022-12-22
Front Endocrinol (Lausanne). 2022
BMC Med Inform Decis Mak. 2025-9-1
Nat Commun. 2025-7-1
Turk Gogus Kalp Damar Cerrahisi Derg. 2025-4-30
Front Vet Sci. 2025-3-12
Front Res Metr Anal. 2019-4-30
Int J Nurs Stud. 2020-8