• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

社会科学中的词嵌入:跨学科综述。

Word embedding for social sciences: an interdisciplinary survey.

作者信息

Matsui Akira, Ferrara Emilio

机构信息

College of Business Administration, Yokohama National University, Yokohama, Kanagawa, Japan.

Thomas Lord Department of Computer Science, University of Southern California, Los Angeles, California, United States.

出版信息

PeerJ Comput Sci. 2024 Dec 5;10:e2562. doi: 10.7717/peerj-cs.2562. eCollection 2024.

DOI:10.7717/peerj-cs.2562
PMID:39896392
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11784867/
Abstract

Machine learning models learn low-dimensional representations from complex high-dimensional data. Not only computer science but also social science has benefited from the advancement of these powerful tools. Within such tools, word embedding is one of the most popular methods in the literature. However, we have no particular documentation of this emerging trend because this trend overlaps different social science fields. To well compile this fragmented knowledge, we survey recent studies that apply word embedding models to human behavior mining. Our taxonomy built on the surveyed article provides a concise but comprehensive overview of this emerging trend of intersection between computer science and social science and guides scholars who are going to navigate the use of word embedding algorithms in their voyage of social science research.

摘要

机器学习模型从复杂的高维数据中学习低维表示。不仅计算机科学,社会科学也从这些强大工具的发展中受益。在这些工具中,词嵌入是文献中最流行的方法之一。然而,我们没有关于这一新兴趋势的具体文献记录,因为这一趋势跨越了不同的社会科学领域。为了很好地整合这些零散的知识,我们调查了最近将词嵌入模型应用于人类行为挖掘的研究。我们基于所调查文章构建的分类法对计算机科学和社会科学交叉的这一新兴趋势提供了简洁而全面的概述,并指导学者在其社会科学研究之旅中运用词嵌入算法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6208/11784867/0c511bc4054f/peerj-cs-10-2562-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6208/11784867/0d7fad6c4dab/peerj-cs-10-2562-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6208/11784867/0c511bc4054f/peerj-cs-10-2562-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6208/11784867/0d7fad6c4dab/peerj-cs-10-2562-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6208/11784867/0c511bc4054f/peerj-cs-10-2562-g002.jpg

相似文献

1
Word embedding for social sciences: an interdisciplinary survey.社会科学中的词嵌入:跨学科综述。
PeerJ Comput Sci. 2024 Dec 5;10:e2562. doi: 10.7717/peerj-cs.2562. eCollection 2024.
2
Text mining-based word representations for biomedical data analysis and protein-protein interaction networks in machine learning tasks.基于文本挖掘的词表示在生物医学数据分析和机器学习任务中的蛋白质-蛋白质相互作用网络。
PLoS One. 2021 Oct 15;16(10):e0258623. doi: 10.1371/journal.pone.0258623. eCollection 2021.
3
A new word embedding model integrated with medical knowledge for deep learning-based sentiment classification.一种集成医学知识的新词嵌入模型,用于基于深度学习的情感分类。
Artif Intell Med. 2024 Feb;148:102758. doi: 10.1016/j.artmed.2023.102758. Epub 2024 Jan 8.
4
Word embedding empowered topic recognition in news articles.词嵌入助力新闻文章中的主题识别。
PeerJ Comput Sci. 2024 Dec 11;10:e2300. doi: 10.7717/peerj-cs.2300. eCollection 2024.
5
Identifying health related occupations of Twitter users through word embedding and deep neural networks.通过词嵌入和深度神经网络识别 Twitter 用户的健康相关职业。
BMC Bioinformatics. 2022 Sep 28;22(Suppl 10):630. doi: 10.1186/s12859-022-04933-2.
6
Impact of word embedding models on text analytics in deep learning environment: a review.词嵌入模型对深度学习环境下文本分析的影响:综述
Artif Intell Rev. 2023 Feb 22:1-81. doi: 10.1007/s10462-023-10419-1.
7
Theory In, Theory Out: The Uses of Social Theory in Machine Learning for Social Science.理论进,理论出:社会理论在机器学习中用于社会科学的应用
Front Big Data. 2020 May 19;3:18. doi: 10.3389/fdata.2020.00018. eCollection 2020.
8
Empowering health geography research with location-based social media data: innovative food word expansion and energy density prediction via word embedding and machine learning.利用基于位置的社交媒体数据增强健康地理学研究:通过词嵌入和机器学习进行创新的食品词汇扩展和能量密度预测。
Int J Health Geogr. 2023 Sep 16;22(1):22. doi: 10.1186/s12942-023-00344-5.
9
Optimizing word embeddings for small dataset: a case study on patient portal messages from breast cancer patients.优化小数据集的词向量:以乳腺癌患者的患者门户消息为例的研究。
Sci Rep. 2024 Jul 12;14(1):16117. doi: 10.1038/s41598-024-66319-z.
10
The language of proteins: NLP, machine learning & protein sequences.蛋白质的语言:自然语言处理、机器学习与蛋白质序列
Comput Struct Biotechnol J. 2021 Mar 25;19:1750-1758. doi: 10.1016/j.csbj.2021.03.022. eCollection 2021.

引用本文的文献

1
Quantifying the influence of vocational education and training with text embedding and similarity-based networks.利用文本嵌入和基于相似度的网络量化职业教育与培训的影响。
PLoS One. 2025 Aug 21;20(8):e0329405. doi: 10.1371/journal.pone.0329405. eCollection 2025.

本文引用的文献

1
Human languages with greater information density have higher communication speed but lower conversation breadth.人类语言的信息密度越大,其交流速度就越快,但对话广度就越低。
Nat Hum Behav. 2024 Apr;8(4):644-656. doi: 10.1038/s41562-024-01815-w. Epub 2024 Feb 16.
2
Disrupted routines anticipate musical exploration.常规被打乱,预示着音乐探索的开始。
Proc Natl Acad Sci U S A. 2024 Feb 6;121(6):e2306549121. doi: 10.1073/pnas.2306549121. Epub 2024 Feb 1.
3
Unsupervised embedding of trajectories captures the latent structure of scientific migration.
无监督轨迹嵌入捕获了科学移民的潜在结构。
Proc Natl Acad Sci U S A. 2023 Dec 26;120(52):e2305414120. doi: 10.1073/pnas.2305414120. Epub 2023 Dec 22.
4
Local similarity and global variability characterize the semantic space of human languages.局部相似性和全局变异性是人类语言语义空间的特征。
Proc Natl Acad Sci U S A. 2023 Dec 19;120(51):e2300986120. doi: 10.1073/pnas.2300986120. Epub 2023 Dec 11.
5
Divergent semantic integration (DSI): Extracting creativity from narratives with distributional semantic modeling.发散语义整合(DSI):通过分布语义建模从叙事中提取创造力。
Behav Res Methods. 2023 Oct;55(7):3726-3759. doi: 10.3758/s13428-022-01986-2. Epub 2022 Oct 17.
6
Using word embeddings to investigate cultural biases.利用词嵌入来研究文化偏见。
Br J Soc Psychol. 2023 Jan;62(1):617-629. doi: 10.1111/bjso.12560. Epub 2022 Jul 23.
7
Historical representations of social groups across 200 years of word embeddings from Google Books.200 年谷歌书籍语料库中的词嵌入技术对社会群体的历史描述。
Proc Natl Acad Sci U S A. 2022 Jul 12;119(28):e2121798119. doi: 10.1073/pnas.2121798119. Epub 2022 Jul 5.
8
Semantic projection recovers rich human knowledge of multiple object features from word embeddings.语义投射从词嵌入中恢复了人类对多个对象特征的丰富知识。
Nat Hum Behav. 2022 Jul;6(7):975-987. doi: 10.1038/s41562-022-01316-8. Epub 2022 Apr 14.
9
Quantifying social organization and political polarization in online platforms.量化在线平台中的社会组织与政治两极分化。
Nature. 2021 Dec;600(7888):264-268. doi: 10.1038/s41586-021-04167-x. Epub 2021 Dec 1.
10
How quantifying the shape of stories predicts their success.如何通过量化故事的形状来预测其成功。
Proc Natl Acad Sci U S A. 2021 Jun 29;118(26). doi: 10.1073/pnas.2011695118.