• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于神经网络改进社交媒体文本中作者画像的特征表示

Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts.

作者信息

Gómez-Adorno Helena, Markov Ilia, Sidorov Grigori, Posadas-Durán Juan-Pablo, Sanchez-Perez Miguel A, Chanona-Hernandez Liliana

机构信息

Instituto Politécnico Nacional (IPN), Centro de Invetigación en Computación (CIC), Mexico City, Mexico.

Instituto Politécnico Nacional (IPN), Escuela Superior de Ingeniería Mecánica y Eléctrica Unidad Zacatenco (ESIME-Zacatenco), Mexico City, Mexico.

出版信息

Comput Intell Neurosci. 2016;2016:1638936. doi: 10.1155/2016/1638936. Epub 2016 Oct 3.

DOI:10.1155/2016/1638936
PMID:27795703
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5066026/
Abstract

We introduce a lexical resource for preprocessing social media data. We show that a neural network-based feature representation is enhanced by using this resource. We conducted experiments on the PAN 2015 and PAN 2016 author profiling corpora and obtained better results when performing the data preprocessing using the developed lexical resource. The resource includes dictionaries of slang words, contractions, abbreviations, and emoticons commonly used in social media. Each of the dictionaries was built for the English, Spanish, Dutch, and Italian languages. The resource is freely available.

摘要

我们介绍了一种用于预处理社交媒体数据的词汇资源。我们表明,使用该资源可以增强基于神经网络的特征表示。我们在PAN 2015和PAN 2016作者剖析语料库上进行了实验,当使用开发的词汇资源进行数据预处理时,取得了更好的结果。该资源包括社交媒体中常用的俚语、缩写词、缩略语和表情符号词典。每个词典都是针对英语、西班牙语、荷兰语和意大利语构建的。该资源可免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/725b/5066026/104b20d0f9ea/CIN2016-1638936.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/725b/5066026/104b20d0f9ea/CIN2016-1638936.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/725b/5066026/104b20d0f9ea/CIN2016-1638936.001.jpg

相似文献

1
Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts.基于神经网络改进社交媒体文本中作者画像的特征表示
Comput Intell Neurosci. 2016;2016:1638936. doi: 10.1155/2016/1638936. Epub 2016 Oct 3.
2
Medical concept normalization in social media posts with recurrent neural networks.社交媒体帖子中的医学概念规范化:基于递归神经网络的方法
J Biomed Inform. 2018 Aug;84:93-102. doi: 10.1016/j.jbi.2018.06.006. Epub 2018 Jun 12.
3
Typical and delayed lexical development in Italian.意大利语中典型和延迟的词汇发展。
J Speech Lang Hear Res. 2014 Oct;57(5):1792-803. doi: 10.1044/2014_JSLHR-L-13-0242.
4
Automatically Expanding the Synonym Set of SNOMED CT using Wikipedia.利用维基百科自动扩展医学系统命名法临床术语(SNOMED CT)的同义词集
Stud Health Technol Inform. 2015;216:619-23.
5
Model-based semantic dictionaries for medical language understanding.用于医学语言理解的基于模型的语义词典。
Proc AMIA Symp. 1999:122-6.
6
Incidence Rate of Canonical vs. Derived Medical Terminology in Natural Language.自然语言中规范医学术语与衍生医学术语的发生率
Stud Health Technol Inform. 2015;210:5-9.
7
Acquiring concepts and features of novel words by two types of learning: direct mapping and inference.通过两种学习方式获取新词的概念和特征:直接映射和推理。
Neuropsychologia. 2014 Apr;56:204-18. doi: 10.1016/j.neuropsychologia.2014.01.012. Epub 2014 Jan 27.
8
Sublanguage analysis of medical weblogs.医学网络日志的子语言分析
Stud Health Technol Inform. 2014;205:565-9.
9
Novel word integration in the mental lexicon: evidence from unmasked and masked semantic priming.心理词库中的新型词汇整合:来自无掩蔽和掩蔽语义启动的证据。
Q J Exp Psychol (Hove). 2013;66(5):1001-25. doi: 10.1080/17470218.2012.724694. Epub 2012 Oct 4.
10
Activation Patterns throughout the Word Processing Network of L1-dominant Bilinguals Reflect Language Similarity and Language Decisions.母语占主导地位的双语者在整个文字处理网络中的激活模式反映了语言相似性和语言决策。
J Cogn Neurosci. 2015 Nov;27(11):2197-214. doi: 10.1162/jocn_a_00853. Epub 2015 Jul 30.

引用本文的文献

1
A normalization model for repeated letters in social media hate speech text based on rules and spelling correction.基于规则和拼写纠正的社交媒体仇恨言论文本中重复字母的归一化模型。
PLoS One. 2024 Mar 21;19(3):e0299652. doi: 10.1371/journal.pone.0299652. eCollection 2024.
2
Comparing COVID-19 metaphors in Chinese and English social media with critical metaphor analysis.运用批评隐喻分析比较中英文社交媒体中的新冠疫情隐喻
Front Psychol. 2023 Jun 1;14:1198265. doi: 10.3389/fpsyg.2023.1198265. eCollection 2023.
3
Automated Detection of Vaping-Related Tweets on Twitter During the 2019 EVALI Outbreak Using Machine Learning Classification.
在2019年电子烟相关肺损伤(EVALI)爆发期间,利用机器学习分类法在推特上自动检测与电子烟相关的推文。
Front Big Data. 2022 Feb 10;5:770585. doi: 10.3389/fdata.2022.770585. eCollection 2022.
4
Neuromarketing as an Emotional Connection Tool Between Organizations and Audiences in Social Networks. A Theoretical Review.神经营销学作为社交网络中组织与受众之间的情感连接工具:理论综述
Front Psychol. 2020 Jul 21;11:1787. doi: 10.3389/fpsyg.2020.01787. eCollection 2020.