• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

姓名到性别的推理服务的比较与基准测试

Comparison and benchmark of name-to-gender inference services.

作者信息

Santamaría Lucía, Mihaljević Helena

机构信息

Amazon Development Center, Berlin, Germany.

University of Applied Sciences, Berlin, Germany.

出版信息

PeerJ Comput Sci. 2018 Jul 16;4:e156. doi: 10.7717/peerj-cs.156. eCollection 2018.

DOI:10.7717/peerj-cs.156
PMID:33816809
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7924484/
Abstract

The increased interest in analyzing and explaining gender inequalities in tech, media, and academia highlights the need for accurate inference methods to predict a person's gender from their name. Several such services exist that provide access to large databases of names, often enriched with information from social media profiles, culture-specific rules, and insights from sociolinguistics. We compare and benchmark five name-to-gender inference services by applying them to the classification of a test data set consisting of 7,076 manually labeled names. The compiled names are analyzed and characterized according to their geographical and cultural origin. We define a series of performance metrics to quantify various types of classification errors, and define a parameter tuning procedure to search for optimal values of the services' free parameters. Finally, we perform benchmarks of all services under study regarding several scenarios where a particular metric is to be optimized.

摘要

对科技、媒体和学术界性别不平等现象进行分析和解释的兴趣日益浓厚,这凸显了使用准确推理方法从名字预测一个人性别的必要性。有几种这样的服务,它们可以访问大型名字数据库,这些数据库通常还丰富了来自社交媒体资料、特定文化规则和社会语言学见解的信息。我们通过将五种名字到性别的推理服务应用于由7076个手动标注名字组成的测试数据集的分类,对它们进行比较和基准测试。对汇编的名字根据其地理和文化起源进行分析和特征描述。我们定义了一系列性能指标来量化各种类型的分类错误,并定义了一个参数调整程序来搜索服务自由参数的最优值。最后,我们针对几个要优化特定指标的场景,对所有研究中的服务进行基准测试。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dbc/7924484/c216edaee650/peerj-cs-04-156-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dbc/7924484/8fed5609adfd/peerj-cs-04-156-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dbc/7924484/c7e92fafd7a3/peerj-cs-04-156-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dbc/7924484/ef0e4ff6d166/peerj-cs-04-156-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dbc/7924484/c216edaee650/peerj-cs-04-156-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dbc/7924484/8fed5609adfd/peerj-cs-04-156-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dbc/7924484/c7e92fafd7a3/peerj-cs-04-156-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dbc/7924484/ef0e4ff6d166/peerj-cs-04-156-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1dbc/7924484/c216edaee650/peerj-cs-04-156-g004.jpg

相似文献

1
Comparison and benchmark of name-to-gender inference services.姓名到性别的推理服务的比较与基准测试
PeerJ Comput Sci. 2018 Jul 16;4:e156. doi: 10.7717/peerj-cs.156. eCollection 2018.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
In-depth analysis of protein inference algorithms using multiple search engines and well-defined metrics.使用多个搜索引擎和明确的指标对蛋白质推断算法进行深入分析。
J Proteomics. 2017 Jan 6;150:170-182. doi: 10.1016/j.jprot.2016.08.002. Epub 2016 Aug 4.
4
Performance of gender detection tools: a comparative study of name-to-gender inference services.性别检测工具的性能:姓名到性别推断服务的比较研究。
J Med Libr Assoc. 2021 Jul 1;109(3):414-421. doi: 10.5195/jmla.2021.1185.
5
Name-based demographic inference and the unequal distribution of misrecognition.基于姓名的人口统计推断与错误识别的不平等分布。
Nat Hum Behav. 2023 Jul;7(7):1084-1095. doi: 10.1038/s41562-023-01587-9. Epub 2023 Apr 17.
6
Using genderize.io to infer the gender of first names: how to improve the accuracy of the inference.使用 genderize.io 推断名字的性别:如何提高推断的准确性。
J Med Libr Assoc. 2021 Oct 1;109(4):609-612. doi: 10.5195/jmla.2021.1252.
7
Erratum to "Performance of gender detection tools: a comparative study of name-to-gender inference services," 2021;109(3):414-21 and "Using genderize.io to infer the gender of first names: how to improve the accuracy of the inference," 2021;109(4):609-12.《性别检测工具的性能:姓名到性别的推理服务的比较研究》(2021年;109(3):414 - 21)及《使用genderize.io推断名字的性别:如何提高推断的准确性》(2021年;109(4):609 - 12)的勘误
J Med Libr Assoc. 2022 Apr 1;110(2):E32. doi: 10.5195/jmla.2022.1528.
8
An ensemble heterogeneous classification methodology for discovering health-related knowledge in social media messages.一种用于在社交媒体消息中发现健康相关知识的集成异构分类方法。
J Biomed Inform. 2014 Jun;49:255-68. doi: 10.1016/j.jbi.2014.03.005. Epub 2014 Mar 16.
9
Attentional load and the consciousness of one's own name.注意负荷与对自己名字的意识。
Conscious Cogn. 2014 May;26:197-203. doi: 10.1016/j.concog.2014.03.008. Epub 2014 Apr 22.
10
Time lagged information theoretic approaches to the reverse engineering of gene regulatory networks.时滞信息论方法在基因调控网络反向工程中的应用。
BMC Bioinformatics. 2010 Oct 7;11 Suppl 6(Suppl 6):S19. doi: 10.1186/1471-2105-11-S6-S19.

引用本文的文献

1
Marked gender inequity in the invited speakers at the European College of Veterinary Surgeons annual scientific congress 2012-2022.2012年至2022年欧洲兽医外科学会年度科学大会受邀演讲者中存在明显的性别不平等现象。
PLoS One. 2025 Sep 2;20(9):e0329147. doi: 10.1371/journal.pone.0329147. eCollection 2025.
2
Global Trends and Cross-Country Differences in Authorship by Women in Academic Anaesthesiology Since 1996: A Repeated Cross-Sectional Analysis.1996年以来学术麻醉学领域女性作者的全球趋势与跨国差异:重复横断面分析
J Clin Med. 2025 Aug 21;14(16):5891. doi: 10.3390/jcm14165891.
3
Can ChatGPT Recognize Its Own Writing in Scientific Abstracts?

本文引用的文献

1
The gender gap in science: How long until women are equally represented?科学界的性别差距:女性何时才能平等代表?
PLoS Biol. 2018 Apr 19;16(4):e2004956. doi: 10.1371/journal.pbio.2004956. eCollection 2018 Apr.
2
Women are underrepresented in computational biology: An analysis of the scholarly literature in biology, computer science and computational biology.女性在计算生物学领域的代表性不足:对生物学、计算机科学和计算生物学学术文献的分析。
PLoS Comput Biol. 2017 Oct 12;13(10):e1005134. doi: 10.1371/journal.pcbi.1005134. eCollection 2017 Oct.
3
The Effect of Gender in the Publication Patterns in Mathematics.
ChatGPT能在科学摘要中识别出自己的写作内容吗?
Cureus. 2025 Jul 25;17(7):e88774. doi: 10.7759/cureus.88774. eCollection 2025 Jul.
4
Silent voices: Uncovering women's absence in veterinary surgery publications.无声的声音:揭示兽医外科学出版物中女性的缺席情况。
PLoS One. 2025 Aug 14;20(8):e0330392. doi: 10.1371/journal.pone.0330392. eCollection 2025.
5
Gender equality in leadership of HIV care cascade clinical trials: A methodological study.艾滋病毒治疗级联临床试验领导力中的性别平等:一项方法学研究。
HIV Med. 2025 Sep;26(9):1356-1366. doi: 10.1111/hiv.70062. Epub 2025 Jun 20.
6
Trends and influences in women authorship in randomised controlled trials in rheumatology: a comprehensive analysis of all published RCTs from 2009 to 2023.风湿病学随机对照试验中女性作者的趋势与影响:对2009年至2023年所有已发表随机对照试验的综合分析
RMD Open. 2025 Mar 27;11(1):e005341. doi: 10.1136/rmdopen-2024-005341.
7
Do Manuscripts by Female Evolutionary Biologists Spend Longer Under Review?女性进化生物学家撰写的手稿审稿时间会更长吗?
Mol Biol Evol. 2025 Mar 5;42(3). doi: 10.1093/molbev/msaf054.
8
Evaluating Covid-19 publications for sex and gender-specific health content: A bibliometric analysis.评估关于新冠病毒病(Covid-19)的出版物中的性别特异性健康内容:一项文献计量分析。
PLoS One. 2025 Feb 19;20(2):e0316812. doi: 10.1371/journal.pone.0316812. eCollection 2025.
9
The Bibliometric Evolution of Neurosurgery Publications From 1977 to 2023.1977年至2023年神经外科出版物的文献计量学演变
Neurosurg Pract. 2025 Jan 30;6(1):e00128. doi: 10.1227/neuprac.0000000000000128. eCollection 2025 Mar.
10
Principal investigator gender and clinical trial success: analysis of over 3000 obstetrics and gynecology trials.主要研究者性别与临床试验成功率:对3000多项妇产科试验的分析
AJOG Glob Rep. 2024 Dec 4;5(1):100427. doi: 10.1016/j.xagr.2024.100427. eCollection 2025 Feb.
性别对数学领域出版模式的影响。
PLoS One. 2016 Oct 25;11(10):e0165367. doi: 10.1371/journal.pone.0165367. eCollection 2016.
4
Trends and comparison of female first authorship in high impact medical journals: observational study (1994-2014).高影响力医学期刊中女性第一作者情况的趋势与比较:观察性研究(1994 - 2014年)
BMJ. 2016 Mar 2;352:i847. doi: 10.1136/bmj.i847.
5
Bibliometrics: global gender disparities in science.文献计量学:科学领域的全球性别差异
Nature. 2013 Dec 12;504(7479):211-3. doi: 10.1038/504211a.
6
The role of gender in scholarly authorship.性别在学术著作中的作用。
PLoS One. 2013 Jul 22;8(7):e66212. doi: 10.1371/journal.pone.0066212. Print 2013.