性别检测工具的性能：姓名到性别推断服务的比较研究。

Performance of gender detection tools: a comparative study of name-to-gender inference services.

出版信息

J Med Libr Assoc. 2021 Jul 1;109(3):414-421. doi: 10.5195/jmla.2021.1185.

DOI:10.5195/jmla.2021.1185

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8485937/

Abstract

OBJECTIVE

To evaluate the performance of gender detection tools that allow the uploading of files (e.g., Excel or CSV files) containing first names, are usable by researchers without advanced computer skills, and are at least partially free of charge.

METHODS

The study was conducted using four physician datasets (total number of physicians: 6,131; 50.3% female) from Switzerland, a multilingual country. Four gender detection tools met the inclusion criteria: three partially free (Gender API, NamSor, and genderize.io) and one completely free (Wiki-Gendersort). For each tool, we recorded the number of correct classifications (i.e., correct gender assigned to a name), misclassifications (i.e., wrong gender assigned to a name), and nonclassifications (i.e., no gender assigned). We computed three metrics: the proportion of misclassifications excluding nonclassifications (errorCodedWithoutNA), the proportion of nonclassifications (naCoded), and the proportion of misclassifications and nonclassifications (errorCoded).

RESULTS

The proportion of misclassifications was low for all four gender detection tools (errorCodedWithoutNA between 1.5 and 2.2%). By contrast, the proportion of unrecognized names (naCoded) varied: 0% for NamSor, 0.3% for Gender API, 4.5% for Wiki-Gendersort, and 16.4% for genderize.io. Using errorCoded, which penalizes both types of error equally, we obtained the following results: Gender API 1.8%, NamSor 2.0%, Wiki-Gendersort 6.6%, and genderize.io 17.7%.

CONCLUSIONS

Gender API and NamSor were the most accurate tools. Genderize.io led to a high number of nonclassifications. Wiki-Gendersort may be a good compromise for researchers wishing to use a completely free tool. Other studies would be useful to evaluate the performance of these tools in other populations (e.g., Asian).

摘要

目的

评估允许上传包含名字的文件（如 Excel 或 CSV 文件）的性别检测工具的性能，这些工具可供没有高级计算机技能的研究人员使用，且至少部分免费。

方法

本研究使用了来自瑞士（一个多语言国家）的四个医生数据集（医生总数：6131 人；女性占 50.3%）。有四个性别检测工具符合纳入标准：三个部分免费（Gender API、NamSor 和 genderize.io）和一个完全免费（Wiki-Gendersort）。对于每个工具，我们记录了正确分类的数量（即正确分配给一个名字的性别）、错误分类的数量（即错误分配给一个名字的性别）和未分类的数量。我们计算了三个指标：排除未分类的错误分类比例（errorCodedWithoutNA）、未分类的比例（naCoded）和错误分类和未分类的比例（errorCoded）。

结果

所有四个性别检测工具的错误分类比例都较低（errorCodedWithoutNA 在 1.5%到 2.2%之间）。相比之下，未识别名字的比例（naCoded）有所不同：NamSor 为 0%，Gender API 为 0.3%，Wiki-Gendersort 为 4.5%，genderize.io 为 16.4%。使用同样惩罚两种错误的 errorCoded，我们得到以下结果：Gender API 为 1.8%，NamSor 为 2.0%，Wiki-Gendersort 为 6.6%，genderize.io 为 17.7%。

结论

Gender API 和 NamSor 是最准确的工具。genderize.io 导致了大量的未分类。对于希望使用完全免费工具的研究人员来说，Wiki-Gendersort 可能是一个很好的折中方案。其他研究将有助于评估这些工具在其他人群（如亚洲）中的性能。

相似文献

Performance of gender detection tools: a comparative study of name-to-gender inference services.性别检测工具的性能：姓名到性别推断服务的比较研究。

J Med Libr Assoc. 2021 Jul 1;109(3):414-421. doi: 10.5195/jmla.2021.1185.

Using genderize.io to infer the gender of first names: how to improve the accuracy of the inference.使用 genderize.io 推断名字的性别：如何提高推断的准确性。

J Med Libr Assoc. 2021 Oct 1;109(4):609-612. doi: 10.5195/jmla.2021.1252.

How accurate are gender detection tools in predicting the gender for Chinese names? A study with 20,000 given names in Pinyin format.性别检测工具在预测中文名字的性别方面有多准确？一项针对 20000 个拼音形式的名字的研究。

J Med Libr Assoc. 2022 Apr 1;110(2):205-211. doi: 10.5195/jmla.2022.1289.

How well does NamSor perform in predicting the country of origin and ethnicity of individuals based on their first and last names?基于一个人的名字，NamSor 在预测其原籍国和种族方面的表现如何？

PLoS One. 2023 Nov 16;18(11):e0294562. doi: 10.1371/journal.pone.0294562. eCollection 2023.

Erratum to "Performance of gender detection tools: a comparative study of name-to-gender inference services," 2021;109(3):414-21 and "Using genderize.io to infer the gender of first names: how to improve the accuracy of the inference," 2021;109(4):609-12.《性别检测工具的性能：姓名到性别的推理服务的比较研究》（2021年；109(3):414 - 21）及《使用genderize.io推断名字的性别：如何提高推断的准确性》（2021年；109(4):609 - 12）的勘误

J Med Libr Assoc. 2022 Apr 1;110(2):E32. doi: 10.5195/jmla.2022.1528.

What Is the Performance of ChatGPT in Determining the Gender of Individuals Based on Their First and Last Names?ChatGPT根据名字确定个人性别的表现如何？

JMIR AI. 2024 Mar 13;3:e53656. doi: 10.2196/53656.

Comparison and benchmark of name-to-gender inference services.姓名到性别的推理服务的比较与基准测试

PeerJ Comput Sci. 2018 Jul 16;4:e156. doi: 10.7717/peerj-cs.156. eCollection 2018.

Development and initial Experience of an online Exchange Platform on Sex and Gender Aspects in Medicine: "GenderMed-Wiki".医学中性别与性方面在线交流平台“性别医学维基”的开发及初步经验

GMS J Med Educ. 2018 Aug 15;35(3):Doc32. doi: 10.3205/zma001178. eCollection 2018.

Building a protein name dictionary from full text: a machine learning term extraction approach.从全文构建蛋白质名称词典：一种机器学习术语提取方法。

BMC Bioinformatics. 2005 Apr 7;6:88. doi: 10.1186/1471-2105-6-88.

Difficult name, cold man: Chinese names, gender stereotypicality and trustworthiness.难名、冷男：中国人的名字、性别刻板印象与可信度。

Int J Psychol. 2021 Jun;56(3):349-360. doi: 10.1002/ijop.12727. Epub 2020 Dec 7.

引用本文的文献

Female first and senior authorship in high-impact critical care journals 2005-2024.2005年至2024年高影响力重症医学期刊中的女性第一作者和资深作者情况

Crit Care. 2025 Sep 8;29(1):395. doi: 10.1186/s13054-025-05649-4.

Marked gender inequity in the invited speakers at the European College of Veterinary Surgeons annual scientific congress 2012-2022.2012年至2022年欧洲兽医外科学会年度科学大会受邀演讲者中存在明显的性别不平等现象。

PLoS One. 2025 Sep 2;20(9):e0329147. doi: 10.1371/journal.pone.0329147. eCollection 2025.

Global Trends and Cross-Country Differences in Authorship by Women in Academic Anaesthesiology Since 1996: A Repeated Cross-Sectional Analysis.1996年以来学术麻醉学领域女性作者的全球趋势与跨国差异：重复横断面分析

J Clin Med. 2025 Aug 21;14(16):5891. doi: 10.3390/jcm14165891.

Can ChatGPT Recognize Its Own Writing in Scientific Abstracts?ChatGPT能在科学摘要中识别出自己的写作内容吗？

Cureus. 2025 Jul 25;17(7):e88774. doi: 10.7759/cureus.88774. eCollection 2025 Jul.

A Bibliometric Analysis of Publications on the Prevalence of Chronic Pain in Children and Adolescents From 2009 to 2023.2009年至2023年儿童和青少年慢性疼痛患病率相关出版物的文献计量分析

Paediatr Neonatal Pain. 2025 Aug 18;7(3):e70013. doi: 10.1002/pne2.70013. eCollection 2025 Sep.

Trends in paediatric anaesthesia research publications and the impact of author sex, country of origin, topic, and external funding.儿科麻醉研究出版物的趋势以及作者性别、原籍国、主题和外部资金的影响。

BJA Open. 2025 Apr 15;14:100397. doi: 10.1016/j.bjao.2025.100397. eCollection 2025 Jun.

Trends and influences in women authorship in randomised controlled trials in rheumatology: a comprehensive analysis of all published RCTs from 2009 to 2023.风湿病学随机对照试验中女性作者的趋势与影响：对2009年至2023年所有已发表随机对照试验的综合分析

RMD Open. 2025 Mar 27;11(1):e005341. doi: 10.1136/rmdopen-2024-005341.

Gender, race and ethnicity biases experienced by hospital physicians: an umbrella review to explore emerging biases in the evidence base.医院医生所经历的性别、种族和族裔偏见：一项系统性综述，以探索证据基础中出现的新偏见。

BMJ Open. 2025 Feb 16;15(2):e094549. doi: 10.1136/bmjopen-2024-094549.

Use of ChatGPT to Explore Gender and Geographic Disparities in Scientific Peer Review.使用ChatGPT探索科学同行评审中的性别和地域差异。

J Med Internet Res. 2024 Dec 9;26:e57667. doi: 10.2196/57667.

Comparative analysis of automatic gender detection from names: evaluating the stability and performance of ChatGPT Namsor, and Gender-API.从名字进行自动性别检测的比较分析：评估ChatGPT、Namsor和Gender-API的稳定性和性能。

PeerJ Comput Sci. 2024 Oct 17;10:e2378. doi: 10.7717/peerj-cs.2378. eCollection 2024.

本文引用的文献

Comparison and benchmark of name-to-gender inference services.姓名到性别的推理服务的比较与基准测试

PeerJ Comput Sci. 2018 Jul 16;4:e156. doi: 10.7717/peerj-cs.156. eCollection 2018.

Gender disparities in coronavirus disease 2019 clinical trial leadership.2019 年冠状病毒病临床试验领导中的性别差异。

Clin Microbiol Infect. 2021 Jul;27(7):1007-1010. doi: 10.1016/j.cmi.2020.12.025. Epub 2021 Jan 5.

Representation of Women Authors in International Heart Failure Guidelines and Contemporary Clinical Trials.国际心力衰竭指南和当代临床试验中女性作者的代表性。

Circ Heart Fail. 2020 Aug;13(8):e006605. doi: 10.1161/CIRCHEARTFAILURE.119.006605. Epub 2020 Aug 6.

Sex Distribution of Editorial Board Members Among Emergency Medicine Journals.急诊医学期刊编辑委员会成员的性别分布。

Ann Emerg Med. 2021 Jan;77(1):117-123. doi: 10.1016/j.annemergmed.2020.03.027. Epub 2020 May 4.

Gender differences in how scientists present the importance of their research: observational study.科学家呈现研究重要性的方式存在性别差异：观察性研究。

BMJ. 2019 Dec 16;367:l6573. doi: 10.1136/bmj.l6573.

Sex and gender reporting in global health: new editorial policies.全球健康领域中的性别与性取向报告：新编辑政策

BMJ Glob Health. 2018 Jul 26;3(4):e001038. doi: 10.1136/bmjgh-2018-001038. eCollection 2018.

Gender disparities in high-quality dermatology research: a descriptive bibliometric study on scientific authorships.高质量皮肤病学研究中的性别差异：一项关于科学作者身份的描述性文献计量学研究

BMJ Open. 2018 Apr 13;8(4):e020089. doi: 10.1136/bmjopen-2017-020089.

Gendermetrics of cancer research: results from a global analysis on lung cancer.癌症研究的性别指标：肺癌全球分析结果

Oncotarget. 2017 Oct 26;8(60):101911-101921. doi: 10.18632/oncotarget.22089. eCollection 2017 Nov 24.

Differences in incomes of physicians in the United States by race and sex: observational study.美国不同种族和性别的医生收入差异：观察性研究。

BMJ. 2016 Jun 7;353:i2923. doi: 10.1136/bmj.i2923.

Trends and comparison of female first authorship in high impact medical journals: observational study (1994-2014).高影响力医学期刊中女性第一作者情况的趋势与比较：观察性研究（1994 - 2014年）

BMJ. 2016 Mar 2;352:i847. doi: 10.1136/bmj.i847.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验