Wang Zifu, Li Meng-Hao, Baxter Patrick, Zhorayev Olzhas, Wei Jiaxin, Kovacs Valerie, Zhao Qiuhan, Yang Chaowei, Koizumi Naoru
Department of Geography and Geoinformation Science, NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, USA.
Schar School of Policy and Government, George Mason University, Fairfax, USA.
Int J Health Geogr. 2025 Apr 28;24(1):10. doi: 10.1186/s12942-025-00397-8.
Illicit kidney trade networks, operating globally, involve intricate interactions among various players, most notably buyers, sellers, brokers, and surgeons. A comprehensive understanding of these trade networks is, however, hindered by the lack of systematically amassed data for analysis. Further, extracting the geographic locations of buyers, sellers, brokers, transplant surgeons, and medical facilities in all relevant publications often involves extensive, time-consuming, manual labelling that is very costly. Although current techniques such as Named Entity Recognition (NER) tools can potentially automate the process, they are limited to identifying country names and often fail to associate the roles (i.e., offering buyer, seller, broker and/or surgery) that each country played.
This study employed state-of-the-art technologies, including Bidirectional Encoder Representations from Transformers (BERT) and Generative Pre-Trained Transformers (GPT) model Llama3.3 from Meta in developing a kidney trade country database. We first extracted news articles reporting illicit kidney trade from the LexisNexis database (2000-2022). BERT and Llama3.3 with chain-of-thought prompt tuning strategies were then applied to the materials to determine the relevance of articles to the illegal kidney trade and to identify the roles those different countries played in kidney trade cases over the past 23 years. The specific country classes recorded in the final kidney trade database included: a) countries of origin for kidney sellers; b) countries of origin of kidney buyers; c) countries performing illegal transplant surgeries; and d) countries of origin of organ trafficking brokers.
The BERT classification model achieved an accuracy of 88.75%, ensuring that only relevant articles were analyzed. Additionally, the Llama3.3-70B model with chain-of-thought prompt tuning strategies extracted location-based roles with an accuracy of 86.30% for sellers, 88.89% for buyers, 93.33% for brokers, and 95.93% for surgeries, supporting these observed patterns. We observed in the final database that the kidney trade networks change and evolve dynamically where the primary role played by each country (as a host of either sellers, buyers or surgeries) change over time. About half of the top 10 countries playing each role gets replaced by other countries within a decade. The final database also demonstrated that developing countries were more likely to be a host of kidney sellers while that played by developed countries was a host of kidney buyers.
The current study developed a geospatial database describing transnational kidney trade country networks over the past two decades. The new approach for geographic location extraction that is more precise compared to conventional NER and machine learning methods.
非法肾脏交易网络在全球范围内运作,涉及各种参与者之间的复杂互动,其中最主要的是买家、卖家、中介和外科医生。然而,由于缺乏系统收集的分析数据,对这些交易网络的全面理解受到阻碍。此外,在所有相关出版物中提取买家、卖家、中介、移植外科医生和医疗设施的地理位置通常需要大量耗时的人工标注,成本非常高。尽管当前的技术,如命名实体识别(NER)工具,有可能使这一过程自动化,但它们仅限于识别国家名称,并且常常无法关联每个国家所扮演的角色(即提供肾脏的买家、卖家、中介和/或进行手术的一方)。
本研究采用了先进技术,包括来自Transformer的双向编码器表征(BERT)和Meta公司的生成式预训练Transformer(GPT)模型Llama3.3,来开发一个肾脏交易国家数据库。我们首先从LexisNexis数据库(2000 - 2022年)中提取报道非法肾脏交易的新闻文章。然后将带有思维链提示调整策略的BERT和Llama3.3应用于这些材料,以确定文章与非法肾脏交易的相关性,并识别不同国家在过去23年的肾脏交易案例中所扮演的角色。最终肾脏交易数据库中记录的具体国家类别包括:a)肾脏卖家的原籍国;b)肾脏买家的原籍国;c)进行非法移植手术的国家;d)器官贩运中介的原籍国。
BERT分类模型的准确率达到88.75%,确保只对相关文章进行分析。此外,采用思维链提示调整策略的Llama3.3 - 70B模型提取基于位置的角色时,卖家的准确率为86.30%,买家为88.89%,中介为93.33%,手术为95.93%,支持了这些观察到的模式。我们在最终数据库中观察到,肾脏交易网络动态变化和演变,每个国家(作为卖家、买家或手术的所在地)所扮演的主要角色随时间而变化。在十年内,扮演每个角色的前10个国家中约有一半被其他国家取代。最终数据库还表明,发展中国家更有可能是肾脏卖家的所在地,而发达国家则是肾脏买家的所在地。
本研究开发了一个地理空间数据库,描述了过去二十年跨国肾脏交易的国家网络。与传统的NER和机器学习方法相比,这种新的地理位置提取方法更加精确。