一种用于学术论文引文网络的随机生成模型。

A stochastic generative model for citation networks among academic papers.

机构信息

Department of Statistical Science, School of Multidisciplinary Sciences, The Graduate University for Advanced Studies, SOKENDAI, Tokyo, Japan.

Department of Global Management, Chuo University, Tokyo, Japan.

出版信息

PLoS One. 2022 Jun 29;17(6):e0269845. doi: 10.1371/journal.pone.0269845. eCollection 2022.

DOI:10.1371/journal.pone.0269845

PMID:35767539

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9242511/

Abstract

We propose a stochastic generative model to represent a directed graph constructed by citations among academic papers, where nodes and directed edges represent papers with discrete publication time and citations respectively. The proposed model assumes that a citation between two papers occurs with a probability based on the type of the citing paper, the importance of cited paper, and the difference between their publication times, like the existing models. We consider the out-degrees of citing paper as its type, because, for example, survey paper cites many papers. We approximate the importance of a cited paper by its in-degrees. In our model, we adopt three functions: a logistic function for illustrating the numbers of papers published in discrete time, an inverse Gaussian probability distribution function to express the aging effect based on the difference between publication times, and an exponential distribution (or a generalized Pareto distribution) for describing the out-degree distribution. We consider that our model is a more reasonable and appropriate stochastic model than other existing models and can perform complete simulations without using original data. In this paper, we first use the Web of Science database and see the features used in our model. By using the proposed model, we can generate simulated graphs and demonstrate that they are similar to the original data concerning the in- and out-degree distributions, and node triangle participation. In addition, we analyze two other citation networks derived from physics papers in the arXiv database and verify the effectiveness of the model.

摘要

我们提出了一个随机生成模型来表示由学术论文之间的引文构建的有向图，其中节点和有向边分别表示具有离散出版时间的论文和引文。所提出的模型假设，两篇论文之间的引文发生的概率基于引用论文的类型、被引论文的重要性以及它们的出版时间之间的差异，就像现有的模型一样。我们将引用论文的出度视为其类型，因为例如综述论文会引用许多论文。我们通过论文的入度来近似被引论文的重要性。在我们的模型中，我们采用了三个函数：逻辑函数用于说明在离散时间内发表的论文数量，逆高斯概率分布函数用于根据出版时间的差异表达老化效应，以及指数分布（或广义帕累托分布）用于描述出度分布。我们认为我们的模型是比其他现有模型更合理和适当的随机模型，并且可以在不使用原始数据的情况下进行完整的模拟。在本文中，我们首先使用 Web of Science 数据库并查看模型中使用的特征。通过使用所提出的模型，我们可以生成模拟图，并证明它们与原始数据在入度和出度分布以及节点三角形参与度方面相似。此外，我们分析了 arXiv 数据库中两个源自物理论文的其他引文网络，并验证了模型的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0830/9242511/4ce7289fe4b4/pone.0269845.g001.jpg

相似文献

A stochastic generative model for citation networks among academic papers.一种用于学术论文引文网络的随机生成模型。

PLoS One. 2022 Jun 29;17(6):e0269845. doi: 10.1371/journal.pone.0269845. eCollection 2022.

Three options for citation tracking: Google Scholar, Scopus and Web of Science.文献引用追踪的三种选择：谷歌学术、Scopus和科学网。

Biomed Digit Libr. 2006 Jun 29;3:7. doi: 10.1186/1742-5581-3-7.

Papers featured in the World Journal of Gastroenterology from 2006 to 2007.2006 年至 2007 年发表在《世界胃肠病学杂志》上的论文。

World J Gastroenterol. 2009 Sep 21;15(35):4471-5. doi: 10.3748/wjg.15.4471.

Nonuniversal power law scaling in the probability distribution of scientific citations.科学引文概率分布中的非普适幂律标度

Proc Natl Acad Sci U S A. 2010 Sep 14;107(37):16023-7. doi: 10.1073/pnas.1010757107. Epub 2010 Aug 30.

Visualizing the context of citations referencing papers published by Eugene Garfield: a new type of keyword co-occurrence analysis.可视化引用尤金·加菲尔德发表论文的文献背景：一种新型的关键词共现分析

Scientometrics. 2018;114(2):427-437. doi: 10.1007/s11192-017-2591-8. Epub 2017 Dec 2.

Web of Science, Scopus, and Google Scholar citation rates: a case study of medical physics and biomedical engineering: what gets cited and what doesn't?科学网、Scopus和谷歌学术的引用率：医学物理与生物医学工程案例研究：哪些被引用了，哪些没有？

Australas Phys Eng Sci Med. 2016 Dec;39(4):817-823. doi: 10.1007/s13246-016-0478-2. Epub 2016 Aug 30.

Power laws in citation distributions: evidence from Scopus.引文分布中的幂律：来自Scopus的证据。

Scientometrics. 2015;103(1):213-228. doi: 10.1007/s11192-014-1524-z. Epub 2015 Jan 22.

Modeling the citation network by network cosmology.用网络宇宙学对引文网络进行建模。

PLoS One. 2015 Mar 25;10(3):e0120687. doi: 10.1371/journal.pone.0120687. eCollection 2015.

The 100 Most Cited Papers in Radiotherapy or Chemoradiotherapy for Cervical Cancer: 1990-2020.1990年至2020年宫颈癌放疗或放化疗领域被引用次数最多的100篇论文。

Front Oncol. 2021 Sep 1;11:642018. doi: 10.3389/fonc.2021.642018. eCollection 2021.

[The citation analysis of the publications in Chinese Journal of Preventive Medicine from 2014 to 2017].《2014年至2017年《中华预防医学杂志》发表论文的引文分析》

Zhonghua Yu Fang Yi Xue Za Zhi. 2020 Aug 6;54(8):867-874. doi: 10.3760/cma.j.cn112150-20200614-00876.

引用本文的文献

Learning the mechanisms of network growth.了解网络生长的机制。

Sci Rep. 2024 May 24;14(1):11866. doi: 10.1038/s41598-024-61940-4.

本文引用的文献

SciPy 1.0: fundamental algorithms for scientific computing in Python.SciPy 1.0：Python 中的科学计算基础算法。

Nat Methods. 2020 Mar;17(3):261-272. doi: 10.1038/s41592-019-0686-2. Epub 2020 Feb 3.

SNAP: A General Purpose Network Analysis and Graph Mining Library.SNAP：一个通用的网络分析和图挖掘库。

ACM Trans Intell Syst Technol. 2016 Oct;8(1). doi: 10.1145/2898361. Epub 2016 Oct 3.

Growing complex network of citations of scientific papers: Modeling and measurements.科学文献引文的复杂网络增长：建模与测量。

Phys Rev E. 2017 Jan;95(1-1):012324. doi: 10.1103/PhysRevE.95.012324. Epub 2017 Jan 30.

Modeling scientific-citation patterns and other triangle-rich acyclic networks.建模科学引文模式及其他富含三角形的无环网络。

Phys Rev E Stat Nonlin Soft Matter Phys. 2009 Sep;80(3 Pt 2):037101. doi: 10.1103/PhysRevE.80.037101. Epub 2009 Sep 14.

An index to quantify an individual's scientific research output.一个用于量化个人科研产出的指标。

Proc Natl Acad Sci U S A. 2005 Nov 15;102(46):16569-72. doi: 10.1073/pnas.0507655102. Epub 2005 Nov 7.

Network growth by copying.通过复制实现网络增长。

Phys Rev E Stat Nonlin Soft Matter Phys. 2005 Mar;71(3 Pt 2A):036118. doi: 10.1103/PhysRevE.71.036118. Epub 2005 Mar 17.

Citation indexes for science; a new dimension in documentation through association of ideas.科学引文索引；通过思想关联实现文献记录的新维度。

Science. 1955 Jul 15;122(3159):108-11. doi: 10.1126/science.122.3159.108.

Growing scale-free networks with tunable clustering.具有可调聚类的无标度网络增长

Phys Rev E Stat Nonlin Soft Matter Phys. 2002 Feb;65(2 Pt 2):026107. doi: 10.1103/PhysRevE.65.026107. Epub 2002 Jan 11.

Spectra of "real-world" graphs: beyond the semicircle law.“真实世界”图的谱：超越半圆律

Phys Rev E Stat Nonlin Soft Matter Phys. 2001 Aug;64(2 Pt 2):026704. doi: 10.1103/PhysRevE.64.026704. Epub 2001 Jul 20.

Emergence of scaling in random networks.随机网络中幂律分布的出现。

Science. 1999 Oct 15;286(5439):509-12. doi: 10.1126/science.286.5439.509.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于学术论文引文网络的随机生成模型。

A stochastic generative model for citation networks among academic papers.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献