Suppr超能文献

文献网络的演变

Evolution of document networks.

作者信息

Menczer Filippo

机构信息

School of Informatics, Indiana University, Bloomington, IN 47408, USA.

出版信息

Proc Natl Acad Sci U S A. 2004 Apr 6;101 Suppl 1(Suppl 1):5261-5. doi: 10.1073/pnas.0307554100. Epub 2004 Jan 27.

Abstract

How does a network of documents grow without centralized control? This question is becoming crucial as we try to explain the emergent scale-free topology of the World Wide Web and use link analysis to identify important information resources. Existing models of growing information networks have focused on the structure of links but neglected the content of nodes. Here I show that the current models fail to reproduce a critical characteristic of information networks, namely the distribution of textual similarity among linked documents. I propose a more realistic model that generates links by using both popularity and content. This model yields remarkably accurate predictions of both degree and similarity distributions in networks of web pages and scientific literature.

摘要

一个没有集中控制的文档网络是如何增长的?随着我们试图解释万维网出现的无标度拓扑结构并使用链接分析来识别重要信息资源,这个问题变得至关重要。现有的信息网络增长模型侧重于链接结构,却忽略了节点的内容。在这里我表明,当前的模型无法再现信息网络的一个关键特征,即链接文档之间文本相似度的分布。我提出了一个更现实的模型,该模型通过同时使用流行度和内容来生成链接。这个模型对网页和科学文献网络中的度分布和相似度分布都产生了非常准确的预测。

相似文献

1
Evolution of document networks.文献网络的演变
Proc Natl Acad Sci U S A. 2004 Apr 6;101 Suppl 1(Suppl 1):5261-5. doi: 10.1073/pnas.0307554100. Epub 2004 Jan 27.
2
Mixed-membership models of scientific publications.科学出版物的混合成员模型。
Proc Natl Acad Sci U S A. 2004 Apr 6;101 Suppl 1(Suppl 1):5220-7. doi: 10.1073/pnas.0307760101. Epub 2004 Mar 12.
3
User-controlled mapping of significant literatures.用户控制的重要文献映射。
Proc Natl Acad Sci U S A. 2004 Apr 6;101 Suppl 1(Suppl 1):5297-302. doi: 10.1073/pnas.0307630100.
9
Finding scientific topics.寻找科学主题。
Proc Natl Acad Sci U S A. 2004 Apr 6;101 Suppl 1(Suppl 1):5228-35. doi: 10.1073/pnas.0307752101. Epub 2004 Feb 10.

引用本文的文献

2
Foundations of Temporal Text Networks.时间文本网络基础
Appl Netw Sci. 2018;3(1):25. doi: 10.1007/s41109-018-0082-3. Epub 2018 Aug 13.
3
A generative model for scientific concept hierarchies.一种用于科学概念层次结构的生成模型。
PLoS One. 2018 Feb 23;13(2):e0193331. doi: 10.1371/journal.pone.0193331. eCollection 2018.
4
Popularity versus similarity in growing networks.在不断发展的网络中,受欢迎程度和相似度。
Nature. 2012 Sep 27;489(7417):537-40. doi: 10.1038/nature11459. Epub 2012 Sep 12.
5
Modeling statistical properties of written text.书面文本的统计特性建模。
PLoS One. 2009;4(4):e5372. doi: 10.1371/journal.pone.0005372. Epub 2009 Apr 29.
6
7
Topical interests and the mitigation of search engine bias.局部兴趣与搜索引擎偏差的缓解
Proc Natl Acad Sci U S A. 2006 Aug 22;103(34):12684-9. doi: 10.1073/pnas.0605525103. Epub 2006 Aug 10.
8
The simultaneous evolution of author and paper networks.作者网络与论文网络的同步演化。
Proc Natl Acad Sci U S A. 2004 Apr 6;101 Suppl 1(Suppl 1):5266-73. doi: 10.1073/pnas.0307625100. Epub 2004 Feb 19.

本文引用的文献

2
From paragraph to graph: latent semantic analysis for information visualization.从段落到图表:用于信息可视化的潜在语义分析
Proc Natl Acad Sci U S A. 2004 Apr 6;101 Suppl 1(Suppl 1):5214-9. doi: 10.1073/pnas.0400341101. Epub 2004 Mar 22.
3
The simultaneous evolution of author and paper networks.作者网络与论文网络的同步演化。
Proc Natl Acad Sci U S A. 2004 Apr 6;101 Suppl 1(Suppl 1):5266-73. doi: 10.1073/pnas.0307625100. Epub 2004 Feb 19.
4
Tracking evolving communities in large linked networks.在大型关联网络中追踪不断演变的群落。
Proc Natl Acad Sci U S A. 2004 Apr 6;101 Suppl 1(Suppl 1):5249-53. doi: 10.1073/pnas.0307750100. Epub 2004 Feb 2.
5
Coauthorship networks and patterns of scientific collaboration.共同作者网络与科学合作模式。
Proc Natl Acad Sci U S A. 2004 Apr 6;101 Suppl 1(Suppl 1):5200-5. doi: 10.1073/pnas.0307545100. Epub 2004 Jan 26.
6
Extracting knowledge from the World Wide Web.从万维网中提取知识。
Proc Natl Acad Sci U S A. 2004 Apr 6;101 Suppl 1(Suppl 1):5186-91. doi: 10.1073/pnas.0307528100. Epub 2004 Jan 26.
7
NETWORKS OF SCIENTIFIC PAPERS.科学论文网络
Science. 1965 Jul 30;149(3683):510-5. doi: 10.1126/science.149.3683.510.
8
Growing and navigating the small world Web by local content.通过本地内容发展并在小世界网络中导航。
Proc Natl Acad Sci U S A. 2002 Oct 29;99(22):14014-9. doi: 10.1073/pnas.212348399. Epub 2002 Oct 14.
9
Network analysis. The structure of the Web.网络分析。网络的结构。
Science. 2001 Nov 30;294(5548):1849-50. doi: 10.1126/science.1067014.
10
Structure of growing networks with preferential linking.具有优先连接的增长网络结构。
Phys Rev Lett. 2000 Nov 20;85(21):4633-6. doi: 10.1103/PhysRevLett.85.4633.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验