Yan Tao, Li Shengnan, Kraner Benjamin, Zhang Luyao, Tessone Claudio J
Blockchain & Distributed Ledger Technologies Group at Department of Informatics and UZH Blockchain Center, University of Zurich, 8050, Zurich, Switzerland.
Data Science Research Center and Social Science Division, Duke Kunshan University, Suzhou, 215316, China.
Sci Data. 2025 Mar 28;12(1):519. doi: 10.1038/s41597-025-04623-7.
Ethereum, one of the leading smart contract blockchain platforms, currently operates on a Proof-of-Stake (PoS) consensus mechanism designed to secure the network while incentivizing desired validator behaviors. Despite blockchain technology's promise of decentralization, limitations and gaps in decentralization persist, posing challenges for analysis and optimization. This study introduces a comprehensive dataset of validator rewards from the Ethereum Beacon chain, categorized into attestation, proposer, and sync committee rewards. By providing granular, transparent, and auditable records of validator activities, the dataset addresses the fragmentation of raw blockchain data and enables robust evaluations of PoS incentive structures. Researchers can leverage this dataset to assess enforceable rules, verify protocol compliance, and analyze long-term validator behavior. In addition, we apply decentralization metrics such as the Shannon entropy, Gini Index, Nakamoto Coefficient, and Herfindahl-Hirschman Index (HHI) to showcase the dataset's utility in studying decentralization trends. Publicly available on Harvard Dataverse and accompanied by open-source analytical tools on GitHub, this dataset facilitates future research aimed at enhancing blockchain systems' decentralization, security, and efficiency.