Dong Hao, Campbell Cameron, Kurosu Satomi, Yang Wenshan, Lee James Z
Division of Social Science, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China,
Demography. 2015 Jun;52(3):1061-88. doi: 10.1007/s13524-015-0397-y.
Comparison and comparability lie at the heart of any comparative social science. Still, precise comparison is virtually impossible without using similar methods and similar data. In recent decades, social demographers, historians, and economic historians have compiled and made available a large number of micro-level data sets of historical populations for North America and Europe. Studies using these data have already made important contributions to many academic disciplines. In a similar spirit, we introduce five new micro-level historical panel data sets from East Asia, including the China Multi-Generational Panel Dataset-Liaoning (CMGPD-LN) 1749-1909, the China Multi-Generational Panel Dataset-Shuangcheng (CMGPD-SC) 1866-1913, the Japanese Ninbetsu-Aratame-Cho Population Register Database-Shimomoriya and Niita (NAC-SN) 1716-1870, the Korea Multi-Generational Panel Dataset-Tansung (KMGPD-TS) 1678-1888, and the Colonial Taiwan Household Registration Database (CTHRD) 1906-1945. These data sets in total contain more than 3.7 million linked observations of 610,000 individuals and are the first such Asian data to be made available online or by application. We discuss the key features and historical institutions that originally collected these data; the subsequent processes by which the data were reconstructed into individual-level panels; their particular data limitations and strengths; and their potential for comparative social scientific research.
比较和可比性是任何比较社会科学的核心。然而,不使用相似的方法和相似的数据,精确的比较几乎是不可能的。近几十年来,社会人口学家、历史学家和经济史学家已经汇编并提供了大量关于北美和欧洲历史人口的微观层面数据集。使用这些数据的研究已经对许多学术学科做出了重要贡献。本着类似的精神,我们引入了来自东亚的五个新的微观层面历史面板数据集,包括1749 - 1909年的中国多代面板数据集 - 辽宁(CMGPD - LN)、1866 - 1913年的中国多代面板数据集 - 双城(CMGPD - SC)、1716 - 1870年的日本实名抄录人口登记数据库 - 下守谷和新田(NAC - SN)、1678 - 1888年的韩国多代面板数据集 - 潭阳(KMGPD - TS)以及1906 - 1945年的殖民台湾户籍数据库(CTHRD)。这些数据集总共包含超过370万个与61万人相关的观测数据,是首批以在线或申请方式提供的此类亚洲数据。我们讨论了最初收集这些数据的关键特征和历史制度;随后将数据重建为个人层面面板的过程;它们特定的数据局限性和优势;以及它们在比较社会科学研究中的潜力。