Suppr超能文献

从1850年至1950年的历史人口普查数据中得出统计推断。

Drawing statistical inferences from historical census data, 1850-1950.

作者信息

Davern Michael, Ruggles Steven, Swenson Tami, Alexander J Trent, Oakes J Michael

机构信息

Minnesota Population Center, University of Minnesota, USA.

出版信息

Demography. 2009 Aug;46(3):589-603. doi: 10.1353/dem.0.0062.

Abstract

Virtually all quantitative microdata used by social scientists derive from samples that incorporate clustering, stratification, and weighting adjustments (Kish 1965, 1992). Such data can yield standard error estimates that differ dramatically from those derived from a simple random sample of the same size. Researchers using historical U.S. census microdata, however, usually apply methods designed for simple random samples. The resulting p values and confidence intervals could be inaccurate and could lead to erroneous research conclusions. Because U.S. census microdata samples are among the most widely used sources for social science and policy research, the need for reliable standard error estimation is critical. We evaluate the historical microdata samples of the Integrated Public Use Microdata Series (IPUMS) project from 1850 to 1950 in order to determine (1) the impact of sample design on standard error estimates, and (2) how to apply modern standard error estimation software to historical census samples. We exploit a unique new data source from the 1880 census to validate our methods for standard error estimation, and then we apply this approach to the 1850-1870 and 1900-1950 decennial censuses. We conclude that Taylor series estimation can be used effectively with the historical decennial census microdata samples and should be applied in research analyses that have the potential for substantial clustering effects.

摘要

几乎所有社会科学家使用的定量微观数据都来自包含聚类、分层和加权调整的样本(基什,1965年,1992年)。这样的数据可能会产生与来自相同规模的简单随机样本的标准误差估计值有显著差异的结果。然而,使用美国历史人口普查微观数据的研究人员通常采用为简单随机样本设计的方法。由此产生的p值和置信区间可能不准确,并可能导致错误的研究结论。由于美国人口普查微观数据样本是社会科学和政策研究中使用最广泛的数据源之一,因此可靠的标准误差估计至关重要。我们评估了综合公共使用微观数据系列(IPUMS)项目1850年至1950年的历史微观数据样本,以确定:(1)样本设计对标准误差估计的影响;(2)如何将现代标准误差估计软件应用于历史人口普查样本。我们利用1880年人口普查中一个独特的新数据源来验证我们的标准误差估计方法,然后将这种方法应用于1850 - 1870年和1900 - 1950年的十年一次人口普查。我们得出结论,泰勒级数估计可以有效地用于历史十年一次人口普查微观数据样本,并且应该应用于可能存在大量聚类效应的研究分析中。

相似文献

1
Drawing statistical inferences from historical census data, 1850-1950.
Demography. 2009 Aug;46(3):589-603. doi: 10.1353/dem.0.0062.
5
The shortcomings of synthetic census microdata.
Proc Natl Acad Sci U S A. 2025 Mar 18;122(11):e2424655122. doi: 10.1073/pnas.2424655122. Epub 2025 Mar 6.

引用本文的文献

1
Intergenerational Coresidence and Family Transitions in the United States, 1850 - 1880.
J Marriage Fam. 2011 Feb;73(1):138-148. doi: 10.1111/j.1741-3737.2010.00794.x.
2
Big Data: Large-Scale Historical Infrastructure from the Minnesota Population Center.
Hist Methods. 2011 Jan 1;44(2):61-68. doi: 10.1080/01615440.2011.564572.
3
Prevalence and characteristics of indoor tanning use among men and women in the United States.
Arch Dermatol. 2010 Dec;146(12):1356-61. doi: 10.1001/archdermatol.2010.355.

本文引用的文献

1
The Decline of Intergenerational Coresidence in the United States, 1850 to 2000.
Am Sociol Rev. 2007 Dec 1;72(6):964-989. doi: 10.1177/000312240707200606.
3
Fertility in New York State in the pre-Civil War era.
Demography. 2008 May;45(2):345-61. doi: 10.1353/dem.0.0009.
4
Estimating regression standard errors with data from the Current Population Survey's public use file.
Inquiry. 2007 Summer;44(2):211-24. doi: 10.5034/inquiryjrnl_44.2.211.
6
Trends in educational assortative marriage from 1940 to 2003.
Demography. 2005 Nov;42(4):621-46. doi: 10.1353/dem.2005.0036.
7
Immigration and the American century.
Demography. 2005 Nov;42(4):595-620. doi: 10.1353/dem.2005.0031.
8
Race, gender, and marriage: destination selection during the Great Migration.
Demography. 2005 May;42(2):215-41. doi: 10.1353/dem.2005.0019.
9
Rethinking the "early" decline of marital fertility in the United States.
Demography. 2003 Nov;40(4):605-20. doi: 10.1353/dem.2003.0035.
10
The racial crossover in family complexity in the United States.
Demography. 2003 Aug;40(3):569-87. doi: 10.1353/dem.2003.0022.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验