The Census Place Project: A Method for Geolocating Unstructured Place Names
Principal Investigator(s): View help for Principal Investigator(s) Enrico Berkes, Ohio State University; Ezra Karger, Chicago Federal Reserve Bank; Peter Nencka, Miami University
Version: View help for Version V2
Name | File Type | Size | Last Modified |
---|---|---|---|
1790_csv.zip | application/zip | 1.2 MB | 03/27/2023 10:57:AM |
1800_csv.zip | application/zip | 1.6 MB | 03/01/2023 01:03:PM |
1810_csv.zip | application/zip | 2.4 MB | 03/01/2023 01:02:PM |
1820_csv.zip | application/zip | 3.8 MB | 03/01/2023 01:02:PM |
1830_csv.zip | application/zip | 5.2 MB | 03/01/2023 01:02:PM |
1840_csv.zip | application/zip | 7.4 MB | 03/01/2023 01:02:PM |
1850_csv.zip | application/zip | 390.2 MB | 03/01/2023 01:03:PM |
1860_csv.zip | application/zip | 539.2 MB | 03/22/2023 09:39:AM |
1870_csv.zip | application/zip | 755.7 MB | 03/22/2023 09:57:AM |
1880_csv.zip | application/zip | 981.2 MB | 03/22/2023 10:34:AM |
- Total of 17 records. Records per page
- « previous Page of 2
- next »
Project Citation:
Berkes, Enrico, Karger, Ezra, and Nencka, Peter. The Census Place Project: A Method for Geolocating Unstructured Place Names. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2023-03-28. https://doi.org/10.3886/E179401V2
Project Description
Summary:
View help for Summary
Researchers use microdata to study the economic development of the United States and the causal effects of historical policies. Much of this research focuses on county- and state-level patterns and policies because comprehensive sub-county data is not consistently available. We describe a new method that geocodes and standardizes the towns and cities of residence for individuals and households in decennial census microdata from 1790--1940. We release public crosswalks linking individuals and households to consistently-defined place names, longitude-latitude pairs, counties, and states. Our method dramatically increases the number of individuals and households assigned to a sub-county location relative to standard publicly available data: we geocode an average of 83% of the individuals and households in 1790--1940 census microdata, compared to 23% in widely-used crosswalks. In years with individual-level microdata (1850--1940), our average match rate is 94% relative to 33% in widely-used crosswalks. To illustrate the value of our crosswalks, we measure place-level population growth across the United States between 1870 and 1940 at a sub-county level, confirming predictions of Zipf's Law and Gibrat's Law for large cities but rejecting similar predictions for small towns. We describe how our approach can be used to accurately geocode other historical datasets.
Related Publications
Published Versions
Report a Problem
Found a serious problem with the data, such as disclosure risk or copyrighted content? Let us know.
This material is distributed exactly as it arrived from the data depositor. ICPSR has not checked or processed this material. Users should consult the investigator(s) if further information is desired.