University of Minnesota
University Relations
http://www.umn.edu/urelate
612-624-6868

Minnesota Supercomputing Institute


Log out of MyMSI
RugglesS

Research Abstracts Online
January - December 2011

Main TOC

University of Minnesota Twin Cities
Office of the Vice President for Research
Minnesota Population Center

PI: Steven Ruggles

Historical Census Record Linkage

This project entails linking records between complete-count datasets and samples from various 19th and early-20th century censuses. Each pair of datasets to be linked is divided on the basis of birthplace and gender. Additionally, datasets sometimes are constructed for married couples. The resulting “demographic perspectives” are then processed independently, e.g., males are processed together, but separately from females. Relatively static or predictably dynamic features are selected for comparison, including names (relatively static over the lifespan for certain subsets of the population) and ages (relatively predictably dynamic). Distance functions indicate similarity between pairs of feature values, which constitute test data for Support Vector Machines. The researchers use MSI resources for the generation of distance functions and the assignment of previously generated distance functions to pairs of records (potential links).

Group Members

Ron Goeken, Faculty Collaborator
Lap Huynh, Staff
Tom Lynch, Staff
Rebecca Vick, Staff