College of Science & Engineering
Outdoor air pollution kills approximately three million people per year. Better estimates of how air pollutant concentrations vary in space and time will allow the predicting the number of deaths with greater precision and accuracy. Many existing concentration estimates that are used to predict air pollution health effects are based on the linear regression of land use attributes (e.g., roads, impervious surfaces) against measured pollution concentrations to predict concentrations in locations where there are no measurements. Based on the hypothesis that geographical attributes (e.g., restaurants, power plants, factories) are a better predictor of air pollution than land use types are, these researchers propose to create new concentration estimates based on a regression of geographical attributes from the OpenStreetMap database against measured concentrations. They propose to use advanced machine-learning techniques and Google’s TensorFlow machine learning library to handle the large number of feature categories that are available in the OpenStreetMap data.