Top of Page - CEHTP Color Logo

California Environmental Health Tracking Program :: Information for Action


DATA & METHODS OVERVIEW < PROJECT FINDINGS < ALAMEDA COUNTY PILOT PROJECT HOME

Alameda County Pilot Project:  Data and Methods Overview


 

An important aspect of the pilot project was to pilot test the use of existing datasets for Tracking, as well as to development and use various methods for analysis.  See below for descriptions of some of the data and methods used in the pilot project.

 

                   Birth outcomes data source

                   Asthma data sources

                   Estimating pollution levels

                   Linking health records to pollution exposure

                   Map making protocols

                   Publications

 

 

Birth outcomes data source

All birth data are drawn from Vital Records, which is the birth certificate information compiled by the State of California.  To protect confidentiality, the only identifying information we used from these files was the mothers' addresses, which were used to make the "smooth surface" maps (see Map making protocols).  Birth certificates in California record basic information such as birthweight and estimated gestational age, along with demographic data such as race and ethnicity.  For this project, we used data from 2001. (back to the top)

 

 

Asthma data sources

One of the goals of the Alameda County Pilot Project was to explore the potential of using health event information from administrative and billing records for asthma surveillance.  We combined databases from Kaiser Permanente of Northern California and the fee-for-service portion of Medi-Cal, the state's Medicaid program.  Kaiser Permanente is the region's largest healthcare provider and represents a relatively complete cross-section of the county population, while Medi-Cal would provide representation for very low-income families who experience a disproportionate burden of asthma.

 

The rationale of combining the two datasets was that we hoped to get as accurate a picture of the asthma-related health events as possible.  One should note, however, that this is not the same thing as collecting information from a random sample of Alameda County residents, which is an ideal way to do this.  The combined dataset is useful for making comparisons about rates of asthma-related events between different parts of the county, but it is difficult to use the information to compare these rates to those in other counties, the state, or the country (see Publications).  The dataset includes information on about 176,000 children under 17 in the county, which is about half of the total that live there.  For this project, we used data from 2001. 

(back to the top)

 

 

Estimating pollution levels

To estimate levels of traffic pollution for Alameda County in 2001, we used a method called land use regression.  In summary, this involved placing sampling tubes in various locations in the county during the spring and fall to measure levels of pollutants in these locations.  Then we compared the amount of pollution at each location with the characteristics of the location (such as distance from roadways, how many cars drove on nearby roads, etc).  We used this information to create a mathematical model that could predict amounts of NO2 for other locations in the county based on these characteristics.  (back to the top)

 

 

Linking health records to pollution exposures

Ideally, the way to understand the health effects of air pollution is to chemically analyze the air that people are breathing while monitoring them for changes in their health status, but this is very difficult and resource intensive on a large scale.  For this study, we only knew the residence address recorded by Medi-Cal or Kaiser Permanente for asthma and maternal address for birth outcomes.  Not only were we unable to directly monitor people's health, we were only able to make assumptions about their pollution exposure based on knowledge of where they lived.

 

As part of the Alameda County pilot project, we experimented with ways to show pollution levels in different parts of the county based on traffic patterns and variations in land use categories.  While different methods appear to have met with different levels of success, one should keep in mind that individual exposure levels are assigned based on the addresses of residence we have recorded for people.  People are of course exposed to different air at work, school, or en route between home and other places.  Therefore, no matter how good our exposure estimates are, assigning exposure based on home addresses should still be considered a very crude approach to this problem.  (back to the top)

 

 

Map making protocols

Making maps of health outcomes can be a very effective way of communicating information.  In choosing how to make these maps, we wanted a way to make high-resolution images, so that people could look at variations smaller than cities or ZIP codes and think about what was going on in the actual communities they knew.  At the same time, we wanted to preserve confidentiality so that there was no way to look at the maps and learn the health information about any one person. 

 

The process we used is called density estimation mapping, which was made possible by first converting all the addresses in the dataset into X and Y coordinates similar to longitude and latitude numbers.  Once this was done, a computer could be used to draw circles on a map of Alameda County, in this case circles a single mile across.  The computer could then calculate the rate of any event (such as preterm births or emergency room visits) for the people living within that circle.  By moving this circle in half-mile increments across the county, we could calculate rates for overlapping areas throughout the county for any place that had sufficient population density  to calculate a rate.

 

This process resulted in "grids" of health event rates containing all the information we needed regarding variations throughout the county, but protected confidential information for any single person involved.  Conventional software packages were then used to create "smoothed surface" images showing the variations in rates across the entire space covered by the grids we had created.  See Publications for more details.  (back to the top)

 

 

Publications

 

Progress in Pediatric Asthma Surveillance I: The Application of Health Care Use Data in Alameda County, California  

Roberts EM, English PB, Van den Eeden SK, Ray GT. Progress in pediatric asthma surveillance I: the application of health care use data in Alameda County, California. Prev Chronic Dis [serial online] 2006 Jul [date cited]. Available from: URL: http://www.cdc.gov/pcd/issues/2006/jul/05_0186.htm.  

 

Progress in Pediatric Asthma Surveillance II: Geospatial Patterns of Asthma in Alameda County, California 

Roberts EM, English PB, Wong M, Wolff C, Valdez S, Van den Eeden SK, et al. Progress in pediatric asthma surveillance II: geospatial patterns of asthma in Alameda County, California. Prev Chronic Dis [serial online] 2006 Jul [date cited]. Available from: URL: http://www.cdc.gov/pcd/issues/2006/jul/05_0187.htm. 

 

(back to the top)

 

 

For questions or more information, please contact:

Eric Roberts, MD PhD

erobert1@dhs.ca.gov

 

 

 

  Return to the pilot project findings page

 

 

 

 

 

CEHTP HomeOverview of Environmental Health TrackingWhat's NewNewsletterAbout UsProgram StructurePartner RolesResourcesGet InvolvedContact UsDirections to our OfficeProgram ActivitiesPlanning ConsortiumNeeds AssessmentOutreach & TrainingStakeholder InvolvementCapacity Building Mini-GrantsAlameda County Pilot ProjectCentral Valley/South Coast Pilot ProjectTechnical AssessmentSB702 ReportSB189 NewsSite Search & IndexCDHS-EHIB

This website is supported by Cooperative Agreement Number U50/CCU922449 from the Centers for Disease Control and Prevention (CDC). Its contents are solely the responsibility of the authors and do not necessarily represent the official views of CDC.