Printer Friendly
The Free Library
14,503,364 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Error and bias in determining exposure potential of children at school locations using proximity-based GIS techniques.


Advances in geographic information systems geographic information system (GIS)

Computerized system that relates and displays data collected from a geographic entity in the form of a map. The ability of GIS to overlay existing data with new information and display it in colour on a computer screen is used primarily to
 (GIS (1) (Geographic Information System) An information system that deals with spatial information. Often called "mapping software," it links attributes and characteristics of an area to its geographic location. ), statistical methodology, and availability of high-resolution georeferenced health and environmental data have created unprecedented opportunities for spatial epidemiology Spatial epidemiology is the study of the spatial distribution of disease.  to investigate local geographic variation in disease (Elliot and Wartenberg 2004). GIS has become widely used to locate the study population by geo-coding addresses, using proximity analysis of pollution sources as a surrogate surrogate n. 1) a person acting on behalf of another or a substitute, including a woman who gives birth to a baby of a mother who is unable to carry the child. 2) a judge in some states (notably New York) responsible only for probates, estates, and adoptions.  for exposure, and integrating environmental monitoring data into the analysis of health outcomes (Nuckols et al. 2004). As the capabilities of GIS have improved, address geo-coding has become a very accessible research methodology, and as a result the individual address is becoming a standard level of spatial investigation. Geo-coding can introduce bias and error (Rushton et al. 2006) but the effect this has on the results of epidemiologic studies epidemiologic study A study that compares 2 groups of people who are alike except for one factor, such as exposure to a chemical or the presence of a health effect; the investigators try to determine if any factor is associated with the health effect  has received limited attention. In this study we explored the effect of positional error in geo-coding using a case study of the exposure potential of children at school locations to traffic-related air pollution.

There are many potential problems with geo-coding, which have been well described in the literature (Harries 1999; Krieger et al. 2001; Ratcliffe 2001; Rushton et al. 2006; Whitsel et al. 2004). Research on the quality of geo-coding has emphasized a consideration of completeness, positional accuracy, and repeatability (Whitsel et al. 2004). The potential bias and error introduced by variability in match rates has received most attention (Hurley Hurley has become the English version of at least three distinct original Irish names: the Ó hUirthile, part of the Dál gCais tribal group, based in Clare and North Tipperary; the Ó Muirthile, based around Kilbritain in west Cork; and the OhIarlatha, from the district of  et al. 2003; Oliver et al. 2005). The effects of the positional accuracy and repeatability of geo-coding has received limited attention and is therefore the subject of this study.

Several studies have determined quantitative estimates of the positional accuracy of geo-coding. Estimates of typical positional errors range from 38 to 75 m (Bonner et al. 2003; Cayo and Talbot 2003; Dearwent et al. 2001; Karimi and Durcik 2004; Ratcliffe 2001; Ward et al. 2005) based on mean or median values Noun 1. median value - the value below which 50% of the cases fall
median

statistics - a branch of applied mathematics concerned with the collection and interpretation of quantitative data and the use of probability theory to estimate population
. Results in urban areas are generally more accurate than in rural areas (Bonner et al. 2003; Cayo and Talbot 2003; Ward et al. 2005). This suggests that the positional error of geo-coding can be substantial and needs to be characterized char·ac·ter·ize  
tr.v. character·ized, character·iz·ing, character·iz·es
1. To describe the qualities or peculiarities of: characterized the warden as ruthless.

2.
 in a meaningful manner relevant to the use of the geocoded locations. Particularly with reference to epidemiologic studies, when short distances are associated with health effects, the geo-coding result must have a positional accuracy that is sufficient to resolve whether such effects are present (Rushton et al. 2006).

Vehicular traffic-related emissions are a major source of air pollution, especially in urban areas. Proximity to busy roads has been associated with health effects in children, particularly respiratory symptoms and asthma (Brauer et al. 2002; Brunekreef et al. 1997; Ciccone et al. 1998; Edwards et al. 1994; Gauderman et al. 2005; Janssen et al. 2001; Kim et al. 2004; Lewis et al. 2005; Morris et al. 2000; Nicolai et al. 2003; van Vliet et al. 1997; Venn et al. 2000, 2001; Zmirnou et al. 2004). Several studies have also found associations between proximity to traffic and higher rates of childhood cancer (Pearson et al. 2000; Raaschou-Nielsen et al. 2001; Savitz and Feingold 1989), but not all studies have been conclusive Determinative; beyond dispute or question. That which is conclusive is manifest, clear, or obvious. It is a legal inference made so peremptorily that it cannot be overthrown or contradicted.  in this regard (Langholz et al. 2002; Reynolds et al. 2002).

Children were chosen as the subject of our study because they represent the largest portion of the population that is susceptible to environmental health risks, and air pollution in particular (Kim 2004; Schwartz 2004). The selection of school locations reflects a longstanding interest to consider time-activity patterns in exposure assessment (Duan 1982; Sexton sex·ton  
n.
An employee or officer of a church who is responsible for the care and upkeep of church property and sometimes for ringing bells and digging graves.
 and Ryan 1998). Many factors affect the exact nature of time-activity patterns (McCurdy and Graham 2003), but several studies confirm that for children schools represent the second most important location (after the home) to consider in environmental exposure analysis (Klepeis et al. 2001; Leech leech, predacious or parasitic annelid worm of the class Hirudinea, characterized by a cylindrical or slightly flattened body with suckers at either end for attaching to prey.  et al. 2002; Schwab et al. 1992; Xue et al. 2004).

Many studies have documented that the concentration of traffic pollution drops off rapidly with increasing distance from the road (Briggs et al. 2000; Gilbert et al. 2003, 2005; Hitchins et al. 2000; Kuhler et al. 1988; Morawska et al. 1999; Ross et al. 2006; Van Vliet et al. 1997; Venn et al. 2001; Wrobel et al. 2000; Zhu et al. 2002a, 2002b). Concentrations are highest near roadways, decrease rapidly after an exponential function exponential function

In mathematics, a function in which a constant base is raised to a variable power. Exponential functions are used to model changes in population size, in the spread of diseases, and in the growth of investments.
, and reach near-background levels at approximately 300-500 m from the road. Based on this strong spatial gradient gradient

In mathematics, a differential operator applied to a three-dimensional vector-valued function to yield a vector whose three components are the partial derivatives of the function with respect to its three variables. The symbol for gradient is ∇.
 in pollutant pol·lut·ant
n.
Something that pollutes, especially a waste material that contaminates air, soil, or water.
 concentrations, measuring proximity to major roads using GIS has become a widely employed alternative to actual exposure monitoring. In a typical analysis scenario, one or more buffer sizes are used to determine whether geocoded locations fall within certain distances from major roads. Most studies use only a single buffer distance In nuclear warfare:1. The horizontal distance which, when added to the radius of safety, will give the desired assurance that the specified degree of risk will not be exceeded. The buffer distance is normally expressed quantitatively in multiples of the delivery error.
2.
, including 100 m (Giordian et al. 2006), 150 m (Gauderman et al. 2005; Green et al. 2004), 169 m (English et al. 1999), 229 m (Wilhelm and Ritz Ritz

elegant and luxurious hotel opened in Paris in 1898 by César Ritz; hence, ‘ritzy, putting on the ritz.’ [Fr. Hist.: Wentworth, 429]

See : Luxury
 2003), 300 m (Zmirnou et al. 2004), and 457 m (Langholz et al. 2002). Several other studies have used multiple distances ranging from 30 to 300 m (Hoek et al. 2002; Lewis et al. 2005; McConnell et al. 2006; Ong et al. 2006). Although the use of discrete buffer distances has been criticized for not capturing the true distance-exposure relationship (Maantay 2002; Zandbergen and Chakraborty 2006), their use is justified by the strong empirical evidence that pollutant concentrations follow a relatively predictable and rapid decrease with distance.

Studies on the effect of traffic-related air pollution have also considered traffic volume in the determination of environmental exposure conditions; adverse effects are observed for traffic counts starting at about 25,000 vehicles per day (Edwards et al.1994; English et al. 1999; Wijst et al. 1993). This value has become the lower exposure threshold used in studies that have modeled the potential exposure based on traffic counts and proximity (Green et al. 2004; Houston et al. 2006; Ong et al. 2006).

Various proximity-based metrics metrics Managed care A popular term for standards by which the quality of a product, service, or outcome of a particular form of Pt management is evaluated. See TQM.  have been employed to relate traffic counts to exposure, including distance to the nearest major roadway with a high traffic count per day (Gauderman et al. 2005; Green et al. 2004; Lewis et al. 2005), the sum of traffic count within a buffer (Ong et al. 2006), distance-weighted traffic density (English et al., 1999; Gauderman et al. 2005; Pearson et al. 2000; Wilhelm and Ritz, 2003; Zmirnou et al. 2004), and traffic count at the nearest road (Raaschou-Nielsen et al. 2001). Studies comparing these traffic metrics to actual exposure to traffic-related pollutants pollutants

see environmental pollution.
 have been few (Briggs et al. 2000; Gauderman et al. 2005), but they suggest pollutant concentrations correlate with distance from nearest road, traffic counts, and modeled air pollution. The distance to major road metric has been suggested as a reasonable, relatively easy-to-visualize metric for descriptive purposes (Green et al. 2004). Proximity to major roads is also computationally com·pu·ta·tion  
n.
1.
a. The act or process of computing.

b. A method of computing.

2. The result of computing.

3. The act of operating a computer.
 easy to estimate from data that are readily available compared with the meteorologic me·te·or·ol·o·gy  
n.
The science that deals with the phenomena of the atmosphere, especially weather and weather conditions.



[French météorologie, from Greek
 and traffic volume data required to model exposure conditions.

Several types of measurement error exist in exposure assessment to traffic-related air pollution (Van Atten et al. 2005). For example, Molitor et al. (2006) documented the effect of missing exposure data at the individual level and determined that the methodology chosen to account for this missing data influences the conclusions regarding the observed health effects. Another type of measurement error is the positional error in the various spatial data Data that is represented as 2D or 3D images. A geographic information system (GIS) is one of the primary applications of spatial data (land maps). See spatial analysis, spatial resolution and GIS glossary.  sets used to derive exposure estimates, which is the focus of this study.

Given that most studies on the exposure of children to air pollution from traffic have used relatively short distances of 50-500 m to majorroadways with traffic counts of ??25,000 vehicles, the question arises whether the geocoded locations of schools are accurate enough to allow for this type of proximity analysis. Several types of positional errors can be identified, including error in the major road network used for vehicle counts, error in the street reference data used for geo-coding, and error introduced by the geo-coding process.

The positional error of street reference data is closely related to the scale of the data. For example, data at a scale of 1:24,000 will be accurate to within 12 m 90% of the time based on National Map Accuracy Standards (NMAS NMAS Novell Modular Authentication Service
NMAS National Map Accuracy Standards
NMAS Nursing and Midwifery Admissions Service (UK)
NMAS North Mississippi Allstars (band)
NMAS Network Management System
) (U.S. Geological Survey The term geological survey can be used to describe both the conduct of a survey for geological purposes and an institution holding geological information.

A geological survey
 1999). The widely used Topologically Integrated Geographic Encoding and Referencing
TIGER redirects here. For other uses see Tiger (disambiguation).


Topologically Integrated Geographic Encoding and Referencing, or TIGER, or TIGER/Line
 (TIGER) street data from the U.S. Census Bureau Noun 1. Census Bureau - the bureau of the Commerce Department responsible for taking the census; provides demographic information and analyses about the population of the United States
Bureau of the Census
 meets the standard for 1:100,000 scale maps and will be accurate to within 50 m 90% of the time, although the most recent versions of TIGER data are expected to be of greater accuracy (U.S. Census Bureau 2000).

These errors are potentially additive additive

In foods, any of various chemical substances added to produce desirable effects. Additives include such substances as artificial or natural colourings and flavourings; stabilizers, emulsifiers, and thickeners; preservatives and humectants (moisture-retainers); and
, presenting a major challenge to fine-scale analysis which relies on small positional error. Of the several types of errors listed above, only the positional error of the major roads has received attention in the literature on the effects of traffic-related air pollution on children (Ong et al. 2006; Wu et al. 2005). Both these studies determined the reliability of using moderately accurate street reference data for geo-coding by manually realigning it with higher quality reference data. Geo-coding results were found to be very unreliable for analysis using short distances.

The main objective of our study was to determine the influence of the quality of geo-coding on the analysis of the effect of traffic- related air pollution on children at school locations. Two aspects of geo-coding quality are considered: positional accuracy (difference between gecoded locations and the actual school locations) and repeatability (difference between the results of different geo-coding techniques). We selected school locations for this study because they represent the second most important location where children spend their time, after the home residence, and because geo-coding has been widely employed in studies that have tried to determine exposure potential to air pollution at school locations (e.g., Chakraborty 2001; Green et al. 2004; Ong et al. 2006). We selected multiple geo-coding techniques to determine the sensitivity of the results to variability in geo-coding quality. The first technique consists of using county street centerlines. This represents the highest quality street reference network available for free in most areas. The second technique consists of TIGER roads from the U.S. Census Bureau (2000). Although its positional and attribute accuracy is often inferior INFERIOR. One who in relation to another has less power and is below him; one who is bound to obey another. He who makes the law is the superior; he who is bound to obey it, the inferior. 1 Bouv. Inst. n. 8.  to that of other data sources, it is a very widely used data source for geo-coding because it is free and covers all of the United States United States, officially United States of America, republic (2005 est. pop. 295,734,000), 3,539,227 sq mi (9,166,598 sq km), North America. The United States is the world's third largest country in population and the fourth largest country in area. . The third and fourth techniques consist of using the services of two commercial geo-coding firms, a common practice among environmental exposure researchers who need to geocode ge·o·code  
n.
The demographic characterization of a neighborhood or locality, especially as used in marketing.
 addresses.

Methods

The study design relies on a comparison between the results of geo-coding and the actual school locations. We obtained addresses for all public schools in Orange County, Florida Orange County is a county located in the U.S. state of Florida and is part of the Orlando-Kissimmee Metropolitan Statistical Area (MSA). As of 2006 Census Bureau estimates, the population is 1,043,500. [1] The county seat is Orlando. , for 2005 from the Orange County School Board (2005). We determined the actual location of each school by using a detailed 1:2,000 digital parcel database for 2005 for Orange County (Orange County Property Appraiser A person selected or appointed by a competent authority or an interested party to evaluate the financial worth of property.

Appraisers are frequently appointed in probate and condemnation proceedings and are also used by banks and real estate concerns to determine the market
 2005). We identified all 153 schools in this database through manual searches based on the fields for physical address and ownership (i.e., Orange County School Board). After we identified the correct property, we overlaid o·ver·laid  
v.
Past tense and past participle of overlay1.
 the parcel boundaries on digital 1-m color orthophotography for 2005 in ArcGIS 9 (ESRI (Environmental Systems Research Institute, Inc., Redlands, CA, www.esri.com) The world's leading developer of geographic information systems (GIS) software, including programs that plot ZIP codes and addresses, demographic information and detailed, color-coded data.  Inc., Redlands, CA). In this overlay (1) A preprinted, precut form placed over a screen, key or tablet for identification purposes. See keyboard template.

(2) A program segment called into memory when required.
 the parcel boundaries are shown and the aligned imagery provides a detailed look at building(s) at the school site. We used this overlay to manually digitize To convert an image or signal into digital code by scanning, tracing on a graphics tablet or using an analog to digital conversion device. 3D objects can be digitized by a device with a mechanical arm that is moved onto all the corners.  the exact outline of the school building(s). We then created a single centroid centroid

In geometry, the centre of mass of a two-dimensional figure or three-dimensional solid. Thus the centroid of a two-dimensional figure represents the point at which it could be balanced if it were cut out of, for example, sheet metal.
 from the digitized building(s) for each school to represent the "true" location.

We geocoded the schools using a 1:5,000 street centerline cen·ter·line  
n.
1. A line that bisects something into equal parts.

2. A painted line running along the center of a road or highway that divides it into two sections for traffic moving in opposite directions, or, in the case of
 network from Orange County for 2005 (Orange County Growth Management 2005) and the TIGER 2000 streets from the U.S. Census Bureau (2000), both using ArcGIS 9. Identical settings were employed, including address locator style, field types used in the address locator, minimum match score, and spelling sensitivity. For any records that did not automatically produce a reliable match, exhaustive manual interactive matching was carried out to achieve the highest possible number of reliable matches. We used a perpendicular offset of 10 m in the placement of the geocoded locations, based on the typical width of the right-of-way of local roads of 15-20 m. We also sent out the address file for processing to two commercial geo-coding firms, which we refer to as Firm A and Firm B. It was not possible to specify geo-coding settings, but we used the final match codes to identify the high-quality matches and used only those in further analysis. Match rates for the four techniques were 94.8% for street centerlines, 89.5% for TIGER roads, 90.2% for Firm A, and 90.9% for Firm B. For the remainder of the analysis, we used only records that could reliably be geocoded using all four techniques (n = 126).

We determined the positional accuracy of the geocoded locations by measuring the straight-line distance between the geocoded location and the actual location of the particular school. This was repeated for each of the four sets of geocoded results. We characterized error distributions using descriptive statistics descriptive statistics

see statistics.
 and cumulative distribution functions.

We determined exposure potential to traffic-related air pollution using proximity to high traffic intensity roads. We obtained a detailed 1:24,000 road network for the State of Florida from the Florida Department of Transportation The Florida Department of Transportation (FDOT) is a decentralized agency charged with the establishment, maintenance, and regulation of public transportation in the state of Florida[1].  (FDOT FDOT Florida Department Of Transportation  2005) with average annual daily traffic (AADT AADT Annual Average Daily Traffic ) values for 2005 for each road segment. We selected road segments with an AADT of ?????25,000 for further analysis. For each school we determined the straight-line distance to the nearest road segment using ArcGIS 9 for the actual location as well as for the four geocoded locations. We also created straight-line buffer zones buffer zone
n.
A neutral area between hostile or belligerent forces that serves to prevent conflict.

Noun 1. buffer zone
 around the road segments with an AADT of 25,000 as discrete representations of distances commonly used in studies on traffic-related air pollution. We used buffer radii ra·di·i  
n.
A plural of radius.


radii
Noun

a plural of radius
 of 50, 100, 150, 250, 500, and 1,000 m.

Figure 1 shows the actual locations of the schools in Orange County as well as the major road network with AADT values of ?????25,000 vehicles per day.

We determined bias and error introduced by geo-coding by comparing the results of proximity analysis using the actual school locations and the fours sets of geo-coding results. Specifically, we determined the number of correctly and incorrectly classified schools using geo-coding for the buffer zones described above. This required determining for each buffer zone which schools are actually located within that distance, which schools are correctly classified as being located within that distance using geo-coding (confirmed positives), which schools are incorrectly classified as being located outside that distance (false negatives), which schools are incorrectly classified as being located within that distance (false positives), and which schools are correctly classified as being located outside that distance (confirmed negatives). We determined the overall agreement between the results for actual school locations and geocoded locations for each distance using percentage false negatives, percentage false positives, sensitivity, and specificity. We repeated this analysis for each of the four sets of geocoded locations.

Results

Figure 2 shows the cumulative distribution functions of the positional error in the geo-coding results, and Table 1 provides descriptive statistics. The first characteristic to notice is the non-normal distribution of errors; the mean is much higher than the median in all four distributions, and the distributions are highly skewed skewed

curve of a usually unimodal distribution with one tail drawn out more than the other and the median will lie above or below the mean.

skewed Epidemiology adjective Referring to an asymmetrical distribution of a population or of data
 due to the occurrence of a small number of very large error values.
Table 1. Summary statistics for the positional error (in meters) of
geocoded locations of schools (n = 126) in Orange County, Florida,
using four different techniques.

Statistics           Centerlines  TIGER  Firm A  Firm B

Mean                        219    351     300     461
Median                      155    178     153     151
Standard deviation          272    604     602   2,330
Minimum                      50     49      48      39
Maximum                   2,302  4,379   5,565   2,596
90th percentile             211    271     238     218
95th percentile             227    302     255     237
95% RMSEa                   196    306     235     210

(a) 95% RMSE is the root mean square error of the error distribution
after removing 5% outliers. It is more common to use the 100% RMSE,
but for non-normally distributed data the removal of 5% outliers before
determining the RMSE value produces a more robust accuracy statistic.


The distributions are relatively similar for the four techniques considered, but the error is consistently larger for the TIGER results. For example, the 50th percentile percentile,
n the number in a frequency distribution below which a certain percentage of fees will fall. E.g., the ninetieth percentile is the number that divides the distribution of fees into the lower 90% and the upper 10%, or that fee level
 is 155 m for street centerlines, 178 m for TIGER roads, 153 m for Firm A, and 151 m for Firm B. At higher percentiles, the curves are a bit further apart, suggesting that street centerline geo-coding is the most accurate technique. For example, the 95th percentiles are 227 m for street centerlines, 302 m for TIGER roads, 255 m for Firm A, and 237 m for Firm B. To characterize the overall distribution the 95% root mean square error (RMSE RMSE Root Mean Square Error
RMSE Root Mean Squared Error
) is a robust statistic statistic,
n a value or number that describes a series of quantitative observations or measures; a value calculated from a sample.


statistic

a numerical value calculated from a number of observations in order to summarize them.
 for non-normally distributed positional errors; the 95% RMSE is 196 m for street centerlines, 306 m for TIGER roads, 235 m for Firm A, and 210 m for Firm B. These results strongly suggest that the positional error in school locations for all types of geo-coding is quite large, and that the error is much higher when using TIGER roads.

Figure 3 shows the cumulative distribution functions of the distance to the nearest major road for the actual school locations and the four sets of geocoded locations. These distributions can be used to examine bias. "Bias" is defined here as a consistent over-or underestimation of the number of schools at risk. If there were no bias, the curves would be nearly identical, and any consistent difference would indicate over-or underestimation. Figure 3 reveals that the curves for the four geo-coding techniques are consistently higher than the curve for the actual school locations at most distance values &lt 1,000 m. This indicates that the geo-coding results provide a consistent overestimation o·ver·es·ti·mate  
tr.v. o·ver·es·ti·mat·ed, o·ver·es·ti·mat·ing, o·ver·es·ti·mates
1. To estimate too highly.

2. To esteem too greatly.
 of the potential schools at risk. Comparing the results for the four sets of geocoded locations reveals that the use of TIGER roads results in the largest bias at distances of up to 500 m.

Although the results in Figure 3 reveal the bias introduced by geo-coding, they do not show the occurrence of error in the form of false positives and negatives. Therefore, for every buffer radius considered, we determined the number of schools within that distance using actual locations and the geocoded locations, as well as the agreement between the results in terms of correctly classifying schools. Table 2 shows the results of this analysis.
Table 2. Bias and error in determining schools (n = 126) at risk based
on proximity to major roads in Orange County, Florida.

                          No. of schools within buffer zone

Geocoding      School       Street     Confirmed    False      False
type,         buildings  geocoding     positives  negatives  positives
buffer
radius (m)

Street
centerlines

50                   1          3          0          1          3

100                  3          5          1          2          4

150                  6          9          4          2          5

250                 17         20         12          5          8

500                 44         44         42          2          2

1,000               69         71         66          3          5

TIGER roads

50                   1          9          0          1          9

100                  3         15          1          2         14

150                  6         20          4          2         16

250                 17         29         13          4         16

500                 44         46         40          4          6

1,000               69         70         67          2          3

Commercial
Firm A

50                   1          5          0          1          5

100                  3         11          1          2         10

150                  6         14          4          2         10

250                 17         23         11          6         12

500                 44         46         39          5          7

1,000               69         72         66          3          6

Commercial
Firm B

50                   1          3          0          1          3

100                  3          8          2          1          6

150                  6         12          5          1          7

250                 17         22         10          7         12

500                 44         48         43          1          5

1,000               69         72         67          2          5

             No. of schools within buffer zone

Geocoding    Confirmed
type,        negatives
buffer
radius (m)

Street
centerlines

50                 122

100                119

150                115

250                101

500                 80

1,000               52

TIGER roads

50                 116

100                109

150                104

250                 93

500                 76

1,000               54

Commercial
Firm A

50                 120

100                113

150                110

250                 97

500                 75

1,000               51

Commercial
Firm B

50                 122

100                117

150                113

250                 97

500                 77

1,000               52

                            Measures of agreement

             Prevalance    False    False  Sensitivity  Specificity

Geocoding       (%)a     negatives  (%)d       (%)e
type,                      (%)b
                         positives
                           (%)c

buffer
radius (m)

Street             0.79       0.79   2.38         0.00        97.60
centerlines

50                 2.38       1.59   3.17        33.33        96.75

100                4.76       1.59   3.97        66.67        95.83

150               13.49       3.97   6.35        70.59        92.66

250               34.92       1.59   1.59        95.45        97.56

500               54.76       2.38   3.97        95.65        91.23

1,000

TIGER roads        0.79       0.79   7.14         0.00        92.80

50                 2.38       1.59  11.11        33.33        88.62

100                4.76       1.59  12.70        66.67        86.67

150               13.49       3.17  12.70        76.47        85.32

250               34.92       3.17   4.76        90.91        92.68

500               54.76       1.59   2.38        97.10        94.74

1,000

Commercial         0.79       0.79   3.97         0.00        96.00
Firm A

50                 2.38       1.59   7.94        33.33        91.87

100                4.76       1.59   7.94        66.67        91.67

150               13.49       4.76   9.52        64.71        88.99

250               34.92       3.97   5.56        88.64        91.46

500               54.76       2.38   4.76        95.65        89.47

1,000

Commercial         0.79       0.79   2.38         0.00        97.60
Firm B

50                 2.38       0.79   4.76        66.67        95.12

100                4.76       0.79   5.56        83.33        94.17

150               13.49       5.56   9.52        58.82        88.99

250               34.92       0.79   3.97        97.73        93.90

500               54.76       1.59   3.97        97.10        91.23

1,000

(a) Number of schools residing within the buffer radius as a percentage of all schools within the study area (n = 126). (b) Number of false negatives as a percentage of all schools within the study area (n = 126). (c) Number of false positives as a percentage of all schools within the study area (n = 126). (d) Number of confirmed positives as a percentage of all schools within the study area (n = 126). (e) Number of confirmed negatives as a percentage of all schools within the study area (n = 126).


Table 2 shows the number of schools located within each buffer radius based on actual and geocoded locations. For each of the four sets of geocoded locations and for nearly all distances considered, the number of schools at risk is consistently higher using geocoded locations than using actual locations, confirming the strong bias toward an overestimation of the number of schools at risk as already shown in Figure 3. Table 2 also shows the confirmed positives, false negatives, false positives, and confirmed negatives. The percentage of false negatives and positives is calculated relative to the total sample size (n = 126). The percentage false of positives is consistently higher than the percentage of false negatives, confirming the strong bias toward an overestimation of the number of schools at risk.

Although the number of false positives and negatives as a percentage of the total sample is relatively small, at short distances the error in classification is very substantial. For example, three schools are actually located within 100 m of a major road, and street centerline geo-coding correctly identifies only one of them. Centerline geo-coding identifies four other schools within 100 m, but these are all false positives. This error is more formally expressed in the measure for sensitivity, which is the percentage of schools located within a specific buffer radius that were correctly identified using geo-coding. As can be seen in Table 2, the values for sensitivity are very low at short distances and gradually increase to values of 90% or higher at distances of 500-1,000 m. The very low values for sensitivity at short distances strongly indicate that the results at these distances are very unreliable. A final measure of agreement is the specificity, which is the percentage of schools located outside a specified buffer radius that are correctly identified using geo-coding. Values are consistently high, with most values > 90% at all distances considered. This is largely owing to owing to
prep.
Because of; on account of: I couldn't attend, owing to illness.

owing to prepdebido a, por causa de 
 that fact that although the identification of schools at risk is very unreliable at short distances, their prevalence is quite low.

When comparing the results of the four geo-coding techniques, the results using TIGER roads are the least reliable, with the highest counts of false positives and the lowest values for sensitivity and specificity. Results from Firm A are the second least reliable, with the second highest counts of false positives and the second lowest values for sensitivity and specificity. Results for street centerlines and commercial Firm A are very similar, and a determination of which method is most reliable varies with the exact distance value and parameter (1) Any value passed to a program by the user or by another program in order to customize the program for a particular purpose. A parameter may be anything; for example, a file name, a coordinate, a range of values, a money amount or a code of some kind.  chosen.

The results in Table 2 strongly suggest that the identification of schools at risk based on proximity is unreliable at short distances. This raises an important question: At what distance, if any, the results do become reliable? We use the measure for sensitivity to try to answer this. If a value of 90% for sensitivity is deemed acceptable, the minimum distance needed to achieve reliable results is 500 m for street centerlines, TIGER, and Firm B, and 1,000 m for Firm A. If a value of 95% is used, the minimum distance is 500 m for street centerlines and Firm B, and 1,000 m for TIGER and Firm A. These results suggest that the reliability of proximity-based identification of schools at risk varies with geo-coding quality, but that overall the results are not reliable at distances < 500 m.

Discussion

The positional error in geocoded locations of schools was very high relative to the accuracy requirements for fine-scale proximity analysis; a median error of 155 m for street centerline geo-coding, 178 m for TIGER roads, 153 m for Firm A, and 151 m for Firm B, and 95% RMSE values of 196, 306, 235 and 210 m, respectively. These estimates are substantially higher than those found in previous studies (Bonner et al. 2003; Cayo and Talbot 2003; Dearwent et al. 2001), which have looked mostly at residential addresses. The larger positional errors for schools are a result of the much larger parcels on which most schools are located, relative to residential properties. Both larger parcel sizes and variability in parcel size along a street segment can contribute to larger positional errors.

The amount of bias and error introduced by the positional error in geo-coding is substantial. As a general rule, spatial data must be much more accurate than the minimum distance used in spatial analysis (Data West Research Agency definition: see GIS glossary.) Analytical techniques to determine the spatial distribution of a variable, the relationship between the spatial distribution of variables, and the association of the variables of an area.  for the results to be meaningful (Diggle 1993; Waller 1996); this rule is clearly not met when using the results of geo-coding of schools in fine-scale analysis in the order of 100 or even several hundred meters.

Figure 4 shows the geo-coding results for selected areas that we use to discuss several common scenarios. In each of the four scenarios the school building centroid and the geocoded locations are shown, in addition to the street centerlines, TIGER road, and the major roadways (AADT [greater than or equal to]?25,000). Street reference networks from the commercial firms are not available, but the placement of the geocoded locations suggests they are relatively similar to the street centerlines.

The first general observation from Figure 4 is the influence of the positional accuracy of the street network. The street centerlines are almost perfectly aligned with the aerial imagery, suggesting a very high positional accuracy. This suggests that the positional error in geo-coding using this reference network is largely attributed to the errors in the placement of the geocoded location along the street segments, not the position of the network itself. The positional error in the TIGER roads, however, is substantial, and misalignment mis·a·ligned  
adj.
Incorrectly aligned.



misa·lignment n.
 with the imagery is observed for almost every segment. Errors up to 50 m are very common, and errors of several hundred meters also occur. This explain the much larger positional error found in the geo-coding results using TIGER roads.

A related observation from Figure 4 is the influence of the positional accuracy of the major road network. In most of the study area the major road network is very well aligned with the imagery and is a near-perfect match with the street centerlines. This suggests that, except for TIGER roads, misalignment of spatial data sets is not a significant factor, as it was in previous studies (Ong et al. 2006; Wu et al. 2005). Also, very few schools are located directly on major roads; most are located on side streets of major roads, making the placement along the side street relative to the major roads the critical factor in correctly identifying schools at risk.

Figure 4A shows a typical scenario where the geocoded locations are in relatively close proximity to each other, but at some distance from the actual school. There are several factors at work here. First, there is the "driveway" effect: Many schools are located on a fairly large parcel and a private driveway leads from the road to the actual building. This driveway does not appear in street network data. So even if a geocoded location were directly in front of the school parcel, it would be at some distance from the actual building. Second, there is misplacement mis·place  
tr.v. mis·placed, mis·plac·ing, mis·plac·es
1.
a. To put into a wrong place: misplace punctuation in a sentence.

b.
 along the street network, as evidenced in particular for the result for Firm B in Figure 4A. Geo-coding relies on linear interpolation Linear interpolation is a method of curve fitting using linear polynomials. It is heavily employed in mathematics (particularly numerical analysis), and numerous applications including computer graphics. It is a simple form of interpolation.  of the actual address within the address range for the street segment. If the address range for the segment does not reflect the true addresses, or the parcels along the segment are not uniform, locations are misplaced mis·place  
tr.v. mis·placed, mis·plac·ing, mis·plac·es
1.
a. To put into a wrong place: misplace punctuation in a sentence.

b.
 along the segment relative to the actual location. As a result of these two effects, all geocoded locations in Figure 4A are placed closer to the major road than the actual school building, resulting in a (potential) false positive. This particular scenario accounts for most false positives in our study.

Figure 4B shows a scenario similar to 4A, but in this case the geocoded locations are further away from the major road than the actual school location, resulting in a (potential) false negative. This particular scenario accounts for most false negatives in our study. Figure 4B also illustrates the substantial error in the TIGER roads, which in this case translates in only a minor additional error in the geo-coding results.

Figure 4C illustrates a scenario where the school is located on a very large parcel, resulting in substantial driveway effect. In this case the actual school buildings are much closer to the major roadway, producing a (potential) false negative. This scenario illustrates that even the most accurate geo-coding result (i.e., a location mapped directly in front of or inside the school parcel) does not capture the school location sufficiently accurate for fine-scale proximity analysis. The scenario again illustrates the substantial error in the TIGER results.

Figure 4D illustrates a scenario where the driveway effect is limited, but the placement of the geocoded locations along the segment varies substantially among the four techniques. Results for street centerlines and Firm B are quite accurate, resulting in a correct classification of whether the school is at risk. Results for TIGER and Firm A are placed at several hundred meters from the actual school location, resulting in incorrect classification.

When comparing the four different sets of geocoded locations in Figure 4, some relevant patterns emerge. First, results for street centerlines and Firm B are consistently close to each other and typically the most accurate relative to the actual school location. This suggests a high degree of repeatability of geo-coding when considering only these two techniques. Results for Firm A are typically at some distance from the previous two locations along the same street segment, and typically at a larger distance from the actual location. Results for TIGER show much more variability, with several locations being placed incorrectly due to both positional and attribute errors in the TIGER roads. These examples illustrate the general findings for positional errors in the geo-coding results and the reliability of the proximity-based exposure analysis: Street centerlines and Firm B are most accurate and reliable, followed by Firm A, with TIGER a distant fourth.

One final pattern that emerges from the four scenarios is that most of the error in the classification of schools results from errors in the placement of the geocoded locations along the street segment, not from the error in the distance from the road. This is clearly illustrated in Figures 4A--C, where placing the geocoded locations at the same distance from the road as the actual school location would not improve the correct identification of schools at risk. This is a result of the fact that in our study area very few schools are located directly on a major road with high traffic densities but instead on side streets perpendicular to major roads.

The four scenarios presented in Figure 4 all contribute to errors in the proximity analysis in the form of false positives and negatives. If these errors were strictly random, the number of false positives and negatives would be the same. However, we observed a consistent bias in the results of the proximity analysis with many more false positives than false negatives. The scenarios in Figures 4A and 4D contribute to false positives, and an inspection of the entire data set reveals that these are in fact the most common scenarios for false positives. The observed bias in the proximity analysis, therefore, is a result of a combination of positional and attribute errors in the street reference data, limitations of the linear interpolation algorithm algorithm (ăl`gərĭth'əm) or algorism (–rĭz'əm) [for Al-Khowarizmi], a clearly defined procedure for obtaining the solution to a general type of problem, often numerical.  used in geo-coding, and the inherent limitations of a street network to capture the actual locations of school buildings located on large parcels.

When considering the results for the four sets of geocoded locations, several observations regarding the accuracy and repeatability of geo-coding emerge. First, the results from the TIGER 2000 roads were by far the least accurate, and the use of these data for fine-scale spatial analysis should be discouraged dis·cour·age  
tr.v. dis·cour·aged, dis·cour·ag·ing, dis·cour·ag·es
1. To deprive of confidence, hope, or spirit.

2. To hamper by discouraging; deter.

3.
 despite their widespread availability and low cost. When updated versions of the TIGER data become available for a particular area, they should be checked to determine whether they provide improved accuracy and reliability. Second, the differences between the results from the two commercial geo-coding firms suggest a substantial lack of repeatability, which is not reflected in the cost (Firm B was more accurate and cheaper). Commercial geo-coding is available for most areas at relatively low cost, but results typically do not include a measure of positional accuracy, and their use for finescale analysis needs to be carefully considered. Third, the use of street centerlines produced the most accurate and reliable results (and results for Firm B were very similar), but this type of data may not be available everywhere, and their quality may vary between jurisdictions, limiting usefulness for larger data sets across multiple jurisdictions.

One very important limitation of this study is that school locations are somewhat unusual in that they are typically located on large parcels relative to other types of locations (such a private residences). As a result, even a geocoded location very close to or inside the correct parcel may be at a substantial distance from the actual school building(s). The presence of large parcels along a street segment also causes errors in the linear interpolation used in the placement of geocoded locations along the street segment. In addition, many schools in suburban areas are off-set from the street with a private driveway. Combined, these factors result in large positional errors in geocoded locations. These errors are typically larger than for other types of locations. For example, Cayo and Talbot (2003) determined the positional error of private residences using the same technique (distance between geocoded locations and building centroid) across an urban--rural gradient and found a median positional error of 38 m for urban areas, 78 m for suburban areas, and 201 m for rural areas. Values for the 90th percentile were 96, 306 and 1,544 m, respectively. The schools in our study are located mostly in suburban areas, and results for the most accurate geo-coding using street centerlines produced a mean of 155 m and a 90th percentile of 211 m. The results for the school locations in our study are substantially less accurate than private residences in typical urban and suburban areas, but more accurate than those in rural areas. Based on these error values, therefore, the observed bias and error in the proximity-based exposure analysis is likely to be less for private residences in urban areas, but still substantial at short distances of 100 or possibly several hundred meters. The results for other types of locations, however--such as hospitals, shopping malls, apartment complexes, businesses, office parks--are more likely to be similar to those found in our study for schools because they share the same characteristics of large parcels and private driveways or access roads.

A second limitation is that our study employed only a relatively small sample of schools for a single county. As a result, our findings are limited to similar areas where schools are located primarily in suburban areas on relatively large parcels. Positional errors in geo-coding in higher-density urban areas are likely to be smaller, with corresponding lower bias and error in proximity-based analysis of environmental exposure. A related limitation is the fact that the study area is located in the United States and that the geo-coding results therefore are typical for the street reference data commonly available in the United States. Results in other jurisdictions may be more reliable if more accurate geo-coding techniques are available.

Conclusions

Positional errors in geo-coding of school locations introduce substantial bias and error in the analysis of the effects of traffic-related air pollution on children. Alternatives to street geo-coding need to be considered, including parcel-based geo-coding, address point geo-coding, the use of ortho-imagery, and field observations using global positioning systems Global Positioning System: see navigation satellite.
Global Positioning System (GPS)

Precise satellite-based navigation and location system originally developed for U.S. military use.
. Digital parcel data in GIS format and high resolution ortho-imagery are becoming more widely available, although coverage might be spotty spot·ty  
adj. spot·ti·er, spot·ti·est
1. Lacking consistency; uneven.

2. Having or marked with spots; spotted.



spot
 across a large study area of multiple jurisdictions. The skill level required to use these types of data in GIS is not much higher than that needed for street geo-coding, and the data are mostly available for free. However, the effort required to compile To translate a program written in a high-level programming language into machine language. See compiler.  and process these detailed data is substantial, and presents a major barrier to more widespread adoption. For very large data sets, including those covering more than a few individual counties, street geo-coding is likely to remain a more cost-effective solution.

The widespread availability of powerful geo-coding tools in commercial GIS software This is a list of notable GIS software applications. See also the comparison of GIS software. Open source software
Most widely used open source applications:
  • GRASS – Originally developed by the U.S.
 and the interest in spatial analysis at the individual level have made address geo-coding a widely employed technique in epidemiologic studies. Although some of the limitations of geo-coding have been addressed in recent review articles in public health and epidemiology epidemiology, field of medicine concerned with the study of epidemics, outbreaks of disease that affect large numbers of people. Epidemiologists, using sophisticated statistical analyses, field investigations, and complex laboratory techniques, investigate the cause  journals (McElroy et al. 2003; Rushton et al. 2006), most studies have employed geo-coding without much consideration of its inherent limitations. Match rates have received most recognition, and the positional error has been assumed to be small in magnitude and random in its effect on analysis results. We have shown in this study that the positional error in geo-coding is neither small nor random, and that caution in the use of geo-coding results for epidemiologic studies is warranted. TIGER data in particular are prone to very large errors.

Limitations of this study include the unique nature of school locations, which results in larger errors than are typically encountered for residential locations, and the fact that only a single county in the United States was considered. Despite these limitations, the study provides insights into the nature of the errors that can be expected for other types of locations and jurisdictions.

Geo-coding is very appealing as a data processing data processing or information processing, operations (e.g., handling, merging, sorting, and computing) performed upon data in accordance with strictly defined procedures, such as recording and summarizing the financial transactions of a  step because it provides a high degree of automation, but the results are not accompanied by reliability estimates for its quality other than match scores. The use of street reference data of high positional accuracy and currency is also no guarantee the positional accuracy of geo-coding will be sufficient, and the use of alternatives should be considered when fine-scale analysis is required.

REFERENCES

Bonner MR, Han D, Nie J, Rogerson P, Vena JE, Freudenheim JL. 2003. Positional accuracy of geocoded addresses in epidemiologic ep·i·de·mi·ol·o·gy  
n.
The branch of medicine that deals with the study of the causes, distribution, and control of disease in populations.



[Medieval Latin epid
 research. Epidemiology 14(4):408--412.

Brauer M, Hoek G, van Vliet P, Meliefste K, Fischer PH, Wijga A, et al. 2002. Air pollution from traffic and the development of respiratory infections Noun 1. respiratory infection - any infection of the respiratory tract
respiratory tract infection

infection - the pathological state resulting from the invasion of the body by pathogenic microorganisms
 and asthmatic and allergic al·ler·gic
adj.
1. Of, caused, or characterized by an allergy.

2. Having an allergy or exhibiting an allergic reaction to a substance.



allergic

pertaining to or caused by allergy.
 symptoms in children. Am J Respir Crit Care Med 166(8):1092--1098.

Briggs DJ, de Hoogh C, Gulliver J, Wills J, Elliott P, Kingham S Kingham is a village in Oxfordshire, England. It is a short drive from the more well known town of Chipping Norton.

The village has one shop, one Royal British Legion (club), one pub, two hotels, a railway station and an assorted amount of houses.
, et al. 2000. A regression-based method for mapping trafficrelated air pollution: application and testing in four contrasting urban environments. Sci Total Environ en·vi·ron  
tr.v. en·vi·roned, en·vi·ron·ing, en·vi·rons
To encircle; surround. See Synonyms at surround.



[Middle English envirounen, from Old French environner
 253(1--3):151--167.

Brunekreef B, Janssen NA, de Hartog J, Harssema H, Knape M, van Vliet P. 1997. Air pollution from truck traffic and lung function in children living near motorways. Epidemiology 8:298--303.

Cayo MR, Talbot TO. 2003. Positional error in automated au·to·mate  
v. au·to·mat·ed, au·to·mat·ing, au·to·mates

v.tr.
1. To convert to automatic operation: automate a factory.

2.
 geo-coding of residential addresses. Int J Health Geogr 2:10.

Chakraborty J. 2001. Analyzing exposure of school children to accidental releases of hazardous substances. J Expo Anal anal (a´n'l) relating to the anus.

a·nal
adj.
1. Of, relating to, or near the anus.

2.
 Environ Epidemiol 11: 269--278.

Ciccone G, Forastiere F, Agabiti N, Biggeri A, Bisanti L, Chellini E, et al. 1998. Road traffic and adverse respiratory effects in children. Occup Environ Med 55:771--778.

Dearwent SM, Jacobs RJ, Halbert JB. 2001. Locational uncertainty in georeferencing public health datasets. J Expo Anal Environ Epidemiol 11:329--334.

Diggle PJ. 1993. Point process modeling in epidemiology. In: Statistics for The Environment (Barnety V, Turkman KF, eds). New York New York, state, United States
New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of
: Wiley, 89--110.

Duan N. 1982. Model for human exposure to air pollution. Environ Int 8:305--309.

Edwards J, Walters S Wal·ters   , Barbara Born 1931.

American television newscaster and reporter. After working for the National Broadcasting Company (1963-1976), she joined the American Broadcasting Company (1976-1979) and became the first woman to anchor the nightly
, Griffiths RK. 1994. Hospital admissions for asthma in preschool children: relationship to major roads in Birmingham, United Kingdom. Arch Environ Health 49:223--227.

Elliot P, Wartenberg D. 2004. Spatial epidemiology: current approaches and future challenges. Environ Health Perspect 112:998--1006.

English P, Neutra R, Scalf R, Sullivan M, Waller L, Zhu L. 1999. Examining associations between childhood asthma and traffic flow using a geographic information system. Environ Health Perspect 107:761--767.

FDOT. 2005. Digital Database of Average Annual Daily Traffic. Tallahassee, FL:Florida Department of Transportation. Available: http://www.dot.state.fl.us/planning/statistics/gis/ [accessed 10 August 2007].

Gauderman WJ, Avol E, Lurmann F, Kuenzli N, Gilliland F, Peters J, et al. 2005. Childhood asthma and exposure to traffic and nitrogen dioxide nitrogen dioxide
n.
A poisonous brown gas, NO2, often found in smog and automobile exhaust fumes and synthesized for use as a nitrating agent, a catalyst, and an oxidizing agent.

Noun 1.
. Epidemiology 16(6):737--743.

Gilbert NL, Goldberg MS, Beckerman B, Brook JR, Jerrett M. 2005. Assessing spatial variability Spatial variability is characterized by different values for an observed attribute or property that are measured at different geographic locations in an area. The geographic locations are recorded using GPS (global positioning systems) while the attribute's spatial variability is  of ambient Surrounding. For example, ambient temperature and humidity are atmospheric conditions that exist at the moment. See ambient lighting.  nitrogen dioxide in Montreal, Canada, with a land-use regression regression, in psychology: see defense mechanism.
regression

In statistics, a process for determining a line or curve that best represents the general trend of a data set.
 model. J Air Waste Manag Assoc 55(8):1059--1063.

Gilbert NL, Woodhouse S Wood´house`

n. 1. A house or shed in which wood is stored, and sheltered from the weather.
, Stieb DM, Brook JR. 2003. Ambient nitrogen dioxide and distance from a major highway. Sci Total Environ 312(1--3):43--46.

Giordian ME, Haneuse S, Wakefield J. 2006. An investigation of the association between traffic exposure and the diagnosis of asthma in children. J Expo Anal Environ Epidemiol 16(1):49--55.

Green RS, Smorodinsky S, Kim JL, McLaughlin R, Ostro B. 2004. Proximity of California public schools to busy roads. Environ Health Perspect 112:61--66.

Harries K. 1999. Mapping Crime: Principles and Practice. Washington, DC:U.S. Department of Justice.

Hitchins J, Morawsaka L, Wolff R, Gilbert D. 2000. Concentrations of submicrometre particles from vehicle emissions near a major road. Atmos Environ 34:51--59.

Hoek G, Brunnekreef B, Goldbohm S, Fischer P, Van den Brandt PA. 2002. Association between mortality and indicators of traffic-related air pollution in the Netherlands: a cohort study A cohort study is a form of longitudinal study used in medicine and social science. It is one type of study design.

In medicine, it is usually undertaken to obtain evidence to try to refute the existence of a suspected association between cause and disease; failure to refute
. Lancet lancet /lan·cet/ (lan´set) a small, pointed, two-edged surgical knife.

lan·cet
n.
 360:1203--1209.

Houston D, Ong PM, Wu J, Winer A. 2006. Proximity of licensed childcare to near-roadway vehicle pollution. Am J Public Health 96(9):1611--1617.

Hurley SE, Saunders Saun´ders

n. 1. See Sandress.
 TM, Nivas R, Hertz hertz (hûrts) [for Heinrich R. Hertz], abbr. Hz, unit of frequency, equal to 1 cycle per second. The term is combined with metric prefixes to denote multiple units such as the kilohertz (1,000 Hz), megahertz (1,000,000 Hz), and gigahertz  A, Reynolds P. 2003. Post office box addresses: a challenge for geographic information system-based studies. Epidemiology 14(4):386--391.

Janssen N, van Vliet P, Aarts F, Harssema H, Brunekreef B. 2001. Assessment of exposure to traffic related air pollution of children attending schools near motorways. Atmos Environ 35:3875--3884.

Karimi HA, Durcik M. 2004. Evaluation of uncertainties associated with geo-coding techniques. Computer-Aided Civil Infrastructural Engineering 19:170--185.

Kim JJ. 2004. Ambient air pollution: health hazards health hazard Occupational safety Any agent or activity posing a potential hazard to health. Cf Physical hazard.  to children. Pediatrics pediatrics (pēdēă`trĭks), branch of medicine dedicated to the attainment of the best physical, emotional, and social health for infants, children, and young people generally.  114:1699--1707.

Kim JJ, Smorodinsky S, Lipsett M, Singer BC, Hodgson AT, Ostro B. 2004. Traffic-related air pollution near busy roads: the East Bay Children's Respiratory Health Study. Am J Respir Crit Care Med 170(5):520--526.

Klepeis N E, Nelson WC, Ott WR, Robinson JP, Tsang AM, Switser P, et al. 2001. The National Human Activity Pattern Survey (NHAPS): a resource for assessing exposure to environmental pollutants environmental pollutants,
n.pl the substances and conditions, including noise, that adversely affect the health and well-being of the people within a community.
. J Expo Anal Environ Epidemiol 11:231-252.

Krieger N, Waterman P, Lemieux K, Zierler S, Hogan hogan

Dwelling of the Navajo Indians of Arizona and New Mexico. The hogan is roughly circular and constructed usually of logs, which are stepped in gradually to create a domed roof.
 JW. 2001. On the wrong side of the tracts? Evaluating the accuracy of geo-coding in public health research. Am J Public Health 91(7):1114-1116.

Kuhler M, Kraft J, Koch W, Windt H. 1988. Dispersion dispersion, in chemistry
dispersion, in chemistry, mixture in which fine particles of one substance are scattered throughout another substance. A dispersion is classed as a suspension, colloid, or solution.
 of car emissions in the vicinity of a highway. In: Environmental Meteorology meteorology, branch of science that deals with the atmosphere of a planet, particularly that of the earth, the most important application of which is the analysis and prediction of weather.  (Grefen K, Lobel J, eds). Dordrecht:Kluwer Academic Publishers, 39-47.

Langholz B, Ebi KL, Thomas DC, Peters JM, London SJ. 2002. Traffic density and the risk of childhood leukemia leukemia (lkē`mēə), cancerous disorder of the blood-forming tissues (bone marrow, lymphatics, liver, spleen) characterized by excessive production of immature or mature  in a Los Angeles Los Angeles (lôs ăn`jələs, lŏs, ăn`jəlēz'), city (1990 pop. 3,485,398), seat of Los Angeles co., S Calif.; inc. 1850.  case-control study case-control study,
n an investigation employing an epidemiologic approach in which previously existing incidents of a medical condition are used in lieu of gathering new information from a randomized population.
. Ann Epidemiology 12:482-487.

Leech JA, Nelson WC, Burnett RT, Aaron S Aaron (âr`ən), in the Bible, the brother of Moses and his spokesman in Egypt, and the first high priest of the Hebrews. He is presented as the instrument of God in performing many signs, such as the turning of his rod into a serpent and , Raizenne ME. 2002. It's about time It's About Time may refer to:

Television
  • It's About Time (TV series), a 1966 American television show.
Theater
  • It's About Time (musical), a 1951 Broadway production.
: a comparison of Canadian and American time-activity patterns. J Expo Anal Environ Epidemiol 12:427-432.

Lewis SA, Antoniak M, Venn AJ, Davies L, Goodwin A, Salfield N, et al. 2005. Secondhand smoke sec·ond·hand smoke
n.
Cigarette, cigar, or pipe smoke that is inhaled unintentionally by nonsmokers and may be injurious to their health if inhaled regularly over a long period. Also called passive smoke.
, dietary fruit intake, road traffic exposures, and the prevalence of asthma: a crosssectional study in young children. Am J Epidemiology 161(5):406-411.

Maantay J. 2002. Mapping environmental injustices: pitfalls and potential of geographic information systems in assessing environmental health and equity. Environ Health Perspect 110(suppl 2):161-171.

McConnell R, Berhan K, Yao L, Jerrett M, Lurmann F, Gilliland F, et al. 2006. Traffic, susceptibility susceptibility

the state of being susceptible. Refers usually to infectious disease but may be to physical factors such as wetting or to psychological factors such as harassment.
, and childhood asthma. Environ Health Perspect 114:766-772.

McCurdy T, Graham SE. 2003. Using human activity data in exposure models: analysis of discriminating dis·crim·i·nat·ing  
adj.
1.
a. Able to recognize or draw fine distinctions; perceptive.

b. Showing careful judgment or fine taste:
 factors. J Expo Anal Environ Epidemiol 13:294-317.

McElroy JA, Remington PL, Trentham-Dietz A, Roberts SA, Newcomber PA. 2003. Geo-coding addresses from a large population-based study: lessons learned. Epidemiology 14(4):399-407.

Molitor J, Molitor N, Jerrett M, McConnell R, Gauderman J, Berhane K, et al. 2006. Bayesian modeling of air pollution effect with missing exposure data. Am J Epidemiol 164:69-76.

Morawska L, Thomas S, Gilbert D, Greenaway C, Rijnders E. 1999. A study of the horizontal and vertical profile of submicrometer particles in relation to a busy road. Atmos Environ 33:1261-1274.

Morris SE, Sale RC, Wakefield JC, Falconer S Falconer

prison where former professor Farragut, who had killed his brother, witnesses the torments and chaos of the penal system. [Am. Lit.: Cheever Falconer in Weiss, 151]

See : Imprisonment
, Elliott P, Boucher BJ. 2000. Hospital admissions for asthma and chronic obstructive airways disease obstructive airways disease Any lung disease–asthma, COPD with airway obstruction, hyperresponsiveness Management Inhaled corticosteroids, maintenance therapy with a β2  in east London East London, city (1991 pop. 240,474), Eastern Cape, SE South Africa, on the Indian Ocean. The city grew around a British military post founded in 1847. Its harbor was developed from 1886, and today it is a leading South African port.  hospitals and proximity of residence to main roads. J Epidemiol Community Health 54(1):75-76.

Nicolai T, Carr CARR Carrier
CARR Customer Acceptance Readiness Review
CARR Carrollton Railroad
CARR Corrective Action Request and Report
CARR City Area Rural Rides (Texas)
CARR Configuration Audit Readiness Review
CARR Customer Acceptance Requirements Review
 D, Weiland SK, Duhme H, von Ehrenstein O, Wagner C, et al. 2003. Urban traffic and pollutant exposure related to respiratory outcomes and atopy atopy /at·o·py/ (at´ah-pe) a genetic predisposition toward the development of immediate hypersensitivity reactions against common environmental antigens (atopic allergy), most commonly manifested as allergic rhinitis but also as  in a large sample of children. Eur Respir J 21(6):956-963.

Nuckols JR, Ward MH, Jarup L. 2004. Using geographic information systems for exposure assessment in environmental epidemiological studies An Epidemiological study is a statistical study on human populations, which attempts to link human health effects to a specified cause. . Environ Health Perspect 112:1007-1015.

Oliver MN, Matthews KA, Siadaty M, Hauck FR, Pickle pickle, general term for fruits or vegetables preserved in vinegar or brine, usually with spices or sugar or both. Vegetables commonly pickled include the beet, cabbage, cauliflower, cucumber, olive, onion, pepper, and tomato.  LW. 2005. Geographic bias related to geo-coding in epidemiologic studies. Int J Health Geographics 4:29.

Ong P, Graham M, Houston D. 2006. Policy and programmatic pro·gram·mat·ic  
adj.
1. Of, relating to, or having a program.

2. Following an overall plan or schedule: a step-by-step, programmatic approach to problem solving.

3.
 importance of spatial alignment of data sources. Am J Public Health 96(3):499-504.

Orange County Growth Management. 2005. Digital Street Centerlines Database. Orlando, FL:Orange County Growth Management. Available: http://www.orangecountyfl.net/ cms/DEPT/growth/maps.htm [accessed 10 August 2007].

Orange County Property Appraiser. 2005. Digital Parcel Database. Orlando, FL:Orange County Property Appraiser. Available: http://www.ocpafl.org/ [accessed 10 August 2007].

Orange County School Board. 2005. Orange County Public Schools. Orlando, FL:Orange County School Board. Available: https://www.ocps.net [accessed 10 August 2007].

Pearson RL, Wachtel H, Ebi KL. 2000. Distance-weighted traffic density in proximity to a home is a risk factor for leukemia and other childhood cancers. J Air Waste Manag Assoc 50:175-180.

Raaschou-Nielsen O, Hertel O, Thomsen BL, Olsen JH. 2001. Air pollution from traffic at the residence of children with cancer. Am J Epidemiology 153:433-443.

Ratcliffe JH. 2001. On the accuracy of TIGER-type geocoded address data in relation to cadastral ca·das·tre also ca·das·ter  
n.
A public record, survey, or map of the value, extent, and ownership of land as a basis of taxation.



[French, from Provençal cadastro, from Italian
 and census areal units. Int J Geogr Inf Sci 15(5):473-485.

Reynolds P, von Behren J, Gunier RB, Goldberg DE, Hertz A, Smith D. 2002. Traffic patterns and childhood cancer incidence rates in California, United States. Cancer Causes Control 13:665-673.

Ross Z, English PB, Scalf R, Gunier R, Smorodinsky S, Wall S, et al. 2006. Nitrogen dioxide prediction in southern California Southern California, also colloquially known as SoCal, is the southern portion of the U.S. state of California. Centered on the cities of Los Angeles and San Diego, Southern California is home to nearly 24 million people and is the nation's second most populated region,  using land use regression modeling: potential for environmental health analysis. J Expo Anal Environ Epidemiol 16(2):106-114.

Rushton G, Armstrong MP, Gittler J, Greene B, Pavlik CE, West MW, et al. 2006. Geo-coding in cancer research: a review.Am J Prev Med 30(2S):S16-S24.

Savitz DA, Feingold L. 1989. Association of childhood cancer with residential traffic density. Scand J Work Environ Health 15:360-363.

Schwab M, McDermott A, Spengler JD. 1992. Using longitudinal lon·gi·tu·di·nal
adj.
Running in the direction of the long axis of the body or any of its parts.
 data to better understand children's activity patterns in an exposure context: data from the Kanaway County health study. Environ Int 18:173-189.

Schwartz J. 2004. Air pollution and children's health Children's Health Definition

Children's health encompasses the physical, mental, emotional, and social well-being of children from infancy through adolescence.
. Pediatrics 113:1037-1043.

Sexton K, Ryan PB. 1998. Assessment of human exposure to air pollution: methods, measurement and model. In: Air Pollution, The Automobile and Public Health (Watson AY, Bates Bates   , Katherine Lee 1859-1929.

American educator and writer best known for her poem "America the Beautiful," written in 1893 and revised in 1904 and 1911.
 RR, Kennedy D, eds). Washington, DC:National Academies Press, 207-238.

U.S. Census Bureau. 2000. Census 2000 TIGER Line Files.Washington, DC: U.S. Census Bureau, Geography Division.Available: http://www.census.gov/geo/www/tiger/[accessed 10 August 2007].

U.S. Geological Survey. 1999. Map Accuracy Standards. Fact Sheet FS-171-99. Reston, VA:U.S. Geological Survey.

Van Atten C, Brauer M, Funk T, Gilber NL, Graham L, Kaden D,et al. 2005. Assessing population exposures to motor vehicle exhaust Exhaust may refer to:

In mathematics:
  • Proof by exhaustion, proof by examining all individual cases
  • Exhaustion by compact sets, in analysis, a sequence of compact sets that converges on a given set
. Rev Environ Health 20(3):195-214.

Van Vliet P, Knape M, de Hartog J, Janssen N, Harssema H, Brunekreef B. 1997. Motor vehicle exhaust and chronic respiratory symptoms in children living near freeways.Environ Res 74:122-132.

Venn A, Lewis S, Cooper M, Hubbard R, Hill I, Boddy R, et al.2000. Local road traffic activity and the prevalence, severity,and persistence (1) In a CRT, the time a phosphor dot remains illuminated after being energized. Long-persistence phosphors reduce flicker, but generate ghost-like images that linger on screen for a fraction of a second.  of wheeze wheeze (hwez) a whistling type of continuous sound.

wheeze
v.
To breathe with difficulty, producing a hoarse whistling sound.

n.
A wheezing sound.
 in school children: combined cross sectional sec·tion·al  
adj.
1. Of, relating to, or characteristic of a particular district.

2. Composed of or divided into component sections.

n.
 and longitudinal study longitudinal study

a chronological study in epidemiology which attempts to establish a relationship between an antecedent cause and a subsequent effect. See also cohort study.
. Occup Environ Med 57:152-158.

Venn AJ, Lewis SA, Cooper M, Hubbard R, Britton J. 2001.Living near a main road and the risk of wheezing Wheezing Definition

Wheezing is a high-pitched whistling sound associated with labored breathing.
Description

Wheezing occurs when a child or adult tries to breathe deeply through air passages that are narrowed or filled with mucus as a
 illness in children. Am J Respir Crit Care Med 164:2177-2180.

Waller LA. 1996. Statistical power and design of focused clustering studies. Stat Med 15:765-782.

Ward MH, Nuckols JR, Giglierano J, Bonner MR, Wolter C,Airola M, et al. 2005. Positional accuracy of two methods of geo-coding. Epidemiology 16(4):542-547.

Whitsel EA., Rose KM, Wood JL, Henley AC, Liao D, Heiss G.2004. Accuracy and repeatability of commercial geo-coding.Am J Epidemiology 160(10):1023-1029.

Wijst M, Reitmer P, Dold S, Wulff A, Nicolai T, Loeffelholz-Collberg EF, et al. 1993. Road traffic and adverse effects on respiratory health in children. BMJ BMJ n abbr (= British Medical Journal) → vom BMA herausgegebene Zeitschrift  307:596-600.

Wilhelm M, Ritz B. 2003. Residential proximity to traffic and adverse birth outcomes in Los Angeles County, California Los Angeles County is a county in California and is by far the most populous county in the United States. Figures from the U.S. Census Bureau give an estimated 2006 population of 9,948,081 residents,[1] while the California State government's population bureau lists a ,1994-1996. Environ Health Perspect 111:207-216.

Wrobel A, Rokita E, Maenhaut. 2000. Transport of traffic-related aerosols in urban areas. Sci Total Environ 257:199-211.

Wu J, Funk T, Lurman FW, Winer AM. 2005. Improving spatial accuracy of roadway networks and geocoded addresses.Transactions in GIS 9(4):585-601.

Xue J, McCurdy T, Spengler J, Ozkaynak H. 2004. Understanding variability in time spent in selected locations for 7-12 year old children. J Expo Anal Environ Epidemiol 14:222-233.

Zandbergen PA, Chakraborty J. 2006. Improving environmental exposure analysis using cumulative distribution functions and individual geo-coding. Int J Health Geogr 5:23.

Zhu Y, Hinds Hinds may refer to:

People with the surname Hinds:
  • Hinds (surname)
In places:
  • Hinds, New Zealand, a small town
  • Hinds County, Mississippi, a US county
In business:
  • F.
 WC, Kim S, Shen Shen, in the Bible, place, perhaps close to Bethel, near which Samuel set up the stone Ebenezer.  S, Sioutas C. 2002a. Study of ultrafine particles near a major highway with heavy-duty diesel traffic. Atmos Environ 36:4323-4335.

Zhu Y, Hinds WC, Kim S, Sioutas C. 2002b. Concentration and size distribution of ultrafine particles near a major highway.J Air Waste Manag Assoc 2:1032-1042.

Zmirnou D, Gauvin S, Pin I, Momas I, Sahraoui F, Just J, et al.2004. Traffic related air pollution and incidence of childhood asthma: Results of the Vesta case-control study.J Epidemiol Community Health 58:18-23.

Address correspondence to P. Zandbergen, Department of Geography, Bandelier West Room 111, MSC (1) (MSC.Software Corporation, Santa Ana, CA, www.mscsoftware.com) Founded in 1963 by Richard H. MacNeal and Robert G. Schwendler, MSC is the world's largest provider of mechanical computer aided engineering (MCAE) strategies, simulation software and services. 01 1110, 1 University of New Mexico The University of New Mexico (UNM) is a public university in Albuquerque, New Mexico. It was founded in 1889. It also offers multiple bachelor's, master's, doctoral, and professional degree programs in all areas of the arts, sciences, and engineering. , Albuquerque, NM 87131 USA. Tel: (505) 277-3105, Fax: (505) 277-3614. E-mail: zandberg@unm.edu

The authors declare they have no competing financial interests.

Received 31 August 2006; accepted 15 May 2007.

(1) Department of Geography, University of New Mexico, Albuquerque, New Mexico “Albuquerque” redirects here. For other uses, see Albuquerque (disambiguation).
Albuquerque (pronounced [ˈæl.bə.kɚ.kiː], Spanish: [al.βu.
, USA; (2) Department of Geography, University of South Florida


    [
, Tampa, Florida “Tampa” redirects here. For other uses, see Tampa (disambiguation).
Tampa is a United States city in Hillsborough County, on the west coast of Florida. It serves as the county seat for Hillsborough County.GR6.
, USA
COPYRIGHT 2007 National Institute of Environmental Health Sciences
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2007, Gale Group. All rights reserved.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Children's Health
Author:Zandbergen, Paul A.; Green, Joseph W.
Publication:Environmental Health Perspectives
Date:Sep 1, 2007
Words:8894
Previous Article:Meeting report: the role of environmental lighting and circadian disruption in cancer and other diseases.(Research)
Next Article:Arsenic exposure and cognitive performance in Mexican schoolchildren.(Children's Health)



Related Articles
Nonprofits target misplacement of students of color in special education.(BRIEFINGS: News Update)
Workgroup report: developing environmental health indicators for european children: world health organization working group.(Children's Health)
Synergistic effects of traffic-related air pollution and exposure to violence on Urban Asthma Etiology.(Research)
Assessing uncertainty in spatial exposure models for air pollution health effects assessment.(Research)
The relationship between early childhood blood lead levels and performance on end-of-grade tests.(Children's Health)
Methylmercury and the developing brain.(Perspectives Correspondence)
Newly recognized pathways of exposure to lead in the middle-income home.(FEATURES)(Author abstract)
Pennsylvania's Asthma School Project and descriptive pilot investigation: a focus on environmental health tracking.(FEATURES)(Author abstract)
The outdoor air quality flag program in Central California: a school-based educational intervention to potentially help reduce children's exposure to...
Newly recognized pathways of exposure to lead in the middle-income home.(Practical Stuff!)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles