Abstract
A pervasive problem in estimating the costs of pollution is that optimizing individuals may compensate for increases in pollution by reducing their exposure, resulting in estimates that understate the full welfare costs. To account for this issue, measurement error, and environmental confounding, we estimate the health effects of ozone using daily boat traffic at the port of Los Angeles as an instrumental variable for ozone. We estimate that ozone causes at least $44 million in annual costs in Los Angeles from respiratory related hospitalizations alone and that the cost of avoidance behavior is at least $11 million per year.
I. Introduction
Air pollution has long been recognized as a negative externality, but considerable debates have ensued over the optimal level of air quality, with few more contentious than the one surrounding ozone.1 Ozone is presumed to have deleterious effect on health, especially for children, the elderly, and those with existing respiratory illnesses, but the exact magnitude is disputed. Part of this controversy stems from discrepancies over statistical approaches to estimate the health effects of ozone and its associated costs on society.
Estimating this relationship is complicated for several reasons. First, a pervasive problem in the literature on the costs of pollution is that optimizing individuals may compensate for increases in pollution by reducing their exposure to protect their health. Behavioral responses to ozone levels are unlikely to be trivial because daily pollution forecasts are widely available through television and newspapers. Because ozone rapidly breaks down indoors, at-risk individuals can easily reduce short-run exposure by going indoors. Indeed, Neidell (2009) shows that air quality warnings reduce time spent outdoors. Therefore, even if one could isolate random variation in pollution, estimates that do not account for these behavioral responses will understate the full welfare effects of ozone. This is particularly relevant because individuals most at risk have the greatest incentive to adopt compensatory behavior.2
A second challenge in estimating the costs of pollution is confounding from environmental factors that may bias standard estimates. For example, weather directly affects health (for example, see Deschenes and Moretti 2009) but also affects ozone levels. Although some measures of weather conditions are observable and can in principle be controlled for, it is difficult to fully control for all weather factors with the correct functional form. Recent evidence by Knittel, Miller, and Sanders (2009) demonstrates that adding higher order terms for temperature and precipitation and adding second order polynomials for wind speed, humidity, and cloud cover has considerable impacts on estimates of health effects from pollution levels. It is even more difficult to control for allergens, which are related to both asthma and pollution, at a sufficient spatial and temporal level.3 Given that the strong daily variations in pollution are unlikely to be driven solely by anthropogenic sources, it is crucial to properly account for the wide range of environmental factors that vary at a daily level.
A third challenge in estimating the cost of pollution arises from measurement error in assigning pollution exposure to individuals. The most common approach for measuring exposure is to assign data from ambient air pollution monitors to the residential location of the individual using various interpolation techniques. Given the tremendous spatial variation in pollution within finely defined areas (Lin, Yuong, and Wang 2001), this approach is likely to yield considerable measurement error. In fact, Lleras-Muney (2009) finds estimates are quite sensitive to the interpolation technique used.4
In this paper, we seek to identify the short-run effects of ozone on health accounting for avoidance behavior, confounding factors, and measurement error, and to provide estimates of the welfare effects and costs of avoidance behavior from ozone-related hospitalizations in the Los Angeles area. To address the three methodological challenges described above, we use daily data on boat arrivals and departures into the port of Los Angeles as an instrumental variable for ozone levels. Three features make boat traffic an appealing instrument. First, boat traffic represents a major source of pollution for the Los Angeles region. The combined ports of Los Angeles and Long Beach, the largest in the United States and third largest in the world, represent the single most polluting facility in the Los Angeles metropolitan area (Polakovic 2002). Because most of these boats come from countries with much less stringent environmental regulations, they contain less sophisticated emissions technology than their U.S. counterparts and emit unusually high levels of nitrogen dioxides (NOx)—contributing to over 20 percent of all NOx emissions in the Los Angeles area (AQMD 2002)—which gets carried inland to form ozone.5 Our data confirm that boat traffic significantly affects daily ozone levels and, important for our identification, the effect of boat traffic on ozone levels declines monotonically with distance from the port.
Second, daily variation in boat traffic is arguably uncorrelated with other short-run determinants of health. The majority of boats arriving at the port travel from overseas. Several factors, such as the extended length of travel and unpredictable conditions at sea, make the exact date of arrival and departure difficult to predict. Therefore, the influx of pollution due to port activity is arguably a randomly determined event uncorrelated with factors related to health, and we provide several pieces of information to support this.
Third, and crucially, boat traffic is generally unobserved by local residents. The arrivals and departures are not included in ozone forecasts or reported by newspapers and local news outlets. Indeed, we find that ozone forecasts and participation in outdoor activities are empirically orthogonal to boat traffic. Therefore, it is difficult for individuals to respond to ozone levels as affected by boat traffic, suggesting that our instrumental variable estimate holds compensatory behavior fixed.
While it is in theory possible that any difference between instrumental variable estimates and OLS estimates is due to the fact the former reflect a particular local average treatment effect (LATE)—namely, the impact of an increase in pollution due to port traffic on health—this possibility seems highly unlikely in our setting.6 The ozone created by emission from boats is chemically identical to the ozone created by emissions from other sources. Furthermore, ozone levels as affected by our instrument are quite comparable to the overall mean level of ozone at the port.7 It is therefore plausible to think that heterogeneity in the ozone effect is limited in our setting.8
Our findings are striking. OLS estimates of ozone on hospitalizations are statistically significant but small in magnitude: Exposure to ozone causes $11.1 million per year in hospital costs in Los Angeles. In contrast, instrumental variable estimates indicate a significantly larger cost of ozone concentrations of about $44.5 million per year, with several robustness checks supporting our main findings. These estimates represent a lower bound of the total costs of ozone because it ignores health episodes that do not result in hospitalizations. Since for the marginal individual the marginal benefit of avoidance behavior must equal its marginal cost, under several additional assumptions we calculate that the overall cost of avoidance behavior must be at least $11.1 million per year in Los Angeles, noting that this does not include inframarginal individuals. More broadly, these results indicate the importance of accounting for several econometric factors in understanding the full welfare effects from pollution.
II. Background on Air Pollution and Health
Ground-level ozone is a criteria pollutant9 regulated under the Clean Air Acts that affects respiratory morbidity by irritating lung airways, decreasing lung function, and increasing respiratory symptoms, with effects exacerbated for susceptible individuals, such as children, the elderly, and those with existing respiratory conditions like asthma. Symptoms can arise from contemporaneous exposure in as quickly as one hour of exposure. Because it may take time for exacerbation of respiratory illnesses, symptoms may arise from cumulative exposure over several days or several days after exposure. For example, “an asthmatic may be impacted by ozone on the first day of exposure, have further effects triggered on the second day, and then report to the emergency room for an asthmatic attack three days after exposure” (Environmental Protection Agency 2006). Thus it is necessary to control for several lags when estimating the relationship between ozone and health.
The process leading to ozone formation makes it highly predictable and straightforward to avoid. Ozone is not directly emitted but forms from interactions between nitrogen oxides (NOx) and volatile organic compounds (VOC) in the presence of sunlight and heat. Therefore, ozone levels can be predicted fairly accurately using weather forecasts. Furthermore, ozone rapidly breaks down indoors where there are more surfaces to interact with (Chang et al. 2000). Since symptoms from ozone exposure can arise over a short period of time, altering short-run exposure by going indoors can reduce the onset of symptoms.
Given the potential effectiveness of avoidance behavior, two forms of public information are available to inform the public of expected air quality conditions: the pollutant standards index (PSI) and air quality episodes. The PSI, which is forecasted on a daily basis, is computed for five criteria pollutants, and the maximum PSI across pollutants is required by federal law to be reported in major newspapers along with a brief legend summarizing the health concerns (Environmental Protection Agency 1999). California state law requires the announcement of a Stage I air quality episode, or smog alert, when the PSI is at least 200, which corresponds to 0.20 parts per million (ppm) for ozone.10 These episodes, which also occur on a daily basis, are more widely publicized than the PSI; they are announced on both television and radio.11
The agency that provides air quality forecasts and issues smog alerts for Southern California is the South Coast Air Quality Management District (SCAQMD). They produce the following day’s air quality forecast by noon the day before to provide enough time to disseminate the information. Because SCAQMD covers all of Orange County and the most populated parts of Los Angeles, Riverside, and San Bernardino counties—an area with considerable spatial variation in ozone—they provide the forecast for each of the 38 source receptor areas (SRAs) within SCAQMD. When an alert is issued, the staff at SCAQMD contacts a set list of recipients, including local schools and news media, who further circulate the information to the public.
Neidell (2009) provides direct evidence that people respond to information about air quality. He identifies the effect of smog alerts by using a regression discontinuity design that exploits the deterministic selection rule used for issuing alerts. When smog alerts are issued, attendance at major outdoor facilities in Los Angeles decreases by as much as 13 percent.
III. Data
Our final data set consists of several different data sets merged together at the daily level by zip code for the months April-October for the years 1993-2000 for all zip codes in SCAQMD. For health data, we use respiratory related emergency department (ED) visits from the California Hospital Discharge Data (CHDD) for the following age groups: 0-5, 6-14, 15-64, and older than 64.12 There are two practical factors that make the CHDD an attractive option. First, it includes the exact date of admission to the hospital and zip code of residence of the patient, enabling us to readily merge it to the other data. Second, it contains the entire universe of discharges and the primary diagnosis of the patient, providing a large sample for detecting respiratory related admissions at such a high frequency. Table 1 shows the daily number of ED respiratory illness admissions per zip code for the age groups considered, as well as the breakdown across types of respiratory illnesses and other independent variables used in this analysis.13
Summary Statistics for Hospital Admissions, Ozone, and Covariates for Years 1993–2000
We use daily pollution data maintained by the California Air Resources Board. There are roughly 35 continuously operated ozone monitors and roughly 20 for carbon monoxide (CO) and nitrogen dioxide (NO2), two other pollutants necessary to consider because of their correlation with ozone and potential health effects.14 We assign pollution levels to the SRA based on the values for the monitor in that SRA, and when no monitor is present, assign pollution values from the monitor in the nearest SRA.15
Data on boat traffic comes from the marine exchange of southern California. The marine exchange records daily logs of the arrival and departure dates of each individual vessel that enters the port, along with the net tons, length, flag, and cargo type of the vessel.16 We aggregate this information to the total number of tons of boats arriving and departing on a daily basis. Using the latitude and longitude of the centroid of each zip code, we compute the physical distance from each zip code to the port.
Finally, data on weather is obtained from the Surface Summary of the Day (TD3200) from the National Climatic Data Center. Using the 30 weather stations available in SCAQMD, we assign daily maximum and minimum temperature, precipitation, resultant wind speed, maximum relative humidity, and sun cover to each SRA in an analogous manner to pollution.17
IV. Conceptual Framework
To fix ideas on measuring and interpreting the effect of pollution on health, assume the following short-term health production function:
(1)
where h is a measure of health, ozone is ambient ozone levels, and avoid is avoidance behavior. W are other environmental factors that directly affect health, such as weather, allergens, and other pollutants. S are all other behavioral, socioeconomic, and genetic factors affecting health.
There are two main approaches to estimating Equation 1 and determining the welfare effects from changes in pollution, with the key difference arising in how avoidance behavior is treated. The first, and most common, is the dose-response approach, which does not control for avoidance behavior when estimating Equation 1 and thus obtains the total derivative of health with respect to ozone: dh/dozone =δh/δozone+δh/δavoid*δavoid/δozone. Because engaging in avoidance behavior may result in welfare loss, in order to measure the full willingness to pay (WTP) for a reduction in pollution one must not only measure the loss associated with dh/ dozone but also measure the costs associated with any changes in avoidance behavior (Cropper and Freeman 1991; Deschenes and Greenstone 2007). A common expression for WTP using the dose-response approach is WTP=dh/dozone*(m+w)+ pavoid* δavoid/δozone, where m are medical expenditures incurred for any illness, w is the value of lost time from the illness, and pavoid is the price of avoidance behavior (Harrington and Portney 1987).18
The second approach for estimating WTP directly accounts for avoidance behavior to estimate the partial derivate δh/δozone of the health production function, and measures the loss associated with the change in health: WTP=δh/δozone*(m+w) (Harrington and Portney 1987). Previous efforts using this approach attempt to directly control for avoidance behavior in Equation 1 (Neidell 2004; Neidell 2009).19 Since both approaches require measuring changes in typically nonmarket behaviors, valuing the welfare effects from a change in environmental quality remains a perennial challenge.20
Instead of directly observing avoidance behavior to estimate δh/δozone, the strategy used in this paper is to use an instrument that shifts ozone levels but is unrelated to both avoidance behavior and other unobserved determinants of health.21 As described in the introduction, we use boat traffic at the ports as an instrument for ozone levels to obtain estimates of δh/δozone. Below we confirm a strong partial correlation between boat traffic and ozone levels and present evidence to support the assumption that boat traffic is uncorrelated with avoidance behavior and unobserved confounders. Moreover, because we have estimated the partial derivative δh/δozone, we can estimate WTP using estimates of medical expenditures and the value of time and thus do not need to measure avoidance behavior.
Furthermore, if we obtain WTP estimates using the health production approach, then we also can obtain estimates of the costs of avoidance behavior. If we use OLS estimates to obtain dhealth/dozone, then we can obtain the first component of the dose-response WTP, commonly referred to as the cost of illness (COI). If we use the IV estimates to obtain δh/δozone, then we can obtain the full WTP. Therefore, the difference between WTP and COI is pavoid* δavoid/δozone, which is the cost of avoidance behavior. This simple yet remarkable result enables us to measure the costs of a typically nonmarket behavior without directly observing it.
V. Empirical Strategy
We estimate the following equations by two stage least squares:
(2)
(3)
where Equation 2 is the second-stage regression and Equation 3 is the first stage regression. hazst is the number of respiratory related ED admissions for age group a in zip code z of SRA s at date t. To remain consistent with previous ozone time-series studies that often found respiratory effects up to four days after exposure (Environmental Protection Agency 2006), we compute the daily average of ozone, boats, and W over the past five days.22 We average across five days because we are only interested in testing whether an effect of ozone exists, and not the particular timing of the effect. Furthermore, a more flexible structure that allows each lag to enter separately would require us to instrument for each lag of ozone, which greatly reduces the precision of our estimates.23
The vector W includes maximum temperature, minimum temperature, precipitation, resultant wind speed, humidity, sun cover, carbon monoxide, and nitrogen dioxide. Mt contains day of week dummies to account for within week patterns of air quality and health. In addition to limiting the analysis to the months of April through October, f(t) includes year*month dummy variables and a cubic time trend to flexibly capture seasonality and long-run trends in air quality and health.24 αa are age dummy variables. θz are zip code fixed effects designed to capture local time-invariant demographic factors that might affect health as well as time-invariant measurement error.25 u is an error term that includes avoidance behavior, unobserved components of W and S, and an idiosyncratic component.
We instrument ozone in Equation 2 using boat traffic, shown in Equation 3. boatst is a measure of tons of boat arrivals and departures at date t and dists is the distance from SRA s to the port. We interact boatst with distance to allow the effect of the boat arrivals and departure to vary depending on how far the SRA is from the port. In all the empirical models in the paper, we cluster all standard errors by date.26
If there is a homogeneous effect of ozone on health, a necessary assumption for unbiased estimates of β1 is that cov(boatst, uazst)=0 and cov(boatst*dists, uazst)=0. Several factors support the validity of this assumption. One, the supply of commodities is a stochastic process such that variation in the production of goods and services and their loading and unloading at the port cannot be timed perfectly. For instance, on any given day the port averages approximately 15 boat arrivals, but the interquartile range of 12 to 17 suggests considerable variation in the number of arrivals. Two, although one vessel is docked at each berth at the port, there is substantial variation in the tonnage of these boats, an important factor affecting emissions, particularly NOx (Environmental Protection Agency 2000; Gajendran and Clark 2003). In support of this, the average boat tonnage is roughly 17,000, but the interquartile range is from 9,300 to 22,000. Three, given that these boats travel from great distances, conditions at sea and vessel travel speeds are likely to affect their exact arrival date.27 These factors suggest it is reasonable to think of the timing of boat arrivals as virtually random in the short-run. Because of that, we have little reason to expect that short-run variations in boat movements directly affect health in the short-run, and below we present supporting evidence.
VI. Results
A. Validity of Boat Traffic as an Instrument
To support the validity of our instrument, we begin by demonstrating the virtually random day-to-day fluctuations in boat traffic. In Figure 1, we plot daily boat traffic for July, 2000, both unadjusted and adjusted for all covariates used in the analysis.28 Immediately evident is that boat traffic today does not appear to predict boat traffic tomorrow, regardless of whether we adjust for environmental conditions. A positive departure from the mean is almost always followed by a negative departure from the mean. When we more formally test this by computing partial autocorrelations using data from all dates, shown in Figure 2, we again find little evidence of a systematic pattern: Once-lagged boat traffic has a statistically insignificant correlation with current boat traffic of only 0.03. These results suggest boat traffic almost perfectly resembles a random walk.
Daily boat traffic in July, 2000
Note: “Raw” plots the total tons of boat traffic by day in July, 2000, demeaned to have a mean of zero. “Adjusted” plots the residuals from regressing boat traffic against maximum temperature, minimum temperature, precipitation, wind speed, humidity, cloud cover, carbon monoxide, nitrogen dioxide, year-month dummies, day of week dummies, and cubic day trend.
Partial autocorrelation of boat traffic
Note: The plotted partial autocorrelations are the coefficients obtained by regressing boat traffic on 40 lags of boat traffic.
Our instrument may be invalid if people can perfectly observe changes in pollution levels induced by the boats and adjust their exposure accordingly. While people may have a good sense of seasonal variation in pollution, have reliable information on current weather conditions that may affect pollution, and have easy access to pollution forecasts, we think it is unlikely they detect daily changes in pollution levels induced specifically by the boats. To probe this, we assess whether pollution forecasts—the main source of information available to the public—are based on boats movements. We show in the first two columns of Table 2 estimates of the relationship between boat traffic and both smog alerts and ozone forecasts. In Column 1, we regress whether a smog alert was issued anywhere in SCAQMD on our measure of boat traffic and all of the covariates in Equation 3, but only using covariate data for the SRA of the port. In Column 2, we repeat this regression using the ozone forecast for the SRA of the port as the dependent variable.29 In both analyses we only use contemporaneous levels and not a five-day average since this more precisely addresses the question of whether boat traffic is incorporated into air quality forecasts.30 The results indicate a statistically insignificant coefficient on boat traffic for both measures of air quality information, which supports the notion that boat traffic is not used in air quality forecasts and hence is unlikely to be related to avoidance behavior.
Relationship between boat traffic with ozone forecasts and outdoor attendance
Our instrument may also be invalid if people possess private information about boats movement and adjust their exposure based on that information. Specifically, if the information on boats movement induces people to decrease their exposure to ozone by limiting time spent outside and this in turn improves health, we will underestimate the biological effect of ozone on health. We assess this by estimating whether attendance at several outdoor activities is related to boat traffic. If private information is based on boat traffic, then outdoor activities will decrease when boat traffic increases. We use four measures of attendance at outdoor activities in SCAQMD: Two major outdoor attractions, the Los Angeles Zoo and the Griffith Park Observatory, and two major league baseball teams, the Los Angeles Dodgers and California Angels.31 Estimates for each venue, shown in Columns 3-6, are statistically insignificant for three of the four venues. Although we find a statistically significant estimate for attendance at the zoo, this estimate is small in magnitude: A one standard deviation increase in boat traffic is associated with a 1.4 percent increase in attendance. Furthermore, when we estimate these equations simultaneously via seemingly unrelated regression, a joint test of significance reveals a statistically insignificant association between boat traffic and attendance. These results suggest individuals are unlikely to update their private information about pollution levels using boat traffic.
B. The Relationship between Boat Movements and Pollution
To assess the strength of our instrument, in Panel A of Table 3 we present evidence of the relationship between boat arrivals and departures on ozone levels in Los Angeles. It is highly statistically significant, with t-statistics that exceed 150 for both boat traffic in levels and boat traffic interacted with distance from port. Our estimates in the second column imply that each 500,000 tons of boat arrivals and departures (the approximate daily average) at the port increase ozone levels in the immediate area by 0.024 ppm, which is just over 60 percent of the mean ozone level in Long Beach of 0.039. This estimate is consistent with previous reports that suggest the port contributes to 50 percent of smog-forming gases (Polakovic 2002).
OLS and IV regression results for effect of ozone on respiratory illnesses
The interaction term between boat movement and distance from the port allows differential effects of port activity on resultant ozone levels depending on how far the area is from the port. We expect a greater impact on ozone levels in zip codes immediately adjacent to the port, with this effect diminishing as we travel away from the port. The negative interaction term, which is also highly statistically significant, is consistent with this. Figure 3 plots the effects of the average daily boat movement in the port on ozone levels based on the distance from the port. The results imply that the effect of the port on inland pollution levels is cut in half at 11 miles from the port and disappears at 23 miles from the port. We also add boat traffic interacted with a quadratic term in distance to allow a nonlinear decay from port emissions (Column 3 of Table 3). Although the quadratic term is statistically significant, Figure 3 shows that this addition does not appreciably change the spatial decay of pollution.
Effect of average daily port activity on ozone levels
Note: Results are based on regression coefficients from Columns 2 and 3 of Table 3.
These results highlight the strength of our first stage: arrivals and departures at the port have a significant effect on ozone levels in the immediately surrounding areas, and this effect diminishes as one moves away from the port. Boat arrivals and departures appear uncorrelated with other factors related to health, suggesting our second-stage estimates will be consistent estimates of the biological effect of ozone on asthma hospitalizations.
C. The Relationship between Ozone and Hospitalizations
Turning to estimates of the relationship between ozone and health, we present OLS and IV results in Panel B of Table 3.32 OLS results, shown in Column 1, indicate ozone has a statistically significant relationship with respiratory related hospitalizations. A five-day increase in ozone of 0.01 ppm is associated with a modest 1.2 percent increase in hospitalizations. To gauge the sensibility of this estimate, we can compare it to estimates from previous epidemiological studies. A meta-analysis by Thurston and Ito (1999), which also focuses on all respiratory hospital admissions for all ages, finds a 1.5 percent increase in hospitalizations from a 0.01 ppm increase in ozone, an estimate quite comparable to ours.33
When we turn to our IV estimates we find estimates that are nearly four times larger than OLS estimates. The considerably larger estimates, shown in Column 2, imply a 0.01 ppm increase in the five-day average ozone is associated with a 4.7 percent increase in hospitalizations. This difference is statistically significant according to a Hausman test, which has a p-value of 0.028. In Column 3, when we use the quadratic in distance, we find a similar 4.5 percent increase in hospitalizations.
These estimates suggest that accounting for avoidance behavior, measurement error, and confounding increases estimates by a factor of four. Neidell (2009) finds estimates are roughly two times larger when controlling only for public air quality information. This difference is due to the fact that we also correct for measurement error, other unobserved sources of information for avoidance behavior, and potential confounding from other environmental factors, such as weather, suggesting the importance of accounting for these additional sources of bias.
Since environmental factors may be an important potential source of confounding, in Table 4 we assess the sensitivity of our estimates to the weather variables and copollutants. If estimates are unaffected by excluding these variables, it lends support to the idea that our approach is accounting for confounders. Column 1 repeats our baseline estimates. Column 2 omits all weather and copollutant variables. Column 3 omits only the latter while Column 4 omits only the former. Lastly, in Column 5 we interact ozone with all of the weather variables and copollutants, and compute the marginal effect of ozone on health by evaluating δh/δozone using the mean of each weather variable and copollutant.34 Our estimates are clearly insensitive to these alternative specifications, suggesting the strength of our instrument in controlling for potential confounding from environmental factors.
Sensitivity of regression results for effect of ozone on respiratory illnesses to weather and copollutants
These cargo boats primarily emit NOx, but also emit particulate matter (PM), which raises an issue of whether our instrument meets the necessary exclusion restrictions since PM affects health (Chay and Greenstone 2003a; Chay and Greenstone 2003b). Two correlation patterns across pollutants suggest this is not likely to be a major issue. One, the correlation between ozone and PM is very low. Ozone typically peaks in the summer because it forms in the presence of heat, whereas the other “criteria” pollutants, including PM, typically peak in the winter. For example, focusing on the Los Angeles region, Moolgavkar (2000) found a correlation with ozone of 0.20 for PM10 and 0.04 for PM2.5. Furthermore, our analysis focuses on the “ozone season”—the months of April-October—where many of these other pollutants are at considerably lower levels they are unlikely to pose a health threat. Two, although we cannot directly control for PM,35 it is highly correlated with both CO and NO2 (Currie and Neidell 2005) so that including the two is likely to serve as a sufficient statistic for PM. Because our results are insensitive to excluding CO and NO2, we do not suspect the omission of PM to present a problem.
Becausee we have aggregated all respiratory illnesses and ozone may have a differential effect across the type of illnesses, in Table 5 we separately explore the effects of ozone on pneumonia (ICD 480-486), bronchitis and asthma (ICD 466, 490, 491, 493, 494), and other respiratory illnesses. Pneumonia, bronchitis, and asthma are conditions more likely to be exacerbated from current exposure, as opposed to respiratory conditions like emphysema, where the effects from exposure are cumulative over time (Environmental Protection Agency 2006). Therefore, we expect larger effects for pneumonia and bronchitis and asthma than for other respiratory conditions. The OLS results indicate fairly comparable effects across the conditions with a slightly larger effect, if anything, for other respiratory conditions. The IV results, however, paint a different picture. Consistent with expectations, the effects are largest for pneumonia, followed by bronchitis and asthma, and then a small effect for other respiratory illnesses.
Regression results for effect of ozone by type of respiratory illness
We also perform a falsification test by specifying the dependent variable as external injuries (fractures, dislocations, and sprains), an outcome that should not be affected by pollution levels. In our IV model, we find a statistically insignificant estimate of -0.161 with a standard error of 0.109. However, we also find a statistically insignificant effect of -0.008 when we estimate this by OLS. Therefore, while this test is useful in that it supports our preferred model, it is not definitive because it does not rule out a model we believe to be incorrect.
VII. The Cost of Pollution and Avoidance Behavior
Since there are no market prices for air quality, estimates of willingness to pay (WTP) are a key input for designing efficient environmental policy. We use our estimates to quantify the WTP to reduce ozone levels in the Los Angeles area. Based on our expressions for WTP in section IV (WTP = δh/δozone*(m + w)), for δh/δozone we use our estimates from Table 3, for m we use the average hospital bill for any respiratory related admissions ($22,438) and for w we use the average length of stay for this admission (6.5 days) multiplied by the average hourly wage of $16/hour in the Los Angeles area. We recognize that this measure likely understates the full value because it ignores any direct effects on well-being not included in hospital costs and it ignores other health episodes that do not result in hospitalizations.36
Given that the mean eight-hour ozone exposure is 0.05 ppm in the months from April to October (Table 1) and 0.025 ppm in the months from October to April, our OLS estimates, which likely reflects cost of illness (COI) estimates, indicate that ozone costs $11.1 million per year in the Los Angeles region.37 Based on the IV estimates, which more closely resemble WTP estimates, the cost amounts to $44.5 million per year. Comparable estimates for a 0.01 ppm change in ozone are $2.8 million using OLS and $11.2 million using IV.38 These differences gives a sense of how a more simplified analysis based on OLS, and thus COI, would vastly underestimate the full costs of ozone exposure.
We also can use these two estimates along with estimates from Neidell (2009) to present bounds on the cost of avoidance behavior. Since our IV estimates differ from OLS by more than just whether they account for avoidance behavior—they also correct for measurement error and environmental confounding—the simple difference between the WTP and COI estimates of $33.4 million gives an upper bound on the cost of avoidance behavior. Using the estimates from Neidell (2009) that suggest production function estimates two times larger than dose response estimates yields an estimate of WTP of $22.1 million. Since this estimate is likely to understate WTP, the $11.1 million difference from COI gives a lower bound on the cost of avoidance behavior. Although our estimated range of $11.1 to $33.4 million for avoidance behavior is wide, it is a large fraction of total costs, thus demonstrating the importance of accounting for avoidance behavior in understanding the societal costs of pollution.
VIII. Conclusion
A pervasive problem in the literature on the costs of pollution is that optimizing individuals may compensate for increases in pollution by reducing their exposure to protect their health. Furthermore, using ambient monitors to approximate individual exposure to pollution may induce considerable measurement error. Moreover, fully accounting for all environmental factors that correlate with both pollution and health, such as weather and allergens, presents a major obstacle. Estimates of the costs of pollution that do not properly address these issues may be significantly biased.
We propose a novel approach to estimate the effects of ozone on health and its associated costs on society. We isolate the short-term effect of ozone holding compensatory behavior fixed, limiting environmental confounding, and accounting for measurement error by using boat arrivals and departures at the Ports of Los Angeles and Long Beach as an instrument for ozone levels. Since boat traffic is unobserved by most residents, it generates an important source of variation in pollution that is difficult to avoid and cannot easily be offset by residents’ compensatory behavior.
We find that OLS estimates of ozone on hospitalization are small. In contrast, instrumental variable estimates indicate a considerably larger effect of ozone concentrations on respiratory related hospitalizations. Based on our models, we calculate that the cost of ozone in Los Angeles is at least $44 million per year. Furthermore, the cost of avoidance behavior ranges from $11.1 to $33.4 million per year depending on assumptions about measurement error and environmental confounding. While these estimates cover a wide range, they are at least as large as the medical and wage expenditures based on a cost of illness analysis, suggesting considerable costs from this nonmarket behavior.
Our findings have several limitations. One, it is unclear how well these estimates generalize to areas other than Southern California. Southern California has some of the highest ozone levels in the United States, so the possibility of nonlinear effects at lower exposure levels cannot be ascertained. Two, although boat traffic largely emits NOx, which leads to ozone formation, it also may emit other pollutants that affect health, thus invalidating the use of boat traffic as an instrument for identifying the effect of ozone. While we probed the possibility of this concern arising from three common urban air pollutants, it is difficult to assess whether we have adequately covered the range of possible pollutants. Finally, this study only focuses on the short-run effects of ozone on hospitalizations. Ozone may have other short-run effects that do not result in hospitalizations (Bell et al. 2004) and long-run effects as well (Jerret et al. 2009) that may increases the total costs of ozone, suggesting a fruitful area for future research.
Footnotes
Enrico Moretti is a professor of economics at the University of California, Berkeley. Matthew Neidell is an assistant professor of health policy and management at Columbia University. Some of the data used in this article are available from August 2011 through July 2014, while information on obtaining the confidential data will be provided upon request from Matthew Neidell, Columbia University, 600 W. 168th Street, 6th floor, New York, NY 10032, mn2191{at}columbia.edu.
↵1. A proposed ozone standard issued by the EPA in 1997 was finally upheld by the Supreme Court in 2002, but only after endless appeals and lengthy lawsuits initiated by states and industry (Bergman 2004). Furthermore, since higher temperatures favor ozone formation, climate change is expected to increase ozone levels (Racherla and Adams 2009), making regulations surrounding ozone an area of increasing importance.
↵2. Although Neidell (2009) finds support for the existence of avoidance behavior, by controlling for only one source of avoidance behavior the estimates are not necessarily informative on the overall magnitude of avoidance behavior and its ultimate impact on the estimated costs of pollution. It is likely that there are several additional sources of information that at risk individuals use to adjust their behavior. Furthermore, in Neidell (2009) these pieces of information may be measured with error since they are assigned at a broader level than are observed pollution levels.
↵3. For example, Hiltermann et al. (1997) find a correlation of 0.57 between ozone and mugwort pollen, a pollen demonstrated to cause airway inflammation in asthmatics.
↵4. Although several epidemiological field studies address this concern by using personal ambient monitors (Tonne et al. 2004), these studies often involve very small samples that preclude the ability to obtain precise estimates of common outcomes of interest, such as hospitalizations.
↵5. In fact, emissions from the port have lead to numerous contentious debates, and a recent senate committee hearing on port pollution lead by Senator Barbara Boxer from California seeks an urgent, national response to port pollution. Evidence of the passion behind the debate is succinctly summarized by the following quote from Bob Foster, mayor of Long Beach: “We’re not going to have kids in Long Beach contract asthma so someone in Kansas can get a cheaper television set.” (Wald 2007).
↵6. To the extent that the effects of ozone are nonlinear, then our results, whether from OLS or IV models, may not generalize to areas with sufficiently different ozone levels,
↵7. To assess this, we computed the predicted values of ozone (at the port) if boat traffic increased or decreased by one standard deviation (SD) from the mean (based on estimating our first stage equation). The adjusted mean for a one SD decrease in boat traffic is 0.0369 and a one SD increase is 0.0420. The unadjusted mean of ozone at the port is 0.0396, which is bounded by the two adjusted means, so the IV variation is at similar levels of ozone to the overall variation
↵8. Since our instrument is assumed to have a different effect on ozone depending on distance from the port, the above statement is true under the plausible assumption that effect of ozone on human health is the same for residents of Los Angeles who live close and far from the port.
↵9. Criteria pollutants are six common air pollutants with established health-based air quality standards. They include ozone, carbon monoxide, particulate matter, nitrogen dioxide, lead, and sulfur dioxide.
↵10. Additionally, a Stage II air quality episode is issued when the PSI exceeds 250 or ozone forecast exceeds 0.30 ppm, but this only occurred once over the time period studied.
↵11. Although air quality episodes can potentially be issued for any criteria pollutant, they have only been issued for ozone. Because ozone is a major component of urban smog, this has given rise to the term “smog alerts.”
↵12. Respiratory related ED visits include all admissions with ICD-9-CM codes from 460-519. Because nonemergency hospitalizations can be prearranged and may not be an immediate reaction to ozone, we only use emergency room admissions. We find little meaningful difference in estimates, however, when using all hospitalizations.
↵13. For the independent variables, we present five-day averages that correspond with our econometric model, described in more detail below. One-day averages are nearly identical to the five-day averages, with differences solely due to missing values.
↵14. Particulate matter was not included in this analysis because measurements are not taken on a daily basis. We elaborate on the potential impact from omitting this variable below in the results section.
↵15. Given that this may induce measurement error, we also estimated models that limit the sample to only SRAs where an ozone monitor is present, but find no considerable difference in estimates. There are considerable disagreements over how to assign pollution from monitors to individuals. For example, assigning pollution at a finer geographic level, such as the zip code, has potential to improve accuracy, but may also worsen it if people travel beyond their zip code. Using SRAs is justified on the grounds that SRAs were specifically designed to represent an area with common pollution concerns that account for geographic and population differences within SCAQMD, so there is a high degree of uniformity in ozone levels within an SRA.
↵16. Of the main cargo types, 14 percent are classified as bulk carriers, 9 percent as general cargo, 6 percent are passenger, 5 percent as vehicle carriers, 14 percent as tankers, and 40 percent as container carriers.
↵17. For maximum relative humidity and sun cover, we use data from the one weather station in SCAQMD with a complete history of this variable (Los Angeles International Airport) to assign to all of SCAQMD. Since assigning these weather variables may lead to measurement error, we also estimate models excluding all weather variables, shown in Table 4, and find this has little impact on our estimates.
↵18. Note that our expression slightly differs from Harrington and Portney (1987) because we do not allow health to directly affect utility, but instead use all time lost from illness in place of only time lost in the workplace.
↵19. Instead of controlling directly for avoidance behavior, Neidell (2004, 2009) controls for factors that shift the demand for avoidance behavior (ozone forecasts and smog alerts). Note that the dose response and health production estimates are identical if avoidance behavior does not exist.
↵20. Another strand of literature uses stated-preference to elicit WTP; see, for example Alberini and Krupnick (2000).
↵21. This also overcomes the potential concerns in Neidell (2009) regarding measurement error in pollution assignment, potential confounding from omitted variables, such as weather and allergens, and inadequate control for all sources of public information that may affect avoidance behavior.
↵22. We also estimated models that included four- or six-day averages of all variables, and this had minimal impact on our estimates.
↵23. We also estimated models with contemporaneous and the four lags of both pollution and weather entered separately, and the individual coefficients on ozone were all statistically insignificant. The sum of the ozone coefficients, which has the same interpretation as our five-day average, was 0.441, which is quite comparable to estimates using our five-day average.
↵24. We also estimated models with year*week dummy variables and found this had little impact on our estimates.
↵25. Note that by including zip code fixed effects we are controlling for the population at risk (to the extent it remains constant over time within a zip code), so we can interpret our estimates as the impact of ozone on the rate of hospital admissions.
↵26. We also estimated models that allowed for arbitrary auto-correlation of four lags, and this had minimal impact on our standard errors.
↵27. In our sample, 14 percent of vessels have a country of origin in Africa, 19 percent in Asia, 18 percent in Europe, and 47 percent in North America. Nearly 39 percent of the North American boats originate within the United States, with the remainder almost entirely from Panama and the Bahamas.
↵28. Results are comparable if we choose other time periods.
↵29. We can not use whether an alert was issued in the SRA of the port because this never occurred in the time period studied.
↵30. By focusing solely on one geographic location, the maximum number of observations is the number of days between April and October multiplied by the number of years of data (eight).
↵31. For more details on these data, see Neidell (2009). Sample sizes fall for these four columns because the Zoo and Observatory are only open six days a week and attendance at baseball games only includes home games. The California Angels were renamed the Anaheim Angels in 1997 and the Los Angeles Angels of Anaheim in 2005.
↵32. Our sample size of 1,927,187 comes from daily data from April-October from four age groups across roughly 350 zip codes in SCAQMD for eight years (1993-2000), less missing values.
↵33. Thurston and Ito (1999) compute a relative risk of 1.18 for a 0.1 ppm change in one-hour ozone. Translating this to percent change ((1.18-1)/1.18) reveals a 15.25 percent increase from a 0.1 ppm change, or 1.525 percent increase from a 0.01 ppm change.
↵34. In this specification, we also instrument for each of the interaction terms by interacting boat traffic with the weather and copollutant variables.
↵35. PM is only measured roughly every six days, so assigning a daily time series at the SRA level would involve nontrivial assumptions.
↵36. Estimates are comparable if we aggregate the separate impacts for the different respiratory illness explored in Table 5.
↵37. Consistent with our econometric models, we assume a linear effect of ozone on health. To obtain this estimate, we multiply our coefficients by the number of Age Groups 4, the number of ZIP codes in SCAQMD (339), and the number of days in the two seasons (214 and 151, respectively), and divide by the number of days we average ozone levels more than five.
↵38. This ratio of WTP to COI of roughly four corresponds closely with previous estimates that focus on stated preferences to obtain WTP (Alberini and Krupnick 2000).
- Received April 2009.
- Accepted October 2009.