Impact of the Clean Water Act on Pollution
Impact of the Clean Water Act on Pollution
January 2017
Abstract
Since the 1972 U.S. Clean Water Act, government and industry have invested over $1 trillion to abate
water pollution, or $100 per person-year. Over half of U.S. stream and river miles, however, still violate
pollution standards. We use the most comprehensive set of files ever compiled on water pollution and its
determinants, including 50 million pollution readings from 170,000 monitoring sites and a network model
of all U.S. rivers, to study water pollution’s trends, causes, and welfare consequences. We have three
main findings. First, water pollution concentrations have fallen substantially. Between 1972 and 2001,
for example, the share of waters safe for fishing grew by 11 percentage points. Pollution was declining at
even faster rates before 1972. Second, the Clean Water Act’s grants to municipal wastewater treatment
plants, which account for $680 billion in expenditure, caused some of these declines. Through these
grants, it cost around $1.5 million (2014 dollars) to make one river-mile fishable for a year. We find little
displacement of municipal expenditure due to a federal grant. Third, the grants’ estimated effects on
housing values are about a fourth of the grants’ costs; we carefully discuss welfare implications.
1
all of what the costs might be” (Quarles 1976). President Nixon actually vetoed the Clean Water Act
and described its costs as “unconscionable,” though Congress later overruled the veto (Nixon 1972).
Large costs could be outweighed by large benefits, but existing cost-benefit analyses of the Clean
Water Act have not estimated positive benefit/cost ratios. One influential study concluded that in 1985,
the Clean Water Act’s annual costs were twice the Act’s benefits (Freeman 2000). Other researchers have
reached similar conclusions (Lyon and Farrow 1995). Even the U.S. Environmental Projection Agency
(2000a; 2000c)’s evaluation of the Clean Water Act – its own regulation – estimated costs larger than
benefits, though they do mention several types of unmeasured benefits.
These controversies have been largely academic, but they have spilled over into politics. The U.S.
Supreme Court’s 2001 and 2006 SWANCC and Rapanos decisions removed Clean Water Act regulation
for nearly half of U.S. rivers and streams. In 2015, the Obama Administration proposed a Clean Water
Rule (also called the Waters of the United States Rule) which would reinstate many of those regulations.
This Rule has met fierce political opposition, and twenty-seven states have sued to vacate it.
This paper seeks to shed light on these controversies using the most comprehensive set of files ever
compiled in academia or government on water pollution and its determinants. These files include several
datasets that largely have not been used in economic research, including the National Hydrography
Dataset, which is a georeferenced atlas mapping all U.S. surface waters; the Clean Watershed Needs
Survey, which is a panel description of the country’s wastewater treatment plants; a historic extract of
the Grants Information and Control System describing each of 35,000 Clean Water Act grants the federal
government gave cities; the Survey of Water Use in Manufacturing, a confidential plant-level dataset of
large industrial water users which was recently recovered from a decommissioned government mainframe
(Becker 2015); and around 50 million water pollution readings at over 170,000 pollution monitoring
sites during the years 1962-2001 from three data repositories—Storet Legacy, Modern Storet and the
National Water Information System (NWIS). Discovering, obtaining, and compiling these data has been
a serious undertaking involving three Freedom of Information Act requests, software development in
C++ with a full-time programmer specialized in hydrological routing models, and extensive discussions
with engineers and hydrologists from the U.S. Geological Survey (USGS), the EPA, and engineering
consultancies. These data enable a more extensive analysis of water pollution and its regulation than has
previously been possible.
The analysis obtains three sets of results. First, we study how water pollution changed in the period
1962-2001. We find that most types of water pollution declined, though the rate of decrease slowed over
time. Between 1972 and 2001, for example, the share of waters that met standards for fishing grew by
11 percentage points, and dissolved oxygen saturation (an omnibus measure of water quality we describe
in detail later) increased by an average of 5 percentage points. We find similar patterns both in well-
documented networks of a few monitoring sites and in the full sample of 170,000 monitoring sites. Our
finding of decreases in most pollutants implies that the prevalence of such violations was even greater
before the Clean Water Act. Two trends are particularly interesting because they may reflect air and
climate pollution. The pH of rivers and lakes has increased at a similar rate to the pH of rainwater, likely
2
in part due to decreased sulfur air pollution. In other words, less acid rain may have led to less acidic
rivers and lakes. Additionally, the temperature of rivers and lakes increased by about 1 degree F every
40 years, which is consistent with climate change.
Second, the paper asks how the Clean Water Act’s grants to municipal wastewater treatment plants
contributed to these trends. The Clean Water Act had two main components—grants for wastewater
treatment plants and permits for industry. The analysis focuses on the first component though controls
for potential confounding effects of the second. We answer this question using a triple-difference research
design. We compare water pollution before versus after Clean Water Act investments occurred, upstream
versus downstream of recipient plants, and between plants receiving grants in early versus late years. To
define upstream and downstream waters, we analyze a set of 70 million nodes that collectively describe
the entire U.S. river network. We show that in the cross-section, water pollution jumps dramatically as
a river passes a treatment plant.
We then find that each grant decreases the probability that downriver areas violate standards for being
fishable by about half a percentage point. These changes are concentrated within 25 miles downstream
of the treatment plant and they persist for around 30 years. We use the results to study the cost-
effectiveness of these Clean Water Act grants. Through these grants, it cost around $1.5 million ($2014)
per year to make one river-mile fishable, and around $0.5 million per year to increase the dissolved oxygen
saturation in a river-mile by ten percentage points. We do not find substantial heterogeneity in cost-
effectiveness across regions or types of grants, though there is imprecise evidence that grants to states
with decentralized authority to administer Clean Water Act effluent permits were more cost-effective.
We also use annual municipal financial records and find that one dollar of a federal grant project led to
approximately one additional dollar of local sewerage capital spending, implying limited crowding-in or
crowding-out of local expenditure.
Third, the paper asks how residents valued these grants. We analyze housing units within a 25 mile
radius of affected river segments, for two reasons—95 percent of recreational trips have a destination
within this distance, and several other channels for benefits of water quality such as odor and appearance
are likely to affect housing units within this radius. We find that a grant’s estimated effects on home
values are about 25 percent of the grant’s costs. Put another way, while the average grant project cost
around $35 million ($2014), the estimated effect of a grant on the value of housing within 25 miles of
the affected river is around $9 million. We find limited heterogeneity in these numbers across regions
and types of grants, though grants to areas where outdoor fishing or swimming is common, to cities with
high amenity levels, or to states with pro-environmental views have slightly higher estimated ratios of
measured benefits to costs.
We discuss several reasons why these hedonic estimates could be interpreted as a lower bound on
willingness to pay for these grants, including that people may have incomplete information about changes
in water pollution and their welfare (including health) implications; they exclude nonuse (“existence”)
values; they abstract from general equilibrium effects; and they exclude the five percent of most distant
recreational trips. One interpretation of these results is that the benefits of these Clean Water Act
3
grants exceed their costs if these unmeasured components of willingness to pay exceed the components
of willingness to pay that we measure by a factor of three or more.
Two aspects of the research design are worth highlighting. The first deals with comparison groups.
Because water pollution flows in a known direction, areas upstream of a treatment plant provide a
natural counterfactual for areas downstream of a plant. For this reason, our preferred estimates of
how Clean Water Act grants affect water pollution use a triple-difference estimator comparing upstream
and downstream areas. But because residents who live upstream of treatment plants can benefit from
clean water downstream of treatment plants (e.g., by traveling for recreation), upstream homes could
benefit from grants. Hence our preferred housing estimates come from difference-in-difference regressions
analyzing homes within a 25 mile radius of river segments that are downstream of treatment plants.
We do report both the double-difference and triple-difference estimators for both outcomes, and obtain
generally similar conclusions.
Second, as in most settings, our identifying assumptions cannot be tested directly, since we cannot
observe a counterfactual world without Clean Water Act grants. We do, however, provide several indirect
tests of the identifying assumptions, all of which support the validity of the research design. First, we
report event study graphs in time which test for pre-trends in outcomes (pollution or home values) in the
years preceding a grant. Second, we report two different research designs—a triple-difference estimator
which uses upstream areas as a counterfactual for downstream areas, and a differences-in-differences
estimate focused only on downstream areas. Third, we assess whether grants affect pollutants closely
related to municipal waste (e.g., biochemical oxygen demand and dissolved oxygen deficits) more than
they affect pollutants that are less closely related (e.g., lead, mercury, and phenols). Fourth, we separately
estimate the effect of a plant receiving one, two, three, four, or five or more grants, and find a positive
dose-response function for most pollutants. Finally, we estimate specifications controlling for particularly
important potential confounding variables, including industrial water pollution sources, air pollution
regulations, and local population totals.
More broadly, this paper departs from the literature in four primary ways. This is the first study to
quantify national changes in water pollution since before the Clean Water Act using a dense network of
monitoring sites. Trends are important both in their own right and because measuring water pollution
is one step towards measuring its national economic costs (Muller, Mendelsohn, and Nordhaus 2011).
Some studies measure trends in water pollution for small sets of monitoring sites (e.g., Smith, Alexander,
and Wolman 1983; USEPA 2000b). Smith and Wolloh (2012) study one measure of pollution (dissolved
oxygen) in lakes beginning after the Clean Water Act and use data from one of the repositories we
analyze. They conclude that “nothing has changed” since 1975. We find similar trends for the pollutant
they study in lakes, though we show that other pollutants are declining in lakes and that most pollutants
are declining in other types of waters. This is also the first national estimate of temperature trends in
U.S. rivers using a dense and long-term network of monitoring sites, which is relevant for learning about
climate change. One study estimates river temperature trends using 18 sites in the Northwestern U.S.
from the NWIS water quality data network that we use, and finds that air temperatures account for 90
4
percent of this river temperature trend (Isaak, Wollrab, Horan, and Chandler 2012).
This paper also provides the first national estimate of how Clean Water Act investments affected
ambient pollution concentrations. We use these estimates to calculate the cost effectiveness of these
investments. Water pollution research typically uses ex ante engineering simulations, rather than actual
observed data on water pollution, to assess the effects of water quality policies (Wu, Adams, Kling,
and Tanaka 2004; Rabotyagov, Campbell, White, Arnold, Atwood, Norfleet, Kling, Gassman, Valcu,
Richardson, Turner, and Rabalais 2014). Far more economic research focuses on air pollution than on
water pollution regulation.1 A few studies do investigate how water pollution affects emissions of a single
pollutant in specific settings, such as analyses of several dozen wastewater treatment plants in one state
(Earnhart 2004a,b) or a study of a phosphate ban in dishwasher detergent (Cohen and Keiser 2015).
Emissions in those studies are self-reported by the regulated plants. Abroad, water pollution in China
and India is much worse than in the U.S. Recent research finds that India’s water pollution regulations,
which have similar structure to the U.S. Clean Water Act, are ineffective (Ebenstein 2012; Greenstone and
Hanna 2014). It is unclear whether the failure of these water pollution regulations in developing countries
reflects their regulatory context, or whether it reflects broader challenges regulating water pollution.
Several studies find that ambient water pollution increases with political boundaries, though generally
without directly observing regulation (Sigman 2002; Lipscomb and Mobarak Forthcoming; Kahn, Li,
and Zhao 2015). Some work investigates how fracking wells and the pollution they send to wastewater
treatment plants affects water quality (Olmstead, Muehlenbachs, Shih, Chu, and Krupnick 2013).
Third, this study provides the first estimate of the effects of water pollution regulation on home values.
Existing estimates of willingness-to-pay for water quality use travel cost methods, hedonics, or stated
preference (i.e., contingent valuation; Kuwayama and Olmstead (2015) list many individual studies).2
Travel cost methods observe the distance people travel to visit recreational sites and associated time and
pecuniary costs, then infer willingness-to-pay for visiting these sites. Travel cost studies typically rely
on observational and cross-sectional variation in pollution and focus on a single county or other limited
area (e.g., Smith and Desvousges 1986). The literature has emphasized that such studies may suffer from
omitted variable bias because unobserved disamenities like factories or roads contribute to pollution and
1
Olmstead (2010) attributes the imbalance between research on air and water pollution to three explanations: air pollution,
unlike surface water pollution, has large and direct consequences for human health; policymakers have implemented many market-
based mechanisms to regulate air pollution but few for water pollution; and most areas of industrialized countries already have
effective basic drinking water treatment and sanitation. We hypothesize two additional reasons for the imbalance in research
between air and water pollution: ambient air pollution data are much more widely available and better documented than ambient
water pollution data; and air pollution regulation has better-understood variation over time and space which is amenable to
empirical econometric analysis than water pollution regulation has.
2
Some research looks at drinking water pollution and health or home values (Snow 1855; Cutler and Miller 2005; Currie, Zivin,
Meckel, Neidell, and Schlenker 2013; Muehlenbachs, Spiller, and Timmins 2015), but we focus on ambient pollution in rivers and
lakes, which is generally distinct from drinking water quality. The only studies we know on surface water pollution and health
focus on China and India (Ebenstein 2012; Greenstone and Hanna 2014), and a study of the Boston area around 1900 (Alsan and
Goldin 2015). Watson (2006) finds that sanitation investments in U.S. Indian reservations decreased infant mortality, though
the analysis pools all investments (including drinking water treatment, wastewater treatment plant construction and upgrades,
household sanitation infrastructure like latrines, and others) so does not separately identify health consequences of investments
in wastewater treatment plants.
5
directly discourage recreational visits (Leggett and Bockstael 2000; Murdock 2006; Moeltner and von
Haefen 2011). Such omitted variables are important for studying air pollution, though their importance
for water pollution is unknown.3 Several authors note that limited water quality variation within housing
markets has restricted the use of hedonic models (Leggett and Bockstael 2000; Palmquist and Smith 2002).
Most cost-benefit analyses of the Clean Water Act rely on stated preferences (Carson and Mitchell 1993;
Lyon and Farrow 1995; USEPA 2000a), which are controversial (Hausman 2012; Kling, Phaneuf, and
Zhao 2012).
Finally, this work sheds light on regulation and policy design more generally. We believe this is the first
empirical study of the efficiency of subsidizing use of pollution control equipment. This policy instrument
is common in many countries for a variety of pollutants.4 Theoretical research has lamented the poor
incentives of such subsidies (Kohn 1992; Aidt 1998; Fredriksson 1998).5 The only econometric analysis we
know of such policies, however, tests how the French policy of jointly taxing industrial air pollution and
subsidizing abatement technologies affected emissions, using data from 226 plants (Millock and Nauges
2006). That study does not quantify efficiency or separately identify the effect of the pollution tax from
the effect of the abatement subsidy. Our analysis of heterogeneity in cost-effectiveness and benefit-cost
ratios also provides a new domain to consider recent research on spatially differentiated policy (Muller
and Mendelsohn 2009), a design which thus far has focused mostly on air pollution.
The paper proceeds as follows. Section 2 provides a background on the Clean Water Act and water
pollution. Section 3 describes the data sources. Section 4 describes the econometric and economic
models. Section 5 describes water pollution trends. Section 6 analyzes how the Clean Water Act’s grants
to municipal wastewater treatment plants affected water pollution. Section 7 discusses grants’ effects on
housing. Section 8 concludes.
3
The only panel study we know is Hausman, Leonard, and McFadden (1995)’s use of recall questions to study changes in
recreation around the Exxon Valdez spill. Keiser (2016) uses upstream ambient pollution as instruments for local lake pollution
to address omitted variables and measurement error. Concurrent work is studying panel data on lakes in Iowa (Herriges, Kling,
and Phaneuf 2015). A few hedonic studies address these concerns through the use of property or lake fixed-effects (Mendelsohn,
Hellerstein, Huguenin, Unsworth, and Brazee 1992; Walsh, Milon, and Scrogin 2011).
4
Countries including Brazil, France, India, Japan, South Korea, Sweden, and the U.S. subsidize municipal water pollution
abatement investments. Many countries have subsidized air pollution abatement investments for transportation (e.g., adding
catalytic converters), including Cyprus, Germany, Hong Kong, the Netherlands, and Sweden. The U.S. Clean Coal program
and France directly subsidize industrial air pollution equipment. Many countries allow accelerated depreciation of pollution
abatement capital costs for taxes.
5
A larger body of theoretical research studies payments for decreasing the level of pollution emitted (Baumol and Oates
1988), subsidies to the use of clean types of production (Mestelman 1982; Fullerton and Mohr 2003), or subsidies to research
on clean technologies (Acemoglu, Akcigit, Hanley, and Kerr 2016). More empirical research studies these policies, and they can
have similar incentives as subsidies to pollution control equipment.
6
2 Background on the Clean Water Act and Water Pollu-
tion
2.1 Clean Water Act Background
A brief background on the Clean Water Act may help explain our analysis and results.6 Our data
begin in 1962, and policies before the Clean Water Act may contribute to some of the water pollution
patterns we observe before 1972. The U.S. Congress passed major water pollution control laws in 1948,
1956, 1961, 1965, 1966, and 1970. Many earlier laws, like the Clean Water Act, supported municipal
wastewater treatment and industrial abatement, but provided funds an order of magnitude below the
funds distributed by the Clean Water Act. By 1966, all 50 states had passed some type of water pollution
legislation, but the level of enforcement varied greatly across states (Hines 1967).
One impetus for the 1972 Clean Water Act was concern about incentives for local governments to
impose weak regulation in order to attract industry. Another concern was that previous laws had left
much pollution decision-making power to states, and not all states strongly supported or enforced water
pollution regulation (Freeman 2000).
Like many other federal environmental regulations, the Clean Water Act retained large roles for state-
level implementation, and the effectiveness of that implementation most likely varied across states. While
a simple formula determined the level of grant funds that each state received, each state designed the
priority lists determining which plants received grants. States with decentralized authority also oversaw
writing of permits for municipal plants, monitoring and enforcement of violations, and other activities
(Sigman 2003, 2005). We report sensitivity analyses which investigate how the effectiveness of grants
varied with decentralization of authority to manage permits.
Also like other regulation, the Clean Water Act targeted municipal waste treatment and industrial
pollution sources, sometimes called “point sources.” But much water pollution comes from urban runoff
and agriculture, the most important “non-point” pollution sources. The Clean Water Act had minor
provisions for these sources, but has largely exempted them from regulation. We briefly analyze trends in
agricultural pollutants, and comment on the relevance of the relative lack of regulating non-point sources
for interpreting our cost-effectiveness numbers.
Most of this paper focuses on the Clean Water Act grants program, but as mentioned earlier, the
Clean Water Act had a second important component—distributing and enforcing permits limiting indus-
trial water pollution, managed through the National Pollutant Discharge Elimination System (NPDES).
NPDES aims to cover every source which directly discharges pollution into U.S. waters. Some plants
are part of a separate “Pretreatment Program,” in which they discharge untreated or lightly-treated
wastewater through sewers to wastewater treatment plants, then pay fees to the treatment plant.7 The
6
The 1972 law was formally called the Federal Water Pollution Control Amendments, though we follow common practice in
referring to it as the Clean Water Act.
7
The wastewater treatment plants which are the focus of this paper also receive effluent permits through the NPDES program.
Because those permits may be related to the federal grants we study here, our analysis of grants may also reflect NPDES permits
7
permits were distributed in the early 1970s. Because this was a national program affecting all plants and
industries at about the same time, we include a variety of fixed effects in regressions that are designed
to control for this program and other potential concurrent environmental regulations. We also obtain
historic data on large industrial water users around the time of the Clean Water Act, which provides a
control used in some sensitivity analyses.
8
wastewater treatment needs (CBO 1985). Within a state, grants were distributed according to a “priority
list” that each state submitted annually to the EPA. States had to base a priority list on seven criteria
(USEPA 1980, p. 8):
1. [T]he severity of the pollution problem; 2. [T]he existing population affected; 3. [T]he
need for preservation of high quality waters; 4. [A]t the State’s option, the specific category
of need. . . 5. . . . [T]echniques meeting innovative and alternative guidelines. . . 6. [O]ther crite-
ria, consistent with these, may be considered (including the special needs of small and rural
communities). The state may not consider: the project area’s development needs not related
to pollution abatement; the geographical region within the State; or future population growth
projections; and 7. [I]n addition to the criteria listed above, the State must consider . . . total
funds available; and other management criteria.
States had to publicly offer information concerning their priority systems and the ranking of particular
projects, including a public hearing to discuss the lists (USEPA 1976, 1981). Funds moved slowly and
EPA estimated that it took two to ten years from project conception to finishing construction, with a
mean of five years.
This paper’s triple-difference regressions assume that in the absence of a grant, areas downstream and
upstream of plants receiving grants would have had a similar evolution of pollution concentrations and
home values. The statutory requirement quoted above, which forbid states from considering population
growth projections or other development needs, gives some support for this identifying assumption.
Ultimately, however, the allocation of state funds to individual plants is an empirical question. As
discussed in the introduction, the empirical sections provide a number of tests for the validity of the
research design.
9
Clean Water Act and because recreational fishing is believed to be a main reason why people value water
quality. We use a definition of whether waters are “fishable” that was developed in the early 1980s by
William Vaughan for Resources for the Future (RFF), and adapted by the USEPA (2000a) to study
the Clean Water Act. This definition has several appealing features. It distills several published water
quality criteria from between 1966 and 1979, which are appropriate to the period we study. It particu-
larly reflected state water quality standards. We reviewed current state-specific standards of “fishable,”
and found that they are generally close to this definition. Finally, it is a widely-used interpretation of
“fishable.” In this definition, water is “fishable” if pollution is below a threshold, based on four measures:
biochemical oxygen demand (BOD), dissolved oxygen saturation, fecal coliforms, and total suspended
solids (TSS). To implement these definitions in the data, we pool data from these pollutants and define
a dummy for whether a raw pollution reading exceeds the relevant standard.9
We also report estimates for whether waters are swimmable, and we report separate results for the
other pollutants that are part of the “fishable” and “swimmable” definitions—biochemical oxygen demand
(BOD), fecal coliforms, and total suspended solids (TSS). These pollutants merit interest in their own
right because BOD, fecal coliforms, and TSS are a majority of the five “conventional pollutants” the
Clean Water Act targeted. The other “conventional” pollutants are pH, which we analyze in Appendix
Table 4, and oil and grease, a pollutant for which we have little data. We define all pollutants so that
lower levels of the pollutant represent cleaner water (so we report the share of waters that are “not
fishable” or “not swimmable,” and we report dissolved oxygen deficits).
Describing these other pollutants may help interpret results. BOD measures the amount of oxygen
consumed by decomposing organic matter. Fecal coliforms proxy for the presence of pathogenic bacteria,
viruses, and protozoa like E. coli or Enterococcus that cause human illness and are unsafe for human
contact. Pathogens including fecal coliforms are the most common reason why water quality violates
state standards today (USEPA 2016). TSS measures the quantity of solids in water that is trapped by a
filter.10 Municipal sources in the early 1980s were estimated to account for about 20 percent of national
BOD emissions and less than one percent of national TSS emissions (Gianessi and Peskin 1981), though
municipal sources may account for a larger share of emissions in urban areas. Most TSS comes from
agriculture and urban runoff, and TSS concentrations are highest in the Midwest (Appendix Figure 2a).
We also report a few results for three additional groups of pollutants: industrial pollutants like lead,
mercury, and phenols; nutrients like nitrogen and phosphorus; and other general water quality measures
9
“Fishable” readings have BOD below 2.4 mg/L, dissolved oxygen above 64 percent saturation (equivalently, dissolved oxygen
deficits below 36 percent), fecal coliforms below 1000 MPN/100mL, and TSS below 50 mg/L. “Swimmable” waters must have
BOD below 1.5 mg/L, dissolved oxygen above 83 percent saturation (equivalently, dissolved oxygen deficits below 17 percent),
fecal coliforms below 200 MPN/100mL, and TSS below 10 mg/L. The definition also includes standards for boating and drinking
water that we do not analyze.
10
We analyze all these physical pollutants in levels, though Appendix Tables 3 and 6 show results also in logs. Fecal coliforms
are approximately lognormally distributed, and BOD and TSS are somewhat skewed (see Appendix Figure 1). Specifying these
pollutants in logs would implicitly assume that the percentage change in a river’s pollution due to a grant is the same for a river
with a high background concentration as for a river with a low background pollution concentration. We specify these pollutants
in levels since this assumption does not seem accurate. Other water pollution research generally specifies BOD and TSS in levels.
Some papers specify fecal coliforms in logs while others do not.
10
like temperature. We use a standardized criterion, described in Appendix B, to choose pollutants for this
appendix table.11 We show these additional pollutants because they merit interest directly, and because
some like lead, mercury, and phenols help test our research design since grants to treatment plants are
unlikely to affect them.
One important question for defining our analysis sample is how far these pollutants travel downstream.
We focus on a distance of 25 miles for several reasons. First, the only engineering study we found on this
question (USEPA 2001) limited its analysis to 25 miles downstream of point sources for BOD. They chose
this distance to reflect 15 watershed-specific studies designed to remedy pollution problems. Second, an
interview with a wastewater regulation specialist at the Iowa Department of Natural Resources suggested
that effects of treatment plants on dissolved oxygen would be concentrated within 20 miles downriver.
Third, estimated effects of grants on whether rivers are fishable out to 100 miles downstream of a treatment
plant only show effects within 25 miles (Appendix Table 6).
3 Data
We use eight types of data: spatial data on rivers and lakes; municipal sources of water pollution;
Clean Water Act grants; ambient levels of water pollution; census tract-level housing and population
characteristics; recreational travel distances; municipal financial records; and other environmental data.
This section describes each dataset. It then briefly discusses how we measure distances along rivers
between plants, monitoring sites, houses, and city centers. Except where otherwise noted, all data cover
the years 1962 to 2001. This section describes the most important features of these data; Appendix B
provides additional details.
Spatial Data on Rivers and Lakes. We use data from the National Hydrography Dataset Plus,
Version 2.1 (NHD), an electronic atlas mapping all U.S. surface waters. NHD organizes the U.S. into
approximately 200 river basins, 2,000 watersheds, 70,000 named rivers, 3.5 million stream and river miles,
and 70 million river nodes. A river in these data consists of a set of river nodes (i.e., points) connected
by straight lines. NHD forms a network describing the flow direction of each river or stream segment and
helps us follow water pollution upstream or downstream. These data let us determine the exact surface
water (e.g., the Mississippi River) where each pollution monitoring site or treatment plant is located, the
exact location on that surface water, the distance along the river upstream or downstream from other
monitoring sites or plants, and the census tracts nearby. Panel A of Figure 1 shows U.S. streams, rivers,
and lakes, colored by their distance from the ocean, Great Lakes, or other terminus. (See additional
details in Appendix B.2.)
Municipal Sources of Water Pollution. We created a panel census of U.S. municipal water
pollution treatment plants. The data come from the EPA and are called the Clean Watershed Needs
Survey (CWNS). EPA staff sent us complete computer-readable files and codebooks for these data from
11
Applying such a stringent criterion is important—one of our repositories alone, Storet Legacy, includes 16,000 different
measures of water pollution.
11
their internal systems. Most years report federal grant identifying codes and the latitude and longitude
of discharge. The data are complete and have accurate panel identifiers in 1978 and then biennially
beginning in 1984.12 Panel B of Figure 1 shows the location of each treatment plant, and shows that
relatively more treatment plants are located in densely populated areas.
Clean Water Act Grants. We filed two Freedom of Information Act requests to obtain details
on each of the 35,000 Clean Water Act grants that the federal government gave to these plants. These
records come from the EPA’s internal archived version of the Grants Information and Control System
(GICS). The data list the grant award date, grant amount, total project cost (including both federal and
local capital expenditures), name of the overseeing government authority (city, county, state, or special
district), a grant identifier code, and the name of the recipient treatment plant. The data report on grants
given to 8,000 different local governments. The data also include grants in the years 1957-1971 given under
predecessor laws to the Clean Water Act. The grants were concentrated in the 1970s. Grants decreased
throughout the 1980s and nearly stopped after 1987. For simplicity, our analysis counts multiple grants
to a single treatment plant in a given calendar year as a single grant. (See additional detail in Appendix
B.4.)
Ambient Levels of Water Pollution. We use water pollution readings from three federal data
repositories: Storet Legacy, Modern Storet, and the National Water Information System (NWIS).13 Storet
Legacy contains data from the early 20th century through 1998. Storet Legacy is large—the raw data
include 18,000 separate data files and 200 million pollution readings. Modern Storet is similar to Storet
Legacy but covers more recent years. The Storet repositories have data from many local organizations.
USGS national and state offices collect a large share of NWIS readings. In addition to describing these
data in more detail, Appendix B.3 discusses the steps we have taken to clean these data, including limiting
to rivers/streams and lakes, limiting to scientifically comparable methods of measuring each pollutant,
winsorizing at the 99th percentile, excluding readings specific to hurricanes and other non-routine events,
and others.
Basic descriptive statistics help characterize the data (Appendix Table 1). The analysis sample inclu-
des 10.5 million observations on the four main pollutants and 37 million observations on the additional
pollutants discussed in Appendix Table 4. Levels of BOD, fecal coliforms, and dissolved oxygen deficits
are substantially lower in the U.S. than in India or China (Greenstone and Hanna 2014). Among the four
main pollutants, about half the data describe dissolved oxygen. The data come from 170,000 distinct
monitoring sites. If all the data came from monitoring sites that reported data for short periods of time,
it would be more difficult to measure long-run trends accurately. Fortunately, 45 percent of the data
come from monitoring sites that report readings in at least three of the four decades we analyze (the
decade 1962-1971 is most often missing); about a fourth of readings come from monitoring sites that only
12
The 1980 and 1982 surveys have incorrect plant identifier codes. The 1976 survey is incomplete. Plant longitude and latitude
are listed beginning in 1984.
13
We considered a fourth repository, the Sustaining the Earth’s Watersheds: Agricultural Research Data System (STE-
WARDS), managed by the USDA. We did not use these data because they focus on years 1990 and later, mainly measure
pesticides, and have a small sample.
12
report in one decade.
No sampling design explains why certain areas and years were monitored more than others. In
some cases, hydrologists purposefully designed representative samples of U.S. waters. At least four
such networks are in these data: the Hydrologic Benchmark Network, the National Stream Quality
Accounting Network, the National Water-Quality Assessment, and the National Water Quality Network
(HBN, NASQAN, NAWQA, and NWQN). The first three this paper discusses in detail later. The fourth
was collected by the Public Health Service beginning in 1957 (USPHS 1961) and focuses on years before
the Clean Water Act. In other cases, sampling locations and frequency were chosen by local governments
or non-governmental organizations. Some states like Massachusetts have relatively dense monitoring
networks, while others like Texas have less dense networks (Figure 1, Panel C). Individual rivers are also
visible in the pattern of monitoring sites. Cities have visibly greater density of monitoring sites. We
explore the sensitivity of our results to restricting estimates to metro areas only, to long-term monitoring
sites which operate over most of the sample period, and to the networks of monitoring sites that have
well-documented design and good data quality, and find similar pollution trends in these subsamples.
Census Tract Data. We use the Geolytics Neighborhood Change Database (NCDB), which Geo-
lytics built from the 1970, 1980, 1990, and 2000 Censuses of Population and Housing. The 1970 census
only included metro areas in tracts, so these tract-level data for 1970 are restricted to metro areas, and
so much of our analysis is as well.
We use these census data because they have national coverage and also because transaction-level re-
cords that county assessor offices maintain, such as those aggregated by Dataquick or CoreLogic and used
in other economics studies, generally do not extend back to the 1970s when Clean Water Act investments
were concentrated. In addition to describing these data in more detail, Appendix B.5 discusses what is
known about the quality of the Census self-reported data as compared to actual transactions, including
concerns about measurement error and inertia.
Recreational Travel Distances. Improving water quality in a particular river segment may affect
neighboring homes that benefit from the nearby amenity value. This improvement may also affect more
distant homes for residents who travel for recreation. We seek to determine a distance around a river
that covers most individuals who travel to participate in recreation at this river. We obtain estimates
of this distance from the Nationwide Personal Transportation Survey (NPTS) for years 1983, 1990, and
1995. This survey is the only source we know that provides a large nationally representative sample of
recreational activities and travel distances over the period we study.14 The survey picks a day and has
respondents list all trips, their purposes, and the driving distances in miles. We limit trip purposes to
“vacation” or “other social or recreation.” In these data, the 95th percentile of one-way distance from
home to recreational destinations is 33.7 miles. This distance is similar in all three years of the survey,
and we calculate 33.7 from the unweighted mean across years.
This distance represents the distance traveled along roads; but the radius we use to calculate the
14
The National Survey of Recreation and the Environment and its predecessor, the National Recreation Survey, do not
systematically summarize trips taken and travel distances. Many travel demand papers use small surveys that report distance
traveled to a specific lake or for a specific narrow region.
13
distance of homes from rivers represents a great circle distance (i.e., the shortest direct path along
the ground). Making the great circle and road distances comparable requires an estimate of how road
distances typically compare to great circle distances. We are aware of two such estimates. First, the 2009
National Household Travel Survey (the successor to the NPTS) reports both the great-circle distance
between a person’s home and the person’s workplace, and also the distance by road. On average, the
ratio of the road distance to the great circle distance is 1.4. Second, a recent study compared driving
distance versus straight-line (i.e., great circle) distance for travel from a representative sample of about
70,000 locations in the U.S. to the nearest community hospital, and the average ratio was also 1.4 (Boscoe,
Henry, and Zdeb 2012). Considering these two, we estimate that the straight-line (great circle) distance
between homes and rivers which covers ninety-five percent of recreational trips is 25 miles (≈33.7/1.4).
Municipal Financial Records. To examine the pass-through of federal Clean Water Act grants
to municipal spending on wastewater treatment, we use data from the 1970-2001 Annual Survey of State
and Local Government Finances and the Census of Governments. These data report annual capital and
total expenditures for sewerage (a category including wastewater treatment), separately for each local
government.15 For use as a control variable in some specifications, we obtain population data for most
of these cities from the 1970-2000 decennial censuses, then linearly interpolate between years. The final
sample includes 199 cities; in addition to describing these data in more detail, Appendix B.6 discusses
the main sample restrictions, including requiring a balanced panel and accurate links to the grants data.
Given this relatively small sample of cities, we do report a set of estimates which weight by the inverse
propensity score to provide estimates more representative of all cities, but the small sample should be
considered when assessing external validity.
Other Environmental Data. We use a few other environmental data sources. One sensitivity
analysis controls for nearby industrial sources of water pollution. This is a potentially important control
because regulation of industrial sources was the second main component of the Clean Water Act. We
are not aware of any complete data on industrial water pollution sources around the year 1972, so we
use two distinct controls as imperfect proxies. The first is a list of the manufacturing plants that used
large amounts of water in 1972. We obtain these data from the confidential 1973 Survey of Water Use
in Manufacturing (SWUM) microdata, accessed through a Census Research Data Center. The second
control is a count of the cumulative number of plants in a county holding industrial effluent (NPDES)
permits. We filed a Freedom of Information Act request to obtain a historic copy of the EPA database
which keeps records of industrial pollution sources—the Permit Compliance System, now called the
Integrated Compliance Information System. Appendix B.7 describes more information on these sources,
along with additional data on weather and nonattainment designations. Finally, Appendix B.8 describes
data used to consider heterogeneity across different groups of grants by several dimensions: grant size,
baseline abatement technologies, baseline pollution, Clean Water Act state decentralization,16 prevalence
15
The “year” in these data refers to each local government’s fiscal year. We convert the data to calendar years using data from
these surveys on the month when each government’s fiscal year ends, assuming that government expenditure is evenly distributed
across months. For the few governments that don’t report when their fiscal year ends, we assume they report by calendar year.
16
Our measure of decentralization indicates whether a state holds authority to administer National Pollutant Discharge Eli-
14
of local outdoor fishing and swimming, local environmental views, declining older urban areas (Glaeser
and Gyourko 2005), and high amenity areas (Albouy 2016).
Spatial Links. We construct four types of links between datasets. The first involves linking each
pollution monitoring site and treatment plant to the river or lake where it monitors. The second involves
measuring distances along rivers between treatment plants and pollution monitoring sites. The third
involves measuring distances of census tracts from rivers. The fourth involves linking grants to individual
plants in the CWNS. Appendix C provides details of each step.
τ =2001
0
X
Qicy = ατ 1[yy = τ ] + Xicy β + δi + icy (1)
τ =1963
Each observation in this analysis is an individual water pollution reading at monitoring site i, hour and
calendar day-of-year c, and year y. The variable Qicy represents the level of water pollution. We estimate
this equation separately for each pollutant. The matrix Xicy includes cubic polynomials in time of day
and in day of year. In sensitivity analyses, Xicy also includes air temperature and precipitation. The
fixed effects δi control for all time-invariant determinants of water pollution specific to monitoring site i.
The error term icy includes other determinants of water pollution. We plot the year-by-year coefficients
α1963 . . . α2001 plus the constant. The year-specific points in graphs can be interpreted as mean national
patterns of water pollution, controlling for time and monitoring site characteristics.
We also estimate linear water pollution trends using the following equation:
0
Qicy = αyy + Xicy β + δi + icy (2)
15
The main coefficient of interest, α, represents the mean annual change in water pollution, conditional on
the other controls in the regression. We also show specifications which interact the trend term y with an
indicator 1[y ≥ 1972] for whether an observation is year 1972 or later. This interaction measures how
water pollution trends differed after versus before the Clean Water Act. We emphasize graphs based on
equation (1) more than tables based on equation (2) for a few reasons—the graphs are more visually
transparent; the nonlinear trends in graphs are crudely approximated with linear trends; and 30 years is
a long post period.
0
Qpdy = γGpy dd + Xpdy β + ηpd + ηpy + ηdwy + pdy (3)
This regression has two observations for each treatment plant p and year y, one observation describing
mean water quality upstream (d = 0), and the other observation describing mean water quality down-
stream (d = 1). The variable Gpy describes the cumulative number of grants that plant p had received
by year y. This regression measures grants as a cumulative stock because they represent investment
in durable capital. The main coefficient of interest, γ, represents the mean effect of each grant on do-
wnstream water pollution. We also explore other specifications for G, including limiting to grants for
construction and not for planning or design, estimating effects separately for each possible number of
cumulative grants, and others.
Equation (3) includes several important sets of controls. The matrix Xpdy includes temperature and
precipitation controls. The plant-by-downstream fixed effects ηpd allow both upstream and downstream
waters for each treatment plant to have different mean levels of water pollution. These fixed effects
control for time-invariant sources of pollution like factories and farms, which may be only upstream or
only downstream of a plant. The plant-by-year fixed effects ηpy allow for water pollution to differ near
each treatment plant in each year, and they control for forces like the growth of local industries, other
environmental regulations, and changes in population density which affect both upstream and downstream
pollution. The downstream-by-basin-by-year fixed effects ηdwy allow upstream and downstream water
16
quality separately to differ by year in ways that are common to all plants in a river basin.17 These
fixed effects address the possibility that other point source pollutants and regulations are located near
wastewater treatment plants and had water quality trends unrelated to the municipal grants.
Equation (3) focuses on the effect of the number of grants a plant has received, rather than the dollar
value of these grants, for several reasons. (Appendix Table 6 reports similar effects of grant dollars.)
First, it may be easier to think in discrete terms about the effect of a grant, rather than the effect of
an arbitrary amount of money. Second, estimating these regressions in simple discrete terms makes the
regression tables more easily comparable with event study graphs. Third, larger grants tend to go to
more populated areas and larger rivers. Because it takes relatively larger investment to achieve a given
change in pollution concentration for a more populated area and larger river, it is ambiguous whether
grants of larger amount should have larger effects on pollution concentrations. Fourth, the distribution
of cumulative grant amounts is highly skewed, with many zeros. Focusing on the number of grants rather
than grant dollars avoids issues involved in log transformations (or other approaches) in the presence of
many zeros.
A few other details of our approach are worth noting. Because the dependent variable is an average
over different numbers of underlying pollution readings, in all regressions where each observation is
plant-downstream-year tuple, we use generalized least squares weighted by the number of raw underlying
pollution readings.18 To maximize comparability between the treatment plant location and monitoring
sites, we also restrict pollution data to monitoring sites located on the same river as the treatment plant.
Finally, all estimates are limited to plants within 1 kilometer of a river node. Appendix Table 6 shows
results with these assumptions relaxed.
The identifying assumption for equation (3) to provide an unbiased estimate of the parameter γ is
that the grants×downstream interaction Gpy d is independent of the regression error, conditional on other
explanatory variables:
E[Gpy dd · dpy |Xpdy , ηpd , ηpy , ηdwy ] = 0
This assumption would be violated if, for example, grants or permits responded to unobserved shocks to
variables like population which themselves affect pollution concentrations.19
We cannot test this assumption directly. However, we can provide several suggestive tests of its
17
A water basin w is defined as a four-digit hydrologic unit code (HUC).
18
Appendix Table 6 also reports unweighted estimates, which are broadly similar to the generalized least squares (GLS)
estimates. GLS based on the number of underlying pollution readings in each plant×downstream×year is an efficient response
to heteroskedasticity since we have grouped data. GLS estimates the effect for the average pollution reading rather than for the
average plant×downstream×year. It is possible that areas with more pollution data may be of greater interest; for example,
Panel C of Figure 1 shows generally more monitoring sites in more populated areas.
19
This assumption could also fail if changes in governments’ effectiveness at receiving grants correlated with governments’
effectiveness at operating treatment plants. This does not seem consistent with our results since it would likely create pre-trends
in pollution or home values (whereas we observe none). Our finding that benefits last about as long as engineering estimates
suggest (30 years) and for exactly the expected pollutants also are not exactly what this story would predict. We also observe
that each additional grant results in further decreases in pollution (Appendix Table 6), which would be a complicated story for
the timing of government human capital to explain.
17
validity. One test is whether pre-trends are flat in the years before a facility receives a grant.20
τX
=25
0
Qpdy = γτ 1[Gp,y+τ = 1]dd + Xpdy β + ηpd + ηpy + ηdwy + pdy (4)
τ =−10
Here τ indexes years since a grant was received.21 A second test is to compare difference-in-difference
estimates, which resemble equation (3) but only use data on downstream areas, against triple-difference
estimates. A third test is to compare estimates of grants’ effects on pollutants known to be related to
treatment plants (e.g., BOD and dissolved oxygen) against estimated effects on other pollutants (e.g.,
lead, mercury, and phenols). A fourth test is to measure whether each additional grant (one, versus
two, versus three, etc.) that a plant receives is associated with additional decreases in pollution. A fifth
test is to assess sensitivity to adding controls for important potential confounding variables: population,
industrial water pollution sources, and other air pollution regulations.22
18
dollars Dcy this city has received:
The dependent and independent variables are cumulative because capital is a stock variable, and since
local investment could occur in years after the grants are received. The regression includes city fixed
effects υc and year fixed effects πy . We also report specifications with river basin-by-year fixed effects
ηwy . The value β = 1 implies complete pass-through (no crowding out or crowding in). Finding β < 1
implies incomplete pass-through (crowding out), while β > 1 implies more than complete pass-through
(crowding in).
The definitions of these variables are important. Municipal expenditures Ecy include both expendi-
tures funded by federal grants and those funded by other sources of revenue. As mentioned in Section
2.1, most grants require cities to pay 25 percent of the capital cost, though a small share require other
copayments. We therefore report two sets of regressions—one where the variable Dcy includes only fede-
ral grant funds, and another where the variable Dcy includes both federal grant funds and the required
municipal capital contribution. We also report specifications that weight by the inverse propensity score
for inclusion in the balanced panel of cities, and discuss models controlling for population and city-year
revenue.
19
Econometrics: Demand for Water Quality
To analyze how Clean Water Act grants affected home values, we use a differences-in-differences estimate
comparing the change in the log mean value of homes within an 0.25, 1, or 25 mile radius in any direction
of the downstream river, before versus after the plant receives a grant, and between plants receiving
grants in early versus late years. We use this specification because homes upstream of a wastewater
treatment plant may also benefit from improvements in water quality downstream of the treatment
plant. Residents travel for recreation, and while water pollution flows downstream, the benefits of water
quality improvements may affect both downstream and upstream areas. As a sensitivity analysis, we
also report triple-difference estimates comparing homes within a 25 mile radius of the upstream segments
against homes within a 25 mile radius of the downstream segments.
Specifically, we estimate the following regression model:
0
Vpy = γGpy + Xpy β + ηp + ηwy + py (6)
Here Gpy represents the cumulative number of grants received by plant p in year y, Vpy is the log mean
value of homes within a 0.25, 1, or 25 mile radius of the portion of the river that is 25 miles downstream
of treatment plant p, ηp are plant fixed effects, and ηwy are river basin-by-year fixed effects. Some
specifications include controls Xpy for house structure characteristics. Other specifications add year 1970
characteristics interacted with indicators for the year y of the observation.23 We estimate the change in
total housing units and total value of the housing stock.
A few points are worth noting. First, we limit the analysis to housing units within a twenty-five
mile radius of downstream waters. As discussed earlier, this is designed to cover roughly ninety-five
percent of recreational trips. Second, we limit regression estimates to the set of tracts reporting home
values in all four years 1970, 1980, 1990, 2000. When we fit the change in home values, we do so
both for only the balanced panel of tract-years reporting home values, and for all tract-years. Third,
because the differences-in-differences specification used for home values does not use upstream areas as
a counterfactual, it involves the stronger identifying assumption that areas with more and fewer grants
would have had similar home price trends in the absence of the grants.24 Fourth, to obtain regression
estimates for the average housing unit, and to provide an efficient response to heteroskedasticity, we
23
Structure characteristics are allowed to have different coefficients in each year, and include the following: number of bedrooms,
number of housing units in building, number of stories in building, heating fuel, cooking fuel, hot water fuel, heating equipment
type, sewer type, plumbing type, year built, air conditioning, kitchen, number of bathrooms, and water access. All variables are
expressed as share of housing units in a plant-year cell with the indicated characteristic. All categorical variables (e.g., number
of bedrooms) are expressed as the share of housing units with each possible category. The 1970 characteristics are the following:
distance to central business district (see Appendix B.5 for details of definition and construction); share of population that is
black; share of population over age 65; share of population under age 6; share with a college degree; share on public assistance;
income per family; and all the 1970 structure characteristics.
24
We focus on specifications including basin-by-year fixed effects and the interaction of baseline characteristics with year fixed
effects, as shown in equation (6). Estimates without the basin-by-year controls are more positive for home values but not rents,
but they are also more sensitive to specification, which is one indication that the within-basin comparison of equation (6) provides
sharper identification.
20
include generalized least squares weights proportional to the number of total housing units in the plant-
year observation and to the sampling probability.25
Finally, these regressions estimate the effect of improvements to wastewater treatment on home values
within 25 miles of downstream river areas. Wastewater treatment plants themselves may be a disamenity
due to noise, smell, appearance, and other reasons. We report one set of estimates which excludes
homes within a 1-mile radius of the wastewater treatment plant, which is likely to remove potential local
disamenities of the treatment plant itself.
21
relevant to explain the slowing trends in continuous variables like BOD, fecal coliforms, or TSS, each of
which had mean levels far above zero even in 2001.
Regressions with linear trend and trend break specifications underscore these findings, subject to the
caveats mentioned earlier about the linear approximation to a non-linear trend and the long post period.
The share of waters that are not fishable fell on average by about half a percentage point per year, and
the share that are not swimmable fell at the same rate (Table 1, Panel A). In total over the period 1972-
2001, the share of waters that are not fishable and the share not swimmable each fell by 11 percentage
points. Each of the four pollutants which are part of these fishable and swimmable definitions declined
rapidly during this period. Fecal coliforms had the fastest rate of decrease, at 2.8 percent per year. BOD,
dissolved oxygen deficits, and total suspended solids all declined more slowly, at about 1.5 percent per
year.
Trends in all these pollutants since the Clean Water Act are large, but trends before the Clean Water
Act were larger (Table 1, Panel B). For example, BOD was falling by 3 percent per year before the Clean
Water Act and 1.5 percent after it. We find pre/post 1972 trend breaks of comparable magnitudes for
all the other pollutants. We interpret these pre-1972 trends somewhat cautiously since, as discussed
earlier, relatively few monitoring sites recorded data before the 1970s, and fewer long-term monitoring
sites operated in the 1960s (Appendix Table 1).
The graphs show the national mean pollution in each year, but they abstract from heterogeneity across
different parts of the U.S. Appendix Figure 2 shows national maps of each pollutant in the final decade,
1992-2001, and also shows decade-by-decade maps for fecal coliforms. The regional patterns of each
pollutant make sense. Dissolved oxygen deficits are most severe in the South, where temperatures are
hottest. Fecal coliforms have the greatest concentrations in the Northeast and Industrial Midwest, where
population densities are high. TSS has the greatest concentrations in the Midwest, which is unsurprising
since much TSS comes from agriculture. Fecal coliform concentrations were much greater in the 1960s
than in the 1990s. Decreases in fecal coliforms are clearest in the South and Far West, and less clear in
the Industrial Midwest and Northeast.
Our finding of substantial decreases in water pollution is novel and noteworthy, but also for these rea-
sons worth scrutinizing. We estimate an extraordinary array of sensitivity analyses, including restricting
to high-quality subsamples of the data, adding important controls, weighting by population, and many
others. Most of these alternative approaches have similar sign, magnitude, and precision as the main
results. Appendix Table 3 shows these results and Appendix E.1 explains each.
22
decrease of about 11 percent per year may be related to air pollution regulations, such as prohibiting
leaded gasoline. The decline in mercury is noteworthy given the recent controversy of the Mercury and
Air Toxics Standards (MATS) policy that would regulate mercury from coal-fired power plants. Some
nutrients like ammonia and phosphorus are declining, while others like nitrates are unchanged. Nutrients
were not targeted in the original Clean Water Act, but are the focus of current regulation. Temperature
is increasing by about 1 degree F per 40 years, which is consistent with effects from climate change.
Electricity generating units and other sources do contribute to thermal pollution in rivers, but it is
noteworthy that rapidly increasing temperature is such an outlier from decreasing pollution trends in
most other water pollutants.
pH trends are particularly interesting. pH increased by 0.007 pH units per year, meaning that waters
became more basic (less acidic). Rainwater monitors that are not in our data record increases of similar
magnitude in the pH of rainwater over this period, and attribute it to declines in atmospheric sulfur air
pollution (USEPA 2007). This is consistent with the idea that decreases in acidic sulfur air pollution
contributed to decreases in acid water pollution, and could completely explain these pH trends.
23
3.4 percent, fecal coliforms fall by 8.5 percent, and the probability that downstream waters are not
swimmable by about half a percentage point. The point estimate implies that each grant decreases TSS
by one percent, though is imprecise. TSS comes primarily from non-point sources like agriculture and
urban runoff, so is less closely related to municipal wastewater.
Event study graphs support these results. These graphs are estimated from specifications correspon-
ding to equation (4).28 In years before a grant, the coefficients are all statistically indistinguishable from
zero, have modest magnitude, and have no clear trend (Figure 3). This implies that pollution levels in
upstream and downstream waters had similar trends before grants were received. In the years after a
grant, downstream waters have 1-2 percent lower dissolved oxygen deficits, and become 1-2 percent less
likely to violate fishing standards. These effects grow in magnitude over the first ten years, are statisti-
cally significant in this period, and remain negative for about 30 years after a grant. The gradual effect
of the grants is unsurprising since EPA estimates that it took two to ten years after a grant is received for
construction to finish. The 30-year duration of these benefits is also consistent with, though on the lower
end of, engineering predictions. Two studies both report that concrete structures of treatment plants are
expected to have a useful life of 50 years but mechanical and electrical components have a useful life of
15-25 years (American Society of Civil Engineers 2011, p. 15; USEPA 2002, p. 11).
Event study graphs for other pollutants are consistent with these results, though are less precise
(Appendix Figure 6). Before a plant receives a grant, each pollutant has fairly similar trends between
upstream and downstream areas. After a plant receives a grant, pollution visually declines for the
swimmable standard; though period-by-period event study graphs for other pollutants are too imprecise
to be informative.
Appendix Figure 7 shows the effect of a grant by distance downstream from a treatment plant.
Less data is available to estimate effects separately for each 5-mile bin along the river, so estimates are
correspondingly less precise. Each point in these graphs represents the difference in pollution after minus
before a grant is received, at a given distance from the treatment plant.
We also consider a wide variety of sensitivity analyses, including all the sensitivity analyses used for
studying water pollution trends; different estimates by distance downstream; different types of grants and
plants; using grant amounts in dollars rather than number of grants; controlling for important potential
confounding variables; and other rules for sample construction. Appendix Table 6 shows the results, and
E.2 discusses each. Again they generally give similar qualitative conclusions as the main results, though
exact point estimates vary.
28
As discussed earlier, the analysis includes plants that never received a grant (which have all event study indicators 1[Gp,y+τ =
1] equal to zero), plants that received a single grant (which in any observation have only a single event indicator equal to one),
and plants that received more than one grant (which in any observation can have several event indicators equal to one). Since
no reference category is required in this kind of event study setting where one observation can receive multiple treatments, for
ease of interpretation, we recenter the graph line so the coefficient for the year before treatment (τ − 1) equals zero. This implies
that coefficients in the graph can be interpreted as the pollution level in a given year, relative to the pollution level in the period
before the treatment plant received a grant.
24
6.2 Grants’ Effects on Water Pollution: Cost-Effectiveness
We now turn to estimate the cost-effectiveness of these grants. The cost-effectiveness is defined as the
annual public expenditure required to decrease dissolved oxygen deficits in a river-mile by 10 percentage
points or to make a river-mile fishable. These calculations combine our regression estimates and the cost
data.29
These estimates are important. Even without the hedonic estimates of the next section, one can
combine cost-effectiveness numbers with estimates from other studies of the value of clean waters to
obtain a cost-benefit analysis of these grants. Moreover, we are not aware of any existing ex post
estimates of the cost required to make a river-mile fishable or to decrease dissolved oxygen deficits.
Taking the ratio of total costs to improved river-miles yields the cost to increase dissolved oxygen
saturation in a river-mile by 10 percentage points. Column (1) of Table 3 includes only waters on the same
river as the treatment plant, as our main regression sample does. Column (2) includes all downstream
river segments, and column (3) assumes that every treatment plant has 25 miles of downstream waters
affected. The simplest specification of column (1) implies that it cost $0.57 million per year to increase
dissolved oxygen saturation in a river-mile by ten percent; the broadest specification of column (3) implies
that it cost $0.54 million per year. The annual cost to make a river-mile fishable ranges from $1.8 million
in the simplest specification of column (1) to $1.5 million in the richest specification of column (3).
Column (3) implies that the grants program made 16,000 river-miles fishable.
A few notes are important for interpreting these statistics. First, this is the average cost to supply
water quality via Clean Water Act grants; the marginal cost, or the cost for a specific river, may differ.
The next subsection investigates heterogeneity directly. Second, measuring cost-effectiveness is insufficient
to reach conclusions about social welfare. Knowing whether these expenditures increased social welfare
requires knowing peoples’ value for these changes, which is the question of Section 7. Third, if some
grant expenditures were lost to rents (e.g., corruption), then those expenditures represent transfers and
not true economic costs. EPA did audit grants to minimize malfeasance. In the presence of such rents,
our analysis could be interpreted as a cost-effectiveness analysis from the government’s perspective.
25
raction of the main downstream×grants term with a given binary characteristic of grants.30 Appendix
B.8 describes how we measure these characteristics.
Columns (1)-(2) of Appendix Table 7 report these estimates, and columns (5)-(6) use these regres-
sions along with data on grant costs to estimate cost-effectiveness. Row 1 repeats the estimates for all
grants discussed earlier. Row 2 finds that grant projects above the median size ($1.2 million) cause larger
decreases in pollution. Because these larger grants cost more, however, columns (5)-(6) suggest they are
slightly less cost effective. Row 3 analyzes grants for plants that initially had more advanced (secondary
or tertiary) abatement technology. If plants face increasing marginal abatement costs, then grants given
to plants with better initial technology might be less cost-effective.31 Row 3 does not provide evidence
to support this hypothesis, and the point estimates actually suggest that grants to plants with tertiary
technology are more cost-effective. These estimates are imprecise, however, and we interpret them cau-
tiously given the very poor quality and limited availability of the data on abatement technologies (see
Appendix B.8 for details). Row 4 suggests that grants to more polluted areas decrease pollution more
and are slightly more cost-effective, perhaps because areas with low initial pollution cannot decrease
pollution much. Row 5 suggests that grants to state-years with decentralization authority to manage
NPDES permits are more effective, and slightly more cost-effective.
Rows 6-8 of Appendix Table 7 study three additional dimensions of heterogeneity which are more
relevant to housing markets, so are more of a focus of Section 6.5. We discuss them briefly here. Row
6 finds that grants to counties with a large share of people who do outdoor fishing or swimming are
significantly more cost-effective.32 These counties are generally rural, so may face lower wage and con-
struction costs. Row 7 finds that states with pro-environmental views have slightly more cost-effective
grants. Row 8 considers two sets of cities highlighted in the urban economics literature—declining older
cities (Glaeser and Gyourko 2005), and high amenity cities (Albouy 2016). Both groups of cities have
low cost effectiveness. High amenity areas may face high wages and construction costs, while declining
urban areas may have governments which are less effective at managing grants.
Overall, this evidence does not suggest dramatic heterogeneity in cost-effectiveness. Compared to
the mean grant, grants to declining urban areas are significantly less cost-effective, while grants to the
generally rural counties where many people go fishing or swimming are significantly more effective. Most
others are statistically indistinguishable from the mean grant, though there is some moderate (if statis-
tically insignificant) heterogeneity in point estimates.
30
Formally, if Zpy is a characteristic of plant p in year y, we add the controls Gpy dd Zpy and Zpy ηdy to equation (3). The term
Zpy ηdy allows the downstream-by-year fixed effects to vary with the binary characteristic Zpy .
31
Advanced abatement technologies can target pollutants which more basic abatement technologies do not target. So it is
plausible that the marginal abatement cost curve for an individual emitted pollutant is increasing, but the curve for ambient
levels of an omnibus measure of water pollution like dissolved oxygen or fishability is not substantially increasing over the range
of technologies we observe.
32
The measure of swimming includes only natural water bodies and excludes pools.
26
6.4 Pass-Through of Clean Water Act Grants to Municipal Expendi-
ture
We now study how grants affect municipal spending on wastewater treatment. This helps interpret our
cost-effectiveness numbers since crowding out or in would change a grant’s effective cost.
Table 4 reports estimates corresponding to equation (5). In Panel A, the main explanatory variable
excludes required municipal contributions, while Panel B includes them. Column (1) reports a basic
differences-in-differences regression with nominal dollars. Column (2) uses real dollars. A city may
spend a grant in years after it is received, so real pass-through may be lower than nominal pass-through.
Column (3) adds river basin-by-year fixed effects. Column (4) reweights estimates using the inverse of
the estimated propensity score for inclusion in the balanced panel of cities.
The estimates in Table 4 are generally consistent with near complete pass-through, i.e., little or no
crowding out or in beyond the required municipal capital copayment. Panel A does estimate pass-through
above one, but this is expected since Panel A excludes the required municipal copayment. The Panel
A pass-through estimates range from 1.15 to 1.27 in real terms or 1.53 in nominal, which mean that
city expenditure increased by around the amount of the typical copay (which was typically a third of
the federal grant). Panel B of Table 4 includes the local copayment in the main explanatory variable,
and the estimates imply pass-through rates of 0.86 to 0.94 in real terms or 1.09 in nominal terms. The
point estimates are near one, and all of these estimates are within a standard deviation of one, so fail
to reject the hypothesis that the municipal wastewater investment exactly equals the cost listed in the
grant project data.33
We emphasize a few caveats in interpreting Table 4. First, the analysis is based on only 199 cities.
The inverse propensity score reweighted estimates are designed to reflect the entire population of US
cities, though interpretation of these estimates should bear in mind their limited sample. Second, this
city-level difference-in-difference estimate cannot use the upstream-downstream comparison that sharpens
identification for the pollution estimates. The basin-by-year fixed effects and population and city revenue
covariates do provide some additional useful controls.
Third, this analysis is different from the question of what municipal spending (and pollution and home
values) would be in a world without the Clean Water Act. Our estimates are consistent with no crowdout
for an individual grant, but it is plausible that the existence of the Clean Water Act decreases municipal
investment in wastewater treatment in aggregate. Jondrow and Levy (1984) and CBO (1985) report
national time-series patterns of municipal spending on sewerage capital and federal grants. Identification
from a national time series is not especially compelling, since other national shocks like the 1973-5 and
early 1980s recessions, high inflation, high interest rates, and the OPEC crisis make the 1960s a poor
33
We also explored estimates controlling for city-year population or city-year municipal revenue. These controls could help
address possible omitted variables bias due to city growth in these differences-in-differences regressions, but population and
especially city-year total expenditure are potentially a case of bad controls (Angrist and Pischke 2009) since they are potentially
affected by grants. Adding population or city revenue controls to the specification of column (4) in Table 4 gives estimates of
1.22 (0.30) or 0.91 (0.18) for Panel A, and 0.92 (0.22) or 0.68 (0.13) for Panel B. We discuss a range of pass-through estimates
including these for cost-effectiveness and cost-benefit analysis.
27
counterfactual for the 1970s and 1980s.34 But their data suggest that nationally, municipal revenue
dedicated to wastewater capital spending was increasing in the 1960s and decreasing after 1972, which
is consistent with aggregate crowdout. Combining the results of Table 4 with these patterns suggests
that once the Clean Water Act began, cities became less likely to spend municipal funds on wastewater
treatment capital. In this sense, the existence of the Clean Water Act did crowd out aggregate municipal
investment in wastewater treatment. But municipal investments that occurred were closely connected
to federal grant funds, and our point estimates imply that the level of grant costs reported in our data
accurately represents the actual change in spending.35
Given these caveats, we now discuss how our cost-effectiveness estimates would change under different
assumptions about crowd out.36 Table 3 shows that it costs $0.5 million annually to increase dissolved
oxygen saturation in a river-mile by 10 percent, and $1.5 million annually to make a river-mile fishable.
Our real pass-through point estimate of 0.91 from column (4) of Table 3 implies cost-effectiveness numbers
of $0.45 million for oxygen and $1.35 million for the fishable standard. The 95 percent confidence region
for our real pass-through estimate ranges from 0.50 to 1.32, which implies a range of cost-effectiveness
values between $0.25 million and $0.66 million for oxygen, and between $0.75 million and $1.98 million
for fishable. All these estimates represent the cost per year to make a river mile fishable or to increase
dissolved oxygen saturation by 10 percent for a year.
28
column (4) includes homes within 25 miles.
Panel A reports estimates of how grants affect log mean home values. The positive coefficients in the
richer specifications of columns (2) through (4) are consistent with increases in home values, though most
are statistically insignificant. Column (4) implies that each grant increases mean home values within 25
miles of affected waters by three hundredths of a percentage point. The 0.25 or 1.0 mile estimates are
slightly larger than the 25 mile estimate, which is consistent with the idea that residents nearer to the
river benefit more from water quality. Panel B analyzes how grants affect log mean rental values. These
estimates are generally smaller than the estimates for housing. The estimate in column (4), including
homes within a 25 mile radius of downstream rivers, is small but actually negative.
Panels A and B reflect the classic hedonic model, with fixed housing stock. Panels C and D estimate
the effect of grants on log housing units (panel C) or the log of the total value of the housing stock
(panel D). In the presence of elastic housing, measuring only price effects (as in Panels A and B) could
understate willingness-to-pay for local amenities. Moreover, many cities have had substantial waterfront
development, which could be related to water quality.
Panels C and D suggest similar conclusions as Panels A and B. Most of these estimates are small
and actually negative. One is marginally significant (Panel C, column 1), though the precision and point
estimate diminish with the controls of column (2). Column (4) in of Panel D literally implies that each
grant decreases the total value of the housing stock within a 25 mile radius of downstream waters by one
point five hundredths of a percentage point.
Figure 4 shows event study graphs, which suggest similar conclusions as these regressions. Panel A
shows modest evidence that in the years after a plant receives a grant, the values of homes within 0.25
miles of the downstream river increase. The increases are statistically insignificant in most years and
small in magnitude. Panel B shows no evidence that homes within 25 miles of the downstream river
increase after a treatment plant receives a grant.
We also report a range of sensitivity analyses, which are broadly in line with the main results. Esti-
mates appear in Appendix Table 8 and discussion appears in Appendix E.3.
29
1970 and 1980 censuses failed to define tracts for non-metro areas, so we lack home values and rents for
these areas. Column (4) includes these non-metro areas by imputing home values and number of housing
units for each non-metro tract in 1970 and 1980 as fitted values from a panel regression of log mean home
values on year fixed effects and tract fixed effects.
Considering all owner-occupied homes within 25 miles of the river, the estimated ratio of the grants’
aggregate effects on home values to the grants’ costs is 0.25. Adding rental units in column (3) does not
change this estimate out to two decimal points (though does at further decimals). The main regression
sample includes only a balanced panel of tracts that appear in all four censuses between 1970-2000;
imputing values for missing homes hardly changes the ratio in column (4). These confidence regions do
not reject the hypothesis that the ratio of the change in home values to the grants’ costs is zero. All four
columns, however, reject the hypothesis that the change in home values equals the grants’ costs.
We also consider alternative assumptions on the timing of homeowner beliefs. As discussed in Ap-
pendix B.5, evidence suggests that people may slowly update beliefs about housing values to reflect local
public goods. We address this possibility in three alternative estimates. The first applies the change in
rental values, which may more quickly reflect changes in local public goods, to both the value of owner-
occupied and rented housing. Tenants do receive tenure discounts, though we are not aware of evidence
that such discounts are systematically related to changes in local public goods. Second, we allow ten
years for the effect of grants on home values to appear; formally, we define the cumulative number of
grants relevant to home prices or rents in year t as the number of cumulative grants observed in year
t − 10 (Appendix Table 8, row 11). Third, we combine these approaches and apply estimated effects of
grants on log rents to both rented and owner-occupied housing, but allow ten years for rents to reflect the
effect of grants. Under these three approaches, the ratios of measured benefits to costs are -0.11 (0.16),
0.11 (0.31), and 0.11 (0.10), respectively. These are broadly similar to the main results.
30
also have a greater ratio than that of the mean grant. Row 8 finds that grants to declining urban areas
(Glaeser and Gyourko 2005) have slightly lower ratios, while the ratio for high amenity areas (Albouy
2016) is greater. Finally, row 9 tests for differences in the housing market response by census region.
This specification finds that grants to the Northeast have smaller ratios, while grants to the south have
larger ratios around 0.73. None of these ratios in rows 6-9 are significantly different than that of the
mean grant.
The map in Appendix Figure 10 shows heterogeneity in the ratio of measured benefits to costs across
U.S. counties. This map assumes the same hedonic price function nationally and reflects spatial heteroge-
neity in the density of housing units. Specifically, these estimates divide treatment plants into ten deciles
of the number of people in 2000 living within 25 miles of downstream river segments. They then use the
regression estimates from column 4 of Table 5 to calculate the ratio of the change in the value of housing
and grant costs, separately for each decile.39 Finally, we average this ratio across all plants in each county.
The map shows that the ratio of measured benefits to costs is much larger in more populated counties.
The bottom decile of counties, for example, includes ratios of measured benefits to costs of below 0.01.
The top decile of counties includes ratios between 0.31 and 0.45. Grants and population are both highly
skewed—37 percent of grant costs and 54 percent of population are in the top decile.
We take three overall conclusions from this analysis of heterogeneity. First, we find suggestive evidence
that ratios of measured benefits to costs follow sensible patterns, though not all estimates are precise.
Second, none of these subsets of grants considered has a ratio of measured benefits to costs above one,
though many of the confidence regions cannot reject a ratio of one. The largest ratios of estimated benefits
to costs are for areas where outdoor fishing or swimming is common (ratio of 0.57), for high amenity
urban areas (ratio of 0.63), and in the South (ratio of 0.74). Third, the predictable spatial variation in
the net benefits of water quality variation suggests that allowing the stringency of regulation to vary over
space might give it greater net benefits (see related arguments for air pollution in Muller and Mendelsohn
(2009)). For example, U.S. regulators are actively considering cap-and-trade systems for water pollution,
and the heterogeneity described here could give a reason to investigate non-uniform trading ratios which
account for non-uniform spatial benefits of water pollution (see Fowlie and Muller (2013) for another air
pollution application).
31
hedonic estimates as a lower bound on total benefits.40
First, people might have incomplete information about changes in water pollution and their welfare
implications. Cross-sectional and time series analyses do find statistically significant though imperfect
correlation between perceived local water pollution, as reported in surveys, and objectively measured
local water pollution (Faulkner, Green, Pellaumail, and Weaver 2001; Poor, Boyle, Taylor, and Bouchard
2001; Jeon, Herriges, Kling, and Downing 2005; Steinwender, Gundacker, and Wittmann 2008; Artell,
Ahtiainen, and Pouta 2013). Residents may observe indirect signals of polluted water, such as fish kills
(dead fish floating on the water or sitting on the bank), algae blooms, sulfurous odors like rotten eggs,
miscolored waters, and may learn about pollution directly from waterfront warning signs, news coverage,
and publicity. People who go fishing value actually catching fish, so if pollution decreases the number of
fish someone can catch, then water pollution may discourage fishing.
Incomplete information would be especially important if pollution abatement improves health. A
direct analysis of the relationship between surface water quality and health would have more potential
in recent years when more extensive health data with detailed geographic identifiers are available. Mis-
perception would be less important if most benefits of surface water quality accrue through recreation
or ambiance, since failing to perceive water pollution through any means (including the absence of fish)
would mean its effects on recreational demand are limited. Most recent cost-benefit analyses of the Clean
Water Act do estimate that a substantial share of benefits come from recreation and ambiance channels
(Lyon and Farrow 1995; Freeman 2000; USEPA 2000a). Cropper and Oates (1992) in fact describe the
Clean Water Act as the only major environmental regulation of the 1970s and 1980s which does not have
health as its primary goal.
Second, these hedonic regressions may include some reasons for which people value water quality,
including recreation and aesthetics, though not all. One category not counted in these estimates is
“nonuse” or “existence” values. A person may value a clean river even if that person never visits or
lives near that river. As mentioned in the introduction, the measurement of nonuse values has been
controversial and faces severe challenges. The controversy is not over whether nonuse values exist; it
is straightforward to explain their place in economic theory (Diamond and Hausman 1993). Instead,
the controversy is over the extent to which stated preference or contingent valuation methods accurately
reflect nonuse values. We recognize both the potential importance of nonuse values for clean surface waters
and the severe challenges in accurately measuring these values.41 Several other categories potentially not
measured here include the value for commercial fisheries, industrial water supplies, lower treatment costs
for drinking water, and safer drinking water.42 Evidence on the existence and magnitude of the benefits
40
Appendix F discusses several additional reasons which we believe have weaker support. These reasons include: changes in
housing supply or resident characteristics; local taxes or fees; inattention to recreation; or homeowner advanced expectations
about grants.
41
The USEPA’s (2000a) cost-benefit analysis of the Clean Water Act does estimate that nonuse values are about a sixth as
large as use values. A meta-analysis of 131 estimates from 18 studies that compares use and nonuse values for water pollution
also estimates that nonuse values are about a third as large as use values (Houtven, Powers, and Pattanayak 2007). These
analyses, however, are subject to the same kinds of serious concerns about both use and non-use estimates in the underlying
studies.
42
Flint, Michigan, has recently had high lead levels in drinking water due to switching its water source from the Detroit River
32
from these other channels is limited, though as mentioned above, recreation and aesthetics are believed
to account for a large majority of the benefits of clean surface waters.
Third, this analysis abstracts from general equilibrium changes. One possible channel is that wages
change to reflect the improvement in amenities (Roback 1982). A second general equilibrium channel is
that the hedonic price function may have shifted. In the presence of such general equilibrium changes,
our estimates could be interpreted as a lower bound on willingness to pay (Banzhaf 2015).
Other possible general equilibrium channels describe reasons why the effects of cleaning up an entire
river system could differ from summing up the effects of site-specific cleanups. One channel involves
substitution. Rivers can be thought of as a differentiated product. Cleaning up part of a river in an area
with many dirty rivers might have different value than cleaning up a river in an area with many clean
rivers. Another possible channel involves ecology. The health of many aquatic species (so indirectly, the
benefit people derive from a river) may depend nonlinearly on the area of clean water. Ecologists have
well established that the growth rate of a population is an inverted U-shaped function of the population’s
size (Allee 1938), and the population size may depend on the area of clean river. Our approach focuses
on the effects of cleaning up an individual site and is not as well suited to capture the potentially distinct
effects of cleaning up entire river systems, though this is a potentially fruitful topic for future study.
Fourth, the 25 mile radius is only designed to capture 95 percent of recreational trips. The magnitude
of benefits from the remaining 5 percent of trips is unclear; the last 5 percent of trips might account for
more than 5 percent of the total surplus from recreational demand because they represent people willing
to travel great distances for recreation. Alternatively, the most distant travelers might be the marginal
participants in recreation, and the inframarginal people who derive the most surplus from water-based
recreation may be those who choose to live nearer to recreational areas. Our recreation data also represent
all trips, and water-based recreation trips might require different travel distances.
Finally, while our pass-through estimates suggest that an individual grant did not cause substantial
crowd-out or crowd-in of municipal spending, we interpret this estimate cautiously since it reflects only
199 cities, does not use upstream waters as a comparison group, and reflects pass-through of marginal
changes in investment, rather than the entire Clean Water Act.
Considering alternative pass-through values helps interpret these results. The point estimate in column
(4) of Table 4 implies that each dollar of federal grants leads to 91 cents of additional municipal capital
spending. In terms of Table 6, this point estimate of pass-through would imply that the ratio of the
change in housing values to costs is 0.31. The 95 percent confidence interval of our pass-through estimate
ranges from 0.50 to 1.32; in terms of Table 6, this implies the ratio of the change in housing values to
costs ranges between 0.21 and 0.56. Alternatively, one way to assess the importance of crowdout is to
ask: what value of pass-through would be needed to make the change in housing values exceed costs?
Table 6 implies that for any pass-through rate above 0.28, costs exceed the change in housing values.
to the Flint River. The Flint River has high chloride levels due partly to natural salt levels and to road salt. The chloride
causes service lines to leech lead. Flint potentially could have prevented these problems by adding corrosion inhibitors (e.g.,
orthophosphate), which are used in many cities including the Detroit water that Flint previously used, at cost of around $100/day.
Drinking water treatment falls under a separate set of regulations, the Safe Drinking Water Act.
33
8 Conclusions
The 1972 Clean Water Act is among the largest and most controversial regulations in U.S. history. The
U.S. has spent more money abating water pollution than abating air pollution, even though economic
research on the willingness to pay for air quality regulation is well-established and research on the value
of water pollution regulation is more limited.
This paper assembles an array of new data to assess water pollution’s trends, causes, and welfare
consequences. We find that by most measures, U.S. water pollution has declined since 1972, though was
declining at an even faster rate before 1972. The share of waters that are fishable has grown by over 10
percentage points since the Clean Water Act.
We study $680 billion in expenditure due to 35,000 grants the federal government gave cities to improve
wastewater treatment plants. Each grant significantly decreased pollution for 25 miles downstream, and
these benefits last for around 30 years. We find weak evidence that local residents value these grants,
though estimates of increases in housing values are generally smaller than costs of grant projects.
These estimates may provide a lower bound on willingness to pay for water quality since they abstract
from several other potential channels of demand for surface water quality, including nonuse (“existence”)
values, general equilibrium effects, and the roughly five percent longest recreational trips. The point
estimates imply that the benefits of the Clean Water Act’s municipal grants exceed their costs if these
unmeasured components of willingness to pay are three or more times the components of willingness to
pay that we measure. As mentioned in the introduction, other recent analyses estimate benefits of the
Clean Water Act that are smaller than its costs, though these other estimates note that they may also
provide a lower bound on benefits. For example, the U.S. Environmental Projection Agency (2000a;
2000c) suggests that the cost-benefit ratio of the Clean Water Act is below 1, though the EPA’s preferred
estimate of the cost-benefit ratio of the Clean Air Act is 42 (USEPA 1997).43
Our analysis does not directly explain such different findings for the Clean Water and Clean Air
Acts. But it may be useful to highlight differences in how these laws answer four important questions
about environmental regulation. These comparisons also highlight features of the Clean Water Act which
are not widely recognized and could lead it to have lower net benefits than some other environmental
regulation.
First is the choice of policy instrument. Market-based instruments are believed to be more cost-
effective than alternatives. Parts of the Clean Air Act use cap-and-trade systems, but nearly none of
the Clean Water Act does. The grants we study actually subsidize the adoption of pollution control
equipment, which is a common policy globally that has undergone little empirical economic analysis.
A second question is scope. Cost-effective regulation equates marginal abatement costs across sources,
which requires regulating all sources. The Clean Air Act covers essentially all major polluting sectors.
The Clean Water Act, by contrast, mostly ignores “non-point” pollution sources like agriculture. Ignoring
such a large source of pollution can make aggregate abatement more costly.
43
Analyses of the Clean Air Act relying solely on hedonic estimates generally have smaller cost-benefit ratios; the EPA’s
cost-benefit numbers for air pollution rely heavily on estimated mortality impacts.
34
A third question involves substitution. Optimizing consumers should equate the marginal disutility
of pollution to the marginal cost of protection from pollution. People breathe the air quality where they
live, and relocating to another airshed or some other defenses against air pollution are costly (Deschenes,
Greenstone, and Shapiro 2013). For water pollution, however, people can more easily substitute between
nearby clean and dirty rivers for recreation.
A fourth question involves health. Air is typically unfiltered when it is inhaled, so air pollution
is believed to have large mortality consequences that account for much of the benefits of air pollution
regulation. Surface waters, by contrast, are typically filtered through a drinking water treatment plant
before people drink them. Most analyses of recent U.S. water quality regulation count little or no direct
benefit from improving human health (Lyon and Farrow 1995; Freeman 2000; USEPA 2000a; Olmstead
2010).44
Finally, we note one similarity between and air water pollution that may be relevant to policy design.
We find some evidence that the net benefits of Clean Water Act grants vary over space in tandem with
population density and the popularity of water-based recreation. Related patterns have been found for
air pollution, and suggest that allowing the stringency of pollution regulation to vary over space might
have potential to produce welfare gains.
References
Acemoglu, D., U. Akcigit, D. Hanley, and W. Kerr (2016): “Transition to Clean Technology,”
Journal of Political Economy, 124(1), 52–104.
Adler, R. W., J. C. Landman, and D. M. Cameron (1993): The Clean Water Act 20 Years Later.
NRDC.
Aidt, T. S. (1998): “Political internalization of economic externalities and environmental policy,” Jour-
nal of Public Economics, 69, 1–16.
Albouy, D. (2016): “What are Cities Worth? Land Rents, Local Productivity, and the Total Value of
Amenities,” Review of Economics and Statistics, 98(3), 477–487.
Alsan, M., and C. Goldin (2015): “Watersheds in Infant Mortality: The Role of Effective Water and
Sewerage Infrastructure, 1880 to 1915,” NBER Working Paper 21263.
American Society of Civil Engineers (2011): “Failure to Act: The Economic Impact of Current
Investment Trends in Water and Wastewater Treatment Infrastructure,” Discussion paper, ASCE.
44
This may contrast with the regulation of surface water quality in developing countries and in the historic U.S. (Ebenstein,
2012; Alsan and Goldin, 2015), both cases where drinking water was less well filtered, piped water access less widespread, and
stringent regulatory standards for coliforms and other bacteria in drinking water were less common or less well enforced.
35
Angrist, J. D., and J.-S. Pischke (2009): Mostly Harmless Econometrics: An Empiricist’s Compa-
nion. Princeton University Press.
Artell, J., H. Ahtiainen, and E. Pouta (2013): “Subjective vs. objective measures in the valuation
of water quality,” Journal of Environmental Economics and Management, 130, 288–296.
Banzhaf, H., and O. Farooque (2013): “Interjurisdictional housing prices and spatial amenities:
Which measures of housing prices reflect local public goods?,” Regional Science and Urban Economics,
43, 635–648.
Banzhaf, H. S. (2015): “Panel Data Hedonics: Rosen’s First Stage and Difference-in-Differences as
“Sufficient Statistics”,” Mimeo, Georgia State.
Baumol, W. J., and W. Oates (1988): The Theory of Environmental Policy. Cambridge University
Press.
Bayer, P., F. Ferreira, and R. McMillan (2007): “A Unified Framework for Measuring Preferences
for Schools and Neighborhoods,” Journal of Political Economy, 115(4), 588–638.
Becker, R. (2015): “Water Use and Conservation in Manufacturing: Evidence from U.S. Microdata,”
CES Discussion Paper 15-16.
Benı́tez-Silva, H., S. Eren, F. Heiland, and S. Jiménez-Martı́n (2015): “How well do individuals
predict the selling prices of their homes?,” Journal of Housing Economics, 29, 12–25.
Boscoe, F. P., K. A. Henry, and M. S. Zdeb (2012): “A Nationwide Comparison of Driving Distance
Versus Straight-Line Distance to Hospitals,” Professional Geographer, 64(2), 188–96.
Bound, J., and A. B. Krueger (1991): “The Extent of Measurement Error in Longitudinal Earnings
Data: Do Two Wrongs Make a Right?,” Journal of Labor Economics, 9(1), 1–24.
Carson, R. T., and R. C. Mitchell (1993): “The Value of clean water: The public’s willingness to
pay for boatable, fishable, and swimmable quality water,” Water Resources Research, 29(7), 2445–2454.
CBO (1985): “Efficient Investment in Wastewater Treatment Plants,” Discussion paper, Congressional
Budget Office.
Cohen, A., and D. A. Keiser (2015): “The Effectiveness of Overlapping Pollution Regulation: Evi-
dence from the Ban on Phosphate in Dishwasher Detergent,” Mimeo, Iowa State.
Cremeans, J. E., and F. W. Segel (1975): “National Expenditures for Pollution Abatement and
Control, 1972,” Survey of Current Business, 55(2), 8–11, 35.
Cropper, M., and W. E. Oates (1992): “Environmental Economics: A Survey,” Journal of Economic
Literature, 30(2), 675–740.
36
Cutler, D., and G. Miller (2005): “The Role of Public Health Improvements in Health Advances:
The Twentieth-Century United States,” Demography, 42(1), 1–22.
Davis, M. A., and E. Quintin (2016): “On the Nature of Self-Assessed House Prices,” Mimeo, Rutgers.
Deschenes, O., M. Greenstone, and J. S. Shapiro (2013): “Defensive Investments and the Demand
for Air Quality: Evidence from the NOx Budget Program and Ozone Reductions,” Mimeograph, MIT.
Diamond, P. A., and J. A. Hausman (1993): Contingent Valuation: A Critical Assessment chap. On
Contingent Valuation Measurement of Nonuse Values, pp. 3–38. North-Holland.
DiPasquale, D., and C. Somerville (1995): “Do House Price Indices Based on Transacting Units
Represent the Entire Stock? Evidence from the American Housing Survey,” Journal of Housing Eco-
nomics, 4, 195–229.
Earnhart, D. (2004a): “Panel Data Analysis of Regulatory Factors Shaping Environmental Perfor-
mance,” Review of Economics and Statistics, 86(1), 391–401.
Ebenstein, A. (2012): “The Consequences of Industrialization: Evidence from Water Pollution and
Digestive Cancers in China,” Review of Economics and Statistics, 94(1), 186–201.
Faulkner, H., A. Green, K. Pellaumail, and T. Weaver (2001): “Residents’ perceptions of water
quality improvements following remediation work in the Pymme’s Brook catchment, north London,
UK,” Journal of Environmental Management, 62, 239–254.
Freeman, A. M. (2000): Public Policies for Environmental Protection chap. Water Pollution Policy,
pp. 169–213. Resources.
Fullerton, D., and R. D. Mohr (2003): “Suggested Subsidies are Sub-optimal Unless Combined
with an Output Tax,” Contributions to Economic Analysis & Policy, 2(1).
Gianessi, L. P., and H. M. Peskin (1981): “Analysis of National Water Pollution Control Policies:
2. Agricultural Sediment Control,” Water Resources Research, 17(4), 803–821.
Glaeser, E. L., and J. Gyourko (2005): “Urban Decline and Durable Housing,” Journal of Political
Economy, 113(2), 345–375.
Greenstone, M., and J. Gallagher (2008): “Does Hazardous Waste Matter? Evidence from the
Housing Market and the Superfund Program,” Quarterly Journal of Economics, 123(3), 951–1003.
Greenstone, M., and R. Hanna (2014): “Environmental Regulations, Air and Water Pollution, and
Infant Mortality in India,” American Economic Review, 104(10), 3038–72.
Griliches, Z., and J. A. Hausman (1986): “Errors in Variables in Panel Data,” Journal of Econome-
trics, 31, 93–118.
37
Haines, M. R., and ICPSR (2010): “Historical, Demographic, Economic, and Social Data: The United
States, 1790-2002,” .
Hall, B., and M. L. Kerr (1991): 1991-1992 Green Index: A State-by-State Guide to the Nation’s
Environmental Health. Island Press.
Hausman, J. (2012): “Contingent Valuation: From Dubious to Hopeless,” Journal of Economic Per-
spectives, 26(4), 43–56.
Henriques, A. M. (2013): “Are Homeowners in Denial about their House Values? Comparing Owner
Perceptions with Transaction-Based Indexes,” Mimeo, Federal Reserve Board.
Herriges, J., C. L. Kling, and D. Phaneuf (2015): “Does Water Quality Matter? Evidence from
Micro Panel Data,” Mimeo, Michigan State University.
Hines, N. W. (1967): “Nor Any Drop to Drink: Public Regulation of Water Quality. Part I: State
Pollution Control Programs,” Iowa Law Review, 186.
Hitchcock, C. Y., and M. D. Giggey (1975): “Report to National Commission on Water Quality
on assessment of technologies and costs for publicly owned treatment works under public law 92-500,”
Discussion paper, Metcalf & Eddy, Inc.
Houtven, G. V., J. Powers, and S. K. Pattanayak (2007): “Valuing water quality improvements in
the United States using meta-analysis: Is the glass half-full or half-empty for national policy analysis?,”
Resource and Energy Economics, 29, 206–228.
Isaak, D., S. Wollrab, D. Horan, and G. Chandler (2012): “Climate change effects on stream
and river temperatures across the northwest U.S. from 1980-2009 and implications for salmon fishes,”
Climatic Change, 113(2), 499–524.
James R. Follain, J., and S. Malpezzi (1981): “Are Occupants Accurate Appraisers?,” Review of
Public Data Use, 9(1), 47–55.
Jeon, Y., J. A. Herriges, C. L. Kling, and J. Downing (2005): “The Role of Water Quality
Perceptions in Modeling Lake Recreation Demand,” Iowa State Working Paper 05032.
John L. Goodman, J., and J. B. Ittner (1992): “The Accuracy of Home Owners’ Estimates of House
Value,” Journal of Housing Economics, 2, 339–357.
38
Jondrow, J., and R. A. Levy (1984): “The Displacement of Local Spending for Pollution Control by
Federal Construction Grants,” American Economic Review Papers and Proceedings, 74(2), 174–8.
Kahn, M. E., P. Li, and K. Zhao (2015): “Water Pollution Progress at Borders: The Role of Changes
in China’s Political Propotion Incentives,” American Economic Journal: Economic Policy, 7(4), 223–
242.
Keiser, D. A. (2016): “The Missing Benefits of Clean Water and the Role of Mismeasured Pollution
Data,” Mimeo, Iowa State.
Kiel, K. A., and J. E. Zabel (1999): “The Accuracy of Owner-Provided House Values: The 1978-1991
American Housing Survey,” Real Estate Economics, 27(2), 263–298.
Kline, P., and C. Walters (Forthcoming): “Evaluating Public Programs with Close Substitutes: The
Case of Head Start,” Quarterly Journal of Economics.
Kling, C. L., D. J. Phaneuf, and J. Zhao (2012): “From Exxon to BP: Has Some Number Become
Better than No Number,” Journal of Economic Perspectives, 26(4), 3–26.
Knopman, D. S., and R. A. Smith (1993): “Twenty Years of the Clean Water Act: Has U.S. Water
Quality Improved?,” Environment, 35(1), 17–41.
Kohn, R. E. (1992): “When Subsidies for Pollution Abatement Increase Total Emissions,” Southern
Economic Journal, 59(1), 77–87.
Kuwayama, Y., and S. Olmstead (2015): Handbook on the Economics of Natural Resources chap. Wa-
ter quality and economics: willingness to pay, efficiency, cost-effectiveness, and new research frontiers,
pp. 474–501. Robert Halvorsen and David F. Layton.
Kuzmenko, T., and C. Timmins (2011): “Persistence in Housing Wealth Perceptions: Evidence from
the Census Data,” Mimeo, Duke.
Lake, E. E., W. M. Hanneman, and S. M. Oster (1979): Who Pays for Clean Water? The
Distribution of Water Pollution Control Costs. Westview Press.
Leggett, C. G., and N. E. Bockstael (2000): “Evidence of the Effects of Water Quality on Resi-
dential Land Prices,” Journal of Environmental Economics and Management, 39, 121–144.
Lipscomb, M., and A. M. Mobarak (Forthcoming): “Decentralization and Pollution Spillovers: Evi-
dence from the Re-drawing of Country Borders in Brazil,” Review of Economic Studies.
Lung, W.-S. (2001): Water Quality Modeling for Wasteload Allocations and TMDL. Wiley.
Lyon, R. M., and S. Farrow (1995): “An economic analysis of Clean Water Act issues,” Water
Resources Research, 31(1), 213–223.
Mehan III, G. T. (2010): “A Symphonic Approach to Water Management: The Quest for New Models
of Watersed Governance,” Journal of Land Use & Environmental Law, 26(1), 1–33.
39
Mendelsohn, R., D. Hellerstein, M. Huguenin, R. Unsworth, and R. Brazee (1992): “Me-
asuring Hazardous Waste Damages with Panel Models,” Journal of Environmental Economics and
Management, 22, 259–271.
Millock, K., and C. Nauges (2006): “Ex Post Evaluation of an Earmarked Tax on Air Pollution,”
Land Economics, 82(1), 68–84.
Moeltner, K., and R. von Haefen (2011): “Microeconometric Strategies for Dealing with Unob-
servables and Endogenous Variables in Recreation Demand Models,” Annual Reviews of Resource
Economics, 3, 375–96.
Muehlenbachs, L., E. Spiller, and C. Timmins (2015): “The Housing Market Impacts of Shale Gas
Development,” American Economic Review, 105(12), 3633–59.
Muller, N. Z., and R. Mendelsohn (2009): “Efficient Pollution Regulation: Getting the Prices
Right,” American Economic Review, 99(5), 1714–1739.
Muller, N. Z., R. Mendelsohn, and W. Nordhaus (2011): “Environmental Accounting for Pollu-
tion in the United States Economy,” American Economic Review, 101(5), 1649–1675.
Murdock, J. (2006): “Handling unobserved site characteristics in random utility models of recreational
demand,” Journal of Environmental Economics and Management, 51(1), 1–25.
Nixon, R. (1972): “Presidential Veto Message: Nixon Vetoes Water Pollution Act,” CQ Almanac.
Oates, W. E. (1999): “An Essay on Fiscal Federalism,” Journal of Economic Literature, 37, 1120–1149.
Olmstead, S. M. (2010): “The Economics of Water Quality,” Review of Environmental Economics &
Policy, 4(1), 44–61.
Olmstead, S. M., L. A. Muehlenbachs, J.-S. Shih, Z. Chu, and A. J. Krupnick (2013): “Shale
gas development impacts on surface water quality in Pennsylvania,” Proceedings of the National Aca-
demy of Sciences, 110(13), 4962–4967.
Palmquist, R. B., and V. K. Smith (2002): The International Yearbook of Environmental and Re-
source Economics 2002/2003 chap. The use of hedonic property value techniques for policy and liti-
gation, pp. 115–164. Edward Elgar, Cheltenham, UK.
Peiser, R. B., and L. B. Smith (1985): “Homeownership Returns, Tenure Choice and Inflation,” Real
Estate Economics, 13(4), 343–360.
Poor, P. J., K. J. Boyle, L. O. Taylor, and R. Bouchard (2001): “Objective versus Subjective
Measures of Water Clarity in Hedonic Property Value Models,” Land Economics, 77(4), 482–493.
Powell, M. (1995): “Building a National Water Quality Monitoring Program,” Environmental Science
& Technology, 29(10), 458A–463A.
Quarles, J. (1976): Cleaning Up America: An Insider’s View of the Environmental Protection Agency.
Houghton Mifflin Company Boston.
40
Rabotyagov, S. S., T. D. Campbell, M. White, J. G. Arnold, J. Atwood, M. L. Norfleet,
C. L. Kling, P. W. Gassman, A. Valcu, J. Richardson, R. E. Turner, and N. N. Rabalais
(2014): “Cost-effective targeting of conservation investments to reduce the northern Gulf of Mexico
hypoxic zone,” Proceedings of the National Academy of Sciences, 111(52), 18530–18535.
Roback, J. (1982): “Wages, Rents, and the Quality of Life,” Journal of Political Economy, 90(6),
1257–1278.
Rosen, M. R., and W. W. Lapham (2008): “Introduction to the U.S. Geological Survey National
Water-Quality Assessment (NAWQA) of Ground-Water Quality Trends and Comparison to Other
National Programs,” Journal of Environmental Quality, 37, S–190 to S–198.
Shapiro, J. S., and R. Walker (2014): “Why is Pollution from U.S. Manufacturing Declining? The
Roles of Trade, Regulation, Productivity, and Preferences,” Mimeo, Yale University.
Sigman, H. (2002): “International Spillovers and Water Quality in Rivers: Do Countries Free Ride?,”
American Economic Review, 92(4), 1152–1159.
(2003): “Letting States Do the Dirty Work: State Responsibility for Federal Environmental
Regulation,” National Tax Journal, 56(1), 107–122.
Smith, R. A., R. B. Alexander, and G. Wolman (1987): “Water-Quality Trends in the Nation’s
Rivers,” Science, 235(4796), 1607–1615.
Smith, V. K., and W. H. Desvousges (1986): Measuring Water Quality Benefits. Kluwer.
Smith, V. K., and C. V. Wolloh (2012): “Has Surface Water Quality Improved Since the Clean
Water Act?,” NBER Working Paper 18192.
Snow, J. (1855): On the Mode of Communication of Cholera. John Churchill, New Burlington Street.
Steinwender, A., C. Gundacker, and K. J. Wittmann (2008): “Objective versus subjective asses-
sments of environmental quality of standing and running waters in a large city,” Landscape and Urban
Planning, 84, 116–126.
Time (1969): “The Cities: The Price of Optimism,” Time Magazine, August 1.
U.S. Army Corps of Engineers (1994): “Consolidated Performance Report on the Nation’s Public
Works: An Update,” Discussion paper, US Army Corps of Engineers.
41
USEPA (1974a): “Awards Register Volume II. Grants Assistance Programs. Showing Projects Awarded
in Fiscal Year 1974.,” Discussion paper, USEPA.
(1974b): “Awards Register. Volume III. Grants Assistance Programs. Showing Projects Awarded
in Fiscal Year 1974.,” Discussion paper, USEPA.
(1974c): “National Water Quality Inventory. 1974 Report to the Congress. Volume II,” Discus-
sion paper, USEPA.
(1975): “Clean Water Construction Grants Program News,” Discussion paper, USEPA.
(1976): “How to Obtain Federal Grants to Build Municipal Wastewater Treatment Works,”
Discussion paper, USEPA.
(1979): “1978 Needs Survey.. Conveyance and Treatment of Municipal Wastewater. Summaries
of Technical Data,” Discussion paper, USEPA.
(1981): “Facilities Planning 1981: Municipal Wastewater Treatment,” Discussion paper, USEPA.
(1997): “The Benefits and Costs of the Clean Air Act, 1970 to 1990,” Discussion paper, USEPA.
(2000a): “A Benefits Assessment of Water Pollution Control Programs Since 1972: Part 1, The
Benefits of Point Source Controls for Conventional Pollutants in Rivers and Streams: Final Report,”
Discussion paper, USEPA.
(2000b): “Liquid Assets 2000: America’s Water Resources at a Turning Point,” Discussion
paper, Office of Water.
(2000c): “A Retrospective Assessment of the Costs of the Clean Water Act: 1972 to 1997: Final
Report,” Discussion paper, USEPA.
(2001): “The National Costs to Implement TMDLs (Draft Report): Support Document 2,”
Discussion paper, USEPA.
(2002): “The Clean Water and Drinking Water Infrastructure Gap Analysis,” Discussion paper,
USEPA.
(2016): “ATTAINS, National Summary of State Information,” Accessed June 15, 2016.
USGAO (1994): “Water Pollution: Information on the Use of Alternative Wastewater Treatment Sys-
tems,” Discussion paper, USGAO.
USPHS (1961): “National Water Quality Network Annual Compilation of Data, October 1, 1960-
September 30, 1961,” Discussion paper, U.S. Department of Health, Education, and Welfare: Public
Health Service.
42
Vogan, C. R. (1996): “Pollution Abatement and Control Expenditures,” Survey of Current Business,
September, 48–67.
Walsh, P. J., J. W. Milon, and D. O. Scrogin (2011): “The Spatial Extent of Water Quality
Benefits in Urban Housing Markets,” Land Economics, 87(4), 628–644.
Watson, T. (2006): “Public Health Investments and the Infant Mortality Gap: Evidence from Federal
Sanitation Interventions on U.S. Indian Reservations,” Journal of Public Economics, 90, 1537–1560.
Wu, J., R. M. Adams, C. L. Kling, and K. Tanaka (2004): “From Microlevel Decisions to Landscape
Changes: An Assessment of Agricultural Conservation Policies,” American Journal of Agricultural
Economics, 86(1), 26–41.
43
Figure 1. National Maps of Water Pollution Data
Notes: In Panel A, rivers are colored by Stream Level from the National Hydrography Dataset. Streams
that flow into oceans, Great Lakes, Canada or Mexico and are the darkest. Streams that flow into these
are lighter; streams that flow into these are still lighter, etc. Panel C shows monitoring sites appearing
in years 1962-2001.
44
Figure 2. Water Pollution Trends, 1962-2001
Panel A. Dissolved Oxygen Deficit Panel B. Share Not Fishable
35
.4
30
Saturation Deficit (%)
.2
15
.1
1962 1972 1982 1992 2001 1962 1972 1982 1992 2001
Year Year
Notes: Graphs show year fixed effects plus a constant from regressions which also control for monitoring site fixed effects,
a day-of-year cubic polynomial, and an hour-of-day cubic polynomial, corresponding to equation (2) from the text.
Connected dots show yearly values, dashed lines show 95% confidence interval, and 1962 is reference category. Standard
errors are clustered by watershed.
Figure 3. Effects of Clean Water Act Grants on Water Pollution: Event Study Graphs
Panel A. Dissolved Oxygen Deficit Panel B. Share Not Fishable
.02
4 2
0
Share Not Fishable
Saturation Deficit (%)
0
-.02
-2
-.04
-4
-.06
9 to 11
12 to 14
15 to 17
18 to 20
21 to 23
24 to 26
27 to 29
-9 to -7
-6 to -4
-3 to -1
0 to 2
3 to 5
6 to 8
<=-10
>=30
12 to 14
15 to 17
18 to 20
21 to 23
24 to 26
27 to 29
0 to 2
3 to 5
6 to 8
9 to 11
-9 to -7
-6 to -4
-3 to -1
>=30
<=-10
Notes: Graphs show coefficients on downstream times year-since-grant indicators from regressions which correspond to the
specification of Table 3. These regressions are described in equation (5) from the main text. Data cover years 1962-2001.
Connected dots show yearly values, dashed lines show 95% confidence interval. Standard errors are clustered by
watershed.
Figure 4. Effects of Clean Water Act Grants on Log Mean Home Values: Event Study Graphs
Panel A. Homes Within 0.25 Miles of River Panel B. Homes Within 25 Miles of River
.04
.01
Log Mean Home Values
.005
0
0
-.02
-.005
<=-10
-9 to -7
-6 to -4
-3 to -1
0 to 2
3 to 5
6 to 8
9 to 11
12 to 14
15 to 17
18 to 20
21 to 23
24 to 26
27 to 29
>=30
<=-10
-9 to -7
-6 to -4
-3 to -1
0 to 2
3 to 5
6 to 8
9 to 11
12 to 14
15 to 17
18 to 20
21 to 23
24 to 26
27 to 29
>=30
Notes: Graphs show coefficients on year-since-grant indicators from regressions corresponding to the specification of Table
6, column (3). Connected dots show yearly values, dashed lines show 95% confidence interval. Standard errors are
clustered by watershed. Panels A and B show different ranges of values on their y-axes. Data cover decennial census years
1970-2000.
45
Table 1. Water Pollution Trends, 1962-2001
Main Pollution Measures Other Pollution Measures
Dissolved Biochemical Total
Oxygen Oxygen Fecal Not Suspended
Deficit Not Fishable Demand Coliforms Swimmable Solids
(1) (2) (3) (4) (5) (6)
Panel A. Linear Trend
Year -0.234*** -0.004*** -0.064*** -83.716*** -0.004*** -0.831***
(0.0298) (0.0003) (0.0052) (8.6031) (0.0003) (0.1006)
46
Table 3. Cost Effectiveness of Clean Water Act Grants ($2014 Mn)
(1) (2) (3)
Annual Cost to Increase Dissolved Oxygen 0.57 0.44 0.54
Saturation in a River-Mile by 10% [0.38 , 1.14] [0.29 , 0.88] [0.36 , 1.06]
Annual Cost to Make a River-Mile Fishable 1.82 1.40 1.51
[1.29 , 3.08] [1.00 , 2.37] [1.00 , 3.07]
Notes: All values in $2014 millions. Calculations include grants given in years 1962-2001. Brackets show 95% confidence
intervals. Annual cost assumes benefits acccrue for 30 years. Assumed stream length means stream length is assumed to be 25
miles. "All plants" includes plants with latitude and longitude data.
Table 4. Pass-Through of Clean Water Act Grants to Municipal Sewerage Capital Spending
(1) (2) (3) (4)
Panel A. Federal Grant Funds
Federal Grant Funds 1.53*** 1.27*** . 1.15*** 1.21***
(0.27) (0.21) (0.25) (0.29)
Panel B. Grant Project Costs
Grant Project Costs 1.09*** 0.94*** 0.86*** 0.91***
(0.19) (0.15) (0.18) (0.21)
Notes: Dependent variable is municipal sewerage capital investment. All municipal and grant costs are cumulative since 1970. Sewerage
capital expenditures include federal and non-federal funds. Cumulative grant project costs also include federal grant amount and required
local capital expenditure. Municipal spending data from Annual Survey of Governments and Census of Governments. Data include balanced
panel of cities over years 1970-2001, see text for details. Columns (1)-(2) use nominal dollars; column (3) uses real dollars deflated by the
Engineering News Record Construction Price Index. Column (4) weights by inverse propensity score for appearing in the balanced panel of
cities, which is estimated as a function of log city population, log city total municipal expenditure, city type (municipality or township), and
census division fixed effects, where city population and expenditure are averaged over all years of the data. Standard errors are clustered by
city. Asterisks denote p-value < 0.10 (*), <0.05 (**), or 0.01 (***).
47
Table 5. Effect of Clean Water Act Grants on Housing Demand
(1) (2) (3) (4)
Panel A. Log Mean Home Values
Cumulative Grants -0.00008 0.00090 0.002665** 0.00025
(0.002503) (0.001412) (0.001286) (0.000326)
Panel B. Log Mean Rental Values
Cumulative Grants 0.00002 -0.00077 0.00004 -0.00011
(0.001700) (0.000833) (0.000711) (0.000158)
Panel C. Log Total Housing Units
Cumulative Grants -0.007003** -0.00046 -0.00041 -0.00018
(0.003202) (0.001186) (0.000942) (0.000242)
Panel D. Log Total Value of Housing Stock
Cumulative Grants -0.006306* 0.00005 0.00145 -0.00017
(0.003292) (0.001878) (0.001603) (0.000461)
Notes: Analysis includes homes within a given distance of downstream river segments. Data include decennial census years 1970-2000.
Cumulative grants include grants in all previous years, not only census years. See main text for description of dwelling and baseline
covariates. Standard errors are clustered by watershed. Asterisks denote p-value < 0.10 (*), < 0.05 (**), or < 0.01 (***).
Table 6. Clean Water Act Grants: Costs and Effects on Home Values ($2014 Bn)
(1) (2) (3) (4)
Ratio: Change in Home 0.04 0.25 0.25 0.28
Values / Costs (0.03) (0.33) (0.33) (0.37)
p-value: Ratio = 0 [0.15] [0.44] [0.45] [0.45]
p-Value: Ratio = 1 [0.00] [0.02] [0.02] [0.05]
Notes: All values in billions ($2014). Calculations include grants given in years 1962-2000. Ninety-five percent confidence
regions are in brackets. Estimates come from regression specifications corresponding to Table 5, column (5).
48
A Expenditure on Water Pollution Abatement
This Appendix reviews available data on expenditures for abating water pollution and air pollution.
Measuring such expenditures is difficult. The first attempt by the Bureau of Economic Analysis (BEA)
to measure pollution abatement costs describes five challenges (Cremeans and Segel 1975) including
determining counterfactual pollution abatement; the problem that many abatement technologies also
have valuable byproducts; the proper classification of capital goods used for abatement; the difficulty
in recognizing business decisions as environmental or not; and the separation of pollution abatement
expenditures from expenditures for industrial safety and related purposes. These are only accounting
challenges; an additional challenge is that even correct accounting measures do not equal full economic
costs. Our goal is simply to describe available estimates, recognizing these caveats.
We consider three sets of estimates: BEA annual accounts for the period 1972-1994; Census abatement
cost surveys for manufacturing combined with EPA expenditure records for government; and EPA reports
on the costs of the Clean Water Act and Clean Air Act. All three methods suggest that total expenditure
on water pollution abatement since the Clean Water Act has exceeded $1 trillion ($2014), which is over
$100 per person-year, or equivalently, annual expenditure just over half a percent of GDP. All three
methods also imply that expenditure on water pollution abatement has exceeded expenditure on air
pollution abatement.
The first set of estimates comes from the Bureau of Economic Analysis (BEA) for the years 1972-1994
(Vogan 1996).45 The BEA estimates aggregate expenditure on water pollution abatement in the period
1972-1994 of $1.3 or $1.4 trillion ($2014) when deflated using quantity or price indices, respectively. Pri-
vate business accounts for two-thirds of these expenditures, and government for the remaining one-third.
The BEA data report total air pollution abatement expenditures at $1.0 to $1.4 trillion ($2014) using
quantity or price indices, respectively, including expenditures by private households (e.g., for vehicles).
This indicates that water pollution abatement expenditures exceed air pollution abatement expenditures
by 6 to 27 percent.
The second set of estimates comes from the Census Bureau for private industry and EPA for govern-
ment sources. The Census conducted the Pollution Abatement Costs and Expenditures survey annually
between 1972 and 1994. We sum capital and operating costs from this survey, and linearly interpolate
for the year 1987 (which had no survey). These data indicate total 1973-1994 abatement expenditures of
$315 billion for water pollution abatement and $338 billion for air pollution abatement ($2014). These
numbers include only the manufacturing sector. Our EPA data on the construction grants program
indicate that local governments spent about $190 billion in federal grant funds, supplemented by local
expenditures, and a federal Revolving Loan fund. 46
The third set of estimates is from EPA reports on the costs of the Clean Air and Clean Water Acts
(USEPA 1997, 2000c). In 1990, the compliance cost of the Clean Air Act was about $25 billion ($1990).
In 1994, the cost of water pollution abatement was $44.6 billion ($1997), though $32 billion of this would
have been spent even without the Clean Water Act. These reports provide no evidence on trends in these
numbers. Under the strong assumption that they had been constant over the period 1972-2001, they
imply costs of $2.5 trillion for water pollution and $1.8 trillion for air pollution ($2014).
45
The BEA reports both quantity and price deflators indexed to 1992. We deflate all BEA values to 1992. For comparability
with the rest of the paper, which reports figures in 2014 dollars, we then deflate these values to the year 2014 using the 1992
and 2014 Construction Price Index of Engineering News Records.
46
A study by national Mayor office estimates that local governments spent $1.4 trillion ($2008) on wastewater treatment
between 1956 and 2008. Another estimate of these expenditures is a report by the Congressional Budget Office (CBO 1985)
which reports that total annual wastewater spending by federal, state, and local governments was above $7 billion (in 1983
dollars) in each year between 1972 and 1983. Extrapolating to the entire period 1972-2016 implies total expenditure was above
$753 billion (2014$).
49
B Data Details
For each dataset, this section provides additional details on the data and on various tests we have
undertaken to probe their accuracy.
B.1 Deflator
Published annually since 1908, the Construction Cost Index of the Engineering News Record (ENR)
index reflects the cost of 200 hours of common labor including wages and fringe benefits, the cost of 2,500
pounds of fabricated structural steel, 1.128 tons of bulk portland cement, and 1,088 board feet of 2x4
lumber. To obtain the index, ENR averages the cost across 20 large cities. The closest series published
by the federal government is the Census’ construction price index for single-family homes.
50
metro areas, so they sightly over-represent metro areas. Monitoring is evenly spread through the 1970s,
1980s, and 1990s, though much less common in the 1960s.
Plotting densities of the raw data helps illustrate some of their properties (Appendix Figure 1).
Dissolved oxygen deficits follow a roughly normal distribution, while BOD, fecal coliforms, and TSS are
more skewed. The dissolved oxygen deficit distribution is smoother than the others because its sample
size is bigger. Some reports list pollution only out to two, one, or zero decimal points, which leads to
heaping in the raw data and visible pileups, for example, at BOD of 0=ln(1) or 0.69=ln(2).
We download the Storet Legacy data from U.S. EPA’s Storet Legacy Data Center and Storet and
NWIS data from the Water Quality Portal. Several decisions are required to extract data from the three
repositories of water pollution data and to make them comparable. We describe steps for each issue in
turn, and then steps taken to make the three repositories comparable.
Ambient Monitoring in Rivers and Lakes. Our analysis includes only rivers and lakes. This
excludes estuaries, oceans, wells, pipes, canals, sampling inside industrial plants, and other sites, though
these other types are uncommon in the pollution data. In Storet Legacy, we identify streams and lakes
using the Storet Legacy station type field provided by the station files. We also remove measurements
where the Primary Activity is Effluent Permit Condition, Effluent(Sample), Biological, or Tissue. We also
exclude records where the Secondary Activity Category is Dredge, Core, Ground Water, or others that are
clearly not river or lake samples, such as Standard Deviation or Sum of Squares. Additionally in all three
repositories, we exclude records around dams since they are highly dependent on dam operations and
often for dam monitoring; these are identified from the word, “Dam,” in the station name or description.
In Modern Storet and NWIS, we limit the data to rivers/streams and lakes in a few steps. First,
we restrict the sample media to surface water. We also restrict the media subtype to Surface Water,
which removes typically less than 1 percent of records that are coded as other media subtype such as
Effluent or Groundwater. For Modern Storet, the media subtype field is only populated for approximately
20 to 30 percent of observations. Of those coded, typically less than 1 percent of records are classified
as something other than “Surface Water.” Thus, for Modern Storet, we keep records where the media
subtype is missing to preserve a large amount of data given that nearly all records that are coded are for
surface water. Next, we limit the site type to lake, reservoir, impoundment, or stream. We distinguish
streams and lakes in the NWIS station data using the provided monitoring location type name field in
the station file. Streams are identified as “Stream,” “Stream: Canal,” “Stream: Ditch,” or “Stream:
Tidal stream.” Lakes are identified as “Lake, Reservoir, Impoundment.” For Modern Storet, we also
identify streams and lakes using the monitoring location type name field. For Modern Storet, streams are
identified as “River/Stream,” “River/Stream Ephemeral,” “River/Stream Intermittent,” “River/Stream
Perennial,” “Riverine Impoundment,” “River/stream Effluent-Dominated,” “Canal Drainage,” “Canal
Irrigation,” “Canal Transport,” “Channelized Stream,” “Floodwater,” “Floodwater Urban,” or “Flood-
water non-Urban.” Lakes are identified as “Lake,” “Reservoir,” “Great Lake,” “Pond-Anchialine,” or
“Pond-Stormwater.”
Measures of Water Pollution. Storet Legacy and NWIS classify each measure of water pollution
according to a single parameter code. These parameter codes classify water quality parameters according
to a broadly defined characteristic (e.g., biochemical oxygen demand) and the method for measuring the
pollutant (e.g., the temperature at which the measurement is taken and the incubation time period). For
example, the parameter code 00310 describes biochemical oxygen demand, measured at a temperature of
twenty degrees Celsius, over a five day incubation period. The parameter code 00306 describes biochemical
oxygen demand, also measured at twenty degrees Celsius, but only over a four day incubation period. For
each measure of water pollution that we use, we start by choosing the parameter code which has the most
observations in STORET Legacy. In nearly all cases, this parameter code corresponds to the parameter
code given in the EPA’s first major water pollution report after the Clean Water Act (USEPA 1974c).
51
We also include parameter codes which are comparable to this main code(s) (e.g., measured in different
units or a different device) if they have at least 10,000 observations in Storet Legacy. We use this rule
because Storet Legacy has the largest number of observations among the three repositories used in the
study, and the largest share of observations concentrated around the early 1970s when the Clean Water
Act began. For NWIS data, we use the same parameter codes as Storet Legacy to extract corresponding
measures of water quality from the Water Quality Portal.
Modern Storet does not use a parameter code, but rather identifies water quality parameters according
to characteristic names. We take several steps to match these characteristics in Storet to those pollutants
in Storet Legacy and NWIS repositories. We utilize concordance tables provided by EPA and the Water
Quality Portal that map Storet Legacy and NWIS parameter codes to Modern Storet “search names.”49
A single characteristic name often corresponds to multiple parameter codes. The EPA concordance
provides the meaning of parameter codes, including information on sample preparation (e.g., details re-
garding filter size), whether the measurement was in the field or laboratory, measurement units, result
sample fraction (e.g., total versus dissolved), result temperature basis, result statistical basis (e.g., mean,
median), additional comments, and additional measurement method details. We supplement this infor-
mation with a similar table from the Water Quality Portal website that provides a few additional details
for each parameter code including result time basis, result weight basis and result particle size basis.
Between these two files, we note which aspects distinguish certain parameter codes from others and
use these to restrict and subsequently match Modern Storet pollution records to Storet Legacy and
NWIS records by pollutant. In addition to the characteristic name, the main distinguishing aspect of a
measure of water pollution is the result sample fraction field that often identifies total versus dissolved
measurements. For biochemical oxygen demand and fecal coliforms, we also restrict based on the result
temperature basis (20 degrees Celsius or missing and 45 degrees Celsius or missing respectively) and
result time basis (5 day or missing and 24 hours or missing respectively). For dissolved oxygen, we
convert dissolved oxygen in mg/L to dissolved oxygen saturation (%) using a standard formula (Lung
2001).50
Sample Exclusions. We impose several sample exclusions. We keep observations with non-missing
observation date, latitude, and longitude. We also limit analysis to ambient monitoring. To limit the
influence of outliers, for each reading above the 99th percentile of the distribution of readings, separately
by pollutant, we recode the result to equal the 99th percentile. To ease interpretation, we define all
pollution outcomes so that lower levels of the outcome represent cleaner water.
For NWIS, we exclude records of Spills, Hurricanes, and Storms by limiting to routine hydrologic
events. Modern Storet and Storet Legacy, unlike NWIS, provide no information on hydrologic events.
We convert all temperature readings to degrees Fahrenheit. For other pollutants, we keep all records
with unit data that are easily converted to standard units. For Storet Legacy and NWIS, we keep
observations with missing units since parameter codes are already assigned to specific units. For Modern
Storet, we keep observations with missing units except for dissolved oxygen and temperature. We exclude
observations with missing units for dissolved oxygen and temperature since we are unable to distinguish
49
The U.S. EPA provides several crosswalks to identify measurements in Storet that are comparable to those in the Storet
Legacy and NWIS repositories ([Link] tables/Characteristic\ Parameter\ Code\ Map/).
In particular, we use the crosswalk STORET\ Modern\ vs\ [Link]. The water quality portal table links a parameter code
to characteristic name, measurement unit code, result sample fraction, result temperature basis, result statistical basis, result
time basis, result weight basis, result particle size basis, and medium. ([Link] srsnames/)
50 DOmgl
The formula is DOperc = 468/(31.5+T ) where T is the water temperature in Celsius. We only apply this conversion for
observations which record both dissolved oxygen in mg/L and water temperature, and for station-days which do not already
have dissolved oxygen saturation (%) defined. When applied to stations which do have dissolved oxygen saturation defined,
regressing the reported level of dissolved oxygen saturation on the value obtained from this conversion obtains a coefficient of
0.996 with a standard error of 0.001.
52
between mg/l and percent saturation in the dissolved oxygen data and degrees C and degrees F in the
temperature data. We restrict our data to samples collected in the continental U.S.
Definitions of Geographic Variables. We use a few steps to define geographic variables. Sto-
ret Legacy has separate files describing stations and describing actual pollution readings. Geographic
identifiers like latitude, longitude, watershed, and county appear in both. We prioritize values of these
variables from station files. When those are missing, we supplement them with values from the results
files. For all repositories, when watershed and county are missing from both station and results files, we
define them based on a monitoring site’s latitude and longitude.
Types of Water Pollution. We use a few criteria to choose additional measures of pollution for
sensitivity analysis in Appendix Table 4. We partly take this list of additional pollutants from the EPA’s
(1974a) first major assessment of water pollution after the Clean Water Act was passed. These additional
pollutants include water temperature, ammonia-nitrogen (total as N), nitrates (total as N), total nitrite
plus nitrate, orthophosphate (dissolved as P)51 , phosphate (total as PO4), total phosphorus, total and
dissolved chlorides, color, total phenols, total dissolved solids, total and dissolved sulfate, total coliforms,
and turbidity.52 Finally, we add total nitrogen as a key measure of nutrient pollution, and lead and
mercury as important heavy metals.
Specific Monitor Networks. For NWIS, we identify stations that are a part of several networks spe-
cifically designed by USGS to examine long-term trends in water quality. These networks are NASQAN,
NAWQA, and HBN. Station identifiers were obtained from USGS. We obtained NASQAN and HBN sta-
tion identifiers through a request from USGS. NAWQA site identifiers were downloaded from a USGS web-
site using filters on “stream” and “lake” for site types ([Link] queries public/jsp/[Link]).
Our NASQAN and HBN samples include only the original NASQAN and HBN networks, which spanned
the years 1974-1995. Our NAWQA sample includes both the NAWQA networks focused on streams/rivers
and on lakes. We add “USGS-“ to the stationid field and match these station files to monitoring files
provided by the Water Quality portal. In several cases, monitoring was performed at these stations even
when they were not officially part of their designated networks. We include all monitoring results during
the 1962-2001 time period that we study.
Station Definitions. Some stations change name slightly—for example, the same station may have
similar names in Modern Storet and in Storet Legacy. In regressions that include station fixed effects or
allow station-level autocorrelation, we define a station as a unique latitude and longitude pair.
In our data, the tuple of a station’s name, the name of the agency that manages it, and the repo-
sitory (Storet Legacy, Modern Storet, or NWIS) uniquely identifies a station. When we pool the three
repositories, 5-10 percent of “stations” that appear unique by this definition have the same longitude and
latitude as another “station.” This is typically because a single station appears in both Storet Legacy
and Modern Storet but with slightly different station codes. This motivates our use of longitude and
latitude to define unique stations.
In a few cases, records appear in both Storet Legacy and Modern Storet. We identify and remove
such duplicates based on station latitude and longitude, reading date and time, and reading value.
Monitoring Depth. We do not account for reading depth since many readings have missing units
and our inspection suggests different monitors use different units.
Measurement Limits We capture special cases where the pollutant could not be measured or the
measurement was outside of standard detection limits. For NWIS and Storet, we flag records with non-
51
The 1974 report includes total soluble phosphate to determine reference levels. We choose to use orthophosphate (dissolved
as P) instead. The number of records in Storet Legacy corresponding to total soluble phosphate (38,000) is far fewer than the
870,000 monitoring results for orthophosphate (dissolved as P).
52
We add dissolved chlorides and dissolved sulfate to this list since the unique pollutant parameter codes listed in the NWQI
Report for chlorides and sulfate refer to total chlorides and total sulfate in Storet Legacy, but dissolved chlorides and dissolved
sulfate in the NWIS.
53
missing information in the detection type field. For example, this includes records coded with “Historical
Lower Reporting Limit,” “Upper Reporting Limit,” or “Estimated Detection Level.” For flagged records,
we then let the result value equal the detection limit if the result value is missing and the detection limit
is not missing. We also restrict the detection limit to be greater than zero except for temperature. For
Storet Legacy, we use the “remarks” field to flag similar records. This includes remarks coded as “B,”
“C,” “I,” “J,” “K,” “L,” “M,” “N,” “O,” “P,” “T,” “U,” “W,” “Z,” and “$,” where the key is provided
by the U.S. EPA ([Link] [Link]). We make no changes to the reported
measurement since the remarks suggest that the reported measurement equals the result value. In one
sensitivity analysis, we take readings with the “Below Detection Limit” (BDL) field coded as “Lower
Limit” or “Other,” and replace them as half the listed value.
Measurement Time. For observations with missing information on the measurement time, we
create a missing hour indicator. For Storet Legacy and Modern Storet, we also code this indicator for
hour equal to 0 since there is a pileup of observations at this hour.
Test of Dissolved Oxygen Data Quality. This full dataset has not been used in research before,
and so to ascertain the accuracy of our results it is useful to probe the quality of these data. The paper
does report numerous sensitivity analyses that provide encouraging evidence that these data provide an
accurate picture of U.S. water quality. Here we report one additional test of the data quality. Standard
hydrology textbooks predict that dissolved oxygen deficits should increase with temperature, in summer
(when flows are lower and temperatures higher), and in morning. The time-of-day patterns of dissolved
oxygen are due to photosynthesis adding oxygen during the day and respiration removing oxygen at night.
Appendix Figure 3 plots regressions of dissolved oxygen deficits on binned indicators for each of these
physical factors, while including monitor fixed effects. The patterns closely follow standard chemistry
predictions. We interpret this as one additional piece of evidence that these data provide good quality
measures of water pollution.
B.4 Grants
The average government manages multiple plants, and the average plant receives multiple grants. Go-
vernments include cities, towns, sewage or water districts, and environmental agencies. After 1987, small
grants to a few areas, mainly islands and Washington DC, continued through the year 2000. Two-thirds
of U.S. wastewater treatment plants received at least one grant.
A local government could receive at least three grants for a single project. The first grant was for
creating a facility plan, the second was for detailed engineering plans, and the third was for construction.
Money was disbursed as it was spent and EPA reviewed projects after completion. The grants data used
for analysis exclude the very few grants in the raw data that list either the federal or total (federal+local)
cost as zero.
The microdata we obtain on grants are up to 50 years old, from an era when computers were rare.
The machine which originally housed these data was decommissioned around the year 2000, so we sought
to corroborate the accuracy of the data.
In order to test the accuracy of the microdata on 35,000 grants, we compare the grants against several
published reports describing this program. A USEPA (2000b) report and associated book (Stoddard,
Harcum, Simpson, Pagenkopf, and Bastian 2002) were based on detailed data describing these grants.
Andy Stoddard and Jon Harcum generously shared the microdata underlying these reports. The grants
data they analyze exactly mirror ours, with two exceptions. First, the aggregate nominal figure they report
for grants ($61 billion) reflects both federal and municipal capital spending, though their publications
and the associated EPA website describe it as federal money only. Second, they only obtained records of
10,000 grants. This appears to be because their data apply several exclusion criteria.
54
We also compared individual grants in the microdata we received against published reports we found
that list individual grants given in early years (USEPA 1974a,b). The grants in our microdata also
appear in these printed volumes, with the same plant and government authority listed. Grant dates are
similar in the microdata and 1970s reports, though some differ by several months. The dollar amounts of
individual grants have the same order of magnitude but the exact amounts differ. This may be because
funds requested, approved, and disbursed can differ, and can take over a decade to finalize.
One sensitivity analysis in our paper looks only at grants given for construction rather than engineering
plans. Following Stoddard, Harcum, Simpson, Pagenkopf, and Bastian (2002), we define a grant as for
construction if the grants microdata list the grant “Step” as equal to three or four, and if the grant also
lists the facility number of the plant receiving the grant.
Operating and Maintenance Costs. Clean Water Act grants involve three types of costs: federal
grants for capital, local matching expenditures for capital, and expenditures for operating and mainte-
nance (O&M). Our grants microdata report only the first two costs, so we estimate the third from other
sources.
National data are consistent with the idea that the ratio of lifetime O&M costs to upfront capital
costs increased almost linearly from 130 percent in 1972 to 259 percent in 1996. We linearly extrapolate
these values to years before and after 1972-1996. These values reflect several sources. Two independent
sources provide identical reports that concrete structures of treatment plants have a useful life of 50 years
but mechanical and electrical components have a useful life of 15-25 years (American Society of Civil
Engineers 2011 and USEPA 2002). We assume grants require O&M expenditures for 25.
We combine this 25 year statistic with the estimated ratio of O&M costs to capital stock in a typical
year. This ratio grew almost linearly from 3.7 percent in 1972 to 7.4 percent in 1996 (USEPA 2002).
These values reflect historic census records on O&M expenditures and perpetual inventory estimates of
capital stocks (U.S. Army Corps of Engineers 1994), both for sewerage infrastructure.
These values represent the most accurate estimates of O&M costs that we can discern. Nonetheless,
it is informative to compare these values against other estimates of these costs. One survey of 226 Clean
Water Act projects found a ratio of annual O&M costs to upfront capital costs of 3.76% to 3.96% (Lake,
Hanneman, and Oster 1979).53 The ratio was similar across different community sizes and government
types. These values are similar to the aforementioned numbers that we use, which report a ratio of 3.7%
in 1972. A second source reports the prediction that for a typical city of 25,000 people, the total cost
of building a treatment plant is about $4.6 million, and the expected real annual O&M cost is about
$200,000 (USEPA 1979). The ratio of annual O&M costs to upfront capital costs in this second source
is 4.3%, which is the value for the year 1976 implied by the data we use. A third source is an ex ante
engineering prediction that lifetime O&M costs are 93 percent of upfront capital costs (Hitchcock and
Giggey 1975).54 The reason why the engineering predictions in this third study are smaller than the ex
post realized costs we use is unclear, though engineering predictions have underestimated the costs of
energy and environmental investments in other settings also.
55
housing units. In years 1970 and 1980, these data cover non-condominium housing units only. The
housing data comes from the census “long form,” which is given to 1 in 6 households. The actual census
questionnaire has homeowners estimate the value of their property as falling into one of several bins.
We use a version of these data in which Geolytics has concorded all tract boundaries to the year 2010
boundaries. We use information on resident demographics, total population, physical features of housing
(e.g., the number of rooms), home values, and rents. Some regressions control for housing structure
characteristics. Because each observation is a census tract (which is subsequently aggregated to buffer
a given distance from a river), we measure these structural characteristics as the share of homes with a
given characteristic. The rental data for 1970 are “contract rents,” which report the amount paid from
renter to owner; the rental data for 1980-2000 are “gross rents,” which include the contract rent plus
utilities and fuels, if these are paid by the renter. As with home values, the actual census questionnaire
has renters enter their contract rent as falling into one of several bins.
The census home values data reflect self-reported home values, rather than actual transaction values.
The census data are also top-coded. Many studies find high correlation between self-reported home
values and sales price indices, either in the cross-section, time-series, or panel (James R. Follain and
Malpezzi 1981; Ihlanfeldt and Martinez-Vazquez 1986; John L. Goodman and Ittner 1992; DiPasquale
and Somerville 1995; Kiel and Zabel 1999; Banzhaf and Farooque 2013; Benı́tez-Silva, Eren, Heiland, and
Jiménez-Martı́n 2015), suggesting that self-reports are informative about market valuations. Banzhaf and
Farooque (2013) actually find that community-mean rental values from the U.S. census are more strongly
correlated with local amenities and with income than home sales value are. Because home values are
the dependent variable in hedonic regressions, using self-reported home values in the presence of classical
measurement error may decrease the precision of estimates but not create attenuation bias (Griliches and
Hausman 1986; Bound and Krueger 1991).
Such self-reported home values may also suffer from inertia. Owner-occupants who have not purchased
a home recently may be slow to update their beliefs about a home’s value (Kuzmenko and Timmins
2011; Henriques 2013). This inertia appears to be consistent with a simple Bayesian updating model,
specifically, a Kalman filter (Davis and Quintin 2016). But this means that homeowners may be slow
to reflect changes in local amenities due to investments in surface water quality. Longstanding rental
tenants often receive tenure discounts, though we are not aware of direct evidence on the speed with
which such discounts adjust to changes in amenities. As one way to address these concerns, in analyzing
home values and rents, we report specifications which allow homeowners and renters up to 10 years to
reflect changes in water quality.
We define city centers for all Standard Metropolitan Statistical Areas (SMSAs) as follows. The
definition of central business district locations used in most research comes from the 1982 Census of
Retail Trade. This definition has two downsides in our setting—it is potentially endogenous to the
Clean Water Act, since the definition was constructed ten years after the Act and since cleaning up
rivers might shift the location of businesses; and it includes a limited number of cities. Instead, for each
Standard Metropolitan Statistical Area (SMSA), we use the 1970 Population Census to construct an
original definition of the city center as the latitude and longitude of the census block centroid which has
the greatest population density. In cities with a central business district defined from the 1982 Census of
Retail Trade, this typically ends up defining close but not identical definitions of city centers. For census
tracts within an SMSA, we then define distance to the city center as distance to the city center of that
SMSA. For census tracts outside an SMSA, we define the distance to the closest city center overall.
56
B.6 Municipal Expenditure Data
We impose a few sample restrictions to ensure that we accurately measure the response of municipal
spending to federal grants. We restrict the sample to governments appearing in all years 1970-2001.55
This is important since the data report capital expenditures but not capital stocks, and missing some
years of municipal expenditures data could underestimate the response of municipal spending to federal
grants. We also restrict the sample to municipalities and townships, which we collectively refer to as
cities. This restriction excludes state governments, county governments, special districts, school districts,
and the federal government. Finally, we exclude cities which have other governments with similar names
in the same state, and cities that have sewer districts, counties, or other nearby related governments that
may receive or spend grants. We make these exclusions because they help accurately measure sewerage
capital and grant receipt. We identify such cities both using listings of sewer districts and local counties
in the survey and census of governments, and using grants which list the authority receiving the grant as a
sewer board, county agency, or other local government that is not a city. Many grants go to water boards,
sewer districts, county agencies, or other local governments which have separate financial management
from a city. Such grants would not appear in the city’s financial records, but the grants data do not
always distinguish which local government administered the grant. These restrictions leave a balanced
panel of 199 cities. As noted in the main text, because this sample is relatively small, we report one
specification using inverse propensity score reweighting to match the characteristics of a broader sample
of cities.56
57
not total water pollution emissions, the SWUM survey questions and resulting report both focus on
water pollution,58 and plants with extensive water use also emit large amounts of water pollution. For
example, the industries that consume the most water in the 1978 version of these data (Becker 2015) –
blast furnaces and steel mills, industrial organic chemicals, petroleum refining, and paper mills – are also
the industries that emit the most water pollution.
The EPA does not have plant-level records from the 1970s. The current PCS data do list the first
year a plant received a water pollution emissions permit. These data suffer from incomplete reporting,
since not all states and plants uploaded data to the EPA’s centralized database. They also suffer from
sample selection, since plants which closed may not appear in the data.
Some sensitivity analyses control for county-by-year-by-pollutant nonattainment designations. For
years after 1977, these data come from the EPA Green Book. Data for years 1972-1977 are constructed
from raw monitors based on the reported nonattainment rule. We define ozone nonattainment to include
all ozone or nitrogen oxides designations, and we define particulate matter nonattainment to include
Total Suspended Particulates (TSPs), particulates smaller than 10 micrometers (PM10 ), and particulates
smaller than 2.5 micrometers (PM2.5 ). Our binary measures of nonattainment include all partial, whole-
county, and other types of nonattainment.
58
the handwritten secondary or tertiary entries.
Row 4 of Appendix Table 7 describes grants to plants with baseline pollution above the median.
We measure baseline pollution as the mean pollution level for each watershed as measured in the years
1962-1971. Baseline pollution levels are calculated separately for dissolved oxygen and for the fishable
standard.
Row 5 of Appendix Table 7 considers states that have decentralized authority to implement the Clean
Water Act NPDES program.60
Row 6 of Appendix Table 7 considers counties that have above-median shares of people who report
outdoor fishing in the previous year. We obtain these data from the confidential version of the National
Survey on Recreation and the Environment (NSRE) years 1999-2009. Fishing is defined as coldwater or
warmwater fishing in rivers, lakes, or streams. Swimming includes swimming in streams, lakes, ponds, or
the ocean. (A separate question that we don’t use asks about swimming in pools.) Our sample includes
approximately 85,000 households. Earlier versions of the survey have been conducted intermittently since
1960; however, county and state participation shares are unavailable from earlier years. The NSRE is
a partnership between the USDA Forest Service Southern Research Station, The National Oceanic and
Atmospheric Administration, the University of Georgia, the University of Tennessee and other federal,
state or private sponsors. The survey is a randomized telephone survey of households across the U.S.
Unfortunately, state- or county-level rates of fishing participation for the entire U.S. are not available
from years before the Clean Water Act.
Row 7 of Appendix Table 7 uses data on environmental views from the “Total Green Index” of Hall
and Kerr (1991). States with Pro-Environmental Views are defined as those with above-median values
of the total green index.
Finally, row 8 of Appendix Table 7 uses data on city growth and amenities. To identify declining
urban areas, we follow Glaeser and Gyourko (2005) by taking 1970-2000 city population growth rate as
reported in the 1972 and 2000 city data books (Haines and ICPSR 2010). We define declining urban
areas as cities with population above 25,000 in the year 1970 which had a population decline by the year
2000. High amenity areas are defined as counties in an SMSA with above-median total amenity value, as
reported in Albouy (2016), Appendix Table A1.
Let Ωt denote the set of homes in tract t. Summing across homes within a tract and dividing by the
number of housing units within a tract gives
P P 0 P
i∈Ωt Pit i∈Ωt Xit it
= β + i∈Ωt
Nt Nt Nt
Letting P̃t denote the mean value of variable P of within tract t, we have
This illustrates how the regression with grouped data (e.g., census tract), weighted by the number of
housing units in the group, obtains the same parameter that house-level data would obtain.
60
These data are obtained from [Link] (accessed August 31, 2016).
59
C Spatial and Other Matching Across Datasets
Conducting the analysis of this paper requires linking several datasets. To link monitors and treatment
plants to rivers, we use the fact that rivers in NHD are internally defined as 70 million distinct longitude
and latitude points connected by straight lines. We refer to these points as stream nodes. We identify
the location where each treatment plant discharges waste using longitude and latitude values from the
1984-1996 CWNS. For each monitor and treatment plant, we then find the nearest stream node. Written
string descriptions included for some monitors suggest this link is very accurate. All treatment plants in
the analysis sample are within 0.6 miles of a stream node.61
To measure distances upstream and downstream along rivers, we use files in NHD which list, for each
stream node, the node(s) that are immediately upstream and/or downstream. We recursively construct
a network tree that defines, for each treatment plant, all stream nodes that are upstream or downstream.
We construct this algorithm to follow these flow relationships when one river flows into another, when
rivers cross watersheds, or when the flow network passes through lakes, estuaries, and other types of
water. Finally, we calculate distances between stream nodes and sum them to measure the distance along
a river between a treatment plant and pollution monitor.
We also link treatment plants to upstream and downstream census tracts. For each treatment plant,
we construct concentric buffers of 1 mile radius around river segments upstream and downstream of the
plant. We define one buffer to include all homes within 1 mile of those rivers, a second buffer to include
homes 1 to 2 miles from those rivers, etc. Many census tracts span multiple buffers. For each tract, we
calculate the share of the tract’s area which is in each buffer. For each tract, we measure population
and housing characteristics within a buffer by multiplying the total within the tract by the share of the
tract’s area within the buffer. For example, if a tract has an aggregate home value of $10 million and has
100 homes, and a fourth of the area of this tract falls within a given 1-mile buffer, then we assume the
section of the tract within the buffer has aggregate home value of $2.5 million and 25 homes.
Finally, we link each grant to the exact wastewater treatment plant receiving the grant. The Freedom
of Information Act (FOIA) data we received list an identifier code for the facility receiving the grant.
These same identifier codes appear in the Clean Watershed Needs Survey (CWNS), so let us precisely
identify the wastewater treatment plant receiving the grant. In some cases where the facility identifier
code is missing, CWNS itself lists the grant code which a plant used, and this grant code matches the
grant codes used in the FOIA data. These two links unique identify the facility receiving a grant for
about 76 percent of grants and 88 percent of grant dollars. This is the sample used in the main results.
We experimented with fuzzy matching between the FOIA and CWNS data to link the unmatched grants.
This matching used string fields such as county, city, zip code, and plant name. We do not focus on
results using this fuzzy matching since the plant name and zip code are often missing or incorrect in the
FOIA data and match quality is questionable.
61
We explored estimating the effect of grants on water pollution using on treatment plants within 0.3 miles of a stream node,
and obtained similar results.
60
D Cross-Sectional Water Pollution Around Wastewater
Treatment Plants
Methodology
We use the following equation to estimate how water pollution changes as a river flows past a wastewater
treatment plant:
Qpdy = βdd + µpy + pdy
Each observation in these data represents a plant-downstream-year tuple. Here Qpdy measures pollution
at plant p in year y and downstream location d. Location d = 1 includes areas downstream of the
treatment plant, and location d = 0 includes areas upstream of the treatment plant. The plant-by-year
fixed effects µpy imply that these comparisons are made within a river×year, so they measure how water
pollution changes as the river flows past the wastewater treatment plant. The coefficient β represents the
mean difference in pollution between downstream and upstream waters near a treatment plant.
We also report the following variant in which the upstream/downstream indicator d is broken into
several bins:
δ=20
X
Qpdy = βδ 1[dd = δ] + µpy + pdy (7)
δ=−25
The data are grouped into distance bins of five miles along the river (20 to 25 miles upstream of the
wastewater treatment plant, 15 to 20 miles upstream, ..., 20 to 25 miles downstream). We then regress
pollution Q on indicators 1[·] for the distance bin in which an observation is located. The bin just
upstream of the treatment plant, 0 to 5 miles upstream, is excluded as a reference category.
These comparisons are cross-sectional and do not analyze changes in a river over time. Because
wastewater treatment plants may locate near other pollution sources, such as urban runoff and industrial
plants, these regressions do not identify the effect of wastewater treatment plants on water pollution. Area
characteristics may also differ in the cross-section between upstream and downstream areas. Indeed, the
average upstream and downstream monitoring sites are 20 miles apart. Compared to upstream areas,
downstream areas have similar population density and share of families on welfare, though slightly lower
share of adults with a college degree and slightly greater share population black.62 These cross-sectional
differences are another reason that our research design exploits the timing of grants across treatment
plants. We view this analysis as a test of the accuracy of water pollution and plant data.
Results
As a river passes a wastewater treatment plant, data show large and statistically significant increases
in pollution (Appendix Table 5). Dissolved oxygen deficits rise by 1.2 percent saturation, which is an
increase of ten percent relative to the upstream pollution level. Fecal coliforms increase the most as a
river passes a treatment plant, by about 40 percent. Other pollutants increase by smaller amounts. The
probability that a river is not fishable rises by about 4 percentage points as a river passes a wastewater
treatment plant.
Graphs of these pollutants by distance from the plant, corresponding to equation (7), show these
patterns more flexibly (Appendix Figure 5). As a river approaches a treatment plant, dissolved oxygen
62
The census tracts of downstream monitoring sites have population density of 835 persons per square mile; upstream areas
have density of 862. Downstream areas have 4.88 percent of families on welfare, while upstream areas have 4.81 percent of
families on welfare; downstream areas have 9.2 percent of adults with a college degree while upstream areas have 9.9 percent of
adults with a college degree, and downstream areas have 8.5 percent of population black while upstream areas have 7.7 percent
of population black. These values use 1970 census data.
61
deficits do not change at all, and the probability that it is not fishable increases gradually. The modest
trend in fishability may occur because other pollution sources are located upstream of a treatment plant.
In the miles just downstream of the treatment plant, dissolved oxygen deficits jump by 1-2 percentage
points, and the probability the river violates fishing standards jumps by 3-4 percentage points. These
increases in pollution levels persist for at least 25 miles downstream of the treatment plant. Similar
patterns appear for other pollutants.63 Some of this persistence may reflect other pollution sources near
cities like industry or urban runoff, and later we directly estimate the length of distance downriver at
which grants affect pollution. These patterns suggest the data behave as one might expect since water
pollution concentrations rise as a river passes a pollution source.
E Sensitivity Analyses
E.1 Pollution Trends
This subsection reports sensitivity analyses for pollution trends using the linear trend specification of
equation (2), noting the caveats mentioned earlier.
Rows 2-6 of Appendix Table 3 consider important subsamples. Row 2 only uses long-term stations,
which begin operating by the year 1971 and report data through at least the year 1988. We choose
these years since the Clean Water Act was passed in 1972, and since 1987 was the last year with the full
grants program. Row 3 restricts the sample to the largely metro counties that had some home values
data in all four decennial censuses 1970-2000; as mentioned earlier, the 1970-80 censuses excluded many
non-metro areas. Rows 4-6 separately estimate results for the three pollution data repositories – NWIS,
Storet Legacy, and Modern Storet – since each has different coverage and affiliated organizations which
collect the data.
Rows 7-11 of Appendix Table 3 report sensitivity analyses prompted in part from discussing this
analysis with hydrologists. Row 7 limits the sample to include only stations which have at least 25
readings, since these may have higher-quality data. Row 8 controls for the level of instantaneous stream
flow, as measured at the same station and time as pollution, and so is limited to to “stream gauge”
observations recording both streamflow and pollution. Row 9 uses data from only the months of July and
August, since this is when streamflows are lowest, temperatures are greatest, and pollution concentrations
are highest. Row 10 takes readings which indicate that they are below a monitor’s detection limit
(“BDL”), which are 5 percent of all readings, and replaces them with half the recorded value. (The main
analysis uses the reported value for these BDL readings.) This is a standard sensitivity analyses for such
readings in water quality research. Row 11 specifies the pollutants with a skewed distribution (BOD,
fecal coliforms, and TSS) in logs rather than levels.
Rows 12-13 of Appendix Table 3 report alternative water pollution indices. Row 12 reports results
where each observation describes mean values for a river-year. In this specification, a “river” is defined
as a unique combination of a watershed and river code.64 Row 13 defines the dependent variable as an
indicator for whether more than 50 percent of pollution readings in the river-year are below the fishable
or swimmable standard.
Rows 14-16 of Appendix Table 3 report results separately for three small and well-documented net-
works of high-quality monitoring sites, all managed by USGS. Row 14 shows estimates for the National
63
Most pollutants, including BOD, fecal coliforms, and dissolved oxygen deficits, increase fairly abruptly in the 5 miles just
before and after a treatment plant. TSS, which is the only exception, increases somewhat steadily in the 25 miles downstream of
a treatment plant. These TSS patterns are consistent with the idea that urban runoff accounts for a large share of TSS emissions
(Gianessi and Peskin 1981), and that urban runoff occurs in cities around treatment plants.
64
A river here is defined as a “levelpathi” from NHD.
62
Stream Quality Accounting Network (NASQAN). Row 15 shows estimates for the National Water-Quality
Assessment (NAWQA) (Smith, Alexander, and Wolman 1987; Alexander, Slack, Ludtke, Fitzgerald, and
Schertz 1998; Rosen and Lapham 2008).65 Row 16 shows estimates for the Hydrologic Benchmark Net-
work (HBN), which includes 37 watersheds expected to have “minimal” effects from human activity
(Alexander, Slack, Ludtke, Fitzgerald, and Schertz 1998). HBN shows smaller trends than the main sam-
ple for BOD, fecal coliforms, and TSS, which is consistent with anthropogenic causes of these pollutants
in the national data.
Rows 17-25 of Appendix Table 3 report nine other important sensitivity analyses. Row 17 allows
arbitrary autocorrelation within both watersheds and years. Row 18 limits the sample to lakes. An
important paper by Smith and Wolloh (2012) finds that dissolved oxygen in lakes has not changed since the
Clean Water Act, and Row 18 corroborates that finding. But the lake point estimate for dissolved oxygen
deficits is negative, all other pollutants in lakes show downward trends, and nearly all of the roughly 20
other sensitivity analyses in Appendix Table 3 also show statistically significant downward trends. These
results suggest that broader trends in water pollution differ from patterns evident in dissolved oxygen in
lakes. Row 19 adds controls for temperature and precipitation. These are relevant since climate change
is increasing air temperatures, but hotter temperatures can increase dissolved oxygen deficits. In row
20 each observation is the mean value in the county-year, and regressions are generalized least squares
weighted by the population in the county-year. This may better reflect the trends experienced by the
average person. Row 21 interacts the time-of-day and day-of-year controls with hydrologic region fixed
effects, to capture the idea that seasonality and time patterns may differ across space. Rows 22-25 report
estimates separately for each of the four census regions; all pollutants are declining in all regions, though
declines were more rapid in the Northeast.
63
miles upstream had received, which hardly changes estimates. This control is designed to address the
possible concern that facilities may be located near each other in rivers, and nearby plants may receive
grants at similar times.66 Row 19 includes controls for the number of grant projects of three different
magnitudes (roughly terciles of the size distribution). The smallest grant projects have no clear effects on
pollution, moderate-size projects lead to statistically insignificant decreases in pollution, and the largest
projects produce the clearest decreases in pollution. Row 20 replaces the cumulative number of grants
with a measure of the log of the cumulative real grant dollars provided, and indicates that a one percent
increases in grant size increases the probability that downstream rivers are fishable by about 1 percent.
As discussed earlier, one potential reason these estimates are less precise than estimates for the number
of grants is that the distribution of grants is highly skewed, but also contains many zeros.
Rows 21-27 of Appendix Table 6 present several other important sensitivity analyses. Row 21 shows
a differences-in-differences specification using data only from downstream waters. This specification
includes plant fixed effects and water basin-by-year fixed effects, and reports the coefficient on a variable
measuring the cumulative number of grants a plant has received. This finds slightly larger and more
precise estimates than the main results do, though most of the main estimates and these estimates
have overlapping confidence intervals. Row 22 allows arbitrary autocorrelation of confidence regions
within year and within watershed. Row 23 includes monitoring sites on other rivers than the river
where the wastewater treatment plant is directly located. Row 24 excludes small wastewater treatment
plants that never received a grant. Row 25 shows unweighted OLS estimates. Finally, row 26 adds
several potentially important control variables, each interacted with a downstream indicator: whether
the county of the wastewater treatment plant was in nonattainment under the Clean Air Act, separately
for each air pollutant; the total population in the county-year of the wastewater treatment plant; and two
indicators for the number of polluting industrial plants in the county-year of the wastewater treatment
plant, extracted from the databases SWUM and PCS as described earlier.67
Finally, we estimate the effect of these grants on other pollutants (Appendix Table 4, column 2).
These results provide additional evidence in support of the identifying assumptions. We find no effect of
a grant on any of the industrial pollutants (lead, mercury, or phenols), and perverse signs for two of the
three. It is not impossible for a grant to affect these industrial pollutants, since some industrial waste can
flow through treatment plants, but the lack of substantive effects on any of these three and incorrect signs
are consistent with the idea that these grants are not correlated with unobserved variables like industrial
activity or industrial water pollution regulations. We also detect no effects of grants on most measures
of nutrients or more general water quality measures such as chlorides, stream flow, or temperature.
64
like noise or odor. Row 3 allows two-way clustering of standard errors by watershed and also by year.
The richest specifications of Table 5 include baseline controls interacted with year fixed effects; row
4 removes these baseline controls. Row 5 reports a differences-in-differences-in-differences regression
comparing upstream versus downstream home values.68 Row 6 reports unweighted OLS estimates. Row
7 replaces downstream-by-basin-by-year fixed effects with downstream-by-year fixed effects and basin-by-
year trends. Row 8 reports estimates only for grants given in the year 1972. If communities in later years
knew in advance a plant would receive a grant, then estimates for later years could be confounded by
homeowner expectations. Row 9 reports the change in housing values around 1987 for plants that never
received a grant. If homeowners had accurate expectations about future grants, these plants may have
experienced a decrease in home values once the grants largely ceased. Row 10 restricts the sample to the
100 largest metropolitan areas as of 1970. Row 11 allows grants to affect outcomes after 10 years, which
may be important if local public goods are only gradually incorporated into self-reported housing values.
Appendix Table 8 suggests little evidence that grants changed the composition of local residents
(columns 7-12). All these point estimates are small, and most are statistically indistinguishable from
zero. More broadly, these estimates do not change our qualitative conclusions about how grants affect
housing values (columns 1-6), though point estimates do vary. There is modest evidence that home values
(though not rents) increase within 0.25 or 1.0 miles of affected waters, though point estimates within 25
miles are uniformly small and indistinguishable from zero. The unweighted estimates for housing (though
not rents) are more positive, which may suggest larger effects for less densely populated areas. Below we
explore heterogeneity directly, and find some evidence of larger effects for (generally rural) areas where
outdoor fishing or swimming is common.
65
explains, states were supposed to discuss priority lists in public hearings, which could provide public
knowledge about plants that might soon receive grants. The extent of such public knowledge is unclear,
however, and both priority lists and the national budget of the grants program changed substantially
between years. Available quantitative evidence does not show clear support for this idea. Homeowner
expectations formed in the year(s) before a grant would create a positive pretrend in home values, but
Figure 4 shows a flat pre-trend in the ten years before a grant. If expectations played a large role overall,
then grants given in the first year of the program (1972) might have larger effects since these were largely
unexpected. Row 8 of Appendix Table 8 estimates only the effect of grants given in the year 1972, and
finds similar effects to the overall estimates of Row 1. Finally, we test for a change in home values in
1987, the year the grants largely concluded, for plants that failed to receive a grant. The point estimates
for this are negative but not statistically distinguishable from either zero or the main estimates (Row 9).
66
Appendix Figure 1. Densities of Raw Pollution Readings
.025
.3
.02
.2
.015
Density
Density
.01
.1
.005
0
0
0 10 20 30 40 -200 -100 0 100
mg/L mg/L
.015
.003
.01
Density
Density
.002
.005
.001
0
.2
.15
Density
Density
.1
.1
.05
0
0
-10 -5 0 5 10 -10 -5 0 5 10
ln(MPN/100mL) ln(mg/L)
67
Appendix Figure 2a. Pollution by River Basin, 1992-2001
Panel A. Biochemical Oxygen Demand
Panel D. TSS
Notes: Data are mean by river basin. Darker colors represent worse pollution, white areas have no data.
Categories are as follows. BOD: <2, 2 to 4, 4 to 6, >6. Dissolved oxygen deficit: <6, 6 to 12, 12 to 18,
>18. Fecal coliforms: <250, 250 to 750, 750 to 1750, >1750. TSS: <25, 25 to 50, 50 to 75, >75.
68
Appendix Figure 2b. Fecal Coliforms, by River Basin and Decade, 1962-2001
Panel A. 1962-1971
Panel B. 1972-1981
Panel C. 1982-1991
Panel D. 1992-2001
Notes: Data are mean by river basin. Darker colors represent worse pollution. Color cutoffs are same in
each year. White areas have no data. Colors are as follows, with values in logs: No data (white), <250
(light blue), 250 to 750 (moderate blue), 750 to 1750 (darker blue), >1750 (darkest blue).
69
Appendix Figure 3. Patterns in Dissolved Oxygen Deficits
.5
2
mg/L
mg/L
0
1
-.5
0
1 2 3 4 5 6 7 8 9 10 11 12 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
Month Hour
Notes: Figures show coefficients from a regression of dissolved oxygen deficit on monitoring station fixed
effects and on dummy variables for the indicated controls. Data use only dissolved oxygen measured in
mg/L. Dissolved oxygen deficit measured as 15 minus the reported level of dissolved oxygen in mg/L.
Data cover years 1962-2001.
70
Appendix Figure 4. Water Pollution Trends, Other Pollution Measures, 1962-2001
5000
6
4000
5
3000
MPN/100mL
mg/L
4
2000
3
1000
2
0
1962 1972 1982 1992 2001 1962 1972 1982 1992 2001
Year Year
100
.7
80
.6
Share Not Swimmable
mg/L
60
.5
40
.4
20
.3
1962 1972 1982 1992 2001 1962 1972 1982 1992 2001
Year Year
Notes: These graphs show year fixed effects plus the constant from regressions which control for station
fixed effects, year fixed effects, day-of-year cubic polynomial, and hour-of-day cubic polynomial,
corresponding to equation (2) in the text. Connected dots show yearly values, dashed lines show 95%
confidence interval, and 1962 is reference category. Standard errors are clustered by watershed.
71
Appendix Figure 5. Water Pollution, by Distance Downstream from Treatment Plant, Other Pollution
Measures
.06
3
.04
2
Saturation Deficit (%)
0
0
-.02
-1
-.04
-25 to -20
-20 to -15
-15 to -10
-25 to -20
-20 to -15
-15 to -10
-10 to -5
10 to 15
15 to 20
20 to 25
-10 to -5
10 to 15
15 to 20
20 to 25
-5 to 0
0 to 5
5 to 10
-5 to 0
0 to 5
5 to 10
Miles Downstream from Treatment Plant Miles Downstream from Treatment Plant
1000
.5
MPN/100mL
500
mg/L
0
0
-500
-.5
-25 to -20
-20 to -15
-15 to -10
-10 to -5
-5 to 0
0 to 5
5 to 10
10 to 15
15 to 20
20 to 25
-5 to 0
-25 to -20
-20 to -15
-15 to -10
-10 to -5
0 to 5
5 to 10
10 to 15
15 to 20
20 to 25
5
.02
Share
mg/L
0
0 -.02
-5
-.04
-10
-25 to -20
-20 to -15
-15 to -10
5 to 10
10 to 15
15 to 20
20 to 25
-10 to -5
0 to 5
-5 to 0
-25 to -20
-20 to -15
-15 to -10
5 to 10
10 to 15
15 to 20
20 to 25
-10 to -5
0 to 5
-5 to 0
Miles Downstream from Treatment Plant Miles Downstream from Treatment Plant
Notes: Graphs show coefficients on distance-from-plant indicators from regressions which also control for
plant-by-year fixed effects, corresponding to equation (3) from the main text. Connected dots show
yearly values, dashed lines show 95% confidence interval. Data cover years 1962-2001. Standard errors
are clustered by watershed.
72
Appendix Figure 6. Effects of Clean Water Act Grants on Water Pollution, Event Study,
Other Pollution Measures
1000
2
0
MPN/100mL
1
mg/L
-1000
0
-2000
-1
0 to 2
3 to 5
6 to 8
9 to 11
12 to 14
15 to 17
18 to 20
21 to 23
24 to 26
27 to 29
>=30
<=-10
-9 to -7
-6 to -4
-3 to -1
<=-10
0 to 2
3 to 5
6 to 8
9 to 11
12 to 14
15 to 17
18 to 20
21 to 23
24 to 26
27 to 29
>=30
-9 to -7
-6 to -4
-3 to -1
20
.02
10
Share
mg/L
0
0
-.02
-10
-.04
-20
12 to 14
15 to 17
18 to 20
21 to 23
24 to 26
27 to 29
<=-10
0 to 2
3 to 5
6 to 8
9 to 11
>=30
-9 to -7
-6 to -4
-3 to -1
<=-10
9 to 11
12 to 14
15 to 17
18 to 20
21 to 23
24 to 26
27 to 29
>=30
0 to 2
3 to 5
6 to 8
-9 to -7
-6 to -4
-3 to -1
Years Since Treatment Plant Received a Grant Years Since Treatment Plant Received a Grant
Notes: Graphs show coefficients on downstream times year-since-grant indicators from regressions which
correspond to the specification of Table 3. These regressions are described in equation (5) from the main
text. Connected dots show yearly values, dashed lines show 95% confidence interval. Data cover years
1962-2001. Standard errors are clustered by watershed.
73
Appendix Figure 7. Effects of Clean Water Act Grants on Water Pollution
by Distance Downstream from Treatment Plant
.04
2
.02
1
Saturation Deficit (%)
Share
0
0 -.02
-1
-.04
-2
-25 to -20
-20 to -15
-15 to -10
5 to 10
10 to 15
15 to 20
20 to 25
-10 to -5
0 to 5
-5 to 0
-25 to -20
-20 to -15
-15 to -10
5 to 10
10 to 15
15 to 20
20 to 25
-10 to -5
0 to 5
-5 to 0
Miles Downstream from Treatment Plant Miles Downstream from Treatment Plant
1500
1
1000
.5
MPN/100mL
mg/L
500
0
0
-500
-.5
-25 to -20
-20 to -15
-15 to -10
5 to 10
10 to 15
15 to 20
20 to 25
-10 to -5
0 to 5
-5 to 0
-25 to -20
-20 to -15
-15 to -10
5 to 10
10 to 15
15 to 20
20 to 25
-10 to -5
0 to 5
-5 to 0
Miles Downstream from Treatment Plant Miles Downstream from Treatment Plant
6
.02
4
.01
2
Share
mg/L
0
0
-.01
-2
-.02
-4
-25 to -20
-20 to -15
-15 to -10
5 to 10
10 to 15
15 to 20
20 to 25
-10 to -5
0 to 5
-5 to 0
10 to 15
15 to 20
20 to 25
-25 to -20
-20 to -15
-15 to -10
5 to 10
-10 to -5
-5 to 0
0 to 5
Miles Downstream from Treatment Plant Miles Downstream from Treatment Plant
Notes: Graphs show distance-from-plant times cumulative grant indicators from regressions which also
control for plant-by-distance, plant-by-year, and distance-by-water basin-by-year fixed effects. These
regressions correspond to equation (6) in the text. Connected dots show yearly values, dashed lines show
95% confidence interval. Data cover years 1962-2001. Standard errors are clustered by watershed.
74
Appendix Figure 8. The Hedonic Model
75
Appendix Figure 9. Effects of Clean Water Act Grants on Log Mean Home Values
10 to 15
15 to 20
20 to 25
0 to 5
5 to 10
10
Notes: Panel A includes homes within 1 mile of river. Data cover decennial census years 1970-2000.
76
Appendix Figure 10. Ratio of Change in Housing Values to Grant Costs, by County
Notes: We divide treatment plants into ten deciles based on the population in the year 2000 which is
within a 25 mile radius in any direction of the river segments that are up to 25 miles downstream of the
plant and on the same river as the plant. For each decile, we calculate the total value of owned homes
and rentals satisfying the same criteria (within a 25 mile radius, etc.). To estimate the change in
housing values, we apply the regression estimates from column (4) of Table 6, and assume improvements
last 30 years. For each decile, we measure costs using the grants data. For each decile, we divide the
total change in housing values by total costs. Finally, we calculate the unweighted average of this ratio
across all plants in a county, and the map plots the result. Counties in white have no treatment plants
or missing data. The share of total real grant project costs going to each decile, ordered from lowest-
population to greatest-population, is as follows: 0.02, 0.02, 0.03, 0.04, 0.05, 0.06, 0.10, 0.13, 0.19, 0.37.
The share of total population in each decile is as follows: 0.00, 0.01, 0.01, 0.02, 0.03, 0.04, 0.06, 0.10,
0.18, 0.54.
77
Appendix Table 1. Water Pollution Descriptive Statistics
Biochemical Total
Oxygen Dissolved Fecal Suspended
Pooled Demand Oxygen Deficit Coliforms Solids
(1) (2) (3) (4) (5)
Mean --- 3.19 19.28 1,688.54 51.32
--- 4.55 28.23 7,279.42 126.87
5th Percentile --- 0.50 -15.90 1.00 1.00
95th Percentile --- 10.00 79.60 6,000.00 211.00
Number of Distinct . . .
Observations 10,581,253 1,258,312 5,582,228 2,032,112 1,708,601
Monitoring Sites 168,200 51,777 145,614 77,269 67,473
River Segments 96,775 35,434 87,222 49,928 44,581
Rivers 46,406 16,932 41,864 24,960 22,163
Mean Years per Monitoring Site 11 11 11 11 10
Mean Readings per Monitoring Site 63 24 38 26 25
78
Appendix Table 2. Descriptive Statistics for Treatment Plants and Grants
River &
Which Plants With Pollution Monitor on
All Data Same River
(1) (2) (3)
Number of Plants 20,956 11,240 6,824
Number of Grants 24,271 14,378 9,483
79
Appendix Table 3. Water Pollution Trends, Sensitivity Analysis
Main Pollution Measures Other Pollution Measures
Dissolved Biochemical Total
Oxygen Oxygen Fecal Not Suspended
Deficit Not Fishable Demand Coliforms Swimmable Solids
(1) (2) (3) (4) (5) (6)
1. Main Estimates -0.244*** -0.005*** -0.064*** -85.261*** -0.004*** -0.830***
(0.030) (0.000) (0.005) (8.850) (0.000) (0.101)
Important Subsamples
2. Long-Term Stations -0.246*** -0.005*** -0.054*** -100.509*** -0.006*** -0.903***
(≤1971 to ≥1988) (0.035) (0.000) (0.008) (14.930) (0.000) (0.156)
10. Readings Below Limit ("BDL") -0.244*** -0.005*** -0.068*** -85.244*** -0.005*** -0.833***
Equal Half Listed Value (0.030) (0.000) (0.005) (8.843) (0.000) (0.101)
11. Logs, Not Levels --- --- -0.014*** -0.031*** --- -0.016***
--- --- (0.002) (0.002) --- (0.001)
Other Fishable and Swimmable Definitions
12. River-Year Means --- -0.003*** --- --- -0.003*** ---
--- (0.000) --- --- (0.000) ---
80
Appendix Table 3. Water Pollution Trends, Sensitivity Analysis (Continued)
Standards Constituent Pollutants
Dissolved Biochemical Total
Oxygen Oxygen Fecal Not Suspended
Deficit Not Fishable Demand Coliforms Swimmable Solids
(1) (2) (3) (4) (5) (6)
Well-Documented USGS Networks
14. NASQAN Network -0.242*** -0.004*** -0.035*** -55.170*** -0.007*** -1.558***
(0.027) (0.000) (0.014) (8.267) (0.001) (0.426)
21. Flexible Seasonality and Time -0.246*** -0.001*** -0.065*** -85.139*** -0.002*** -0.842***
(0.030) (0.000) (0.005) (8.603) (0.000) (0.102)
22. Census Region: Northeast -0.473*** -0.006*** -0.067*** -77.046** -0.007*** -0.719***
(0.125) (0.001) (0.012) (33.889) (0.001) (0.176)
23. Census Region: Midwest -0.275*** -0.004*** -0.059*** -82.040*** -0.004*** -0.634***
(0.035) (0.000) (0.008) (14.610) (0.000) (0.201)
24. Census Region: South -0.191*** -0.005*** -0.063*** -94.000*** -0.005*** -0.796***
(0.048) (0.000) (0.007) (13.118) (0.001) (0.078)
25. Census Region: West -0.237*** -0.004*** -0.095*** -58.125*** -0.004*** -1.599***
(0.056) (0.000) (0.022) (11.715) (0.001) (0.278)
Notes: Standard errors are clustered by watershed. Regressions include Station FE and Season, Hour Controls except where
otherwise noted. See text for details. Data cover years 1962-2001. Asterisks denote p-value < 0.10 (*), < 0.05 (**), or <
0.01 (***).
81
Appendix Table 4. Results for Other Measures of Water Pollution
Downstream *
Trend Cumulative # of Grants
(1) (2)
Industrial Pollutants
1. Lead (μg/L) -0.114*** -0.386
(0.004) (0.711)
Dependent Variable Mean 2.348 27.095
N 522,826 19,290
Nutrients
4. Ammonia (mg/L) -0.039*** -0.030**
(0.002) (0.013)
Dependent Variable Mean -2.408 0.456
N 1,667,380 34,354
82
Appendix Table 4. Results for Other Measures of Water Pollution (Continued)
Downstream *
Trend Cumulative # of Grants
(1) (2)
General Water Quality Measures
10. Dissolved Chlorides (mg/L) 0.001 -8.181
(0.001) (7.764)
Dependent Variable Mean 3.037 106.949
N 1,006,375 16,054
83
Appendix Table 5. Water Pollution Upstream Versus Downstream of Treatment Plants
Main Pollution Measures Other Pollution Measures
Dissolved Biochemical Total
Oxygen Not Oxygen Fecal Not Suspended
Deficit Fishable Demand Coliforms Swimmable Solids
(1) (2) (3) (4) (5) (6)
Downstream 1.202*** 0.041*** 0.623*** 914.524*** 0.051*** 5.416***
(0.373) (0.004) (0.093) (224.581) (0.005) (1.146)
Notes: Each observation in a regression is a plant-downstream-year tuple. Data cover years 1962-
2001. Dissolved oxygen deficit equals 100 minus dissolved oxygen saturation, measured in
percentage points. Dependent variable mean is for upstream pollution. Standard errors are
clustered by watershed. Asterisks denote p-value < 0.10 (*), < 0.05 (**), or < 0.01 (***).
84
Appendix Table 6. Sensitivity Analysis: Effects of Clean Water Act Grants on Water Pollution
Main Pollution Measures Other Pollution Measures
Dissolved Biochemical Total
Oxygen Oxygen Fecal Not Suspended
Deficit Not Fishable Demand Coliforms Swimmable Solids
(1) (2) (3) (4) (5) (6)
1. Main Estimates -0.768*** -0.007*** -0.114*** -196.183** -0.004** -0.404
(0.197) (0.003) (0.042) (95.860) (0.002) (0.591)
Important Subsamples
2. Long-Term Stations -1.276*** -0.030*** -0.211* -326.610 -0.008** -2.194
(≤1971 to ≥1988) (0.334) (0.010) (0.125) (387.461) (0.003) (1.959)
11. Readings Below Limit -0.766*** -0.007*** -0.113*** -203.251** -0.004** -0.411
Equal Half Listed Value (0.198) (0.003) (0.041) (96.067) (0.002) (0.591)
12. Logs, Not Levels --- --- -0.012* -0.013 --- -0.003
--- --- (0.006) (0.024) --- (0.009)
Other Fishable and Swimmable Definitions
13. 50% Fishable-Swimmable --- -0.023*** --- --- 0.003 ---
Definition --- (0.005) --- --- (0.006) ---
85
Appendix Table 6: Sensitivity Analysis: Effects of Clean Water Act Grants on Water Pollution (Continued)
Main Pollution Measures Other Pollution Measures
Dissolved Biochemical Total
Oxygen Oxygen Fecal Not Suspended
Deficit Not Fishable Demand Coliforms Swimmable Solids
(1) (2) (3) (4) (5) (6)
Other Distances Upstream and Downstream of Treatment Plant
14. Separate by Downstream Dist.
0 to 25 Miles Downstream -0.655*** -0.009*** -0.094** -240.926*** -0.002 -1.063**
(0.188) (0.002) (0.039) (75.691) (0.002) (0.489)
25 to 50 Miles Downstream -0.096 0.007 -0.070* 143.268 0.004 -0.018
(0.303) (0.005) (0.037) (106.276) (0.004) (1.126)
50 to 75 Miles Downstream 0.027 0.002 -0.039 142.204 0.001 -0.207
(0.251) (0.004) (0.051) (101.642) (0.003) (0.905)
75 to 100 Miles Downstream 0.806 0.000 -0.196 -105.046 0.006 -0.197
(0.647) (0.005) (0.128) (124.609) (0.008) (1.060)
15. Plants with Monitors > 10 Mi. -0.845*** -0.008*** -0.130*** -222.592** -0.005** -0.125
Upstream and Downstream (0.204) (0.003) (0.039) (109.611) (0.002) (0.521)
Other Specifications for Measuring Grants
16. Grants for Construction -1.206*** -0.010*** -0.132** -93.551 -0.006** -0.268
(0.219) (0.004) (0.052) (128.303) (0.003) (0.728)
18. Control for Cumulative -0.857*** -0.008*** -0.109** -276.040*** -0.003 -0.608
Upstream Grants (0.207) (0.003) (0.050) (84.427) (0.003) (0.623)
20. Log Cumulative Real Grant -0.984*** -0.010*** -0.065 -255.290 -0.008* -1.127
Dollars ($Bn) (0.302) (0.004) (0.079) (170.719) (0.005) (1.136)
(Continued next page)
86
Appendix Table 6: Sensitivity Analysis: Effects of Clean Water Act Grants on Water Pollution (Continued)
Main Pollution Measures Other Pollution Measures
Dissolved Biochemical Total
Oxygen Oxygen Fecal Not Suspended
Deficit Not Fishable Demand Coliforms Swimmable Solids
(1) (2) (3) (4) (5) (6)
Other Important Sensitivity Analyses
21. Differences-in-Differences -0.635*** -0.009*** -0.093 -284.175*** -0.003* -0.874**
Downstream Areas Only (0.158) (0.002) (0.058) (71.707) (0.002) (0.401)
24. Exclude Plants with No Grants -0.673*** -0.007** -0.080 -246.457* -0.004 -0.160
(0.244) (0.003) (0.057) (142.479) (0.003) (0.730)
26. Control for Downstream *. . . -0.814*** -0.009*** -0.110*** -219.371*** -0.007*** -0.369
Nonattainment, Industrial (0.180) (0.003) (0.036) (70.975) (0.002) (0.607)
Sources, Population
Notes: "Long Term Stations" includes only stations which begin operating by 1971 and continue through at least 1988.
"Control for Stream Gauge Flow" includes only stations which report instantaneous stream flow at the same time they report
pollution, and it controls for streamflow. "Include Monitors on Other Rivers" includes monitors on rivers different than the
treatment plant, but that eventually flow into or out of the treatment plant's river. Decennial sample includes averaged
periods 1965-1974, 1975-1984, 1985-1994, 1995-2004. "Cumulative Grants" rows use trends sample. Data cover years 1962-
2001. Standard errors are clustered by watershed. Asterisks denote p-value < 0.10 (*), < 0.05 (**), or < 0.01 (***).
87
Appendix Table 7. Heterogeneity of Clean Water Act Grants on Water Pollution and Home Values
Regressions Fitted Values
Change in
Dissolved Log Mean Cost Per Unit Housing
Oxygen Not Home Log Mean Dissolved Cost Per River- Values /
Dependent Variable Deficit Fishable Values Rents Oxygen Mile Fishable Costs
(1) (2) (3) (4) (5) (6) (7)
1. Cumulative Grants -0.770*** -0.024*** 0.00025 -0.00011 0.53 1.70 0.25
(0.197) (0.005) (0.00033) (0.00016) [0.35 , 1.08] [1.21 , 2.87] (0.34)
88
Appendix Table 8. Sensitivity Analysis, Home Values
Mean Families on College Black (%) Population Population
Log Home Values Log Rents Family Public Graduates Under Age Age 65 or
0.25 Mi. 1 Mi. 25 Mi. 0.25 Mi. 1 Mi. 25 Mi. Income Assistance (%) 6 (%) Older (%)
(1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12)
1. Main Estimates 0.0009 0.0027** 0.0002 -0.0008 0.0000 -0.0001 -0.0002 0.0000 0.0000 -0.0001 0.0000 0.0000
(0.0014) (0.0013) (0.0003) (0.0008) (0.0007) (0.0002) (0.0003) (0.0000) (0.0001) (0.0001) (0.0000) (0.0000)
2. Exclude 1-Mile Radius --- 0.0025* 0.0002 --- 0.0000 -0.0001 -0.0002 0.0000 0.0000 -0.0001 0.0000 0.0000
Around Treatment Plant --- (0.0013) (0.0003) --- (0.0007) (0.0002) (0.0003) (0.0000) (0.0001) (0.0001) (0.0000) (0.0000)
3. Cluster by Watershed 0.0009 0.0027 0.0002 -0.0008 0.0000 -0.0001 -0.0002 0.0000 0.0000 -0.0001 0.0000 0.0000
and Year (0.0035) (0.0026) (0.0006) (0.0013) (0.0011) (0.0002) (0.0004) (0.0001) (0.0002) (0.0002) (0.0000) (0.0000)
4. No baseline controls -0.0001 0.0018 0.0000 0.0000 0.0004 -0.0003 -0.0013** 0.0001 -0.0002 0.0003 0.0000 0.0000
(0.0025) (0.0023) (0.0006) (0.0017) (0.0013) (0.0003) (0.0005) (0.0001) (0.0002) (0.0002) (0.0000) (0.0001)
5. Triple-Difference 0.0058* 0.0046 0.0010 0.0026* 0.0018 0.0000 -0.0007 0.0000 0.0000 0.0003 0.0000 0.0000
Regression (0.0033) (0.0032) (0.0012) (0.0015) (0.0015) (0.0006) (0.0006) (0.0001) (0.0003) (0.0005) (0.0000) (0.0001)
6. OLS 0.0053*** 0.0045*** 0.0011 0.0016 0.0011 -0.0002 0.0000 -0.0001 0.0001 -0.0002 0.0000 0.0001
(0.0017) (0.0016) (0.0008) (0.0012) (0.0012) (0.0007) (0.0004) (0.0001) (0.0002) (0.0002) (0.0000) (0.0001)
7. Downstream-by-year FE 0.0010 0.0031** -0.0002 -0.0005 0.0005 -0.0002 -0.0004 0.0000 0.0000 0.0000 0.0000 0.0000
and Basin-by-year trends (0.0017) (0.0015) (0.0003) (0.0009) (0.0008) (0.0002) (0.0002) (0.0000) (0.0001) (0.0001) (0.0000) (0.0000)
8. Grants Given in 1972 -0.0106 -0.0026 0.0018 0.0036 0.0042 0.0009 0.0012 -0.0008*** 0.0005 -0.0001 -0.0001 0.0001
(0.0083) (0.0079) (0.0026) (0.0052) (0.0051) (0.0014) (0.0017) (0.0002) (0.0006) (0.0006) (0.0001) (0.0002)
9. Plants Without Grants, -0.0042 -0.0041 -0.0009 0.0011 -0.0004 0.0000 0.0005 -0.0001 -0.0001 -0.0001 0.0000 0.0001
1987 Effect (0.0044) (0.0044) (0.0012) (0.0026) (0.0022) (0.0006) (0.0009) (0.0001) (0.0003) (0.0004) (0.0000) (0.0001)
10. Large Cities 0.0008 0.0024 0.0003 -0.0006 0.0005 0.0000 0.0001 0.0000 0.0001 -0.0002* -0.0000* 0.0000
(0.0016) (0.0016) (0.0003) (0.0007) (0.0007) (0.0001) (0.0002) (0.0000) (0.0001) (0.0001) (0.0000) (0.0000)
11. Effect 10+ Years 0.0018 0.0016 0.0001 0.0002 0.0002 0.0001 0.0001 0.0000 0.0001** 0.0000 0.0000 0.0000
After a Grant (0.0018) (0.0015) (0.0003) (0.0006) (0.0005) (0.0001) (0.0002) (0.0000) (0.0000) (0.0001) (0.0000) (0.0000)
Notes: Unless otherwise noted, all regressions include homes within 25 miles of the river of interest. Regression specification corresponds to column (6) of Table 5. Regressions
weighted by denominator of response variable Row 5 includes treatment plants in the half of counties with above-median shares of people who fish in lakes and rivers, as measured
in 1990 surveys. Row 6 includes treatment plants in the half of counties with above-median shares of people who swim in lakes and rivers, as measured in 1990 surveys. Data
includes decennial census years 1970-2000. Standard errors clustered by watershed. Asterisks denote p-value < 0.10 (*), < 0.05 (**), or < 0.01 (***).
89