www.elsevier.com/locate/rse
Remote Sensing of Environm
Vegetation height estimation from Shuttle Radar Topography Mission and
National Elevation Datasets
Josef Kellndorfera,*, Wayne Walkera, Leland Piercea, Craig Dobsona, Jo Ann Fitesb,
Carolyn Hunsakerc, John Vonad, Michael Cluttere
aRadiation Laboratory, EECS Department, The University of Michigan, 1301 Beal Avenue, Ann Arbor, MI 48109, United StatesbUSDA Forest Service, Adaptive Management Services, Tahoe National Forest, Nevada City, CA, United States
cUSDA Forest Service, Forestry Sciences Laboratory, Pacific South West Research Station, Fresno, CA, United StatesdPlum Creek Timber Company, Watkinsville, GA, United States
eWarnell School of Forest Resources, The University of Georgia, Athens, GA, United States
Received 16 April 2004; received in revised form 26 July 2004; accepted 27 July 2004
Abstract
A study was conducted to determine the feasibility of obtaining estimates of vegetation canopy height from digital elevation data collected
during the 2000 Shuttle Radar Topography Mission (SRTM). The SRTM sensor mapped 80% of the Earth’s land mass with a C-band
Interferometric Synthetic Aperture Radar (InSAR) instrument, producing the most complete digital surface map of Earth. Due to the
relatively short wavelength (5.6 cm) of the SRTM instrument, the majority of incoming electromagnetic energy is reflected by scatterers
located within the vegetation canopy at heights well above the bbald-EarthQ surface. Interferometric SAR theory provides a basis for properly
identifying and accounting for the dependence of this scattering phase center height on both instrument and target characteristics, including
relative and absolute vertical error and vegetation structural attributes.
An investigation to quantify the magnitude of the vertical error component was conducted using SRTM data from two vegetation-free
areas in Iowa and North Dakota, revealing absolute errors of �4.0 and �1.1 m, respectively. It was also shown that the relative vertical error
due to phase noise can be reduced significantly through sample averaging. The relative error range for the Iowa site was reduced from 13 to 4
m and for the North Dakota site from 7 to 3 m after averaging of 50 samples. Following error reduction, it was demonstrated that the SRTM
elevation data can be successfully correlated via linear regression models with ground-measured canopy heights acquired during the general
mission timeframe from test sites located in Georgia and California. Prior to outlier removal and phase noise reduction, initial adjusted r2
values for the Georgia and California sites were 0.15 and 0.20, respectively. Following outlier analysis and averaging of at least 20 SRTM
pixels per observation, adjusted r2 values for the Georgia and California sites improved to 0.79 (rmse=1.1 m) and 0.75 (rmse=4.5 m),
respectively. An independent validation of a novel bin-based modeling strategy designed for reducing phase noise in sample plot data
confirmed both the robustness of the California model (adjusted r2=0.74) as well as the capacity of the binning strategy to produce stable
models suitable for inversion (validated rmse=4.1 m). The results suggest that a minimum mapping unit of approximately 1.8 ha is
appropriate for SRTM-based vegetation canopy height mapping.
D 2004 Elsevier Inc. All rights reserved.
Keywords: SRTM; InSAR; NED; Vegetation canopy height; Biomass; Carbon; Noise reduction
0034-4257/$ - see front matter D 2004 Elsevier Inc. All rights reserved.
doi:10.1016/j.rse.2004.07.017
* Corresponding author. Tel.: +1 734 763 9442; fax: +1 734 647 2106.
E-mail address: [email protected] (J. Kellndorfer).
1. Introduction
Accurate estimates of aboveground terrestrial biomass
and carbon stocks are dependent on the availability of
biophysical measures that capture both the horizontal and
vertical structural character of vegetation. In general,
ent 93 (2004) 339–358
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358340
optical remote sensing systems are well suited to the
acquisition of structural information in the horizontal
dimension (e.g., canopy cover, community type, etc.)
given their long-established sensitivity to variations in
pigment composition and surface biochemistry. However,
in the vertical dimension, passive optical systems such as
Landsat and SPOT suffer from an inability to penetrate
through layers of vegetation, making difficult the accurate
retrieval of vertical structural metrics such as canopy
height, particularly as canopy density increases. Only in
the last decade have significant advances in the develop-
ment of active sensor technologies made it possible to
obtain consistent and reliable estimates of vertical
structural metrics. These advances include sensors such
as Light Detection and Ranging (LIDAR) (Dubayah &
Drake, 2003; Lefsky et al., 1999) and Synthetic Aperture
Radar (SAR) (Bergen & Dobson, 1999; Dobson, 2000;
Dobson et al., 1995; Kasischke et al., 1997; Kellndorfer
& Ulaby, 2003; Kellndorfer et al., 1998, 2003; Ulaby et
al., 1995). Another technology for estimating forest
vertical structure based on an airborne microwave
scatterometer has been proposed by Hyyppa and Halli-
kainen (1996) and Martinez et al. (2000). While both
airborne LIDAR and microwave scatterometers can
achieve high accuracy and high spatial resolution esti-
mates of vegetation height, neither technology is currently
capable of providing regional-scale to global-scale data-
sets. On the other hand, to date, several spaceborne SAR
Fig. 1. Conceptual representation of a forest stand indicating the relative position
SRTM resolution cell (~30 m).
missions have generated an abundance of global-scale
data, which have proven useful in the estimation of forest
biophysical parameters.
In particular, Interferometric Synthetic Aperture Radar
(InSAR) has proven to be an invaluable tool in the
determination of vegetation canopy height and a number
of studies have had success retrieving canopy height from
InSAR measurements (Brown, 2003; Hagberg et al., 1995;
Kobayashi et al., 2000; Papathanassiou & Cloude, 2001;
Rosen et al., 2000; Sarabandi & Lin, 2000; Treuhaft &
Siqueira, 2000). Papathanassoiu and Cloude (2001) used a
repeat-pass (temporal baselinec10 min) polarimetric (using
both horizontal- and vertical-polarized fields) airborne
instrument at L-band (23 cm wavelength) to acquire
estimates of canopy height. Their results showed a standard
deviation between estimated and observed heights of
approximately 2.5 m. In another work, Hagberg et al.
(1995) used the spaceborne ERS-1 C-band (5.6 cm wave-
length with vertical polarization) platform as part of a
repeat-pass (temporal baselinecseveral weeks) processing
scheme over a boreal forest at Hfkmark in northern Sweden.
Due to the influence of temporal decorrelation, their results
were less accurate with rms errors on the order of 5 m. In a
third study conducted by Treuhaft at al. (1995), the
Topographic Synthetics Aperture Radar (TOPSAR) operat-
ing at C-band (vertical polarization) was flown over
multiple boreal forest stands within the Bonanza Creek
Experimental Forest, Alaska. The estimation error reported
s of mean canopy height and scattering phase center height within a single
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358 341
in this case varied between 3 and 6 m, depending on the
forest stand and environmental conditions.
In February 2000, an unprecedented near-global eleva-
tion dataset based on single-pass InSAR technology was
acquired as part of the National Aeronautics and Space
Administration (NASA) Jet Propulsion Laboratory (JPL)
Shuttle Radar Topography Mission (SRTM), resulting in the
most complete digital topographic database of the Earth. In
accordance with a memorandum of understanding between
NASA and the primary mission sponsor, the National
Geospatial Intelligence Agency (NGA; formerly NIMA), a
raster digital elevation model (DEM) dataset was released at
a resolution of 1-arc sec (c30 m) within the United States
and 3 arc sec (c90 m) elsewhere. Given the relatively short
operating wavelength (C-band, 5.6 cm) of the SRTM sensor,
the interferometric height response over vegetated terrain is
expected to reflect the interaction of the InSAR signal with
various scatterers associated with leaves, branches, and
stems. As a result, the height surface [i.e., the scattering
phase center height (hspc)], retrieved via InSAR processing,
will be higher than the underlying bbald-EarthQ surface (Fig.1). Preliminary work by Brown (2003) investigated the use
of SRTM data for estimating the height of vegetation
canopies. Based on both simulated and real data from red
pine stands in Michigan, canopy height estimates were
achieved with an rms error of approximately 4 m.
The objective of this study is to determine the feasibility
of deriving vegetation canopy height from the SRTM digital
elevation data in conjunction with the National Elevation
Dataset (NED), which provides a reference surface for bald-
Earth topography. This effort focuses on two pilot study
areas: one in southeastern Georgia near Jesup and a second
in the northern Sierra Nevada of California near Quincy.
The Jesup site lies in the center of a heavily managed
landscape characterized by large homogeneous forest stands
and moderately undulating topography. Conversely, the
Fig. 2. SRTM global coverage map showing the number of data takes acquired ov
and North Dakota test sites are shown (modified from http://www2.jpl.nasa.gov/s
Sierra Nevada site represents less intensely managed, more
heterogeneous forests and highly variable terrain. Thus, the
biogeophysical characteristics of the two test sites provide a
unique opportunity to evaluate the SRTM data as a source
for canopy height estimates across a range of vegetation
densities and structural classes as well as a variety of
topographic conditions.
2. Shuttle radar topography data
2.1. Mission and instrument characteristics
The SRTM was flown on board the Space Shuttle
Endeavor during mission STS-99, which was in orbit from
February 11 to February 22 of 2000. SRTM was the first
spaceborne fixed-baseline InSAR and marked the largest
rigid structure to be deployed in space with a payload
weight of 13,660 kg. During the mission, a total of 12.3
Tbyte of data were collected. The orbit inclination was 578,which allowed for a targeted coverage of 80% of the total
Earth landmass lying between the latitudes of 608 north and
568 south (Fig. 2). Of the total targeted area, 99.97% was
mapped with at least one data take (i.e., one overpass),
corresponding to 119.51 million km2. Additionally, 94.59%
(113.10 million km2) of the targeted area was covered at
least twice, 49.25% (58.59 million km2) at least three times,
and 24.10% (28.81 million km2) at least four times. Six
locations, totaling 50,000 km2, were not imaged during the
mission, all of which are located in the conterminous United
States (USGS, 2003).
SRTM was flown at an altitude of 233 km where two
InSAR instruments were operated. These included the
United States C-band (5.6 cm, 5.3 GHz) sensor and the
German X-band (3.1 cm, 9.6 GHz) sensor. To enable the
collection of single-pass fixed-baseline interferometric data,
er land and water. Approximate locations of the California, Iowa, Georgia,
rtm/coverage.html).
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358342
a 60-m-long boom was extended from the shuttle cargo bay
with C-band- and X-band-receiving antennas attached to its
end. Dual-purpose transmit/receive antennas were operated
in the cargo bay. The 60-m fixed-baseline configuration of
the InSAR system, in conjunction with a ScanSAR mapping
mode, resulted in a swath width of 225 km for C-band. The
ScanSAR mode was comprised of four subswaths, which
alternated between horizontal and vertical polarizations. The
look angle ranged between 308 and 608 (Hensley et al.,
2000).
2.2. SRTM data characteristics
According to the mission objectives, SRTM data were
expected to have an absolute horizontal circular accuracy of
less than 20 m. Absolute and relative vertical accuracy was
anticipated to be less than 16 and 10 m, respectively.
Performance evaluations by NIMA, the USGS, and the
SRTM project team have shown the absolute vertical error
to be much smaller, with the most reliable estimates being
approximately 5 m (Curkendall et al., 2003; Rosen et al.,
2001a,b; Smith & Sandwell, 2003; Sun et al., 2003). Given
the single-pass interferometric configuration of the SRTM
sensor, the dataset was not subject to the temporal
decorrelation errors that are common in repeat-pass systems.
Following the mission, the SRTM dataset was interfero-
metrically processed by the JPL and made available to the
public. The processor included averaging where multiple
data takes were acquired and filtering of the interferogram to
reduce noise using either a boxcar lowpass filter or power
spectral filtering (Hensley et al., 2000). Within the Unites
States, data were released at a spacing of 1 arc sec (c30 m).
For all other regions, data are being released at a spacing of
3 arc sec (c90 m). The projection parameters are set to
geographic coordinates (unprojected) and the data are
horizontally referenced to the North American Datum
1983 (NAD83). The vertical reference datum is the North
American Vertical Datum 1988 (NAVD88). SRTM data for
the United States are accessible via the National Map
Seamless Data Distribution System provided by the United
States Geological Survey (http://seamless.usgs.gov).
3. NED
The NED is a compilation of various elevation data
sources including 7.5 min, 15 min, 2 arc sec, and 3 arc sec
DEMs dating back as far as 1978. In 1999, and for the
first time, the NED was assembled completely for the
continental United States from 7.5-min DEM source data
(10 and 30 m resolution) (Gesch et al., 2002). Develop-
ment of the NED required the merging of 57,000 different
DEM data files—54,000 within the conterminous United
States alone. The NED is a seamless dataset where
procedures were developed to maintain the database with
periodic updates to insure the integration of higher-
resolution elevation data as these data become available.
The NED is released in geographic coordinates at a
resolution of 1 arc sec. The horizontal and vertical data
are NAD83 and NAVD88, respectively. Thus, the NED is
available in the same resolution and projection parameters
as the SRTM dataset. Given the production history of the
NED, the accuracy varies spatially with the quality of the
individual source data. The USGS is assessing the quality
of the NED by comparing it to the High-Accuracy
Reference Network (HARN). Releases of the NED
currently contain accuracy statistics where available (Gesch
et al., 2002).
4. Study area datasets
4.1. Test site Jesup, Georgia
During a NASA-EOCAP SAR program, an intensive
survey of slash pine (Pinus elliotii) plantations owned by
the Plum Creek Timber was conducted for a test site near
Jesup (31875VN, 82800VW), southeast Georgia (Kellndorfer
et al., 2003). The field campaign was conducted during the
early months of 2000, which coincided well with the
timeframe of the SRTM mission. A total of 22 stands were
biometrically surveyed using a cluster plot design (Vona,
2001). Fifteen of the 22 stands were unthinned and a 0.02-
acre (0.008 ha) cluster-plot survey was performed. The
remaining seven stands were thinned and the cluster plot
size was 0.05 acre (0.02 ha). Each cluster contained a center
plot and four diagonally aligned subplots at a distance of
one chain (c20 m) from the center. Throughout each stand,
cluster plots were sampled on a regular grid of 3�6 chains
(c60�120 m). The tally consisted of recording all stems
greater than 1�1 in. (2.54 cm) diameter at breast height
(DBH) classes. Additionally, a minimum of 8 dominant and
16 codominant trees in each stand were selected at random,
and height measurements were made using a handheld
clinometer.
Published biometric models were used to calculate
several biophysical parameters including basal area, dom-
inant height, stem density, and volume from the plot survey
data. According to the standards for forest measurement in
the United States, the following equations reflect units of the
English system. Using the observed dominant height
measurements (hd), the heights of all remaining slash pine
stems were predicted (hp) after Pienaar et al. (1993) with:
hp ¼ 1:12hd
�1� 1:257e
�2:058�
dDq
��ð1Þ
where hd=height of observed dominants [ft]; d=DBH of
remaining stems [in.]; Dq=quadratic mean diameter of
remaining stems [in.].
For further analysis, the stand biometric parameters were
translated from English to SI units. The mean canopy height
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358 343
(hcan) of each stand was computed by averaging the heights
(ht) for all trees (NT) present in all plots:
h¯ can ¼1
NT
XNT
t¼1
ht ð2Þ
where hcan=mean canopy height of stand; NT=number of
observed (hd) and predicted (hp) stems; h t=height of
observed (hd) or predicted (hp) stems.
As a measure of vertical structural variability, the
standard deviation (rcan) of canopy height was computed
as well.
Because the Jesup survey was part of an experiment
designed around acquisition of data by the JPL Airborne
Synthetic Aperture Radar (AirSAR), all stands were located
Fig. 3. (a) NED and (b) SRTM digital elevation models of the Jesup, Georgia test s
Bright areas in both images represent areas of higher relative elevation. Image sc
within a 7-km-wide swath, which spanned the test site from
the southwest to the northeast. The original stand bounda-
ries were supplied by Plum Creek in the form of a vector
polygon layer, which was derived from uncorrected aerial
photography and, hence, many polygons contained minor
location errors. Although the stand boundaries were
accurate in terms of their general shape, their location and
extent were not completely reliable. In some cases,
polygons were moved to the correct location, which was
identified using the georeferenced AirSAR data.
Fig. 3a shows the NED DEM image of the Jesup test site
as well as the locations of the surveyed stands. The test site
is moderately hilly with elevations ranging from 9 to 46 m,
and a drainage network formed by several creeks and
streams is apparent. The test site lies in an intensively
ite. Boundary locations of 22 surveyed stands are shown as white polygons.
ale is approximately 33�24 km (790 km2).
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358344
managed timber-producing region and the majority of the
area consists of slash pine plantations. Lowland vegetation
is also common and forms narrow bands of hardwoods that
buffer the drainage network. Fig. 3b shows the SRTM DEM
image with the stand locations superimposed. The SRTM
image demonstrates the sensitivity of the sensor to topo-
graphic features like drainage networks and undulating
terrain that are also seen in the NED image. Additionally,
the SRTM image reveals the marked sensitivity of the
sensor to features relating to the spatial distribution of
Fig. 4. (a) SRTM minus NED difference image of the Jesup, Georgia, test site. Th
surface that reflects the height of the scattering phase center (hspc). (b) Transects
(upper curve), NED (center curve), and SRTM–NED (lower curve) difference (h
vegetation. For example, taller features like forest stands
appear brighter (higher elevation) relative to adjacent
features like clearcuts and fields, which are shorter and
appear darker (lower elevation). The sensitivity of the
SRTM sensor to features which extend vertically above the
bald-Earth surface is even more clearly observed when the
NED DEM elevations are subtracted from the SRTM DEM
(Fig. 4a). In this difference image, the gray-value range
corresponds to increasing feature height from dark to light.
Discussion of Fig. 4b is deferred to Section 5.3.
e differencing procedure removes the underlying topography, resulting in a
A–C in (a) were used to extract elevation profiles (A–C) from the SRTM
spc) images. The reference (flat) line represents zero elevation.
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358 345
4.2. Test site Sierra Nevada, California
As part of NASA-funded remote sensing research
focused on the structure of forests in the Sierra Nevada
mountain range, a 10,000-ha test site was established 40 km
southwest of Quincy (39857VN, 120856VW) on the Plumas
National Forest, northern California. The test site is
dominated by mixed coniferous forests characterized by
varying amounts of Douglas-fir (Pseudotsuga menziesii),
red fir (Abies magnifica), white fir (Abies concolor), sugar
pine (Pinus lambertiana), ponderosa pine (Pinus ponder-
osa), and incense cedar (Calocedrus decurrens), with
elevations ranging from 1200 to 1850 m.
Fig. 5. (a) NED and (b) SRTM digital elevation models of the Sierra Nevada, Califo
shown as white circles. Bright areas in both images represent areas of higher rela
During 2000 and 2001, an intensive survey was
conducted to characterize the three-dimensional structure
of forest stands within the test site. A grid of 1.0-ha
circular (56.4 m radius) sample plots representing approx-
imately 3% of the test site was generated and super-
imposed on layers of transportation and hydrology. Roads
and rivers were each buffered by 10 m; plots intersecting
the buffered network were systematically shifted to the
west by 56.4 m. If the western shift was unsuitable, plots
were shifted east, then north, and, finally, south in
sequence. The final coordinates of each plot were then
calculated and used to identify plot locations in the field
via global positioning system (GPS).
rnia, test site. The locations of 227 56.4-m radius (1.0 ha) surveyed plots are
tive elevation. Image scale is approximately 24�18 km (430 km2).
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358346
Each sample plot consisted of two concentric annuli with
radii of 15.0 m (0.07 ha) and 56.4 m (1.0 ha). In the 15-m
inner annulus, all live trees z10 cm in DBH were targeted
for structural measurements. Structural variables measured
and recorded for live trees included: (1) species, (2) DBH,
(3) crown diameter, (4) crown form, (5) height to bottom of
partial crown, (6) partial crown wedge angle, (7) height to
bottom of full crown, (8) height to top of live crown, and (9)
total tree height. In the outer annulus (15–56.4 m), structural
measurements focused on large (late successional/old
growth) live trees (z76 cm DBH). Measurements of live
Fig. 6. SRTM minus NED difference image of the Sierra Nevada, California, test si
in a surface that reflects the height of the scattering phase center (hspc). Transects
curve), NED (center curve), and SRTM–NED (lower curve) difference (hspc) ima
trees included: (1) species, (2) DBH, and (3) and total tree
height. All heights were measured to the nearest 0.01 m with
an Impulse 200 LR Laser Rangefinder.
In total, 227 sample plots were established and measured
during the northern Sierra Nevada field campaign. For each
plot, the mean canopy height (hcan) was computed according
to Eq. (2), where NT is the number of measured stems in
each plot and stem height (ht) corresponds to the measured
height of each stem.
Fig. 5a shows the locations of all sample plots super-
imposed on the NED DEM of the test site. In general, the
te. The differencing procedure removes the underlying topography, resulting
A–C were used to extract elevation profiles (A–C) from the SRTM (upper
ges. The reference (flat) line represents zero elevation.
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358 347
site is characterized by rolling to steeply sloping terrain. A
number of drainage networks are visible in the NED DEM
including the prominent middle fork of the Feather River in
the southeast corner. Bucks Lake, shown in black, is located
along the west-central edge of the image. The SRTM DEM
of the test site is shown in Fig. 5b. The marked sensitivity to
terrain features that was clearly visible in the Jesup SRTM
image is similarly demonstrated here. However, unlike the
Jesup SRTM image, the Sierra Nevada SRTM image
appears to be nearly identical to its NED complement, and
seemingly devoid of all vegetation and anthropogenic
features. Nevertheless, much of the northern Sierra Nevada
study site is, in fact, heavily forested. Whereas in the Jesup
SRTM image, individual forest stands and other features
stand out as brighter compared to their surroundings, in the
corresponding Sierra Nevada image, brightness differences
reflecting finer-scale vegetation and anthropogenic features
are almost completely masked by the larger-scale brightness
differences that result from the pronounced variation in
topographic elevation. However, with the influence of
topography being effectively removed in the SRTM–NED
difference image (Fig. 6a), tracts of taller trees can be
identified as they appear brighter and are interspersed with
occasional clearcuts, shrub fields, and otherwise unforested
land, which appears darker. Discussion of Fig. 6b is deferred
to Section 5.3.
5. Determination of the scattering phase center heights
5.1. Theory
Given the nature of the interferometric height response,
the SRTM dataset represents the scattering phase center
height (hspc), and only reflects bald-Earth elevations in
vegetation- and structure-free areas. The scattering phase
center height is dependent on target and sensor character-
istics and can be described according to:
hspc ¼ ftarget vs; vm; sr; smÞofsensor k; b; p; h; gp���
ð3Þ
where vs=vegetation structure; vm=vegetation moisture;
sr=soil roughness; sm=soil moisture; o=concatenation oper-
ator; k=wavelength; b=baseline length and attitude;
p=polarization; h=incidence angle; gp=phase noise.
For the SRTM C-band sensor configuration, wavelength
(k) and baseline length/attitude (b) are constant. Therefore,
the dependency of hspc on these parameters is unvarying
across the SRTM dataset. On the other hand, polarization
( p) and incidence angle (h) are variable within each
ScanSAR swath. Simulations conducted by Sarabandi and
Lin (2000) suggest that the C-band interferometric height
response for coniferous vegetation is lower for hh polar-
ization than for vv polarization by approximately 1–2 m. For
deciduous vegetation, the simulation results show no
significant difference in the hh and vv polarization height
response. Simulations of the incidence angle dependency in
the 30–608 range corresponding to the SRTM ScanSAR
swath indicate variations in the scattering phase center
height on the order of 1–2 m for coniferous vegetation
(based on an average stand height of 9 m) and no significant
difference for deciduous vegetation (based on an average
stand height of 17 m). While both simulations were based
on stands with closed canopies, it is understood that in open
canopies at steep incidence angles, the SAR signal will
penetrate deeper, thus lowering the center of the interfero-
metric height response. Nevertheless, in order to reduce the
dependency of the height response on p and h, integration(i.e., averaging of multiple data takes), ranging from 2 to as
many as 10 or more, was performed during the NASA-JPL
product generation phase (USGS, 2003). Since incidence
angle information from the individual data takes cannot be
retrieved from the integrated dataset, the influence of
incidence angle on the interferometric height response
cannot be determined. Thus, a p- and h-dependent residualerror in the estimation of hspc is to be assumed.
The influence of soil roughness and moisture on hspc is
dependent on k, h, and canopy density. Canopy density
directly affects the extent to which an electromagnetic wave
can penetrate through the canopy to the ground. For
moderate to dense canopies, the C-band signal return at
the incidence angle range of SRTM can be largely attributed
to volume scattering from within the upper canopy rather
than from the underlying surfaces (Sarabandi & Lin, 2000).
For low-density canopies, the C-band signal has a greater
likelihood of penetrating to the ground. Ignoring signal
dependencies on soil moisture and roughness for the time
being, as the amount of signal penetrating to the ground
increases, backscatter from the ground surface will increase.
As a result of the higher ground surface response, hspc will
be observed to decrease (i.e., the center of phase scattering
will be closer to the ground). Superimposed on this
observation, varying soil moisture and roughness character-
istics can influence hspc as well. Simulations by Sarabandi
and Lin (2000) suggest that the soil-related influence on the
height response for the SRTM incidence angle range and
wavelength is less than 0.5 m. Since soil roughness and
moisture measurements were not available for the Jesup and
Sierra Nevada test sites, and accounting for these parameters
would prove rather difficult in the context of the SRTM
processing scheme, an additional yet minor residual error
component remains in the estimation of hspc due to these
influences.
A more pronounced source of measurement uncertainty
as compared to the error sources described above relates to
phase noise. For single-pass interferometers like SRTM,
phase noise is mainly caused by thermal and quantization
noise of the radar receivers (Bamler, 1999). The magnitude
of the phase noise error is largely dependent on the number
of radar looks used in signal averaging. Hence, the error is
less where multiple data takes were acquired and multiple
samples are averaged. For Gaussian phase noise, the
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358348
uncertainty decreases with the square root of the number of
looks (e.g., averaging four looks would result in an error
reduction of 50%). For the SRTM mission, a phase noise
error of less than 10 m was targeted (Hensley et al., 2000).
To summarize the potential error sources described thus
far, a total relative error, which is decoupled from the
influence of vegetation on the estimation of hspc can be
defined as:
er ¼ esoil þ ep;h þ epn ð4Þ
where er=total relative error; esoil=soil error due to soil
roughness and moisture uncertainty; ep,h=error due to
polarization and incidence angle uncertainty; epn=error dueto phase noise uncertainty.
Separate from the existence of a relative error, an
absolute error in SRTM-derived elevation is also observed
and is related to (1) error in the attitude (roll) of the
interferometric baseline, and (2) error in the measurement
of the baseline length. Both of these errors result in large-
scale deviations in SRTM elevation from that of the true
surface, but generally, this deviation can be corrected
using ground control points (GCPs) (Bamler, 1999).
Depending on the source of the GCPs (e.g., the NED),
the reference elevation data may have inherent errors as
well. Thus, in order to compare SRTM elevations to
reference elevations, we define a spatial variable that
represents a vertical offset between the SRTM and a given
reference dataset according to:
dv x; yð Þ ¼ hSRTM�b x; yð Þ � href x; yð Þ ð5Þ
where dv=vertical offset; hSRTM-b=noise-free SRTM bald-
Earth elevation; href = reference bald-Earth elevation;
x,y=geographic location.
Given that the absolute error associated with the SRTM
and reference DEMs varies at a relatively broad spatial scale
(thousands of kilometers), dv is expected to result in a
constant (i.e., trendless) offset for relatively small regions
and a spatially variable offset or trend for larger regions.
Following the relative and absolute error formulations
provided in Eqs. (4) and (5), respectively, Eq. (3) can now
be reformulated as:
hspc ¼ fveg vs; vmÞ þ dv þ erð ð6Þ
It follows from Eq. (6) that in order to accurately model
the functional relationship between vegetation character-
istics (vs and vm) and hspc, it is desirable to identify and
remove dv and minimize, to the extent possible, the error
component er, which is primarily dependent on the
reduction of epn.
5.2. Investigation of SRTM phase noise error and vertical
offset
In an attempt to quantify the potential range of dv and epnvalues to be expected within the SRTM coverage of the
conterminous United States, an analysis was carried out
using SRTM and NED elevation data from two large (c1.5
km2) agricultural fields, one in Iowa (IA) and one in North
Dakota (ND) (Fig. 2). Agricultural fields were selected
because they tend to be flat, and during the February 2000
timeframe of the SRTM mission, they would have been
devoid of vegetation. The selected fields were equal in size,
covering an area of approximately 1600 SRTM pixels.
Based on inspection of the SRTM coverage map (Fig. 2),
Field IA (41855VN, 94827VW) was located in a region
where only one SRTM data take was obtained. Conversely,
Field ND (48851VN, 101800VW) was selected from a region
where at least four data takes were acquired. The averaging
of multiple data takes, ranging from 2 to as many as 10 or
more, was performed during the NASA-JPL product
generation phase (USGS, 2003). Given the timeframe of
the SRTM mission and the northerly latitude of Iowa and
North Dakota, both fields are assumed to have been snow-
covered when the SRTM data were acquired. At northerly
latitudes, snow is assumed to be essentially dry in February
and, hence, the presence of snow was not expected to affect
the estimation of dv (Rignot et al., 2001). Given that several
more data takes (N4) were available for Field ND compared
with Field IA (Fig. 1), less phase noise was expected in the
SRTM pixel values for Field ND.
The first step in the analysis was the elimination of
existing topographic variation within the fields by subtract-
ing the NED DEM from the SRTM DEM. Under the
assumption that noise-free SRTM and NED data should
provide for consistent bald-Earth elevation estimates, it
follows that: (1) the residuals of the SRTM–NED difference
image can be attributed to epn, and (2) the mean of the
residuals, should it differ from zero, is an estimate of the
vertical offset (dv) between the SRTM and NED datasets.
From the frequency distribution and summary statistics for
the two difference images, a number of inferences can be
made (Fig. 7, Table 1). First, the phase noise within both
fields is observed to be Gaussian in nature. Second, the
mean value calculated from the difference images (IA=�4.0
m, ND=�1.1 m) is other than zero, reflecting a vertical
offset between the SRTM and NED datasets, which differs
between sites by approximately 2.9 m. Third, the narrower
noise range associated with Field ND (7 m) compared to
that of Field IA (13 m) confirms the hypothesis that greater
noise reduction occurs in locations (i.e., North Dakota)
where multiple data takes have been averaged. In order to
quantify the relationship between sample (i.e., pixel)
averaging and subsequent noise reduction, samples from
each of the difference images were block-averaged with a
window size ranging from 3�3 to 25�25 (i.e., 9–625
samples per block). Fig. 8 shows the results for each field.
In both cases, the results are shown after the removal (via
subtraction) of the respective dv offset. As expected, the
noise reduction is reflected in a decrease in the noise range
with increasing window size. This reduction is more
pronounced up to ca. 7�7 samples averaged, with a sharper
Fig. 7. Histogram of sample values extracted from Fields IA (Iowa) and ND (North Dakota) illustrating the presence of Gaussian phase noise (epn).
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358 349
decrease observed in Field IA. At 7�7 samples (~50 pixels)
averaged, the range for Field IA was reduced from 13 m to
approximately 4 m, and the range for Field ND was reduced
from 7 m to approximately 3 m. Beyond the 7�7 threshold,
a more gradual decrease is observed in both fields and the
maximum, minimum, and mean values begin to converge.
Table 1
Summary statistics for phase noise (epn) from Fields IA (Iowa) and ND
(North Dakota)
Field IA Field ND
41.918N, 94.458W 48.858N, 101.008W
SRTM Min 330.0 449.0
Mean 337.5 454.1
Max 346.0 457.0
Range 16.0 8.0
NED Min 337.7 451.4
Mean 341.5 455.1
Max 345.8 456.7
Range 8.1 5.3
SRTM–NED Min �10.1 �4.1
Mean �4.0 �1.1
Max 2.6 2.7
Range 12.7 6.8
At a threshold of 25�25 (=625) samples, the noise range is
reduced to less than 1 m.
Compared to the relatively flat, nonvegetated regions
studied here, the phase noise inherent to SRTM data from
forested terrain should be far less pronounced due to the
higher signal-to-noise ratio associated with InSAR back-
scatter from vegetation canopies. Hence, the noise curves
shown in Fig. 8 should represent worst-case scenarios with
respect to the relative error to be expected in the SRTM data,
particularly for Field IA where only one SRTM data take
was acquired.
5.3. Test site error reduction and vertical offset identification
An effort was made to apply the knowledge of SRTM
phase noise and vertical offsets gained above to the analysis
of the Jesup and Sierra Nevada test sites. The extent to
which these error sources were present in the datasets was
explored graphically via the assessment of elevation profiles
extracted from the SRTM, NED, and SRTM–NED differ-
ence images. A series of elevation profiles (A–C) was
extracted from the Jesup test site along selected transects
(Fig. 4b). In general, Profile A, representing a transect of
Fig. 8. Comparison of phase noise (epn) statistics (minimum, mean, and maximum) for Fields IA (Iowa) and ND (North Dakota) following a block averaging
procedure in which block size was increased linearly from 3�3 (nine samples) to 25�25 (625 samples). All values were adjusted by the mean absolute error
determined from each field.
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358350
approximately 15 km, demonstrates the close overall
correspondence between the SRTM and NED elevation
datasets. Further inspection of the profile reveals the
pronounced relative difference in elevation between the
two DEMs, which is best reflected in the SRTM–NED
difference (hspc) curve shown at the bottom of Profile A.
This difference in elevation reflects the presence of, and
variability associated with, vegetation and anthropogenic
features along the transect. Profiles B and C are approx-
imately 2–3 km in length and correspond to transects across
riparian (B) and upland (C) areas (Fig. 4b). In profile B, the
SRTM–NED difference curve reveals quite clearly the
presence of peaks associated with bands of riparian forest.
Similarly, profile C shows the sharp boundaries between
alternating plantations and clearcuts.
Elevation profiles (A–C) extracted from the Sierra
Nevada test site illustrate the greater topographic variability
and elevation range that distinguishes this site from Jesup
(Fig. 6b). Whereas Sierra Nevada Profile (SP) A exhibits an
elevation range of approximately 400 m, Jesup Profile (JP)
A has a range that is closer to 25 m (Fig. 4b). In general, SP
A also suggests the more continuous nature of the forest and
the presence of taller trees. Sierra Nevada Profiles B and C
are approximately 2–3 km in length and are intended to
complement the corresponding Jesup profiles. Profile B
reflects a transect crossing three more or less parallel
riparian zones (Fig. 6b). Again, the SRTM–NED difference
curve (bottom) captures the presence of taller vegetation in
these moister, more hospitable areas. The transect associated
with Profile C crosses a region of alternating plantations and
clearcuts. The SRTM–NED difference curve (bottom) is
consistent with that of JP C, revealing the sharp boundaries
between these features.
In general, the elevation profiles presented in Figs. 4b
and 6b illustrate the marked sensitivity of the SRTM sensor
to vegetation canopy height. At the same time, however, the
profiles also reveal the presence of phase noise error. As a
result, and in order to improve subsequent scattering phase
center height calculations, efforts were undertaken to reduce
the observed noise component. In both test sites, this was
accomplished using a sample (i.e., pixel) averaging
approach.
For the Jesup site, sample averaging was carried out
within the boundaries of each of the 22 stands described in
Section 4.1. The field data (i.e., cluster plots) were
aggregated within these boundaries as well. The interior of
stand boundaries was buffered by 30 m (i.e., one SRTM
pixel) to minimize edge effects resulting from geocoding
errors. Following this procedure, the number of samples per
stand ranged from 2 to 574.
For the Sierra Nevada site, sample averaging was
accomplished by extracting all pixels within a radius of 45
m around the center point of each plot. This zone of
inclusion was selected as a basis for averaging all pixels
having a majority of their area within the boundary of the
56.4-m (1.0 ha) sample plot annulus. Given the relatively
low pixel-to-plot area ratio, the extraction procedure
resulted in the averaging of just 6–10 pixels per plot. Based
on the results presented in Fig. 8, this sample size was
deemed much too small to effectively reduce the phase
noise component. Operating under the assumption that
larger sample plots (e.g., coincident with stand-level
boundaries) would increase the pixel-to-plot area ratio, a
novel strategy was devised based on a linear binning or
partitioning of the observed canopy height variable. The
goal of this strategy was to effectively simulate stand-level
statistical averaging using plot level data. To implement the
approach, the original 227-plot dataset was divided among
20 bins, with the number of plots per bin ranging from 3 to
29. With the exception of the lowermost and two uppermost
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358 351
bins, the bin width was set to 1.5 m. Bin width was
determined based on a desire to obtain a minimum of 20
pixels per bin. Under this binning strategy, the number of
pixels per bsimulated standQ or bin ranged from 24 to 253—
a sample size much more suitable for reducing epn.Within each bin, the observed mean canopy height (hcan)
was computed according to Eq. (2), with NT representing
the sum of all stems in all plots within the respective bin,
and ht corresponding to the observed heights of these stems.
In general, the binning strategy can be equated to simulating
mean observed canopy and associated scattering phase
center heights within homogeneous (in terms of observed
height), contiguous stands, with the caveat that individual
sample plots are not co-located within contiguous stands but
are distributed across a larger forested region (see plot
locations in Fig. 5).
To determine if a vertical offset (dv) existed between the
SRTM and NED datasets for the test sites, regions
representing barren, nonvegetated areas were identified
from the National Land Cover Dataset (NLCD) (Vogelmann
et al., 2001) and recent Landsat ETM+ imagery. For the
Jesup site, the analysis revealed a small offset of �0.2 m
(i.e., bald-Earth SRTM elevations were, on average, 20 cm
lower than NED elevations). For the Sierra Nevada test site,
the offset was �1.3 m.
5.4. Test site scattering phase center height calculation
Following the procedures for error accounting described
above, the mean scattering phase center height (hspc) for a
vegetated region can be estimated within relatively narrow
error bounds if a large-enough sample population can be
averaged. Substituting the unknown functional dependency
on vegetation characteristics in Eq. (6) with the SRTM–
NED difference, the mean scattering phase center heights
for the Jesup and Sierra Nevada test sites were calculated
according to:
h¯ spc ¼1
Np
XNP
i¼1
hSRTM � hNEDÞ þ dvð ð7Þ
where hspc=mean scattering phase center height; hSRTM=
SRTM pixel elevation; hNED=NED pixel elevation; NP=
number of pixels in a stand, plot, or bin.
The remaining error component er associated with the
hspc of each stand (Jesup), plot, or bin (Sierra Nevada) is
primarily dependent on the number of samples (NP)
averaged and can be approximated from Fig. 8.
6. Regression models for canopy height determination
The primary objective of this study was to determine
whether a functional relationship exists between the STRM
scattering phase center height and the observed vegetation
canopy height for forested areas in the Jesup and Sierra
Nevada test sites. Given the significant relative error
inherent in the SRTM data, this relationship cannot be
established at the individual pixel scale. Hence, a formula-
tion of this relationship must necessarily be based on mean
estimates of observed canopy (Eq. (2)) and scattering phase
center height (Eq. (7)), and can be expressed in the most
general form as:
h¯ can ¼ f h¯ spc��
ð8Þ
Preliminary inspection of the data suggested that linear
models might be adequate for describing the functional
relationship in Eq. (8). Thus, the following model was tested
for both sites:
h¯ can ¼ b0 þ b1h¯spc ð9Þ
where b0=model intercept; b1=model slope.
Based on the relationship in Eq. (9), a least squares
regression analysis was performed on the data from each test
site to determine the model parameters and associated
coefficients of determination. To quantify the deviation
between observed and predicted mean canopy height values,
the root mean square error was computed as:
rmse ¼
ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiXNi¼1
�h¯ obscan � h¯ predcan
�2
N
vuuuutð10Þ
where rmse=root mean square error; hcanobs=observed mean
canopy height; hcanpred=predicted mean canopy height; N=
number of observations.
7. Results
7.1. Test site Jesup, Georgia
A summary of biophysical parameters for the 22 Jesup
stands is included in Table 2. Included in this table are the
hspc and observed hcan values used in the regression
analyses. Fig. 9 shows the hspc values plotted against the
observed hcan values for the 22 stands. The linear regression
that included all 22 stands in the analysis resulted in an
adjusted r2 value of 0.15. The weak linear relationship
observed was largely due to the presence of several outlier
stands. Two possible explanations for the outliers can be
offered. First, several stands have a relatively small area and
therefore too few samples were available for averaging to
reduce the phase noise component present in the SRTM
data. Three of the outlier stands, indicated by open triangles
in Fig. 9, have fewer than 20 pixels (i.e., b1.8 ha) within
them and significantly fewer pixels than the remaining
stands (31–574). Removal of the three stands resulted in an
improved adjusted r2 value of 0.42 with an rmse of 1.8 m
(Table 4). Second, four of the remaining 19 stands exhibit
Table 2
Summary statistics for 22 stands surveyed within the test site Jesup, Georgia
Stand ID Age Strata Number of
surveyed
plots
Plot size
[m2]
Number
of SRTM
pixels
Mean canopy
height [m]
(hcan)
S.D.
height
[m]
Basal
area
[m2/ha]
Trees per
hectare
SRTM–NED
height [m]
(hspc)
S.D.
SRTM–NED
height [m]
195766 18 Unthinned 24 81 574 17.1 2.0 21.5 788 9.2 2.4
224800 15 Unthinned 29 81 447 16.4 1.5 19.1 1114 7.9 3.0
224305 20 Unthinned 24 81 377 20.1 2.1 21.5 592 10.1 2.4
224542 25 Thinned 25 202 327 15.6 3.2 15.4 458 8.5 2.9
224472 15 Unthinned 30 81 258 14.0 2.2 11.6 724 6.0 2.0
195835 32 Unthinned 24 81 247 21.3 2.8 31.0 766 12.8 2.6
224796 22 Thinned 11 202 236 12.7 2.6 8.9 464 5.8 2.3
224353 14 Unthinned 53 81 210 10.8 2.0 9.1 899 4.7 2.1
224368 18 Unthinned 24 81 187 18.3 1.7 24.0 1083 9.0 2.0
195816 18 Unthinned 22 81 127 18.9 1.5 22.4 675 9.1 2.9
224278 20 Unthinned 28 81 82 15.8 2.6 15.5 932 8.9 2.9
224303 20 Unthinned 27 81 72 18.0 4.4 25.5 692 10.8 2.0
224224 18 Unthinned 24 81 37 16.5 2.0 20.4 762 7.1 2.2
224799 18 Unthinned 24 81 36 16.9 1.8 24.4 997 10.4 2.2
224784 18 Unthinned 28 81 31 16.1 1.5 26.3 1217 8.3 2.7
224302 21 Thinned 31 202 165 19.8 5.0 15.1 333 7.5 2.3
224401 22 Unthinned 43 81 113 18.0 5.5 14.2 386 9.6 2.5
224550 20 Unthinned 36 81 59 16.7 6.0 30.3 700 13.0 2.6
224546 26 Thinned 41 202 128 15.7 7.5 19.9 426 10.6 2.4
224335 17 Unthinned 25 81 13 10.4 2.0 23.5 1034 8.3 1.9
224304 20 Unthinned 26 81 6 12.0 2.5 21.1 556 8.8 2.1
224547 26 Thinned 41 202 2 15.7 7.5 19.9 426 16.2 2.2
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358352
relatively large standard deviations in observed height (on
the order of 5 m and above), which points to the presence of
a dissimilar stand structure characterized by a more
heterogeneous canopy. Whereas this heterogeneity should
also be reflected in the corresponding standard deviations of
the SRTM hspc values, it is not (Table 2). This is due to the
fact that the variability in observed height within the four
stands exists at a scale less than the resolution of the SRTM
data. Visual inspection of the stands both on the ground and
Fig. 9. Plot of SRTM scattering phase center height (hspc) versus observed stand he
key differences in stand variables including stand size [e.g., triangles indicate sma
indicate stands with high height variability (S.D.N5)].
in high-resolution imagery confirms that these stands exhibit
a very heterogeneous structure including windfalls and
small clearings, which contribute to the large variance in
observed height. Thus, these stands were deemed not
representative of the remaining sample and would have to
be treated with an alternative approach (e.g., elimination of
the windfall/clear areas from both the ground and image
sample). Since the ground survey did not record exact plot
locations, this was not possible retrospectively. Removal of
ight (hcan) for all stands (22) in the Jesup, Georgia, test site. Symbols reflect
ll stands with few (b20) pixels per stand] and stand structure [e.g., squares
Fig. 10. Plot of SRTM scattering phase center height (hspc) versus observed stand height (hcan) for 15 stands in the Jesup, Georgia, test site.
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358 353
the four stands resulted in a significant improvement in the
correlation between SRTM hspc and hcan. The adjusted r2
value of the model with seven outlier stands removed was
0.79 with an rmse of 1.1 m (Fig. 10).
Under the assumption that additional sample averaging
could further reduce the influence of phase noise on model
performance, a minimum threshold of 50 pixels per stand was
explored since (1) according to Fig. 8, the noise range is
reduced significantly for stands of this size, and (2) a sufficient
number of stands (i.e., 12) remained available with which to
develop the regression model (Table 2). The resulting adjusted
r2 value was 0.86 with an rmse of 1.0 m (Fig. 10).
Table 3
Summary statistics for 20 bsimulated standsQ (i.e., bins) based on data from 227
Bin
number
Bin range
[m]
Number
of plots
per bin
Plot
size
[m2]
Number
of SRTM
pixels per bin
Mean canop
height [m]
(hcan)
1 b5.00 3 707 26 3.5
2 5.00–6.49 3 707 25 5.9
3 6.50–7.99 3 707 26 7.4
4 8.00–9.49 11 707 85 8.8
5 9.50–10.99 29 707 253 10.2
6 11.00–12.49 24 707 214 11.8
7 12.50–13.99 20 707 173 13.3
8 14.00–15.49 22 707 191 14.7
9 15.50–16.99 19 707 158 16.1
10 17.00–18.49 21 707 184 17.8
11 18.50–19.99 10 707 87 19.2
12 20.00–21.49 16 707 139 20.9
13 21.50–22.99 7 707 56 22.0
14 23.00–24.49 13 707 114 23.8
15 24.50–25.99 5 707 44 24.9
16 26.00–27.49 4 707 35 27.1
17 27.50–28.99 6 707 47 28.5
18 29.00–30.49 4 707 35 29.4
19 30.50–33.99 4 707 36 33.0
20 34.00–38.00 3 707 24 37.3
The parameters of the linear regression models associated
with the 20- and 50-pixel thresholds indicate that the height
of the scattering phase center increases moderately
(slope=1.2–1.3) with increasing canopy height and lies
approximately 6+ m (intercept=5.8–6.6) below the canopy
surface.
7.2. Test site Sierra Nevada, California
An overview of biophysical parameters, including hspcand observed hcan values, corresponding to the 20 simulated
Sierra Nevada stands (i.e., bins) is provided in Table 3. Prior
plots surveyed within the test site Sierra Nevada, CA
y S.D.
height
[m]
Basal area
per bin
[m2/ha]
Number
of trees
per bin
SRTM–NED
Height [m]
(hspc)
S.D.
SRTM–NED
height [m]
0.9 0.2 17 3.3 3.4
2.8 2.6 91 3.7 2.2
4.0 3.7 122 8.7 1.8
3.9 10.3 220 7.5 1.8
5.0 47.2 990 13.0 2.9
6.3 57.0 805 12.2 2.8
7.5 55.5 663 14.1 2.6
7.5 68.2 688 16.1 2.6
9.3 63.0 483 12.8 2.7
8.1 80.2 684 18.1 2.9
10.5 46.0 289 18.3 1.9
10.3 60.5 331 19.5 3.0
10.4 25.8 172 20.8 2.4
11.1 59.7 321 21.3 3.4
11.0 18.1 78 19.7 4.0
8.2 20.2 100 18.6 2.4
10.7 33.7 139 19.9 3.0
12.4 20.0 69 13.5 3.0
12.8 18.3 52 22.1 2.9
10.1 15.1 38 26.3 2.8
Fig. 11. Plot of SRTM scattering phase center height (hspc) versus observed plot height (hcan) for all plots (227) in the Sierra Nevada, California, test site.
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358354
to binning, a simple linear regression analysis was con-
ducted with observed hcan regressed on hspc (Fig. 11). Data
from all 227 plots were included in model development. The
result was an adjusted r2 value of 0.20 with an rmse of 5.8
m (Table 4), indicating the presence of only a slight positive
correlation. This relatively weak relationship was attributed
to the presence of residual phase noise resulting from the
fact that only 6–10 pixels were averaged per plot under the
45-m buffering strategy. This number was observed to be
quite low in light of the findings presented in Fig. 8 as well
as in comparison to the Jesup site where the average stand
size was significantly larger and hence the number of pixels
extracted was markedly greater (i.e., N20 pixels following
outlier removal) . The binning approach described in
Section 5.3 was therefore undertaken in an effort to simulate
stand-level statistics using plot-level data. Under this
strategy, the number of pixels being averaged per observa-
tion was increased from a minimum of 6 to a minimum of
20 (Table 3). Fig. 12 shows the hspc values plotted against
observed hcan values for the resulting 20 height bins. The
regression analysis based on this strategy resulted in a
significantly improved adjusted r2 value of 0.75 with an
rmse of 4.5 m (Fig. 12; Table 4). Based on the Jesup results,
Table 4
Results of simple linear regression analyses performed on the test sites Jesup, Ge
Dataset Observed Adjusted r
Jesup test site: linear least squares regression
Number of pixels N20 19 0.42
Number of pixels N20 and hcan S.D. b5 m 15 0.79
Number of pixels N50 and hcan S.D. b5 m 12 0.86
Sierra Nevada test site: linear least squares regression
No pixels threshold (based on all plots) 227 0.20
Number of pixels N20 (based on all plots) 20 0.75
Number of pixels N50 (based on all plots) 11 0.86
Number of pixels N20 (based on 126 training plots) 20 0.74
an attempt was made to determine whether any remaining
phase noise could be removed. Again, a minimum threshold
of 50 pixels per bin was tested, which resulted in 11 height
bins (Table 3). The least squares regression based on these
data resulted in an adjusted r2 value of 0.86 with an rmse of
1.7 m (Fig. 12; Table 4). Inspection of Fig. 12 reveals the
influence that bins with fewer than 50 samples had on the
20-bin model prior to their removal. In the initial 20-bin
model, the slope and intercept were 1.336 and �1.892,
respectively. The regression based on the remaining eleven
bins resulted in slight decreases in both the slope (1.088)
and intercept values (�0.940).
7.3. Model inversion and validation
In the context of this research, validation efforts were
focused on the bin-based modeling approach developed and
applied as part of the Sierra Nevada pilot study. Validation
was limited to the Sierra Nevada site given (1) the limited
amount of field data available for the Jesup site (22
georeferenced stands) compared to the Sierra Nevada site
(227 georeferenced plots), and (2) a desire to conduct a
formal and independent assessment of the bin-based
orgia, and Sierra Nevada, California
2 F statistic p value Intercept (b0) Slope (b1) rmse [m]
14.30 0.00149 9.785 0.784 1.8
54.93 N0.0001 6.619 1.160 1.1
67.88 N0.0001 5.778 1.261 1.0
58.81 N0.0001 10.883 0.370 5.8
58.15 N0.0001 �1.892 1.336 4.5
63.72 N0.0001 �0.940 1.088 1.7
55.44 N0.0001 �1.775 1.351 4.6
Fig. 12. Plot of SRTM scattering phase center height (hspc) versus observed binned plot height (hcan) for 20 bsimulated standsQ (i.e., bins) in the Sierra Nevada,
California, test site.
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358 355
modeling strategy employed as part of the Sierra Nevada
analysis.
A validation effort based on independent training and
testing datasets was carried out in order to test the robustness
of the binning strategy summarized in Table 3. Here, a 20-
pixel minimum threshold was established to pool the plot
data into 20 bins (height classes), thereby simulating 20
stands of relatively homogeneous height. To properly
validate the binning approach, sufficient data were needed
within each bin to allow for the 20-pixel minimum threshold
to be maintained in both training and testing datasets. Thus,
Table 5
Summary statistics for bsimulated standsQ (i.e., bins) used in regression model de
Bin
number
Bin range [m] Training (126 plots)
Number
of plots
per bin
Number
of SRTM
pixels per bin
Mean canopy
height [m]
(hcan)
SR
hei
(hs
1 b5.00 3 26 3.5 3.
2 5.00–6.49 3 25 5.9 3.
3 6.50–7.99 3 26 7.4 8.
4 8.00–9.49 6 47 8.8 8.
5 9.50–10.99 14 119 10.3 12.
6 11.00–12.49 12 108 11.9 11.
7 12.50–13.99 10 90 13.3 13.
8 14.00–15.49 11 99 14.6 16.
9 15.50–16.99 10 85 16.1 12.
10 17.00–18.49 10 88 17.8 18.
11 18.50–19.99 5 45 19.0 18.
12 20.00–21.49 8 68 20.8 20.
13 21.50–22.99 4 32 22.2 19.
14 23.00–24.49 6 51 23.8 20.
15 24.50–25.99 3 27 24.9 20.
16 26.00–27.49 4 35 27.1 18.
17 27.50–28.99 3 23 28.6 17.
18 29.00–30.49 4 35 29.4 13.
19 30.50–33.99 4 36 33.0 22.
20 34.00–38.00 3 24 37.3 26.
for all bins containing 40 (i.e., 2�20) pixels or more (see
Table 3), half of the sample plots (102 plots) and associated
pixels were randomly selected for model development (i.e.,
training), while the remaining half (101 plots) were reserved
for inversion and validation (i.e., testing) (Table 5). Bins
containing fewer than 40 pixels (24 additional plots) were
used exclusively for model development. Thus, whereas the
model was trained on the entire range of height classes (0–38
m) represented by the 20 bins (126 plots), validation was
conducted on height classes ranging from 8 to 26 m and from
27.5 to 29 m represented by 13 bins (101 plots) (Table 5).
velopment (20 training bins) and validation (13 testing bins)
Testing (101 plots)
TM–NED
ght [m]
pc)
Number
of plots
per bin
Number
of SRTM
pixels per bin
Mean canopy
height [m]
(hcan)
SRTM–NED
height [m]
(hspc)
3
7
7
3 5 38 8.7 6.5
0 15 134 10.1 13.9
4 12 106 11.8 13.1
5 10 83 13.3 14.8
2 11 92 14.7 16.0
6 9 73 16.1 12.9
2 11 96 17.8 17.9
6 5 42 19.3 17.9
0 8 71 21.0 19.0
9 3 24 21.9 21.9
1 7 63 23.8 22.3
2 2 17 24.8 18.9
6
2 3 24 28.3 22.5
5
1
3
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358356
The regression analysis applied to the training dataset
resulted in an adjusted r2 value of 0.74 with an rmse of 4.6 m
(Table 3). These results are consistent with the adjusted r2
and rmse values (0.75, 4.5 m) reported for the original 20-
bin model developed using data from all 227 plots.
Furthermore, the slope (b0) and intercept (b1) parameters
associated with the original 20-bin model (b0=1.336,
b1=�1.892) and the 20-bin training model (b0=1.351,
b1=�1.775) are virtually identical, indicating that the
original model parameters were appropriate. When the
training model is inverted to predict observed canopy height
values for the independent 13-bin testing dataset, an rmse of
4.1 m is achieved (Table 4), a result consistent with those
reported above for both the original 20-bin dataset (rmse=4.5
m) and the 20-bin validation dataset (rmse=4.6 m).
8. Discussion
In this study, two test sites representing a range of
vegetation structure, moisture, and topographic character-
istics were selected for use in determining the feasibility of
deriving vegetation canopy height from SRTM digital
elevation data. Whereas the Jesup test site is dominated
by heavily managed, relatively homogeneous plantations of
slash pine on moderately rolling terrain, the Sierra Nevada
test site is characterized by less heavily managed, relatively
heterogeneous tracts of coniferous forest of variable
composition and structure on highly variable terrain. All
else being equal, these structural-, topographic-, and
associated moisture-related differences have a direct and
differential influence on the height of the scattering phase
center (i.e., the vertical position within the canopy from
where the majority of backscattered electromagnetic energy
is being returned). As expected, different empirical models
are needed to describe the differing relationships that exist
between the two test sites with respect to observed and
remotely sensed estimates of vegetation canopy height.
Preliminary inspection of the data indicated that simple
univariate linear models would be sufficient for describing
these relationships. It follows that efforts to extend this
research to regional, continental, or global scales will
necessarily require development of a more comprehensive
region-specific and/or multivariate approach. To be most
effective, such an approach would be conducted in the
context of an ecoregional framework where regional
ecological drivers of vegetation composition and structure
including climate and topography can be appropriately
controlled. Within individual ecoregions, remotely sensed
descriptors of horizontal structure (e.g., canopy density,
cover type, etc.) can then be used together with SRTM
scattering phase center height information as part of a
multivariate and structurally hierarchical approach to
regression model development.
Whatever the spatial scale at which the SRTM data are
used, based on the research presented here, it is clear that
accurate estimates of vegetation canopy height cannot be
expected at the level of individual SRTM resolution cells
(c900 m2). Due to the relative error component resulting
largely from phase noise, sample averaging is a critical
prerequisite of any effort to derive canopy height informa-
tion from the SRTM data. The results from the Jesup and
Sierra Nevada test sites suggest that 20 pixels is perhaps the
minimum acceptable threshold for sample averaging (Figs.
10 and 12). The results of the Iowa and North Dakota field
studies support this finding but also indicate that a higher
averaging threshold may be warranted depending on what
level of residual relative error is deemed acceptable (Fig. 8).
Regardless of the final threshold selected, thought needs to
be given to the specific method used to aggregate pixels
prior to averaging. The stand-based approach applied in the
Jesup case is generally ideal assuming that (1) survey data
are collected on a per-stand basis; (2) stand boundaries are
available for use in the analysis, and (3) stands exhibit
relatively homogeneous structural characteristics. Since in
the general case one or more of these conditions are likely
not to be met, an alternative averaging strategy with more
universal application is desirable. Future work in this regard
will focus on the use of knowledge-based segmentation,
which provides for the automatic and optimal delineation of
local homogeneous regions reflecting a partitioning of the
landscape into units of similar height and overall structure.
This approach is superior to kernel-based methods (i.e., the
simple buffer employed in the Sierra Nevada case) for
several reasons. First, image segments can be highly
adaptive to the irregular shapes of homogeneous regions
(e.g., bands of riparian forest). Second, ancillary information
including other high-resolution remote sensing imagery
such as Landsat-derived vegetation indices or land cover
information can be incorporated in the retrieval of image
segments in order to refine the process of boundary
delineation. National-scale data of this sort will soon be
available in the United States through the ongoing release of
the National Land Cover database 2001 (Homer et al.,
2004).
Despite the presence of phase noise and other contrib-
utors to the overall relative and absolute height errors (see
Eqs. (3) and (4)), efforts undertaken as part of this research
to both understand and minimize errors sources have been
quite successful. In most cases, the errors themselves are
relatively minor. Some of these errors such as those related
to soil moisture and roughness are nearly impossible to
quantify given the complex nature of SRTM data acquisition
and processing (e.g., averaging of multiple data takes). As a
result, and given their minimal influence on the height
estimation process, no effort has been made to account for
them explicitly. On the other hand, more significant sources
of error such as those attributed to phase noise and vertical
offsets are reasonably well understood and can be mini-
mized, if not removed, using the techniques (e.g., sample
averaging) described here. Future work in the arena of error
characterization will focus on development of methods for
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358 357
minimizing this suite of errors across broader geographic
extents. For example, regional-scale efforts to estimate
vegetation canopy height require techniques for spatially
modeling the absolute error associated with vertical offsets
or trends between the SRTM and NED datasets. Results
from the Iowa/North Dakota investigation of phase noise
indicated a difference in vertical offset amounting to almost
3 m. In the context of regional-scale analyses, spatially
varying vertical trends such as that observed in the Great
Plains can be identified using trend analysis together with
SRTM–NED difference data from vegetation-free areas
(e.g., snow-covered fields, pastures, meadows, etc.). The
trend can then be modeled using geostatistical techniques
such as kriging and accurately removed from the regional
estimates of scattering phase center height.
A number of additional research areas are currently being
explored in the context of efforts to extend SRTM-derived
canopy height estimation to larger scales. Perhaps most
important among these is the need to acquire accurate and
consistent field survey information for use in larger-scale
applications of the SRTM data. One potential source of such
data is the Forest Inventory and Analysis (FIA) program of
the USDA Forest Service (Gillespie, 1999). This federal
program represents the only nationwide source of timely
and reliable forest inventory and monitoring information.
The inventory consists of some 150,000 permanent plots
nationwide; 20% of these are surveyed annually. Generally
speaking, the FIA sampling design is quite similar to that
employed as part of the Sierra Nevada field survey. The
encouraging results obtained from the Sierra Nevada test
site suggest that FIA survey data may be useful for scaling
the SRTM-based height estimation process up to the
conterminous United States and beyond.
A second key area of research involves developing a
more complete physical understanding of the interferometric
scattering phase center height and determining what this
measure best reflects in the context of traditional field
metrics for describing vegetation canopy height. In this
study, the functional relationship between hspc and the
observed mean canopy height was explored exclusively.
However, it is possible that other alternative height metrics
(e.g., median, maximum, or dominant height) may in fact be
more highly correlated with the hspc. In areas like the Jesup
test site, dominated almost exclusively by even-aged pine
plantations, variability in canopy height is very low,
resulting in a situation where very little difference exists
between values of the mean, median, maximum, or
dominant height. In this case, the mean represents an
entirely appropriate measure of the observed height.
However, in regions like the Sierra Nevada where variability
in vertical structure tends to be significant, mean canopy
height is a very different measure from that of maximum or
dominant height. Thus, in the presence of heterogeneous
structural conditions, it may be that hspc is a much better
predictor of one of these alternative height metrics,
especially in stands of moderate to high density.
9. Conclusions
The research presented here demonstrates that elevation
data from the NASA JPL SRTM holds considerable potential
for developing statistically significant inversion models for
deriving estimates of vegetation canopy height. Various
relative and absolute SRTM error sources were identified as
part of this work and broadly applicable methods have been
proposed for removing or significantly reducing these errors.
Most important among these relates to the minimization of
the SRTM-inherent phase noise. Using elevation profiles and
outlier analysis together with an analysis of noise statistics in
vegetation-free areas, it was established that averaging a
minimum of 20 SRTM pixels yields stable estimates of the
mean scattering phase center height. This corresponds to a
minimum SRTM mapping unit for SRTM-based vegetation
canopy height estimation of approximately 1.8 ha.
The possibility of expanding this work to the scale of the
conterminous United States is now being explored. Com-
plementary field and ancillary data sources including the
national FIA database and the soon-to-be released NLCD
2001 represent unprecedented sources of information for
enabling the production of the first-ever nationwide map of
vegetation canopy height. Such a map would undoubtedly
prove invaluable for a wide range of applications including
biomass estimation and carbon accounting as well as timber
management and habitat characterization.
In summary, SRTM elevation data exhibit a significant
sensitivity to the vertical structure of vegetation. Although
part of the user community views vegetation as amajor source
of noise and nuisance in what would otherwise be highly
accurate SRTM-derived bald-Earth elevation data, it is shown
that for the purposes of vegetation height estimation, the
SRTM dataset is an invaluable resource of near-global scope.
Acknowledgements
The collection of ground data within the Jesup test site
was supported by the NASA EOCAP-SAR program and the
Plum Creek Timber. Analysis of the Jesup data was
supported by a NASA Solid Earth grant titled bVegetationCorrected SRTM-Derived Digital Elevation Models.Q Datacollection within the Sierra Nevada test site was supported
by a NASA Earth Science grant titled bForest Structure fromMultispectral Fusion.Q The analysis was supported, in part,
by the aforementioned grant as well as a NASA Earth
System Science Fellowship awarded to the second author.
References
Bamler, R. (1999). The SRTM Mission: A world-wide 30 m resolution
DEM from SAR Interferometry in 11 days. In D. Fritch, & R. Spiller
(Eds.), Photogrammetric week ’99 (pp. 145–154). Heidelberg7
Wichmann.
J. Kellndorfer et al. / Remote Sensing of Environment 93 (2004) 339–358358
Bergen, K. M., & Dobson, M. C. (1999). Integration of remotely sensed
radar imagery in modelling and mapping of forest biomass and net
primary production. Ecological Modelling, 122, 257–274.
Brown, C. G. (2003). Tree height estimation using Shuttle Radar
Topography Mission and ancillary data. PhD Thesis, Department of
Electrical Engineering and Computer Science, The University of
Michigan, Ann Arbor, MI.
Curkendall, D., Fielding, E., Cheng, T., & Pohl, J. (2003). A computational-
grid based system for continental drainage network extraction using
SRTM digital elevation models. IEEE ICPP Workshops, 181–190.
Dobson, M. C. (2000). Forest information from Synthetic Aperture Radar.
Journal of Forestry, 98, 41–43.
Dobson, M. C., Ulaby, F. T., Pierce, L. E., Sharik, T. L., Bergen, K. M.,
Kellndorfer, J. M., et al. (1995). Estimation of forest biophysical
characteristics in Northern Michigan with SIR-C/X-SAR. IEEE Trans-
actions on Geoscience and Remote Sensing, 33, 877–895.
Dubayah, R. O., & Drake, J. B. (2003). Lidar remote sensing for forestry.
Journal of Forestry, 98, 44–46.
Gesch, D., Oimoen, M., Greenlee, S., Nelson, C., Steuck, M., & Tyler, D.
(2002). The National Elevation Dataset. Photogrammetric Engineering
and Remote Sensing, 68, 5–11.
Gillespie, A. (1999). Rationale for a National Annual Forest Inventory
Program. Journal of Forestry, 97, 16–20.
Hagberg, J. O., Ulander, L. M. H., & Askne, J. (1995). Repeat-pass SAR
interferometry over forested terrain. IEEE Transactions on Geoscience
and Remote Sensing, 33, 331–340.
Hensley, S., Rosen, P., & Gurrola, E. (2000). Topographic map generation
for the Shuttle Radar Topography Mission C-band SCANSAR
interferometry. Proceedings of SPIE, 4152, 179–189.
Homer, C., Huang, C., Yang, L., Wylie, B., & Coan, M. (2004).
Development of a 2001 National Landcover Database for the United
States. Photogrammetric Engineering and Remote Sensing, 70,
829–840.
Hyypp7, J., & Hallikainen, M. (1996). Applicability of airborne profiling
radar to forest inventory. Remote Sensing of Environment, 57, 39–57.
Kasischke, E. S., Melack, J. M., & Dobson, M. C. (1997). The use of
imaging radars for ecological applications—A review. Remote Sensing
of Environment, 59, 141–156.
Kellndorfer, J. M., Dobson, M. C., Pierce, L. E., & Ulaby, F. T. (1998).
Toward consistent regional- to-global-scale vegetation characterization
using orbital SAR systems. IEEE Transactions on Geoscience and
Remote Sensing, 36, 1396–1411.
Kellndorfer, J.M, Dobson, M. C., Vona, J., & Clutter, M. (2003). Towards
precision forestry: Plot-level parameter retrieval for slash pine
plantations with JPL AIRSAR. IEEE Transactions on Geoscience and
Remote Sensing, 41, 1571–1582.
Kellndorfer, J. M., & Ulaby, F. T. (2003). Forest biomass inversion from
SAR using object oriented image analysis techniques. IEEE Geoscience
and Remote Sensing Symposium, Toulouse, 6, 3465–3467.
Kobayashi, Y., Sarabandi, K., & Dobson, M. C. (2000). An evaluation of
the JPL TOPSAR for extracting tree heights. IEEE Transactions on
Geoscience and Remote Sensing, 38, 2446–2454.
Lefsky, M. A., Harding, D., Cohen, W. B., Parker, G., & Shugart, H. H.
(1999). Surface Lidar remote sensing of basal area and biomass in
deciduous forests of Eastern Maryland, USA. Remote Sensing of
Environment, 67, 83–98.
Martinez, J. M., Floury, N., Le Toan, T., Beaudoin, A., Hallikainen, M., &
Makynen, M. (2000). Measurements and modeling of vertical back-
scatter distribution in forest canopy. IEEE Transactions on Geoscience
and Remote Sensing, 38, 710–719.
Papathanassiou, K. P., & Cloude, S. K. (2001). Single-baseline polarimetric
SAR interferometry. IEEE Transactions on Geoscience and Remote
Sensing, 39, 2352–2363.
Pienaar, L. V., Shiver, B. D., & Rheney, J. W. (1993). Yield prediction for
mechanically site-prepared slash pine plantations in the southeastern
Coastal Plain. Southern Journal of Applied Forestry, 17, 163–173.
Rignot, E., Echelmeyer, K., & Krabill, W. B. (2001). Penetration depth of
Interferometric Synthetic Aperture Radar signals in snow and ice.
Geophysical Research Letters, 28, 3501–3504.
Rosen, P., Eineder, M., Rabus, B., Gurrola, E., Hensley, S., Knoepfle, W.,
et al. (2001a). SRTM mission cross comparison of X and C band data
properties. IEEE Geoscience and Remote Sensing Symposium, 2,
751–753.
Rosen, P., Hensley, S., Gurrola, E., Rogez, F., Chan, S., Martin, J., et al.
(2001b). SRTM C-band topographic data: Quality assessments and
calibration activities. IEEE Geoscience and Remote Sensing Sympo-
sium, 2, 739–741.
Rosen, P. A., Hensley, S., Joughin, I. R., Li, F. K., Madsen, S. N.,
Rodriguez, E., et al. (2000). Synthetic Aperture Radar interferometry.
Proceedings of the IEEE, 88, 333–382.
Sarabandi, K., & Lin, Y. C. (2000). Simulation of interferometric SAR
response for characterizing the scattering phase center statistics of forest
canopies. IEEE Transactions on Geoscience and Remote Sensing, 39,
115–125.
Smith, B., & Sandwell, D. (2003). Accuracy and resolution of Shuttle
Radar Topography Mission data. Geophysical Research Letters, 30,
1467–1470.
Sun, G., Ranson, K. J., Kharuk, V. I., & Kovacs, K. (2003). Validation of
surface height from Shuttle Radar Topography Mission using shuttle
laser altimeter. Remote Sensing Environment, 88, 401–411.
Treuhaft, R. N., Madsen, S. N., Moghaddam, M., & van Zyl, J. J. (1995).
Vegetation characteristics and underlying topography from interfero-
metric radar. Radio Science, 31, 1449–1485.
Treuhaft, R. N., & Siqueira, P. R. (2000). Vertical structure of vegetated
land surfaces from interferometric and polarimetric radar. Radio
Science, 35, 141–177.
Ulaby, F. T., Dobson, M. C., Pierce, L. E., Kellndorfer, J. M., & Sarabandi,
K. (1995). Scene classification and biomass estimation using SIR-C/X-
SAR. Proceedings of the international symposium on the retrieval of
biophysical and geophysical parameters from SAR data for land
applications (pp. 53–57). Toulouse7 CNES/IEEE.
United States Geological Survey (2003). Shuttle Radar Topography
Mission documentation: SRTM Topo. http://edcftp.cr.usgs.gov/pub/
data/srtm/Documentation/SRTM_Topo.txt
Vogelmann, J. E., Howard, S. M., Yang, L., Larson, C. R., Wylie, B. K., &
van Driel, J. N. (2001). Completion of the 1990’s National Land Cover
Data Set for the conterminous United States. Photogrammetric
Engineering and Remote Sensing, 67, 650–662.
Vona, J. (2001). Modeling slash pine (Pinus elloitti) parameters using
synthetic aperture radar. MS thesis, The University of Georgia, Athens.