Characterization of ecosystem responses to climatic controls using artificial neural networks ANTJE M. MOFFAT * , CLEMENS BECKSTEIN w , GALINA CHURKINA z, MARTINA MUND * andMARTINHEIMANN * *Max Planck Institute for Biogeochemistry, Hans-Kno ¨ll-Str. 10, 07745 Jena, Germany, wDepartment of Mathematics and Computer Science, Friedrich Schiller University, Ernst-Abbe-Platz 1-4, 07743 Jena, Germany, zLeibniz-Centre for Agricultural Landscape Research (ZALF), Eberswalder Strasse 84, 15374 Mu ¨ ncheberg, Germany Abstract Understanding and modeling ecosystem responses to their climatic controls is one of the major challenges for predicting the effects of global change. Usually, the responses are implemented in models as parameterized functional relationships of a fixed type. In contrast, the inductive approach presented here based on artificial neural networks (ANNs) allows the relationships to be extracted directly from the data. It has been developed to explore large, fragmentary, noisy, and multidimensional datasets, such as the carbon fluxes measured at the ecosystem level with the eddy covariance technique. To illustrate this, our approach has been systematically applied to the daytime carbon flux dataset of the deciduous broadleaf forest Hainich in Germany. The total explainable variability of the half-hourly carbon fluxes from the driving climatic variables was 93.1%, showing the excellent data mining capability of the ANNs. Total photosynthetic photon flux density was identified as the dominant control of the daytime response, followed by the diffuse radiation. The vapor pressure deficit was the most important nonradiative control. From the ANNs, we were also able to deduce and visualize the dependencies and sensitivities of the response to its climatic controls. With respect to diffuse radiation, the daytime carbon response showed no saturation and the light use efficiency was three times greater for diffuse compared with direct radiation. However, with less potential radiation reaching the forest, the overall effect of diffuse radiation was slightly negative. The optimum uptake of carbon occurred at diffuse fractions between 30% and 40%. By identifying the hierarchy of the climatic controls of the ecosystem response as well as their multidimensional functional relationships, our inductive approach offers a direct interface to the data. This provides instant insight in the underlying ecosystem physiology and links the observational relationships to their representation in the modeling world. Keywords: artificial neural networks (ANNs), climatic controls, ecological data mining, ecosystem physiology, eddy covariance carbon flux, FLUXNET, Hainich forest, inductive modeling Received 20 August 2009 and accepted 8 November 2009 Introduction The change of the earth’s climate strongly affects terres- trial biological ecosystems (IPCC, 2007a), but the response of the ecosystems to the changing environmental condi- tions is largely unknown. Even basic phenomena are still under debate: The observed net uptake of CO 2 by the land biosphere implies an unexplained large, increasing land sink, also called missing sink or residual land sink (Burgermeister, 2007; IPCC, 2007b). For the Northern Hemisphere, the average estimate of the land carbon sink from atmospheric inversions is almost a factor of two larger than the bottom-up estimate, and the longitudinal partitioning of the northern sink is subject to large un- certainties (IPCC, 2007b). Furthermore, it is now recog- nized that biological processes influence the climate of the earth system significantly (Heimann & Reichstein, 2008). Therefore, understanding the climatic controls of the ecosystem response is fundamental and essential in the context of global change. To tackle this question, towers equipped with the eddy covariance technique have been established, and these are measuring the carbon flux in a wide range of vegetation types and climate zones all over the world (Baldocchi, 2008). The flux measurements have a high temporal resolution of half-hourly to hourly, but, due to the limitations of the eddy covariance technique, they are fragmentary and noisy (Papale et al., 2006). Main limitations are the theoretical requirement of stationar- ity of the flow, turbulent atmospheric conditions, and no residual vertical wind speed or horizontal advection, Correspondence: Antje M. Moffat, tel. 1 49 3641 576220, fax 1 49 3641 577200, e-mail: [email protected]Global Change Biology (2010) 16, 2737–2749, doi: 10.1111/j.1365-2486.2010.02171.x r 2010 Blackwell Publishing Ltd 2737
13
Embed
Characterization of ecosystem responses to climatic controls using artificial neural networks
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Characterization of ecosystem responses to climaticcontrols using artificial neural networksA N T J E M . M O F FA T *, C L E M E N S B E C K S T E I N w , G A L I N A C H U R K I N A z, M A R T I N A M U N D *
and M A R T I N H E I M A N N *
*Max Planck Institute for Biogeochemistry, Hans-Knoll-Str. 10, 07745 Jena, Germany, wDepartment of Mathematics and Computer
Science, Friedrich Schiller University, Ernst-Abbe-Platz 1-4, 07743 Jena, Germany, zLeibniz-Centre for Agricultural Landscape
Research (ZALF), Eberswalder Strasse 84, 15374 Muncheberg, Germany
Abstract
Understanding and modeling ecosystem responses to their climatic controls is one of the major challenges for
predicting the effects of global change. Usually, the responses are implemented in models as parameterized functional
relationships of a fixed type. In contrast, the inductive approach presented here based on artificial neural networks
(ANNs) allows the relationships to be extracted directly from the data. It has been developed to explore large,
fragmentary, noisy, and multidimensional datasets, such as the carbon fluxes measured at the ecosystem level with the
eddy covariance technique. To illustrate this, our approach has been systematically applied to the daytime carbon flux
dataset of the deciduous broadleaf forest Hainich in Germany. The total explainable variability of the half-hourly
carbon fluxes from the driving climatic variables was 93.1%, showing the excellent data mining capability of the
ANNs. Total photosynthetic photon flux density was identified as the dominant control of the daytime response,
followed by the diffuse radiation. The vapor pressure deficit was the most important nonradiative control. From the
ANNs, we were also able to deduce and visualize the dependencies and sensitivities of the response to its climatic
controls. With respect to diffuse radiation, the daytime carbon response showed no saturation and the light use
efficiency was three times greater for diffuse compared with direct radiation. However, with less potential radiation
reaching the forest, the overall effect of diffuse radiation was slightly negative. The optimum uptake of carbon
occurred at diffuse fractions between 30% and 40%. By identifying the hierarchy of the climatic controls of the
ecosystem response as well as their multidimensional functional relationships, our inductive approach offers a direct
interface to the data. This provides instant insight in the underlying ecosystem physiology and links the observational
relationships to their representation in the modeling world.
r 2010 Blackwell Publishing Ltd, Global Change Biology, 16, 2737–2749
have no direct physiological meaning, the curve pro-
gression of this function and of its derivative can be
used to derive the physiological characteristics: The
derivative starts off almost constant at the onset of light,
corresponding to a linear initial slope. This initial
slope of 0.050 mmol CO2/mmol photons is the initial
quantum yield a, the maximum light use efficiency
of the ecosystem. The offset of NEP at zero light
is the daytime respiration and has a value of
�2.9mmol CO2 m�2 s�1. Towards high PPFD values,
the derivative approaches zero, denoting the saturation
of the NEP response. The optimum (saturated) NEP at
the highest irradiance of 1750 mmol photons m�2 s�1 is
22.5mmol CO2 m�2 s�1.
The obtained properties, the initial linear increase, the
leveling off to saturation, and the magnitude of physio-
logical parameters, meet the behavior expected for the
light response of a deciduous broadleaf forest (e.g.,
Larcher, 2003). This agreement demonstrates that our
inductive approach is able to extract the underlying
functional relationship directly from the data.
Since the relationships were derived solely from the
observations without a priori assumptions, the agree-
ment also provides an independent corroboration of the
light response hypotheses at the ecosystem level.
Although the need for such corroboration might not
be obvious, an assessment of all commonly used semi-
empirical light response curves showed that some
curves (e.g., the rectangular hyperbola) do not reflect
the required physiological characteristics (A. M. Moffat,
2010). Their incorrect behavior at the edges, right where
the physiological parameters are derived, leads to large
differences in the estimates of the physiological para-
meters, despite a comparable overall performance.
What is the effect of diffuse radiation?
The dependency of the daytime NEP response on the
diffuse light was extracted from the dataset by training
the ANN models with the diffuse and direct PPFD as
inputs (Fig. 6). The simplicity of these ANN models (see
Fig. 6), their high R2 of 89.6% and their low SD of
2.9mmol CO2 m�2 s�1 (see section ‘What are the climatic
controls of the measured NEP flux?’) demonstrate, that
the extracted functional relationship NEPANN(PPFDdif,
PPFDdir) is well suited to display and quantitatively
characterize the response. The numerical partial deri-
vatives reveal a significant difference in the functional
relationship to diffuse radiation compared with direct
radiation (Fig. 7, bottom): The initial quantum yield of
PPFDdif is almost three times higher, its light use
efficiency (magnitude of the derivative) is enhanced
throughout the response, and the response shows no
saturation even for high PPFDdif. These results are in
full agreement with Gu et al. (2002), who found simi-
larly enhanced light use efficiencies and weakened
tendencies to cause canopy saturation for the diffuse
radiation. Since the response to PPFDdif does not satu-
rate, this effect is even more pronounced for high values
of total PPFD.
The high input relevance and enhanced light use
efficiency and sensitivity of PPFDdif compared with
PPFDdir stresses the importance of the diffuse radiation
for the ecosystem response. As the dominant secondary
control of the half-hourly daytime NEP response, it
should be included in ecosystem models trying to
predict the carbon flux at half-hourly or hourly time-
scales (see also Roderick et al., 2001). The hypotheses
needed for the implementation can be based on the
functional relationships derived by the ANNs. The
presented study shows the dependencies of the NEP
response to diffuse radiation at the Hainich forest
(Figs 6 and 7). Herein lies the strength of our inductive
approach: in addition to the detection and quantifica-
tion of the impact of diffuse radiation, it provides an
explicit characterization of the functional relationship.
The enhanced light use efficiency of diffuse light
leads to an increase in the NEP response of the Hainich
forest. However, less of the potential radiation Rpot is
received at the surface for high diffuse fractions due to
the absorption and reflection by clouds and aerosols,
and less light leads to a decrease in the NEP response.
Therefore, the question arises whether the overall effect
is positive or negative?
To provide insight into this aspect, the ANN model
was trained with the following three climatic input
PPFDdir
(μmol photons m–2 s–1)
200 400 600 800 100012001400PPFDdif
(μmol photons m –2 s –1)
100200
300400
500600
700800
NE
P (
μmol
CO
2 m
–2 s
–1)
05
101520253035
Fig. 6 Closed symbolic representation of the ANN modeling
the half-hourly daytime NEP response to the climatic controls
diffuse PPFDdif and direct PPFDdir. The simplicity of the ANN
model is well suited to display and characterize the functional
relationship NEPANN(PPFDdif, PPFDdir). For variable descriptions
see Table 1.
C H A R A C T E R I Z I N G E C O S Y S T E M R E S P O N S E S 2745
r 2010 Blackwell Publishing Ltd, Global Change Biology, 16, 2737–2749
drivers: the potential radiation Rpot and the diffuse
fraction fdif, plus the vapor pressure deficit VPD to
include confounding effects with diffuse radiation.
Since the daytime NEP response is now modeled with
three inputs, the analytical function has too many
dimensions to be directly visualized. For these multi-
dimensional relationships, the partial derivatives are of
great value to examine their behavior. Figure 8 shows
the numerical partial derivatives of the modeled re-
sponse with respect to fdif: At first, the NEP response is
enhanced (positive derivative) until it reaches an opti-
mum (zero derivative) and then the NEP response is
reduced (negative derivative). This means that the net
effect of the diffuse radiation is at an optimum for
diffuse fractions from 28% to 44% at the Hainich site.
This range is close to the optimum of 45% found in a
recent study by Knohl & Baldocchi (2008) for NEP fluxes
at Hainich using a biophysical multilayer model of the
canopy. Both approaches thus depict optima where there
is less diffuse than direct light. But, as one can see in Fig. 8,
the majority of the half-hourly measurements are beyond
the optimum range counteracting the increase in the NEP
response. This leads to a slightly negative average numer-
0 200 400 600 800 1000 1200 1400 1600
–5
0
5
10
15
20
25
30
35
PPFDdir (μmol photons m–2 s–1)
0 200 400 600 800 1000 1200 1400 1600–0.01
0
0.01
0.02
0.03
0.04
0.05
0.06 Numerical partial derivatives
0 100 200 300 400 500 600 700 800
–5
0
5
10
15
20
25
30
35(a) (b)
(c) (d)
PPFDdif (μmol photons m–2 s–1)
0 100 200 300 400 500 600 700 800
∂NE
P/∂
PP
FD
dif
∂NE
P/∂
PP
FD
dif
NE
P (
μmol
CO
2 m
–2 s
–1)
NE
P (
μmol
CO
2 m
–2 s
–1)
–0.01
0
0.01
0.02
0.03
0.04
0.05
0.06
MeasuredModeled
MeasuredModeled
Numerical partial derivatives
Fig. 7 ANN model predictions (black circles) and half-hourly measurements (gray circles) of the daytime NEP response plotted vs. the
two climatic drivers: (a) diffuse PPFDdif and (b) direct PPFDdir. The ANN model captures 89.5% of the variability of the half-hourly
measurements. The numerical partial derivatives correspond to the light use efficiency, which is about three times higher for diffuse
compared with direct radiation. For variable descriptions see Table 1.
fdif
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
∂NE
P/∂
f dif
–30
–20
–10
0
10
20Numerical partial derivatives
fdif
NE
P
Fig. 8 Numerical partial derivatives of the daytime NEP re-
sponse to the diffuse fraction fdif for each half-hourly data point.
The small sketch depicts the functional relationship of NEP to fdif.
The net effect of fdif reaches its optimum between 28% and 44%.
For variable descriptions see Table 1.
2746 A . M . M O F F AT et al.
r 2010 Blackwell Publishing Ltd, Global Change Biology, 16, 2737–2749
ical derivative of �0.5mmol CO2 m�2 s�1 per half-hourly
data point, thus a negative but small overall effect from
the diffuse radiation.
How does the VPD affect the daytime NEP response?
After the diffuse proportion, the next most important
secondary driver is a measure of the air humidity, Rh, or
dryness, VPD, respectively. To investigate the effect, the
half-hourly daytime NEP response was modeled with
PPFDdif, PPFDdir, and VPD as the climatic input drivers.
Adding VPD improved the R2 by 1.1 to 90.7%. The
numerical partial derivatives show the characteristics of
the NEP response with respect to VPD (Fig. 9): first, a
slight increase in NEP (positive derivative), an opti-
mum (zero derivative) around 4 hPa, and, then, a strong
down-regulating effect (negative derivative) with in-
creasing dryness of the air.
ANN models trained on individual months can be
used to investigate whether the sensitivity of the NEP
response to VPD varies over the summer period. To
detect primarily the response to VPD, only early after-
noon hours (11:30am–2pm hours) with stable light
conditions but high changes in VPD were extracted
for the analysis. Figure 10 shows that the negative
sensitivity to VPD peaks in August, the hottest and
driest month. In a study by Schulze (1970) on the carbon
gas exchange of single beech trees in Sollingen, 100 km
north-east of Hainich, the strongest effect due to dry
atmospheric conditions occurred also in August. Thus,
the response to air moisture found at the tree level can
be observed in the carbon flux measurements at the
stand level – analogously to the light response hypoth-
eses in section ‘What are the characteristics of the NEP
response to light?’ above.
Discussion
The strength of a fully inductive approach to rely only
on the information present in the data has its own
specific challenges. The following points need to be
taken into consideration to avoid pitfalls in the inter-
pretation of the ANN models:
Adequate response space: To reach the goal of modeling
the overall response, an annual dataset is appropriate. If
the interest is in the light response curve, the dataset
should span time periods where the ecosystem stays in
the same phenological and ecological states with re-
spect to the photosynthesis response. For example,
including months with leaves off would smear out the
photosynthesis response. Taking summer months but
including months with drought conditions might result
in a light response curve where the saturation has a
drop for the highest irradiances. This will look like
photoinhibition, but will actually be caused by the
superposition of the light response curve with a re-
duced optimum NEP under water stress. As an alter-
native to limiting the dataset to the same state, the
entire dataset can also be used but with an additional
input variable describing the changing condition, for
example a proxy for the water stress. With this, the
ANN is able to distinguish between drought and
VPD (hPa)0 2 4 6 8 10 12 14 16
∂NE
P/∂
VP
D
–0.8
–0.6
–0.4
–0.2
0
0.2Numerical partial derivatives
VPD
NE
P
4
Fig. 9 Numerical partial derivatives of the daytime NEP re-
sponse to the VPD for each half-hourly data point. The small
sketch depicts the functional relationship of NEP to VPD. There
is a negative, down-regulating effect on NEP for high values of
VPD. For variable descriptions see Table 1.
–20
–10
0
10
20
30
40
MonthJun Jul Aug Sep
Sen
sitiv
ities
(P
aD)
PPFDdirPPFDdifVPD
Fig. 10 The positive and negative sensitivities of the daytime
NEP response to PPFDdir, PPFDdif, and VPD during early after-
noon hours, modeled separately for each month. The sensitivity
of NEP to VPD is most negative in the hottest and driest month
of August. PPFD, photosynthetic photon flux density. For vari-
able descriptions see Table 1.
C H A R A C T E R I Z I N G E C O S Y S T E M R E S P O N S E S 2747
r 2010 Blackwell Publishing Ltd, Global Change Biology, 16, 2737–2749
nondrought conditions and will map the responses
accordingly.
Artifacts: To avoid modeling artifacts present in a
specific dataset or nonobvious changes in the phenolo-
gical or ecological states, the identified relationships
should prove to be robust for different time periods,
e.g., individual months vs. the whole summer period or
summer months of different years.
Missing relevant driver: The ANN can show a good
model performance though a physiologically relevant
driver was missing. It means that the effect of the
missing driver was mapped onto the included drivers
through cross-correlations. Usually, the found relation-
ships are then not independent, and, therefore, not
robust. The mapped functional relationships will
change as soon as another driver with some cross-
correlation or the actual missing driver is added. If
adding drivers does not change the main properties of
the numerical partial derivatives, this is a good sign for
robustness.
Confounding factors: The hidden biases or indirect
effects caused by confounding phenological, ecological,
or climatic factors are much harder to detect. To rule out
known confounding factors, these can be added to the
data used for training as observed or theoretical drivers.
This way, their impact is included in the modeled
response, provided that the confounding factors are
not correlated to any of the other input drivers, that
they are well defined over the whole range, and that
they do not add too many degrees of freedom to the
network. An alternative solution is to perform marginal
sampling, where the dataset is grouped into subsets for
certain ranges of the confounding factor. The ANN
models are then trained on each of the subgroups.
Robust relationships will hold true for all of the sub-
groups.
Ecophysiological plausibility: Since the ANN models are
constrained solely by the data, some prior knowledge of
ecosystem physiology is required to ensure a proper
choice of the representative dataset and to judge the
plausibility of the results under anticipation of con-
founding factors. Only then does this inductive ap-
proach produce meaningful results.
The systematic approach presented in this paper has
been implemented as a toolbox. Once the dataset is
configured, the setup of different ANN routines is
simple and highly flexible. The ANN training proce-
dure is fully automated and takes only a few minutes
on a typical desktop computer. Although the analysis
tools have been tailored to extract information from
large datasets, the ANNs also appear to work with
small amounts of data. We have tested their ability to
model the light response curve for single days with as
few as 10 data points; the physiological quantities
estimated from the ANNs were consistent with the
estimates of a prescribed semi-empirical equation.
Conclusions and outlook
As demonstrated for the daytime carbon fluxes of the
Hainich forest, the inductive approach presented here
can be used to characterize the functional dependencies
solely from the half-hourly eddy covariance measure-
ments, without prior assumptions about the shape of
the response. The extracted purely empirical light re-
sponse curve provides an independent corroboration of
current plant physiological hypotheses. Estimates of the
random measurement error from the ANN model re-
siduals were lower than previous estimates. This could
be attributed to the inclusion of the proportion
of diffuse radiation, which was the second most im-
portant input variable to explain the daytime carbon
fluxes after total radiation. This key finding stresses the
importance of the diffuse radiation for the short-term
light response.
The functional dependency of the daytime response
to diffuse radiation showed no saturation, and it would
be of great interest to investigate the generality of this
relationship for other types of ecosystems. The net effect
of the diffuse radiation was determined by modeling
with two theoretical drivers – the potential radiation at
the top of the atmosphere and the diffuse fraction. Since
the light conditions at the Hainich forest were mainly
beyond the optimum diffuse fraction of 30–40%, the
overall effect of the diffuse light was on average slightly
negative for the 3 years 2000–2002.
The most important nonradiative drivers in the hier-
archy of the climatic controls were the vapor pressure
deficit, followed by air temperature and wind direction.
Multidimensional relationships in the data were further
characterized using numerical partial derivatives. For
example, the vapor pressure deficit showed a strong
down-regulating effect with increasing dryness of the
air.
Our inductive approach offers the potential to serve as
a new key instrument for the explanation of observa-
tions, for instant testing and independent validation of
hypotheses, and for the detection of new findings. The
worldwide network of eddy flux towers in FLUXNET
offers the opportunity to investigate ecosystems span-
ning from the arctic to the savannah. For managed
ecosystems, the ability to include theoretical variables,
such as a fuzzy variable to describe the harvesting event,
will be of benefit. The theoretical variables also offer the
possibility to include time lag effects and determine their
relevance for the ecosystem response. The approach is
not limited to the net carbon flux, but can be extended to
2748 A . M . M O F F AT et al.
r 2010 Blackwell Publishing Ltd, Global Change Biology, 16, 2737–2749
the partitioned GPP/RE carbon flux, the energy and
momentum flux, or other greenhouse gases.
By supplying the link between the observations and
their representation in the modeling world, the pre-
sented inductive approach is complementary to
the classic hypothetic-deductive approach. This will
further the understanding of the underlying processes
as well as promote their implementation in models,
which, in turn, will help the prediction of the effects
of changing environmental conditions on the terrestrial
biosphere.
Since purely empirical models adapt to the particular
conditions of the ecosystem as present in the training
dataset, they can also be used to identify differences in
the response over time. If changes in the climate lead to
changes in the ecosystem response to its climatic con-
trols, the presented methodology would be able to
detect these directly in the measurements.
Acknowledgements
We would like to thank the following people for their contribu-tions to this paper: Olaf Kolle for introducing Antje Moffat to theHainich flux site, Corinna Rebmann for sharing her broadknowledge about the measurements, Andrew Richardson, JohnGrace, Alessandro Cescatti, and Detlef Schulze for in-depthdiscussions of the results, Petra Werner and Andrew Jarvis fortheir comments on the general scope, Gill McLean for proof-reading of this manuscript, Bryce Moffat for proofreading of thevarious drafts, and the ROOT team at CERN for providing theirextensive C 11 programming framework. Furthermore, wewould like to thank the editor Ivan Janssens and the threeanonymous reviewers for their thorough comments and con-structive criticism, which greatly helped to improve this paper.
The datasets used in this paper were obtained from theCarboEurope-IP database (EU project GOCE-CT-2003-505572).The site PIs Alexander Knohl, Corinna Rebmann, and WernerKutsch are thanked for making the Hainich data available to thedatabase.
References
Abramowitz G (2005) Towards a benchmark for land surface models. Geophysical
Research Letters, 32, L22702.
Baldocchi DD (2008) Breathing of the terrestrial biosphere: lessons learned from a
global network of carbon dioxide flux measurement systems. Australian Journal of
Botany, 56, 1–26.
Bishop CM (1995) Neural Networks for Pattern Recognition. Oxford University Press,