Top Banner
1965 1976 1987 1998 2009 Time (month) 0 40 80 120 160 Monthly avereged streamflow (cfs) 1965 1976 1987 1998 2009 Time (month) -10 0 10 20 Avereged Deficit of Outflow (cfs) Pyeongchang River Basin Probabilistic Assessment of Drought Using Hidden Markov Model in Han River Basin Yeijun Park, Ji-Young Yoo, Hyun-Han Kwon, Tae-Woong Kim Introduction Research Background & Objectives Generally, drought analysis is carried out using various drought indices, especially, precipitation and streamflow are widely used as indicators for detecting meteorological and hydrological drought. Although many studies have proposed various drought indices with pre- defined threshold for assessing drought, but there are some problems which needs further assessment. Small differences in precipitation or other data could have significant effect on determining the condition of drought. Standardized Precipitation Index (SPI) does not consider lack of rainfall duration, therefore, long-term drought can not be represented effectively. Lack of defining some standard criteria for drought condition. Due to variable nature of drought characteristics, it is hard to figure out from the beginning to the end of drought event and to clearly identify the damaged areas which show that the quantitative assessment of drought has many uncertainties. In this study, probabilistic assessment was performed for drought using Hidden Markov Model based drought index (HMDI) instead of using pre- defined threshold to utilize inherent characteristics and consider uncertainties. Contributions Yeijun Park : Dept. of Civil and Environmental Engineering, Hanyang University, Republic of Korea. Ji-Young Yoo : Dept. of Civil Engineering, Chonbuk National University, Republic of Korea Hyun-Han Kwon : Dept. of Civil Engineering, Chonbuk National University, Republic of Korea Tae-Woong Kim : Corresponding Author, Dept. of Civil and Environmental Engineering, Hanyang University, Republic of Korea (E-mail: [email protected]). Acknowledgement R218/EGU2014- 3273 Hidden Markov Model (HMM) A HMM is a statistical Markov model in which the system being modeled is assumed to be a Markov process with unobserved hidden states. In a HMM, the hidden state is not directly visible, but output is dependent on the state which is visible. In this study, hidden states are drought and observable symbols are streamflow. The hidden states of data X t consist of a number of random variables K(2 - ∞), influence of previous state (X t-1 ) and the rest of state are independent. The hidden state follows the first Markov model with transition between states only rely on previous state. Methodology Data and Study Area - continued Results and Discussions Conclusions Drought Analysis Using HMM Before applying HMM, the determination of the number of hidden states and the adjustment of observed data are required so that can be well described by the model. To determine the number of hidden states, we used Log-likelihood (LLH), Akaine Information Criterion (AIC), Bayesian Information Criterion (BIC) to evaluate the degree of suitability. To prevent model from overfitting, the optimal number of hidden states was determined when the log-likelihood converged and the AIC and BIC have the values. Log-Likelihood function was converged when the number of states are 7 to 11. In our case, the AIC and BIC have the smallest values at 8. In this study, to consider the inherent characteristics of drought, the probabilistic assessment of hydrological drought using Hidden Markov Model (HMM) was performed. When using an existing drought index SSI, only one value can be used as a criterion to determine the drought condition. However, the HMDI can classify the drought condition considering inherent characteristics embedded in observations and show probability of each drought condition at a particular point in time. In addition, through the comparison of actual drought events near the basin, the HMDI showed consistent results very close to the actual drought and preserved better hydrological persistence. The proposed method of probabilistic analysis of drought using Hidden Markov Model has better performance as compared to the conventional method using predefined threshold and would be helpful making new criteria of drought. Results and Discussions - continued Pyeongchang State AIC BIC 7 3631.49 3460.94 8 3627.42 3446.98 9 3865.66 3448.22 10 3989.17 3482.28 11 4101.81 3496.95 2 4 6 8 10 12 3 5 7 9 11 Hidden State -2000 -1900 -1800 -1700 -1600 Log-likelihood Pyeongchang River Basin (6 month window Using the determined number 8, the characteristics of hidden states were distinguished based on the amount of excess of deficiency. Naming the hidden state according to characteristics of data, Hidden Markov Drought Index (HMDI) estimated probability density functions of hidden state. For example, HMDI_1 is the first state representing the largest deficit. Displaying HMDI_1 to HMDI_4 shows the have lack of streamflow compared to average. 2 4 6 8 1 3 5 7 Hidden State -40 -20 0 20 40 Average Deficit of Outflow(m 3 /s) Pyeongchang River Basin (6 month window Evaluating Standardized Streamflow Index To verify the method, we compared the HMM by using the estimated posterior probability of each hidden state (HMDI) and Standardized Streamflow Index (SSI) which is used as one of pre-defined drought index. To determine classification criterion of SSI, the criteria of SPI was used which is one of the representative Standardized Index(SI) and then compared with HMDI. Hidden state State Range Condition HMDI_1 SSI_1 -2.0 ≥ Z Extreme drought HMDI_2 SSI_2 -1.5 ≥ Z > - 2.0 Severe drought HMDI_3 SSI_3 -1.0 ≥ Z > - 1.5 Moderate drought HMDI_4 SSI_4 1.0 ≥ Z > - 1.0 Near normal Comparison of HMDI and SSI The estimated result of SSI at each point only shows one drought condition but HMDI shows various condition based on posterior probability of hidden state. To validate our study, the comparison of HMDI and SSI was carried out by using the real drought events in Han River basin. During the comparing period, all HMDIs estimated evenly, but SSI estimated only one state or discontinuously. 1965 1976 1987 1998 2009 Time(Month) 0 0.2 0.4 0.6 0.8 1 Probability of Hidden State -40 -20 0 20 40 Averege Deficit of Outflow(cfs) SSI Pyeongchang River Basin (6 month window) Drought Event (Year) Hidden State Probabil ity State Probabil ity 1967 - 1968 HMDI_2 0.23 SSI_3 0.10 HMDI_3 0.31 1973 HMDI_1 0.39 SSI_2 0.42 HMDI_2 0.10 1976 - 1978 HMDI_1 0.12 SSI_2 0.19 HMDI_2 0.20 HMDI_3 0.28 SSI_3 0.19 1981 - 1982 HMDI_1 0.17 SSI_2 0.25 HMDI_2 0.10 HMDI_3 0.10 2001 HMDI_2 0.34 SSI_3 0.33 HMDI_3 0.13 2008 - 2009 HMDI_1 0.21 SSI_1 0.46 HMDI_2 0.10 SSI_2 0.00 HMDI_3 0.47 SSI_3 0.13 O 1 O 2 O t-1 O t X 1 X 2 X t-1 X t ... O t (t=1, …, N) : Observed state X t (t=1, … , N) : Hidden state Using the probabilistic relationship between observable state and hidden state, the HMM has five components as below: Set of hidden states : a set of states described by the Markov process. Set of observable state : a set of externally visible transition states. Initial probability vector : the probability of hidden state at specific time t=1. Transition probability matrix : the probability from previous hidden state to current state. Observation probability matrix : the probability of each observable state. <Fig. 1 Graphical representation of HMM> <Table1 Classification of drought> <Fig. 2 Study area and monthly time series of streamflow and average deficit> <Fig. 3 LLH, AIC, BIC values and information of Hidden states> Data and Study Area Data Used in the Study The Pyeongchang River basin was selected to analyze the effect of drought independently, which is located on the most upstream of Han River basin. The dataset used in this study is the average monthly streamflow for the record period 1966-2009, provided by Water Management Information System (WAMIS). To consider monthly characteristics, 3, 6 and 12 months were taken with changing the data for differences between the average of each month to distinguish surplus and deficit of streamflow. <Fig.4 Graphical representation of HMDI and SSI for the study period> <Table 2 Comparison of HMDI and SSI> ※ Z is a standardized normal score
1

Pyeongchang River Basin

Feb 22, 2016

Download

Documents

s_now

R218/EGU2014-3273. Probabilistic Assessment of Drought Using Hidden Markov Model in Han River Basin. Pyeongchang River Basin. Yeijun Park, Ji -Young Yoo , Hyun-Han Kwon, Tae- Woong Kim. Introduction. Data and Study Area - continued. Results and Discussions - continued. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Pyeongchang  River Basin

1965 1976 1987 1998 2009Time (month)

0

40

80

120

160

Mon

thly

ave

rege

d st

ream

flow

(cfs

)

1965 1976 1987 1998 2009Time (month)

-10

0

10

20

Aver

eged

Def

icit

of O

utflo

w (c

fs)

Pyeongchang River Basin

Probabilistic Assessment of Drought Using Hidden Markov Model in Han River Basin

Yeijun Park, Ji-Young Yoo, Hyun-Han Kwon, Tae-Woong Kim

IntroductionResearch Background & Objectives

Generally, drought analysis is carried out using various drought indices, especially, precipitation and stream-flow are widely used as indicators for detecting meteorological and hydrological drought.

Although many studies have proposed various drought indices with pre-defined threshold for assessing drought, but there are some problems which needs further assessment.

① Small differences in precipitation or other data could have significant effect on determining the condi-tion of drought.

② Standardized Precipitation Index (SPI) does not consider lack of rainfall duration, therefore, long-term drought can not be represented effectively.

③ Lack of defining some standard criteria for drought condition.④ Due to variable nature of drought characteristics, it is hard to figure out from the beginning to the end

of drought event and to clearly identify the damaged areas which show that the quantitative assess-ment of drought has many uncertainties.

In this study, probabilistic assessment was performed for drought using Hidden Markov Model based drought index (HMDI) instead of using pre-defined threshold to utilize inherent characteristics and consider uncertainties.

Contributions • Yeijun Park: Dept. of Civil and Environmental Engineering, Hanyang University, Republic of Korea.• Ji-Young Yoo : Dept. of Civil Engineering, Chonbuk National University, Republic of Korea• Hyun-Han Kwon : Dept. of Civil Engineering, Chonbuk National University, Republic of Korea• Tae-Woong Kim: Corresponding Author, Dept. of Civil and Environmental Engineering, Hanyang Univer-

sity, Republic of Korea (E-mail: [email protected]).

Acknowledgement • This work was supported by National Research Foundation of Korea - Grant funded by the Korean Govern-

ment (NRF-2013R1A1A2013160)

R218/EGU2014-3273

Hidden Markov Model (HMM)

A HMM is a statistical Markov model in which the system being modeled is assumed to be a Markov process with unobserved hidden states.

In a HMM, the hidden state is not directly visible, but output is dependent on the state which is visible. In this study, hidden states are drought and observable symbols are streamflow.

The hidden states of data Xt consist of a number of random variables K(2 - ∞), influence of previous state (Xt-1) and the rest of state are independent. The hidden state follows the first Markov model with transition between states only rely on previous state.

Methodology

Data and Study Area - continued

Results and Discussions

Conclusions

Drought Analysis Using HMM

Before applying HMM, the determination of the number of hidden states and the adjustment of observed data are required so that can be well described by the model.

To determine the number of hidden states, we used Log-likelihood (LLH), Akaine Information Criterion (AIC), Bayesian Information Criterion (BIC) to evaluate the degree of suitability.

To prevent model from overfitting, the optimal number of hidden states was determined when the log-likeli-hood converged and the AIC and BIC have the values.

Log-Likelihood function was converged when the number of states are 7 to 11. In our case, the AIC and BIC have the smallest values at 8.

In this study, to consider the inherent characteristics of drought, the probabilistic assessment of hydrological drought using Hidden Markov Model (HMM) was performed.

When using an existing drought index SSI, only one value can be used as a criterion to determine the drought condition. However, the HMDI can classify the drought condition considering inherent characteristics em-bedded in observations and show probability of each drought condition at a particular point in time.

In addition, through the comparison of actual drought events near the basin, the HMDI showed consistent re-sults very close to the actual drought and preserved better hydrological persistence.

The proposed method of probabilistic analysis of drought using Hidden Markov Model has better perfor-mance as compared to the conventional method using predefined threshold and would be helpful making new criteria of drought.

Results and Discussions - continued

Pyeongchang

State AIC BIC

7 3631.49 3460.94

8 3627.42 3446.98

9 3865.66 3448.22

10 3989.17 3482.28

11 4101.81 3496.952 4 6 8 10 123 5 7 9 11

Hidden State

-2000

-1900

-1800

-1700

-1600

Log-

likel

ihoo

d

Pyeongchang River Basin (6 month window)

Using the determined number 8, the characteristics of hidden states were distinguished based on the amount of excess of deficiency.

Naming the hidden state according to characteristics of data, Hidden Markov Drought Index (HMDI) esti-mated probability density functions of hidden state. For example, HMDI_1 is the first state representing the largest deficit.

Displaying HMDI_1 to HMDI_4 shows the have lack of streamflow compared to average.

2 4 6 81 3 5 7Hidden State

-40

-20

0

20

40

Aver

age

Defic

it of

Out

flow

(m3 /

s)

Pyeongchang River Basin (6 month window)

Evaluating Standardized Streamflow Index

To verify the method, we compared the HMM by using the estimated posterior probability of each hidden state (HMDI) and Standardized Streamflow Index (SSI) which is used as one of pre-defined drought index.

To determine classification criterion of SSI, the criteria of SPI was used which is one of the representative Standardized Index(SI) and then compared with HMDI.

Hidden state State Range Condition

HMDI_1 SSI_1 -2.0 ≥ Z Extreme drought

HMDI_2 SSI_2 -1.5 ≥ Z > -2.0 Severe drought

HMDI_3 SSI_3 -1.0 ≥ Z > -1.5 Moderate drought

HMDI_4 SSI_4 1.0 ≥ Z > -1.0 Near normal

Comparison of HMDI and SSI

The estimated result of SSI at each point only shows one drought condition but HMDI shows various condi-tion based on posterior probability of hidden state.

To validate our study, the comparison of HMDI and SSI was carried out by using the real drought events in Han River basin.

During the comparing period, all HMDIs estimated evenly, but SSI estimated only one state or discontinu-ously.

1965 1976 1987 1998 2009Time(Month)

0

0.2

0.4

0.6

0.8

1

Prob

abili

ty o

f Hid

den

Stat

e

-40

-20

0

20

40

Aver

ege

Defic

it of

Out

flow

(cfs

)

SSI

Pyeongchang River Basin (6 month window) Drought Event(Year)

Hidden State Probability State Probability

1967 - 1968HMDI_2 0.23

SSI_3 0.10HMDI_3 0.31

1973HMDI_1 0.39

SSI_2 0.42HMDI_2 0.10

1976 - 1978HMDI_1 0.12

SSI_2 0.19HMDI_2 0.20HMDI_3 0.28 SSI_3 0.19

1981 - 1982HMDI_1 0.17

SSI_2 0.25HMDI_2 0.10HMDI_3 0.10

2001HMDI_2 0.34

SSI_3 0.33HMDI_3 0.13

2008 - 2009HMDI_1 0.21 SSI_1 0.46HMDI_2 0.10 SSI_2 0.00HMDI_3 0.47 SSI_3 0.13

O1 O2 Ot-1 Ot

X1 X2 Xt-1 Xt...• Ot (t=1, …, N) : Observed state

• Xt (t=1, … , N) : Hidden state

Using the probabilistic relationship between observable state and hidden state, the HMM has five compo-nents as below:

① Set of hidden states : a set of states described by the Markov process.② Set of observable state : a set of externally visible transition states.③ Initial probability vector : the probability of hidden state at specific time t=1.④ Transition probability matrix : the probability from previous hidden state to current state.⑤ Observation probability matrix : the probability of each observable state.

<Fig. 1 Graphical representation of HMM>

<Table1 Classification of drought>

<Fig. 2 Study area and monthly time series of streamflow and average deficit>

<Fig. 3 LLH, AIC, BIC values and information of Hidden states>

Data and Study AreaData Used in the Study

The Pyeongchang River basin was selected to analyze the effect of drought independently, which is located on the most upstream of Han River basin.

The dataset used in this study is the average monthly streamflow for the record period 1966-2009, provided by Water Management Information System (WAMIS).

To consider monthly characteristics, 3, 6 and 12 months were taken with changing the data for differences between the average of each month to distinguish surplus and deficit of streamflow.

<Fig.4 Graphical representation of HMDI and SSI for the study period>

<Table 2 Comparison of HMDI and SSI>

※ Z is a standardized normal score