A Methodology of Forest Monitoring from Hyperspectral ......A Methodology of Forest Monitoring from Hyperspectral Images with Sparse Regularization A Methodology of Forest Monitoring

Copyright (C) Mitsubishi Research Institute, Inc.

MITSUBISHI RESEARCH INSTITUTE, INC.

A Methodology of Forest Monitoring fromHyperspectral Images with Sparse RegularizationA Methodology of Forest Monitoring fromHyperspectral Images with Sparse Regularization

Jul. 26, 2011

Keigo YOSHIDA, Takashi OHKI, Masahiro TERABE, Hozuma SEKINE (MRI)Tomomi TAKEDA (ERSDAC)

IGARSS 2011, VancouverTU4.T08.1: Hyperspectral Monitoring of the Environment I

2Copyright (C) Mitsubishi Research Institute, Inc.

Forest Monitoringby Remote SensingSolutionSolution

High Cost ofForest Survey& MonitoringProblemProblem

Decision making for Forest Management Disaster prevention planning Finding forest of poor growth GHG Credit estimation Resource management

Accurate Info.of Present ForestNeedsNeeds

Forest conditions change dynamically

Conduct periodical field survey

estimate 600-1200 USD / Km2 / year※ in case of Japan; 1 USD = 81 yen

No need for field survey all over the area

Highly-frequent observation

Introduction：Forest Monitoring by Remote Sensing


Introduction：Highly-developed Sensing Tech. & Challenges

Modeling is not easy for several sensor dataof different physical property

Statistical or Data-driven approach is needed

Sensor fusion

Reflect diverse propertyof targets

Hard to bring out potential of big sensor data

[e.g.] NDVI use just 2 bands, or Red and IR& have to select optimal band combinations

Complexity of prediction model increases,resulting in poor prediction performance

Dimension is high but sample size is smalldue to limitation of field survey

This causes model overfitting

Hyperspectral sensor

provide detailed optical info.on forest physiognomy

growth situation character of tree species

etc.

ChallengesSensing Tech.


Research Outline

Utilize rich data by a machine learning technique (sparse regularization)and achieve accurate, informative, & less costly forest monitoring

Utilize rich data by a machine learning technique (sparse regularization)and achieve accurate, informative, & less costly forest monitoring

Output DataOutput Data

MethodologyMethodology

Remote Sensing Data Fusion（CASI-3 hyperspectral images + SAR signals）

Field Survey ResultsInput DataInput Data

Sparse Regularization

（Sparse Discriminant Analysis、LASSO regression）

Predicted Stand Factors of each subcompartmentsfor Forest Management

（Species, Canopy cover, Timber volume, Tree height）

Prediction Models

Subcompartment： a general spatial unit for forest monitoring


Target Site

Town-owned forest in Shimokawa,Hokkaido, Japan

Approx. 90 % of town is covered by forest Utilize local conifer resources for business Environmental model city for low-carbon society

Shimokawa


Remote Sensing Data

Species and Canopy-cover prediction: CASI-3 hyperspectrumVolume and Height prediction: Data fusion (CASI-3 + PALSAR)

Species and Canopy-cover prediction: CASI-3 hyperspectrumVolume and Height prediction: Data fusion (CASI-3 + PALSAR)

Remote Sensing Hyperspectral sensor (optical property) Airborne hyperspectral imager CASI-3 84 bands from 400 to 1060 nm (wavelenght res. : 8 nm) Original spatial res.: 2.0 m

→ Resolution is decreased to 30m to simulate satellite-based operation

PALSAR (shape or volume property) Microwave backscattering resizedorg. image


Field Survey

During aircraft obs., conduct field survey to collect data for modeling & validationDuring aircraft obs., conduct field survey to collect data for modeling & validation

Field Survey: Place 25-sq-m quadrats Inventory study for trees whose DBH > 5cm & and record tree species Canopy cover measurement with whole-sky camera Height measurement for sampled 10 trees


What is Sparse Regularization ? Why Do I Use it ?

“Sparse” means the model has a low # of nonzero parameters

■ Optimal Band Selection

Ineffective parameters will be removed from prediction modelautomatically by solving convex optimization problem

■ Higher Generalization Capability

simple model with smaller # of bands achieves lessoverfitting; better prediction performance

■ More Interpretable Model


norm（penalty）

Sparse Regularization：Theoretical Overview

Add penalty to loss function to obtain model with small num. of variablesAdd penalty to loss function to obtain model with small num. of variables

LASSO (R. Tibshirani et al., 96)

Optimal Scoring (T. Hastie et al., 94)

Perform Fisher’s linear discriminant analysis as regression by score

convert categorical variables for class membership into quantitative

Optimize and weight vector simultaneously

Loss function (LS)


Intuitive Explanation of Sparse Regularization

To reduce empirical errors,W moves away from 0,then penalty increases

＜penalty＞

L1-norm: attraction force to 0 is const.-> Small values in W tend to be 0

L2-norm: attraction force is small around 0-> Small values in W remain

L1-regularization L2-regularization

Coefficients

<attraction force to 0>


Experimental Flow

2. Prediction for Subcompartments

1. Modeling

LASSO Regression

Forest Pixel Extraction

Subcompartment Prediction

Hyperspectral Reflectance(ave. w/in each quadrat)

Hyperspectral Reflectance(ave. w/in each quadrat)

Hyperspectral Reflectance(30m x 30m pixels)

Hyperspectral Reflectance(30m x 30m pixels) Semisupervised LDA

PredictionPerformancePrediction

Performance

RegressionModel

RegressionModel

Forest PixelsForest Pixels

PredictedForest Condition

PredictedForest Condition

Averaged Reflectancew/in each Subcomp.

Averaged Reflectancew/in each Subcomp. Obtained Model

Sparse LDA ClassificationModel

ClassificationModel

PALSAR SignalsPALSAR Signals


Variety in a Subcompartment

（Subcompartment）

There is a large variety inside a subcompartment

Non-forest area• Deforestation area• Canopy gaps

Invading woods other than planted species• they’re not recorded on forest register

There is a large variety inside a subcompartment

Non-forest area• Deforestation area• Canopy gaps

Invading woods other than planted species• they’re not recorded on forest register


Dataset: Target category: 4 species

Larix kaempferi, Abies sachalinensis, Picea glehnii, other Broadleaf

Source Hyperspectral reflectance by CASI-3

84 bands, 400 – 1060 nm

9 signals given by PALSAR data polarimetries (HH/HV/VV) Three scattering components proposed by Freeman

i.e. surface scattering, double bounce scattering, volume scattering Averaged alpha angle Polarimetric entropy Anisotropy

Quadrats:

Experimental Setting (1/2)


Experimental Setting (2/2)

Validation： 100 times iteration of 5-fold cross valiadtion

Comparison： Methodology Classification Spectral Angular Mapper ; SAM Regularized Discriminant Analysis; RDA （L2-norm regularization） ν-Support Vector Machines; SVM （w/ Linear and RBF kernel）

RegressionPartial Least Squares; PLS

Input data pseudo multi-spectral imageASTER image simulated from CASI-3 data 3 bands 760 - 860Band 3

630 – 690Band 2

520 – 600Band 1

wavelength range (nm)#


■ Prediction accuracy (Mean of 100 times 5-fold CV)

0.0 %6.2 %1.8 %92.0 %Larix

0.8 %4.3 %16.3 %78.8 %0.1 %15.1 %84.4 %0.4 %

Abies0.0 %0.3 %76.4 %23.3 %

PredictedUpper：HyperspecLower： Multi-spec

Actual

BroadleafPiceaAbiesLarix

67.1 %0.0 %0.2 %32.7 %86.4 %6.0 %0.0 %7.6 %

Braodleaf

0.0 %77.6 %0.0 %22.4 %2.8 %70.8 %6.2 %20.2 %

Picea

SDA achieves highest performance （87％ prediction accuracy）Results by using hyperspectral images outperform pseudo-images

SDA achieves highest performance （87％ prediction accuracy）Results by using hyperspectral images outperform pseudo-images

■ Confusion Matrix（SDA, Mean of 100 times 5-fold CV)

84.0

SVM(Lin.)

83.0 %82.2 %69.0 %87.0 %CV-Accuracy

SVM(RBF)RDASAMSDAMethod

Result: Quadrat Species Classification


Result: Quadrat Stand Factor Regression

LASSO with hyperspectral data provides best performance for all stand factorLASSO with hyperspectral data provides best performance for all stand factor

RMSE(10-fold CV)

Canopy Cover Timber Volume Tree Height

poor results

Prediction(10-fold CV)


Grass & Bamboo

Bare ground

Cloud

Clouds

deforestation

Sparse forest

Validation with 39 pixels selected manually Forest vs. non-forest: 100 % Overall Accuracy: 97.4 % （38/39）

※ Dots indicate top-leftpoint of each pixel

Result: Forest Extraction by Semi-supervised LDA


Invading broadleaf trees were found

Abies sachalinensis

Broadleaf

Confirm consistency between actual and estimated species by field surveyConfirm consistency between actual and estimated species by field survey

30 m resized pixels

Original CASI-3

Picea glehnii

Larix kaempferi

Abies sachalinensis

Natural Broadleaf

Legend

▲ Field Survey Points

※ Dots indicate top-leftpoint of each pixel

Result: Tree Species Composition in Subcompartments


Predicted Tree Species Distribution Map

Legend

registry

Predicted

Young Picea glehnii

Invasion of broadleaf

to larch plantation

mixed = below 70% dominancy


Predicted Maps

Canopy Cover Timber Volume Tree Height


Validation and Evaluation – Canopy Cover Map

Line-thinnednon-thinned

Line-thinnednon-thinned

Confirm prediction reflects forest conditions rightly by field surveyConfirm prediction reflects forest conditions rightly by field survey

Canopy Density


Larix trees with relatively higherstand age were observed

Young Picea glehnii& Broadleaf forest of low height

Validation and Evaluation – Tree Height Map


Conclusions

Present forest monitoring method from hyperspectral and SAR image

Integrate diverse data source with different property of targets To overcome high-dimension-small-sample-size problem resulting in over-fitting,

sparse regularization techs (LASSO & Sparse Discriminant Analysis) are adopted

3 advantages of sparse regularization

Generalization, Interpretability, Optimal Band Selection

Experimental simulations of satellite-based operation prove effectiveness

Advantage in prediction accuracy to several supervised methods Advantage of hyperspectral data to multispectral Prediction results reflect existing forest conditions rightly


Many thanks for your kind attention.

Questions ?


Supplementary Slides


How sparse ?

-1.07626.01 nm4-4.20520.95 nm3-1.36441.81 nm26.32418.28 nm1

-21.931055.39 nm8-8.60962.66 nm76.71923.70 nm6

21.45766.86 nm5

75.98Intecept-

Coef.Parameter#

-31.731061.01 nm11

-26.741055.39 nm10

40.041011.70 nm8

-25.78671.51 nm4

22.12640.38 nm3

-22.81560.97 nm2

12.51473.31 nm1

42.921038.75 nm9

27.13877.98 nm7

-65.16835.50 nm6

18.61706.10 nm5

11.68Intercept-

5.14VV14

-1.83HV13

-2.40HH12

Coef.Parameter#

2.87Vol. Scattering Coef.4-18.67513.13 nm3-58.71441.81 nm2-0.59426.17 nm1

26.10VV5198.50Intercept-

Coef.Parameter#

Canopy Cover Model: 8 / 84 params. = 9.5%

Timber Vol. Model: 5 / 93 params. = 5.4%

Tree Height Model: 14/ 93 params. = 15.0%


Frequently Selected Parameters for Species Classification

Distinct bands for confier/broadleaf properties & feature of species are selected.

Around 450nm: absorption peak of G-type lignin richly contained in conifer wood

Around 520nm: absorption peak of S-type lignin richly contained in broadleaf wood

Red edge

0

10

20

30

40

50

60

70

80

90

100

0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80

バンド番号

出現率

[%]

：freq. selectedbandsFrequency used

in prediction model

by 100 bootstrapping

Reflectance

spectrum

Band No.

Picea

Larix

AbiesBroadleafR

efle

ctan

ceA

ppea

rance

Rat

io[%

]

Wavelength [nm]

A Methodology of Forest Monitoring from Hyperspectral ......A Methodology of Forest Monitoring from Hyperspectral Images with Sparse Regularization A Methodology of Forest Monitoring

Documents