Top Banner
113 S N F EURO UZZY 14. Neuro-Fuzzy Systems Building a fuzzy system requires prior knowledge (fuzzy rules, fuzzy sets) manual tuning: time consuming and error-prone Therefore: Support this process by learning learning fuzzy rules (structure learning) learning fuzzy set (parameter learning) Approaches from Neural Networks can be used
46
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: ppt

113 SNFEURO

UZZY

14. Neuro-Fuzzy SystemsBuilding a fuzzy system requires

prior knowledge (fuzzy rules, fuzzy sets)

manual tuning: time consuming and error-prone

Therefore: Support this process by learning

learning fuzzy rules (structure learning)

learning fuzzy set (parameter learning)

Approaches from Neural Networks can be used

Page 2: ppt

114 SNFEURO

UZZY

Learning Fuzzy Sets: Problems in Control

Reinforcement learning must be used to compute an error value(note: the correct output is unknown)

After an error was computed, any fuzzy set learning procedures can be used

Example: GARIC (Berenji/Kedhkar 1992)online approximation to gradient-descent

Example: NEFCON (Nauck/Kruse 1993)online heuristic fuzzy set learning using arule-based fuzzy error measure

Page 3: ppt

115 SNFEURO

UZZY

Page 4: ppt

116 SNFEURO

UZZY

Example: Prognosis of the Daily Proportional Changes of the DAX at the Frankfurter Stock Exchange (Siemens)

Database: time series from 1986 - 1997

DAX Composite DAXGerman 3 month interest rates Return GermanyMorgan Stanley index Germany Dow Jones industrial indexDM / US-$ US treasury bondsGold price Nikkei index JapanMorgan Stanley index Europe Price earning ratio

Page 5: ppt

117 SNFEURO

UZZY

Fuzzy Rules in FinanceTrend RuleIF DAX = decreasing AND US-$ = decreasingTHEN DAX prediction = decreaseWITH high certaintyTurning Point RuleIF DAX = decreasing AND US-$ = increasingTHEN DAX prediction = increaseWITH low certaintyDelay RuleIF DAX = stable AND US-$ = decreasingTHEN DAX prediction = decreaseWITH very high certaintyIn generalIF x1 is 1 AND x2 is 2THEN y =WITH weight k

Page 6: ppt

118 SNFEURO

UZZY

Classical Probabilistic Expert Opinion Pooling Method

DM analyzes each source (human expert, data + forecasting model) in terms of (1) Statistical accuracy,and (2) Informativeness by asking the source to asses quantities (quantile assessment)

DM obtains a “weight” for each source

DM “eliminates” bad sources

DM determines the weighted sum of source outputs

Determination of “Return of Invest”

Page 7: ppt

119 SNFEURO

UZZY

E experts, R quantiles for N quantitieseach expert has to asses R·N values

stat. Accuracy:

information score:

weight for expert e:

outputt=

roi =

E

e

etew

1output

T

ttty

1

DMoutputsign

Ee eeee

eeee

cIccIcw

1 idid

N

i riri

rR

rroiRi vv

ppvvN

I1 1,,

11

11,1, lnln1

R

i

iiR p

sspsIpsINC0

2 ln, ,,21

Page 8: ppt

120 SNFEURO

UZZY

Formal Analysis

Sources of informationR1 rule set given by expert 1R2 rule set given by expert 2D data set (time series)

Operator schemafuse (R1, R2)fuse two rule setsinduce(D) induce a rule set from Drevise(R, D) revise a rule set R by D

Page 9: ppt

121 SNFEURO

UZZY

Formal Analysis

Strategies:fuse(fuse (R1, R2), induce(D))revise(fuse(R1, R2), D)fuse(revise(R1, D), revise(R2, D))

Technique: Neuro-Fuzzy SystemsNauck, Klawonn, Kruse, Foundations of Neuro-Fuzzy Systems, Wiley 97SENN (commercial neural network environment, Siemens)

Page 10: ppt

122 SNFEURO

UZZY

From Rules to Neural Networks

1. Evaluation of membership degrees

2. Evaluation of rules (rule activity)

3. Accumulation of rule inputs and normalization

NF: IRn IR, r

l r

j jj

lll

xkxkwx

11

lD

j ijsc xx

1)(

,l: IRn [0,1]r,

Page 11: ppt

123 SNFEURO

UZZY

Neuro-Fuzzy Architecture

Page 12: ppt

124 SNFEURO

UZZY

The Semantics-Preserving Learning Algorithm

Reduction of the dimension of the weight space1. Membership functions of different inputs share their parameters,

e.g.

2. Membership functions of the same input variable are not allowed to pass each other, they must keep their original order, e.g.

Benefits: the optimized rule base can still be interpretedthe number of free parameters is reduced

stablecdax

stabledax

increasingstabledecreasing

Page 13: ppt

125 SNFEURO

UZZY

Return-on-Investment Curves of the Different Models

Validation data from March 01, 1994 until April 1997

Page 14: ppt

126 SNFEURO

UZZY

Neuro-Fuzzy Systems in Data Analysis

Neuro-Fuzzy System:System of linguistic rules (fuzzy rules).Not rules in a logical sense, but function approximation.Fuzzy rule = vague prototype / sample.

Neuro-Fuzzy-System:Adding a learning algorithm inspired by neural networks.Feature: local adaptation of parameters.

Page 15: ppt

127 SNFEURO

UZZY

A Neuro-Fuzzy System

is a fuzzy system trained by heuristic learning techniques derived from neuralnetworks

can be viewed as a 3-layer neural network with fuzzy weights and specialactivation functions

is always interpretable as a fuzzy system

uses constraint learning procedures

is a function approximator (classifier, controller)

Page 16: ppt

128 SNFEURO

UZZY

Learning Fuzzy Rules

Cluster-oriented approaches=> find clusters in data, each cluster is a rule

Hyperbox-oriented approaches=> find clusters in the form of hyperboxes

Structure-oriented approaches=> used predefined fuzzy sets to structure the

data space, pick rules from grid cells

Page 17: ppt

129 SNFEURO

UZZY

Hyperbox-Oriented Rule Learning

y

x

Search for hyperboxesin the data space

Create fuzzy rules byprojecting thehyperboxes

Fuzzy rules and fuzzysets are created at thesame time

Usually very fast

Page 18: ppt

130 SNFEURO

UZZY

Hyperbox-Oriented Rule Learning

Detect hyperboxes in the data, example: XOR functionAdvantage over fuzzy cluster anlysis:

No loss of information when hyperboxes are represented as fuzzy rulesNot all variables need to be used, don‘t care variables can be discovered

Disadvantage: each fuzzy rules uses individual fuzzy sets, i.e. the rule base iscomplex.

y

x

y

x

y

x

y

x

Page 19: ppt

131 SNFEURO

UZZY

Structure-Oriented Rule Learning

small medium large

x

ysmall

medium

large Provide initial fuzzy sets for

all variables.

The data space is partitionedby a fuzzy grid

Detect all grid cells thatcontain data (approach byWang/Mendel 1992)

Compute best consequentsand select best rules(extension by Nauck/Kruse 1995, NEFCLASS model)

Page 20: ppt

132 SNFEURO

UZZY

Structure-Oriented Rule Learning

Simple: Rule base available after two cycles through the training data1. Cycle: discover all antecedents2. Cycle: determine best consequents

Missing values can be handledNumeric and symbolic attributes can be processed at the same time (mixedfuzzy rules)

Advantage: All rules share the same fuzzy setsDisadvantage: Fuzzy sets must be given

Page 21: ppt

133 SNFEURO

UZZY

Learning Fuzzy Sets

Gradient descent proceduresonly applicable, if differentiation is possible, e.g. for Sugeno-type fuzzysystems.

Special heuristic procedures that do not use gradient information.

The learning algorithms are based on the idea of backpropagation.

Page 22: ppt

134 SNFEURO

UZZY

Learning Fuzzy Sets: Constraints

Mandatory constraints:Fuzzy sets must stay normal and convexFuzzy sets must not exchange their relative positions (they mustnot „pass“ each other)Fuzzy sets must always overlap

Optional constraintsFuzzy sets must stay symmetricDegrees of membership must add up to 1.0

The learning algorithm must enforce these constraints.

Page 23: ppt

135 SNFEURO

UZZY

Example: Medical Diagnosis

Results from patients tested for breast cancer (Wisconsin Breast Cancer Data).

Decision support: Do the data indicate a malignant or a benign case?

A surgeon must be able to check the classification for plausibility.

We are looking for a simple and interpretable classifier: knowledge discovery.

Page 24: ppt

136 SNFEURO

UZZY

Example: WBC Data Set

699 cases (16 cases have missing values).

2 classes: benign (458), malignant (241).

9 attributes with values from {1, ... , 10}(ordinal scale, but usually interpreted as a numerical scale).

Experiment: x3 and x6 are interpreted as nominal attributes.

x3 and x6 are usually seen as „important“ attributes.

Page 25: ppt

137 SNFEURO

UZZY

Applying NEFCLASS-J

Tool for developing Neuro-Fuzzy Classifiers

Written in JAVA

Free version for research available

Project started at Neuro-Fuzzy Group of University of Magdeburg, Germany

Page 26: ppt

138 SNFEURO

UZZY

NEFCLASS: Neuro-Fuzzy Classifier

Input variables (attributes)

Fuzzy rules

Output variables (class labels)

Fuzzy sets (antecedents)

Unweighted connections

Page 27: ppt

139 SNFEURO

UZZY

NEFCLASS: Features

Automatic induction of a fuzzy rule base from data

Training of several forms of fuzzy sets

Processing of numeric and symbolic attributes

Treatment of missing values (no imputation)

Automatic pruning strategies

Fusion of expert knowledge and knowledge obtained

from data

Page 28: ppt

140 SNFEURO

UZZY

Representation of Fuzzy Rules

x y

R1 R2

c1 c2

large

smalllarge

Example: 2 Rules

R1: if x is large and y is small, then class is c1.

R2: if x is large and y is large, then class is c2.

The connections x R1 and x R2

are linked.

The fuzzy set large is a shared weight.

That means the term large has always thesame meaning in both rules.

Page 29: ppt

141 SNFEURO

UZZY

1. Training Step: Initialisation

small medium large

x

y

small

medium

large

Specify initial fuzzy partitions for all input variables

x y

c1 c2

Page 30: ppt

142 SNFEURO

UZZY

2. Training Step: Rule Base

Algorithm:

for (all patterns p) dofind antecedent A,such that A( p) is maximal;if (A L) then add A to L;

end;

for (all antecedents A L) dofind best consequent C for A;create rule base candidate R = (A,C);Determine the performance of R;Add R to B;

end;Select a rule base from B;

Variations:

Fuzzy rule bases can also be created by using prior knowledge, fuzzy cluster analysis, fuzzy decision trees, genetic algorithms, ...

Page 31: ppt

143 SNFEURO

UZZY

Selection of a Rule Base

• Order rules by performance.

• Either selectthe best r rules orthe best r/m rules per class.

• r is either given or is determined automatically such that all patterns are covered.otherwise.

conclass if

with

1

),()(0

,111

rp

N

ppr

cr

Rc

RN

P

x

x

:Rule a of ePerformanc

Page 32: ppt

144 SNFEURO

UZZY

Rule Base Induction

NEFCLASS uses a modified Wang-Mendel procedure

x y

c1 c2

R1 R3R2

small medium large

x

y

small

medium

large

Page 33: ppt

145 SNFEURO

UZZY

Computing the Error Signal

10,1 )( withcon rRrrr EE

Rule Error:

output) actual : output, correct :( and

with

oted

otdddE

dda

jjj2

max)(,1,0:

,)(1)sgn(

Fuzzy Error ( jth output):

x y

c1 c2

R1 R3R2

Error Signal

Page 34: ppt

146 SNFEURO

UZZY

3. Training Step: Fuzzy Sets

bbcfcbabfa

bxacfbx

xf

EE

E

E

)sgn()(1

0)(otherwise if

Example:triangularmembershipfunction.

Parameterupdates for anantecedentfuzzy set.

otherwise

if

if

0

],[

),[

)(],1,0[: ,,,, cbxbcxc

baxabax

xcbacba

Page 35: ppt

147 SNFEURO

UZZY

Training of Fuzzy Sets

small medium large

x

y

small

medium

large

0.85enlarge

0.30

reduce

x

(x)

0.55

initial fuzzy set

Heuristics: a fuzzy set is moved away from x (towards x)and its support is reduced (enlarged), in order toreduce (enlarge) the degree of membership of x.

Page 36: ppt

148 SNFEURO

UZZY

Training of Fuzzy Sets

Variations:

• Adaptive learning rate

• Online-/BatchLearning

• optimistic learning(n step look ahead)

Observing the error on a validation set

Algorithm:

repeatfor (all patterns) do

accumulate parameter updates;accumulate error;

end;modify parameters;

until (no change in error);

localminimum

Page 37: ppt

149 SNFEURO

UZZY

Constraints for Training Fuzzy Sets

• Valid parameter values

• Non-empty intersection of adjacent fuzzy sets

• Keep relative positions

• Maintain symmetry

• Complete coverage (degrees of membership add up to 1 for each element)

1

2

3

Correcting a partition after modifying the parameters

Page 38: ppt

150 SNFEURO

UZZY

4. Training Step: PruningGoal: remove variables, rules and fuzzy sets, in order toimprove interpretability and generalisation.

Page 39: ppt

151 SNFEURO

UZZY

PruningAlgorithm:

repeatselect pruning method;

repeatexecute pruning step;train fuzzy sets;

if (no improvement)then undo step;

until (no improvement);

until (no further method);

Pruning Methods:

1. Remove variables(use correlations, information gain etc.)

2. Remove rules(use rule performance)

3. Remove terms(use degree of fulfilment)

4. Remove fuzzy sets(use fuzziness)

Page 40: ppt

152 SNFEURO

UZZY

WBC Learning Result: Fuzzy Rules

R1: if uniformity of cell size is small and bare nuclei is fuzzy0 then benign

R2: if uniformity of cell size is large then malignant

Page 41: ppt

153 SNFEURO

UZZY

WBC Learning Result: Classification Performance

Predicted Classmalign benign not

classifiedsum

malign 228 (32.62%) 13 (1.86%) 0 (0%) 241 (34.99%)benign 15 (2.15%) 443 (63.38%) 0 (0%) 458 (65.01%)sum 243 (34.76%) 456 (65.24%) 0 (0%) 699 (100.00%)

NEFCLASS-J: 95.42%

Discriminant Analysis: 96.05%

C 4.5: 95.10%

NEFCLASS-J (numeric): 94.14%

Multilayer Perceptron: 94.82%

C 4.5 Rules: 95.40%

Estimated Performance on Unseen Data (Cross Validation)

Page 42: ppt

154 SNFEURO

UZZY

WBC Learning Result: Fuzzy Sets

0.01.0 2.8 4.6 6.4 8.2 10.0

0.5

1.0

0.01.0 2.8 4.6 6.4 8.2 10.0

0.5

1.0

sm lguniformity of cell size

bare nuclei

Page 43: ppt

155 SNFEURO

UZZY

NEFCLASS-J

Page 44: ppt

156 SNFEURO

UZZY

Resources

Detlef Nauck, Frank Klawonn & Rudolf Kruse:

Foundations of Neuro-Fuzzy SystemsWiley, Chichester, 1997, ISBN: 0-471-97151-0

Neuro-Fuzzy Software (NEFCLASS, NEFCON, NEFPROX):

http://www.neuro-fuzzy.de

Beta-Version of NEFCLASS-J:

http://www.neuro-fuzzy.de/nefclass/nefclassj

Page 45: ppt

157 SNFEURO

UZZY

Download NEFCLASS-J

Download the free version of NEFCLASS-J athttp://fuzzy.cs.uni-magdeburg.de

Page 46: ppt

158 SNFEURO

UZZY

ConclusionsNeuro-Fuzzy-Systems can be useful for knowledge discovery.

Interpretability enables plausibility checks and improves acceptance.

(Neuro-)Fuzzy systems exploit tolerance for sub-optimal solutions.

Neuro-fuzzy learning algorithms must observe constraints in order not to jeopardise the semantics of the model.

Not an automatic model creator, the user must work with the tool.

Simple learning techniques support explorative data analysis.