Top Banner
[email protected] Ensemble methods for algorithmic trading
22

Ensemble methods for modeling financial data

Aug 08, 2015

Download

Engineering

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Ensemble methods for modeling financial data

[email protected]

Ensemble methods for algorithmic trading

Page 2: Ensemble methods for modeling financial data

Background

● What is algorithmic trading?

● What is the relevance of machine learning?

● Where does the current topic fit in ?

Page 3: Ensemble methods for modeling financial data

Trading

Traders trade via open outcrying

Close to the conventional notion of “trading”

Slow and inefficient

Manual Algorithmic

People like you and I design algorithms to predict like human traders

Computer algorithms trade with each other

Blazingly fast with high trade volumes

Page 4: Ensemble methods for modeling financial data

Machine learning

How does an algorithm make money?

Page 5: Ensemble methods for modeling financial data

Let’s make it more interesting !

Page 6: Ensemble methods for modeling financial data

Linear regression!

(Why is the relationship linear ?)

(Any more problems ?)

Standard ML technique

Page 7: Ensemble methods for modeling financial data

How about this graph ?

Page 8: Ensemble methods for modeling financial data

Trees to the rescue !

Decision trees are very popular in classification

Can do regression as well !

Simple and efficient

Very intuitive

Page 9: Ensemble methods for modeling financial data

Walk through

Page 10: Ensemble methods for modeling financial data

Hang on

p

Phew!

Page 11: Ensemble methods for modeling financial data

Which brings us to the discussion of the day

What is an ensemble method?

How is it relevant to finance?

Two very common ( but remarkably powerful) ensemble methods

Page 12: Ensemble methods for modeling financial data

Ensemble

Wikipedia says:

“In statistics and machine learning, ensemble methods use multiple learning algorithms to obtain better predictive performance than could be obtained from any of the constituent learning algorithms”

Begin with a weak learner ( Tree in our case )

Train several of them

Combine their output ( Bagging and Boosting )

Page 13: Ensemble methods for modeling financial data
Page 14: Ensemble methods for modeling financial data

BaggingHow do you naturally expand the idea of a tree? ( Hint : think real world )

Page 15: Ensemble methods for modeling financial data

Random forest

● Training○ Sample a subset of the input

( Bootstrapping )○ Build a regression tree on top of

it○ Repeat till “convergence”

● Prediction○ Pass the input to each tree in

the forest○ Take a weighted combination

Page 16: Ensemble methods for modeling financial data
Page 17: Ensemble methods for modeling financial data

In random forests, the trees are built independently

Possibility of redundancy

Is there a way to not isolate our training subsets?

Potential issues?

Page 18: Ensemble methods for modeling financial data
Page 19: Ensemble methods for modeling financial data

Boosting

● Training○ Sample a subset of the input○ Build a tree on top of it○ Obtain an error statistic on the WHOLE

input○ Use this statistic to generate the next

input subset

Median heavy training instead of mean heavy training

Page 20: Ensemble methods for modeling financial data
Page 21: Ensemble methods for modeling financial data

Why use this in finance ?

i.i.d assumption goes for a toss

Noise filtering is a challenge

Sophisticated methods often fail ( and are miserably slow)

We need to rely on simple methods and yet guarantee high accuracy