Reinforcement Learning for Railway Scheduling · Reinforcement Learning for Railway Scheduling Overcoming Data Sparseness through Simulations Dr. Erik Nygren Research and Innovation

Reinforcement Learning

for Railway Scheduling

Overcoming Data Sparseness through Simulations

Dr. Erik Nygren Research and Innovation Lab Swiss Federal Railway

© SBB • Solution Center Infrastructure • Research & Innovation • October 2017 2

Swiss Railway Network. A Complex Dynamical System.

Influencing Factors Facts

1

10,000

1,210,000

Weather People

Infrastructure Events

12,997

Most dense network 33,000

210,000 t

1t

31,266

3,230 km

KM

Energy


Train Dispatching and Scheduling. Challenges in the Worlds Densest Train Network.

RCS

Train runs

Production

Timetable

Evolution of Dispatching. Towards Full Automation.

Today

Future

Past



Automated Train Dispatching. Current Challenges.

Big Data

Big Data: Not enough relevant information

Automated

Dispatching

Learning

Measure-

ments Action


Reinforcement Learning for Railway Dispatching. Overcoming Data Sparseness through Simulations.

WIP

Measure- ments Action

Validation

Data generation

Learning

Learning

Action

Artificial Data

Big Data

High Performance Simulation


High Performance Simulations. Unleashing the Power of Parallel Computing.

DGX-1 High Performance Simulations

Time speedup Scenario variations Influencing factor analysis


Preliminary Results. Visualization of Simulation Results.

2h realtime

500x

5000x Simulation speed

Visualization speed


Reinforcement Learning. Playing the Dispatcher Game.

Action

Reward


Artificial Data

DGX-1 Automated Dispatcher


Machine Learning on Artificial Data. Generating, Evaluating and Optimizing Train Dispatching.

Automated Dispatcher

Reinforcement Learning

Tree Search

Evolutionary Strategies

Building Blocks Variable Topologies

1

2

3

Mixed Integer Linear Programming

Genetic Algorithm


Current State... And Future Expected Reward.


DGX-1 Automated Dispatcher

Fully Automated Process

Train runs

Production

Timetable

Take Home.

Big Data Big Information


Take Home.

AI

Model

Big Data Big Information

Dr. Erik Nygren

[email protected]

AI Researcher

Research Team

mailto:[email protected]

mailto:[email protected]


Reward Function. How to Reward an Artificial Dispatcher.

Reward

Reinforcement Learning for Railway Scheduling · Reinforcement Learning for Railway Scheduling Overcoming Data Sparseness through Simulations Dr. Erik Nygren Research and Innovation

Documents