Top Banner
Models of Architecture Nokia Bell Labs 2018 Maxime Pelcat INSA Rennes, IETR, Institut Pascal This work has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 732105: CERBERO.
53

Models of Architecture

Feb 19, 2022

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Models of Architecture

Models of Architecture

Nokia Bell Labs 2018

Maxime Pelcat INSA Rennes, IETR, Institut Pascal

This work has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 732105: CERBERO.

Page 2: Models of Architecture

INSA Rennes – IETR VAADER • INSA Rennes

• IETR VAADER

• Institut Pascal

2

Page 3: Models of Architecture

• Abstracting computational architecture to –Predict performance

–Support current hardware evolutions

Models of Architecture

Page 4: Models of Architecture

• Hardware Architectures are becoming –More complex

–More heterogeneous

–More High Performance embedded Computing (HPeC) • Embedded deep learning, near-sensor computing, fog

computing, edge computing, many-cores, etc.

• Real-time constraints, stream processing applications

Motivation: architecture evolution

Page 5: Models of Architecture

• Let’s look at ARM-based HPeC –Let us consider 4 heterogeneous solutions

• ARM = control path + some of the data path

• in red: data path

Motivation: HPeC architectures

Multi-ARM FPGA Multi-ARM

GPGPU Multi-ARM DSP Multi-ARM

Page 6: Models of Architecture

• Let’s look at ARM-based HPeC

Motivation: HPeC architectures

Multi-ARM FPGA Multi-ARM

GPGPU Multi-ARM DSP Multi-ARM

Page 7: Models of Architecture

• ARM big.LITTLE: Samsung Exynos 5422

Motivation: HPeC architectures

A15 A15

A15 A15

SCU ACE

A7 A7

A7 A7

SCU

2MB 0.5MB

2GB DDR (PoP)

Easy to program Linux SMP Thread migration 12Gflops <10W

Low energy cores

High Performance

cores

2GHz 1.4GHz

Page 8: Models of Architecture

• Let’s look at ARM-based HPeC

Motivation: HPeC architectures

Multi-ARM FPGA Multi-ARM

GPGPU Multi-ARM DSP Multi-ARM

Page 9: Models of Architecture

• Multi-ARM + GPGPU: Nvidia Jetson TX1 module

Motivation: HPeC architectures

A57 A57

A57 A57

SCU 4GB

external DDR on module

Less easy to program Linux SMP + CUDA/OpenCL

32 cores /warp

Control path

1.6GHz

256-core Maxwell GPGPU

Data path

H.264 4K 60Hz

Page 10: Models of Architecture

• Let’s look at ARM-based HPeC

Motivation: HPeC architectures

Multi-ARM FPGA Multi-ARM

GPGPU Multi-ARM DSP Multi-ARM

Page 11: Models of Architecture

• Multi-ARM + DSP: Texas Instruments Keystone II TCI6638K2K Motivation: HPeC architectures

A15 A15

A15 A15

SCU 4MB

Difficult to program (well) Linux SMP + Open Event Machine 160 Gflops <15W

Control path

1.4GHz

Data path

Teranet

C66

1MB

C66

1MB

C66

1MB

C66

1MB

C66

1MB

C66

1MB

C66

1MB

C66

1MB

FFTC

6MB MSMC

1.2GHz

Page 12: Models of Architecture

• Let’s look at ARM-based HPeC

Motivation: HPeC architectures

Multi-ARM FPGA Multi-ARM

GPGPU Multi-ARM DSP Multi-ARM

Page 13: Models of Architecture

• Multi-ARM + FPGA: Xilinx Zynq Ultrascale + Motivation: HPeC architectures

A53 A53

A53 A53

SCU 1MB

More difficult to program (well) Linux SMP + HLS or HDL

Control path

1.5GHz

Data path GPU

FPGA

R5 R5

Switch fabric

Not GPGPU

Up to 4MB 1MFF 0.5MLUT

600MHz

Page 14: Models of Architecture

Motivation: HPeC architectures

• Current trends – FPGAs are gaining importance: what about flops?

– Adding video/image accelerators • Video Compression: H.264/AVC, H.265/HEVC, etc.

• AI: For tensor applications reach 1Tops/W

–RISC-V as an open HW competitor to ARM

Page 15: Models of Architecture

• Towards more complexity –More cores, hierarchies of clusters

–Heteronegeneity, Interconnect complexity

• Reminds intra-core modifications in XXth

Motivation: architecture evolution

ALU clk clk clk + + + + Ld × + Str

SIMD VLIW

Page 16: Models of Architecture

• But there are some differences between intra-core and inter-core parallelism –At coarse grain, PEs communicate asynchronously – There is no (or less) centralized processing decision – There is no performance portability (nothing equivalent

to C-to-VLIW compilers)

• How can/should we manage this HW complexity? –Can we predict performance at design time? How?

Motivation: architecture evolution

Page 17: Models of Architecture

System Objectives

Maxime Pelcat 17

T°C

Energy

Reliability

Memory

Unit Cost $

Security Maintenance Cost

$

Performance

Peak Power

Page 18: Models of Architecture

System Prototype

System Design: Y-Chart

Maxime Pelcat 18

Architecture

Design

Algorithm Application

Redesign

Red

esign

Page 19: Models of Architecture

Model of Architecture (MoA)

conform to

Model-Based Design

19

KPI

Architecture Model

KPI Evaluation

Algorithm Algorithm Model

Red

esign

Maxime Pelcat

Model of Computation(MoC)

conforms to

Red

esign

Page 20: Models of Architecture

On MoC Side: Many Results • #EdwardALee, #ProgrammingParadigms • Discrete Event MoCs • Finite State Machines imperative languages • Functional MoCs • Petri Nets • Dataflow MoCs SDF, CSDF, IDF, IBSDF, PSDF,

SPDF, PiSDF, etc.

20 Maxime Pelcat

PREESM

Page 21: Models of Architecture

And they are not all here… Dataflow MoCs Case

Feature SDF ADF IBSDF DSSF PSDF PiSDF SADF SPDF DPN KPN

Expressivity Low Med. Turing complete

Hierarchical X X X X

Compositional X X X

Reconfigurable X X X X X X

Statically schedulable X X X X

Decidable X X X X (X) (X) X (X)

Variable rates X X X X X X X

Non-determinism X X X

SDF: Synchronous Dataflow ADF: Affine Dataflow IBSDF: Interface-Based Dataflow DSSF: Deterministic SDF with Shared Fifos PSDF: Parameterized SDF

PiSDF Parameterized and Interfaced SDF SADF: Scenario-Aware Dataflow SPDF: Schedulable Parametric Dataflow DPN: Dataflow Process Network KPN: Kahn Process Network

Page 22: Models of Architecture

But Still a Lot to Do • on Real-Time Multicore systems especially

• Usually, RT application specification = –Multiple tasks sharing resources

–Activation periods or triggering events

• Objective = keeping resources busy

22 Maxime Pelcat

T1 T2

T3

Page 23: Models of Architecture

MoCs are not sufficient

23

Energy

Energy Evaluation

Algorithm Algorithm Model

Maxime Pelcat

Model of Computation(MoC)

conforms to

Page 24: Models of Architecture

Models of Architecture

Maxime Pelcat 24

Model of Architecture (MoA)

conform to

KPI

Architecture Model

KPI Evaluation

Algorithm Algorithm Model

Red

esign

Red

esign

Page 25: Models of Architecture

Problem: Predict System Quality • How to predict a system « quality » ? –Efficiently (simple procedure)

–Early (from abstract models)

–Accurately (with a good fidelity)

–With reproducibility (same models = same prediction)

25 Maxime Pelcat

Page 26: Models of Architecture

Model of Architecture • Definition –Model of a system Non-Functional Property

–Application-independent

–Abstract

–Reproducible

26 Maxime Pelcat

Pelcat, M; Mercat, A; Desnos, K; Maggiani, L; Liu, Y; Heulot, J; Nezan, J-F; Hamidouche, W; Ménard, D; Bhattacharyya, S (2017) "Reproducible Evaluation of System Efficiency with a Model of Architecture: From Theory to Practice", IEEE TCAD.

Page 27: Models of Architecture

Model of Architecture

Maxime Pelcat 27

Model Reproducible

Application-independent

Abstract

AADL

MCA SHIM

UML MARTE /

AAA

CHARMED

S-LAM

MAPS

LSLA

Page 28: Models of Architecture

NFP = MoA( ) activity( )

MoA depends on MoC

Model of Architecture

28

One and always the same quality evaluation

Model H conforms to MoA Model G conforms to MoC

Activity

MoC( )

Maxime Pelcat

application Performance

Power

Energy Memory T°C

Reliability

Security Cost

Page 29: Models of Architecture

Model of Architecture

29

KPI

MoA MoC

Act

Maxime Pelcat

Page 30: Models of Architecture

LSLA: First MoA • LSLA = Linear System-Level Architecture

Model

• Motivated by the additive nature of energy consumption

Maxime Pelcat 30

Page 31: Models of Architecture

System Objectives

Maxime Pelcat 31

T°C

Energy

Reliability

Memory

Unit Cost $

Security Maintenance Cost

$

Performance

Peak Power

Page 32: Models of Architecture

Energy/Power Define Architecture

0 20W 20kW 20MW

Need a dissipator

2W 7W

Need a fan

Embedded system Dedicated system

or conventional system

HPC

HPeC

influence

Page 33: Models of Architecture

LSLA Model of Architecture

33

Task1 signal

signal Task2 Task3

Task4

Task5

1

1

1

1

1

1

1

PE1 PE2 CN

10x+1

2x+0 3x+0

16+12+22=50

Maxime Pelcat

token

quantum

Compositional

Page 34: Models of Architecture

LSLA Model of Architecture

34

Task1 signal

signal Task2 Task3

Task4

Task5

1

1

1

1

1

1

1

PE1 PE2 CN

10x+1

2x+0 3x+0

16+12+22=50

Maxime Pelcat

SDF: Model of Computation

Activity

LSLA: Model of Architecture

Page 35: Models of Architecture

LSLA MoA for Energy Prediction • 86% of fidelity on octo-core ARM

35 Maxime Pelcat

Page 36: Models of Architecture

LSLA MoA for Energy Prediction • The model is learnt from energy

measurements

36

PE PE

CN

PE PE

PE PE

CN

PE PE

CN

Maxime Pelcat

Page 37: Models of Architecture

LSLA MoA for Energy Prediction • The model is learnt from energy

measurements

37

PE PE

CN

α 1.5W 1.5W

PE PE 1.5W 1.5W

PE PE

CN

γ

0.3W 0.3W

PE PE 0.3W 0.3w

CN

β

Maxime Pelcat

Page 38: Models of Architecture

LSLA: MoA, not MoHW • LSLA models HW + communication

libraries + scheduler + Oss +…

• LSLA models the service the platform offers to the applications

• Top-down approach –Learning parameters from experiments

Maxime Pelcat 38

Page 39: Models of Architecture

System Objectives

Maxime Pelcat 39

T°C

Energy

Reliability

Memory

Unit Cost $

Security Maintenance Cost

$

Latency

Peak Power

Page 40: Models of Architecture

MoAs: Limits of LSLA • Energy Linear model OK • Latency

• Latency does not have an additive nature

40

Maxime Pelcat

Task1 Task2 1

1

1

Task1

Task2

1

1

1

1

Latency = sum

Latency = max

!

Page 41: Models of Architecture

Activity & MoA for Latency

41

Task1 signal

signal Task2 Task3

Task4

Task5

1

1

1

1

1

1

1

SDF

a)

b)

Maxime Pelcat

c)

Page 42: Models of Architecture

Activity & MoA for Latency

42

PE1 PE2 CN

10x+1

2x+0 3x+0

Σ 12+12+11=35

Σ 8+6+11=25

max(35,25)=35

a)

b)

Maxime Pelcat MaxPlus

Page 43: Models of Architecture

c)

Activity & MoA for Latency

43

PE1 PE2 CN

10x+1

2x+0 3x+0

Σ 24

Maxime Pelcat

Page 44: Models of Architecture

System Prototype

Accuracy? No, Fidelity!

Maxime Pelcat 44

Architecture

Design

Algorithm Application

Redesign

Red

esign

Page 45: Models of Architecture

Current Activities

Maxime Pelcat 45

Page 46: Models of Architecture

• Cross-layer Design of Reconfigurable Cyber-Physical Systems

H2020 CERBERO

Maxime Pelcat 46

Page 47: Models of Architecture

CERBERO System Adaptation

Maxime Pelcat 47

Page 48: Models of Architecture

H2020 Cerbero Toolchain

Maxime Pelcat 48

VT

AOW DynAA

PAPI

SPIDER JADE

ARTICO3 MDC

Intermediate Representation

C++

System Model

Application / Architecture

Runtime Support

Low-Level Implementation (Hardware Abstraction)

PREESM

MECA End-user interaction

Page 49: Models of Architecture

GdR SOC2 • Groupement de recherche SOC2 – Systems on a Chip, Connected Systems

– Industrial partner club

Maxime Pelcat 49

Page 50: Models of Architecture

GdR SOC2

Maxime Pelcat 50

Page 51: Models of Architecture

SAMOS XIX • 19th edition of SAMOS Conference

• July 7-11, submission in March

Maxime Pelcat 51

Page 52: Models of Architecture

Takeaway Message • MoAs can early predict performance/quality –Especially for HPeC systems

• MoAs are not HW Models – They model HW + protocols + OS + …

• MoAs are built/learnt top-down – They can and should be simple

• The need for MoAs may rise –Due to Fog/Edge Computing complexity

Maxime Pelcat 52

KPI

MoA MoC

Act

Page 53: Models of Architecture

Questions?

Maxime Pelcat 53

Pelcat, M; Mercat, A; Desnos, K; Maggiani, L; Liu, Y; Heulot, J; Nezan, J-F; Hamidouche, W; Ménard, D; Bhattacharyya, S (2017) "Reproducible Evaluation of System Efficiency with a Model of Architecture: From Theory to Practice", IEEE TCAD.

www.cerbero-h2020.eu

http://preesm.org