Top Banner
Asymmetry Aware Scheduling Algorithms for Asymmetric Processors Nagesh Lakshminarayana Sushma Rao Hyesoon Kim Computer Science Georgia Institute of Technology
30

Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Jan 06, 2016

Download

Documents

yul

Asymmetry Aware Scheduling Algorithms for Asymmetric Processors. Nagesh Lakshminarayana Sushma Rao Hyesoon Kim Computer Science Georgia Institute of Technology. Outline. Background and Problem Application characteristics on AMP/SMP LJFPF Policy CJFPF Policy Conclusion. PE B. PE B. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Nagesh Lakshminarayana Sushma Rao Hyesoon Kim

Computer Science Georgia Institute of Technology

Page 2: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Outline

• Background and Problem

• Application characteristics on AMP/SMP

• LJFPF Policy

• CJFPF Policy

• Conclusion

Page 3: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Heterogeneous Architectures

• A particularly interesting class of parallel machines is Heterogeneous Architecture:– Multiple types of Processing Elements (PEs)

available on the same machine

PEA

PEB

PEB

PEB

PEB

Inte

rcon

nect

Page 4: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Heterogeneous Architectures

• Heterogeneous architectures are becoming very common:

Multicore CPU + GPU

IBM Cell processor

Special accelerator

Fast core

Slow core

Slow core

Slow core

Slow core

Focus of this talk

Asymmetric Processors

Fast core

Page 5: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Scheduling Problem: Multiple applications

Fast core

Slow core

Slow core

Slow core

Slow core

Scalable applications

Non-scalable applications

Fast core

Fast Core

Slow Core

Page 6: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Scheduling Problem: Multi-threaded application

Fast core

Slow core

Slow core

Slow core

Slow core

Fast core

Page 7: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Problem

How to schedule multi-threaded applications on Asymmetric Multiprocessors (AMP)?

Page 8: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Outline

• Background and Problem

• Application characteristics on AMP/SMP

• LJFPF Policy

• CJFPF Policy

• Conclusion

Page 9: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Experimental Methodology

• Use a 1.87GHz two-socket Quad-core machine to measure the performance

• Use SpeedStep technology to emulate an AMP

All-slow (SMP) All 8 processors are running at 1.6 GHz

One-fast (AMP) 1 processors are running at 1.87 GHz

7 processors are running at 1.6GHz

Half-half (AMP) 4 processors are running at 1.87GHz

4 processors are running at 1.6GHz

All-fast (SMP) All processors are running at 1.87GHz

Page 10: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Performance Results on AMP/SMP

0.8

0.85

0.9

0.95

1

1.05

No

rma

lize

d e

xe

cu

tio

n t

ime

All-slow

One-fast

Half-half

All-fast

Page 11: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Fast core

Slow core

Slow core

Slow core

Slow core

Fast core

Slow-Limited Applications

barrier

Page 12: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Middle-perf Benchmarks

barrier

Similar to a slow-limited benchmark but sequential section is much longer

Page 13: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Unstable Benchmarks

barrier

barrier

Lots of barriers Asymmetric workloads

Page 14: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

PARSEC Benchmarks

Application Locks Barriers Cond. Variables

AMP performance category

BlackSholes 39 8 0.000 slow-limited

Bodytrack 6824702 111160 0.003 unstable

Canneal 34 0 0.003 middle-perf

dedup 10002625 0 0.009 unstable

ferret 1422579 0 0.014 slow-limited

facesim 7384488 0 0.03 middle-perf

Fluidanimate 1153407308 31998 0.02 unstable

Freqmine 39 0 0.12 middle-perf

streamcluster 1379 633174 0.013 middle-perf

swaptions 9 0 0.00 slow-limited

vips 11 0 0.0049 unstable

x264 207692 0 13793 middle-perf

Page 15: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Outline

• Background and Problem

• Applications on AMP/SMP

• LJFPF Policy

• CJFPF Policy

• Conclusion

Page 16: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

LJFPF Policy

• Longest Job to a Fast Processor First

barrier

Fast core

Fast core Slow core

Slow core

Page 17: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

How Does the Scheduler Know

• Length of work?

• Current mechanism: application sends the information

• On-going work: Prediction mechanism

Page 18: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Evaluation

• Matrix Multiplication

Sequential version

Parallel versionSymmetric workload

Parallel versionAsymmetric workload

Page 19: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Asymmetric Workload (Matrix Multiplication)

0.9

0.95

1

1.05

1.1

1.15

1.2

300-300

310-290

320-280

330-270

340-260

350-250

360-240

No

rma

lize

d e

xecu

tion

tim

e

All-fast

Half-half(LJFPF)

Half-half (RR)

All-slow

Page 20: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Real Application

• ITK (Medical image processing tool kit)– Open source but a real application

Page 21: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Evaluation: MultiRegistration

• Kernel loop has 50 iterations

50 % 8 ≠0

• Divide 50 iterations into 7, 7, 7, 7, 6, 6, 5, 5

Page 22: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

0.92

0.94

0.96

0.98

1

1.02

1.04

All-f

ast

Ha

lf-h

alf

(LJF

PF

)

Ha

lf-h

alf

(RR

)

All-s

low

No

rma

lize

d e

xe

cu

tio

n t

imeResults: ITK Benchmark

2.3%

Page 23: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Outline

• Background and Problem

• Application characteristics on AMP/SMP

• LJFPF Policy

• CJFPF Policy

• Conclusion

Page 24: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Critical Section

Lock

Lock

Page 25: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Critical Section Limited Workloads

Critical section

Useful workwaiting

Case (a)

Case (b)

Page 26: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Critical Section Effects

0

1

2

3

4

5

6

7

8

9

10%CS 15%CS 20%CS

sp

eed

up

All-fast

Half-half

All-slow

Half-half performs similar to all-fast

Page 27: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

CJFPF Policy

• Critical Job to a Fast Processor First Policy

Fast core

Slow core

Slow core

Slow core

Page 28: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

0

1

2

3

4

5

6

7

8-12 16-24 40-60

sp

eed

up

CJFPF

RR

CJFPF Results

Longer critical sectionThe benefit of the CJFPF policy decreases

Page 29: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Conclusion

• We evaluated the characteristics of multi-threaded applications on AMPs.

• Barriers and critical sections are important factors.• Propose two new scheduling policies: Longest job

to fast core first (LJFPF), critical job to fast core first (CJFPF)– Scheduling polices improve performance for asymmetric

workloads.• Future work

– Develop a prediction mechanism– Evaluate symmetric workloads on AMPs– Other kinds of heterogeneous architectures

Page 30: Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Thank you!