An Evolutionary Approach for the Task Mapping Problem Filipo Novo Mór Supervisor: Dr. César Augusto Missio Marcon Co-supervisor: Dr. Andrew Rau-Chaplin www.filipomor.com master thesis defense
PowerPoint Presentation
An Evolutionary Approach for the Task Mapping Problem
Filipo Novo MrSupervisor: Dr. Csar Augusto Missio MarconCo-supervisor: Dr. Andrew Rau-Chaplin
www.filipomor.commaster thesis defense
Presentation Outline1IntroductionTheoretical BackgroundRelated WorkProject MethodologyExperimental Results
Conclusions
Filipo Novo Mr
Introduction Some concepts NoC - Network on Chip Tasks
2Filipo Novo Mr
t0t1t3t4t2t5
Introduction The Task Mapping Problem
3t0t1t3t4t2TASKSt5
NP-Hard problem!
power consumption communication profile execution timeFilipo Novo Mr
IntroductionBrute-force algorithms are not feasible for solving NP-Hard problemsAlternative: to use heuristic methodsBest solution possible, although there is no guarantee the best global solution will be foundEvolutionary AlgorithmsDifferential Evolution (DE)4
Filipo Novo Mr
IntroductionMotivation Previous works Considering the DE features of: Optimization of non-linear problems Simplicity and flexibility of its code Try finding a more efficient task mapping solver using DE5
Filipo Novo Mr
IntroductionObjective Implement a new elitist strategy on Single Objective DE to efficiently solve the Task Mapping onto NoC Problem6
Filipo Novo Mr
Theoretical Background7Filipo Novo MrIntroductionTheoretical BackgroundRelated WorkProject MethodologyExperimental Results
Conclusions
Theoretical Background Task Mapping onto NoC ProblemTraditional approaches:PartitioningMappingFilipo Novo Mr8
C.A.M.Marcon, 2005
Theoretical Background Task Mapping AlgorithmsFilipo Novo Mr9
Network Flow Algorithm Shortest Tree Algorithm A* Algorithm Mathematical Inequalities Linear programming Evolutionary Algorithms Genetic programming Simulated Annealing
Theoretical Background Evolutionary AlgorithmsFilipo Novo Mr10
Theoretical BackgroundDifferential Evolution (DE)Filipo Novo Mr11
Theoretical BackgroundDifferential Evolution (DE)Filipo Novo Mr12
vector initialization Population is randomly initialized Uniform probabilistic distribution If a preliminary solution is available, must add distributed random deviations to it Each individual on the population represent a solution candidate
Theoretical BackgroundFilipo Novo Mr13
NPPopulation
(solution candidate)1
(solution candidate)2
(solution candidate)3
(solution candidate)n
Theoretical BackgroundDifferential Evolution (DE)Filipo Novo Mr14
mutation generate a new mutate vector a new parameter vector is generated by the DE by adding the weighted difference between two population vectors to a third vector
Theoretical BackgroundFilipo Novo Mr15
D. Bingham, 2015
Theoretical BackgroundFilipo Novo Mr16
X2X1
target vector
mutation factor
Theoretical BackgroundDifferential Evolution (DE)Filipo Novo Mr17
mutation generate a new mutate vector a new parameter vector is generated by the DE by adding the weighted difference between two population vectors to a third vector the resulting vector will be used as a donor on the next step keeps pacing throughout the solution space
Theoretical BackgroundDifferential Evolution (DE)Filipo Novo Mr18
recombination enhance the Population diversity keep track of good candidate solutions from previous generations
Theoretical BackgroundFilipo Novo Mr19
D
Theoretical BackgroundDifferential Evolution (DE)Filipo Novo Mr20
selection only the best individuals will be kept in the Population
Theoretical BackgroundFilipo Novo Mr21
Population
Xi,G
(solution candidate)2
(solution candidate)3
(solution candidate)n
Uj,i,G+1
Theoretical BackgroundFilipo Novo Mr22
DE complete steps
Theoretical BackgroundPopulation Evaluation on DEFilipo Novo Mr23
Filipo Novo Mr24Theoretical BackgroundTested algorithms:Brute Force Nave: N2 two independent nested loops.Brute Force Smart: N2 two dependent nested loops.Mishra & Sandeep: heapsort + 1 outer loop with a dynamic variant linked list.
Tested in a I5 CPU, 8GB RAM, running Kubuntu 14.04. All tests performed using nice -20 prioritization.To generate the data set:
Filipo Novo Mr25Theoretical BackgroundManaging the DE archive
truncate the archive using the Crowding Distance metricKumar and Kesavan, 2015
Theoretical BackgroundSimulated Annealing (SA)Filipo Novo Mr26
FCE Frankfurt Consulting Engineers GmbH, 2015
Theoretical BackgroundNASA Numerical Aerodynamic Simulation (NAS)Filipo Novo Mr27
CG - Conjugate Gradient, irregular memory access and communicationFT - discrete 3D fast Fourier Transform, all-to-all communicationIS - Integer Sort, random memory accessLU - Lower-Upper Gauss-Seidel solver. Large number of short messagesMG - Multi-Grid on a sequence of meshes, long- and short-distance communication, memory intensiveThese applications were selected because they have task communication based profiles. Therefore they are ideal for the purposes of this work.
Related Work28Filipo Novo MrIntroductionTheoretical BackgroundRelated WorkProject MethodologyExperimental Results
Conclusions
Related WorkJ. R. Ku and S. G. Ku [34]Two phases: clustered high communicating tasks into partitionsUsed NSGA-II algorithmMapped these partitions onto NoC processors.Tried to keep high communicating partitions close to each otherUsed a second version of the NSGA-II algorithm15% more efficient then Physical Mapping AlgorithmC. Deng et al. [41]Changed the classical DEIncluded a sorting step before chromosomes recombinationFor high-level task graphs, free of a target hardware architectureFilipo Novo Mr29
Related WorkSen Zhao et al. [45]Proposed a MODE using an adaptative mutation operator.The strategy is changed during runtime to try achieving better solutions on the flyThe resulting vector is now compared with the whole population, not only with your fatherTested using benchmark ZDT functions onlyD. Das, M. Verma and A. Das [58]Hardware/software partitioning problem using DEObjective functions: execution time, area cost and communication costDE ran 16% faster than PSOQuality of acieved solutions were not describedZhuo Qingqi et al. [51]Solving Task Mapping problem combining two evolutionary algorithms (not DE)Parallel approach for searching the solution spaceMPEG-4 and VOPD (Video Objective Plane Decoder) benchmark applicationsSaves 13% on energy and is 3% more efficient in communication latencyFilipo Novo Mr30
Project Methodology31Filipo Novo MrIntroductionTheoretical BackgroundRelated WorkProject MethodologyExperimental Results
Conclusions
Project MethodologyFilipo Novo Mr32
resulting task mapEACFBD012345678chromosomes
individual
task mapping step
Project MethodologyFilipo Novo Mr33
Project Methodology Data Structures ModellingFilipo Novo Mr34
00342132444221034011t0t1t2t3t4
Population size (NP)
Population Dimension (D)
D = number of existing tasksAdherent to SODE and MODE
Project Methodology Communication Volume MetricFilipo Novo Mr35
fo3(solution 1) = 10+0+25+20+15 = 70
fo3(solution 2) = 10+10+10+10+10+10+10+25+20+15 = 130
Project Methodology Load Balance MetricFilipo Novo Mr36
fo2(solution 1) = 29.45fo2(solution 2) = 54.11
Project Methodology Modifying DE: rewarding good individuals Identify most communicating tasks proposal 1:Reward individuals keeping most communicating tasks near to each other Proposal 2: Try generate good individuals during mutation or recombination operationsFilipo Novo Mr37
Project Methodology Identifying most communicating tasksFilipo Novo Mr38
ACEBDF55325341A, B: 5A, C: 5B, D: 3D, F: 1F, D: 4C, E: 5E, A: 3E, B: 2
A, B: 5A, C: 5B, D: 3D, F: 1+4C, E: 5E, A: 3E, B: 2
A, B, C,E: 5+5+3B, D, A, E: 3+5+2D, F, B: 5+3C, E, A: 5+5E, A, C, B: 3+5+2
A, B, C,E: 13B, D, A, E: 10D, F, B: 8C, E, A: 10E, A, C, B: 10
tA, tB, tC and tE
Project Methodology Proposal 1Filipo Novo Mr39
Ideal bonus value is 10%Different bonus values tend to stuck the evolution (no more convergence is reach)On average, 14% of solutions at the final Generation had been rewarded
Project Methodology Proposal 2Filipo Novo Mr40
Proposal 2 was halted:No more convergence after 4 generations on averageToo few tasks? Too small NoC?
Project Methodology Validating the DE (Single Objective)Filipo Novo Mr41
SO_Proc36_T36_CR0_50_F0_40_Gen1000_Noc6_6_1_Pop20_Test2016061716017308_ft32x1_v2ap01
Project Methodology Validating the DE (Multiple Objective)Filipo Novo Mr42
Function ZDT1
ETH Zrich, 2008
Project Methodology Validating the DE (Multiple Objective)Filipo Novo Mr43
Function ZDT2ETH Zrich, 2008
Project Methodology Validating the DE (Multiple Objective) Hypervolume metricFilipo Novo Mr44
Kian Sheng Lim et al, 2013
Experimental Results45Filipo Novo MrIntroductionTheoretical BackgroundRelated WorkProject MethodologyExperimental Results
Conclusions
Experimental ResultsParametersRangeNP10 and 20G100, 300, 500, 100, 5000 and 10000CR0.05, 0.1, 0.2, 0.3, 0.4, 0.5, 06, 0.7, 0.8 and 0.9F0.05, 0.1, 0.2, 0.3, 0.4, 0.5, 06, 0.7, 0.8 and 0.9
Filipo Novo Mr46
Single Objective DENASA NAS applications: IS, CG, FT, MG, LUEach test case was executed at least 30 timesGoal: reduce communication volume
Experimental ResultsFilipo Novo Mr47
Single Objective DE NASA NAS benchmark
Experimental ResultsFilipo Novo Mr48
Single Objective DE NASA NAS benchmark
Experimental ResultsFilipo Novo Mr49
Single Objective DE NASA NAS benchmark
FT e IS COM MUITAS TROCAS DE MENSAGENS49
Experimental ResultsFilipo Novo Mr50
SODE vs CAFES NASA NAS benchmarkNASA NAS applications: IS, CG, FT, MG, LUEach test case was executed at least 30 timesCAFES was set to the best execution parameters found during preparation tests.The same formula was used by CAFES and SODE to calculate the fitness valueThe comparison focused on the quality of the best candidate solutionsThe comparison considered the five best candidate solutions of each test case for both tested algorithms
FT e IS COM MUITAS TROCAS DE MENSAGENS50
Experimental ResultsFilipo Novo Mr51
SODE vs CAFES NASA NAS benchmarkMean Values: absolute scalar value for the communication volumeStandard Deviation: how close are the best solutions from each other
FT e IS COM MUITAS TROCAS DE MENSAGENS51
Conclusions52Filipo Novo MrIntroductionTheoretical BackgroundRelated WorkProject MethodologyExperimental Results
Conclusions
ConclusionsA new adaptation for the SODE was proposed, rewarding individuals who kept related communicating tasks close to each otherTestes were executed using the NASA NAS benchmark, showing our implementation was able to generate feasible solutions.Our algorithm was compared to the SA implementation existing on the CAFES Framework.Our implementation reached better solutions on two of five benchmark applications; achieve similar results on one application. CAFES achieved better solution on other two tested applicationsOur implementation has proved to be important on solving the Task Mapping onto NoC problem, specially for applications with similar NASA NAS message exchange profiles Filipo Novo Mr53
An Evolutionary Approach for the Task Mapping Problem
Filipo Novo MrSupervisor: Dr. Csar Augusto Missio MarconCo-supervisor: Dr. Andrew Rau-Chaplin2016, August 18th
www.filipomor.commaster thesis defenseThank you!