Switching to High Gear Switching to High Gear Opportunities for Grand-scale Opportunities for Grand-scale Real-time Parallel Simulations Real-time Parallel Simulations Kalyan S. Kalyan S. Perumalla, Ph.D. Perumalla, Ph.D. Senior Research Senior Research Staff Member Staff Member Oak Ridge National Oak Ridge National Laboratory Laboratory Adjunct Professor Adjunct Professor Georgia Institute of Georgia Institute of Technology Technology IEEE DS-RT, Singapore Oct 26, 2009
54
Embed
Switching to High Gear Opportunities for Grand-scale Real-time Parallel Simulations
IEEE DS-RT, Singapore Oct 26, 2009. Switching to High Gear Opportunities for Grand-scale Real-time Parallel Simulations. Kalyan S. Perumalla, Ph.D. Senior Research Staff Member Oak Ridge National Laboratory Adjunct Professor Georgia Institute of Technology. Main Theme. - PowerPoint PPT Presentation
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Switching to High Gear Switching to High Gear Opportunities for Grand-scale Real-Opportunities for Grand-scale Real-time Parallel Simulationstime Parallel Simulations
Kalyan S. Perumalla, Ph.D.Kalyan S. Perumalla, Ph.D.
Senior Research Staff MemberSenior Research Staff MemberOak Ridge National LaboratoryOak Ridge National Laboratory
Adjunct ProfessorAdjunct ProfessorGeorgia Institute of TechnologyGeorgia Institute of Technology
IEEE DS-RT, Singapore
Oct 26, 2009
2 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
• End systems– Processor traits, disk traits, OS instances, daemons, services, S/W bugs, …
• “Heavy” applications and traffic– Video (YouTube, …), VOIP, live streams; foreground, background
• Behavioral infusion– Social nets (topologies, dynamics, agencies, advertisers), peer-to-peer
12
13 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Example: Epidemiology or Computer Worm PropagationExample: Epidemiology or Computer Worm Propagation
13
• Typical dynamics model– Multiple variants exist, but
qualitatively similar
• Excellent fit, but post-facto (!)– Plot collected data
• Difficult as predictive model– Great amount of detail buried in α
• Gory detail needed for better predictive power– Interaction topology
– Resource limitations
• Typical dynamics model– Multiple variants exist, but
qualitatively similar
• Excellent fit, but post-facto (!)– Plot collected data
• Difficult as predictive model– Great amount of detail buried in α
• Gory detail needed for better predictive power– Interaction topology
– Resource limitations
( )dI
I S Idt
( )dI
I S Idt
14 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Slippery Slope: Cost and TimeSlippery Slope: Cost and Time
14
Cost to realize experimentation capability
Time to reach experimentation capability
15 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Our Research Organization in Discrete Our Research Organization in Discrete Event Runtimes and ApplicationsEvent Runtimes and ApplicationsOur Research Organization in Discrete Our Research Organization in Discrete Event Runtimes and ApplicationsEvent Runtimes and Applications
– 3-D 103-D 1077-cells simulated on 10-cells simulated on 1044 cores cores
• µµΠΠ
– Performance prediction of 10Performance prediction of 1066-core -core MPI programs on 10MPI programs on 1044 cores cores
• NetWarpNetWarp
– Hi-fi Internet test-bedHi-fi Internet test-bed
17 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Scalable Experimentation for Cyber SecurityScalable Experimentation for Cyber SecurityNetWarp NetWarp is our novel test-bed is our novel test-bed technology for highly scalable, technology for highly scalable, detailed, rapid experimentation of detailed, rapid experimentation of cyber security and cyber cyber security and cyber infrastructuresinfrastructures
18 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
19 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
NetWarp ArchitectureNetWarp Architecture
Business sensitive
19
20 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
DOE-Sponsored Institute for Advanced DOE-Sponsored Institute for Advanced Architectures and AlgorithmsArchitectures and Algorithms
“…catalyst for the co-design and development of architectures, algorithms, and applications to create synergy in their respective evolutions…”
“…catalyst for the co-design and development of architectures, algorithms, and applications to create synergy in their respective evolutions…”
Need highly scalable simulation methods and Need highly scalable simulation methods and methodologies to simulate next generation architectures methodologies to simulate next generation architectures and algorithms on future supercomputing platforms…and algorithms on future supercomputing platforms…
21 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
– Actual application code executed on the real hardware
– Platform is simulated at large virtual scale
– Timing customized by user-defined machine
• Scale is key differentiator– Target: 150,000 virtual cores
– E.g., 150,000 virtual MPI ranks in simulated scenario
• Based on µsik (micro simulator kernel)– Scalable PDES engine
– TCP- or MPI-connectedsimulation kernels
• Scale is key differentiator– Target: 150,000 virtual cores
– E.g., 150,000 virtual MPI ranks in simulated scenario
• Based on µsik (micro simulator kernel)– Scalable PDES engine
– TCP- or MPI-connected simulation kernels
22 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Example: MPI application over Example: MPI application over μπμπ
Modify MPI include and recompile– Change #include <mpi.h> to#include <mupi.h>
Relink to mupi library– Instead of –lmpi, use -lmupi
Run the modified MPI application(a μπ simulation)– mpirun –np 4 test -nvp 32
runs test with 32 virtual MPI rankssimulation uses 4 real cores
μπ itself uses multiple real cores torun in parallel
Modify MPI include and recompile– Change #include <mpi.h> to#include <mupi.h>
Relink to mupi library– Instead of –lmpi, use -lmupi
Run the modified MPI application(a μπ simulation)– mpirun –np 4 test -nvp 32
runs test with 32 virtual MPI rankssimulation uses 4 real cores
μπ itself uses multiple real cores to run in parallel
23 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Epidemic Disease PropagationEpidemic Disease Propagation• Can be an extremely challenging simulation problemCan be an extremely challenging simulation problem
• Asymptotic behaviors are relatively well understoodAsymptotic behaviors are relatively well understood
• Transients are poorly understood, hard to predict wellTransients are poorly understood, hard to predict well
• Defined and characterized by many interlinked processesDefined and characterized by many interlinked processes
• ““Gory Detail” necessaryGory Detail” necessary
• Can be an extremely challenging simulation problemCan be an extremely challenging simulation problem
• Asymptotic behaviors are relatively well understoodAsymptotic behaviors are relatively well understood
• Transients are poorly understood, hard to predict wellTransients are poorly understood, hard to predict well
• Defined and characterized by many interlinked processesDefined and characterized by many interlinked processes
• ““Gory Detail” necessaryGory Detail” necessary
24 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
25 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
PDES Scaling NeeedsPDES Scaling Neeeds
• Anticipate Anticipate impending impending opportunitiesopportunities in multiple in multiple application areas of grand-application areas of grand-scale PDES scenariosscale PDES scenarios
• Prepare to capitalize on Prepare to capitalize on increasing increasing computational computational powerpower (300K+ cores) (300K+ cores)
• Aim to achieve computational Aim to achieve computational capability to enable capability to enable new new PDES-based scientific PDES-based scientific solutionssolutions
• Anticipate Anticipate impending impending opportunitiesopportunities in multiple in multiple application areas of grand-application areas of grand-scale PDES scenariosscale PDES scenarios
• Prepare to capitalize on Prepare to capitalize on increasing increasing computational computational powerpower (300K+ cores) (300K+ cores)
• Aim to achieve computational Aim to achieve computational capability to enable capability to enable new new PDES-based scientific PDES-based scientific solutionssolutions
26 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Jaguar Petascale System [Cray XT5]Jaguar Petascale System [Cray XT5]
27 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Jaguar: NCCS’ Cray XT5*Jaguar: NCCS’ Cray XT5*
* Data and images from http://nccs.gov
28 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
To realize scale with any of the PDES To realize scale with any of the PDES models and applications, we need the models and applications, we need the
core frameworkscore frameworks to scale to scale
To realize scale with any of the PDES To realize scale with any of the PDES models and applications, we need the models and applications, we need the
core frameworkscore frameworks to scale to scale
29 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Recent Attempts at 10Recent Attempts at 1055-Core PDES -Core PDES FrameworksFrameworks
Bauer Bauer et alet al (Jun’09) on Blue Gene P (Argonne) (Jun’09) on Blue Gene P (Argonne) Perumalla & Tipparaju (Jan’09) on Cray XT5 (ORNL)Perumalla & Tipparaju (Jan’09) on Cray XT5 (ORNL)
Business Sensitive
Degradation beyond 64K cores observed by us as well as othersDegradation beyond 64K cores observed by us as well as others
Degradation observed in more than one metric (rollback efficiency, speedup)Degradation observed in more than one metric (rollback efficiency, speedup)
30 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Implications to Discrete Event Execution on Implications to Discrete Event Execution on High Performance Computing PlatformsHigh Performance Computing Platforms
Business Sensitive
31 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Some of our ObjectivesSome of our Objectives
• Scale from 10Scale from 1044 cores (current) to 10 cores (current) to 1055-10-1066 cores (new) cores (new)
• Realize very large-scale scenarios (multi-billion entity)Realize very large-scale scenarios (multi-billion entity)• Cyber infrastructures, social computing, epidemiology, logisticsCyber infrastructures, social computing, epidemiology, logistics• Aid projects in simulation-based design of future generation supercomputersAid projects in simulation-based design of future generation supercomputers
• Scale from 10Scale from 1044 cores (current) to 10 cores (current) to 1055-10-1066 cores (new) cores (new)
• Realize very large-scale scenarios (multi-billion entity)Realize very large-scale scenarios (multi-billion entity)• Cyber infrastructures, social computing, epidemiology, logisticsCyber infrastructures, social computing, epidemiology, logistics• Aid projects in simulation-based design of future generation supercomputersAid projects in simulation-based design of future generation supercomputers
Fill technological gap by achieving the highest scaling Fill technological gap by achieving the highest scaling capabilities of parallel discrete event simulationscapabilities of parallel discrete event simulations
Ultimately, enable formulation of grand-scale solutions with non-Ultimately, enable formulation of grand-scale solutions with non-traditional supercomputing simulationstraditional supercomputing simulationsUltimately, enable formulation of grand-scale solutions with non-Ultimately, enable formulation of grand-scale solutions with non-traditional supercomputing simulationstraditional supercomputing simulations
32 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
• Account for reflectivity, transmitivity, multi-path effects
• Power level (voltage) modeled per face of grid cell
33 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
PHOLD BenchmarkPHOLD Benchmark
• Relatively fine grained– ~5 microseconds computation per event
• 10 “juggler” entities per processor core– Analogous to grid cells, road intersections or such
• Total of 1000 “juggling balls” per core– Analogous to state updates exchanged among cells
• Upon receipt of a ball event, a juggler throws it back random (exponential) time into the future to a random juggler– 1 every 1000 juggling exchanges are constrained to be intra-core, rest
inter-core
34 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Radio Propagation: Speedup on Cray XT4Radio Propagation: Speedup on Cray XT4
35 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Radio Propagation: Speedup on Cray XT4Radio Propagation: Speedup on Cray XT4
36 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Radio Propagation: Runtime Costs on Cray XT4Radio Propagation: Runtime Costs on Cray XT4
37 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Epidemic Propagation: Performance on Cray Epidemic Propagation: Performance on Cray XT5XT5
38 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Epidemic Propagation – Parallel Run time Epidemic Propagation – Parallel Run time on Cray XT5on Cray XT5
500
550
600
650
700
750
0 16384 32768 49152 65536
No. of Cores
Ru
nti
me
(sec
on
ds)
Optimistic Conservative
39 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
PHOLD: Performance on Cray XT5PHOLD: Performance on Cray XT5
40 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
• Scalability problems with current approaches not evident previously– Fine until 104 cores, but poor thereafter
• Even with discrete event, implementation is key– Semi-asynchronous execution scales poorly
– Fully asynchronous execution needed
• Scalability problems with current approaches not evident previously– Fine until 104 cores, but poor thereafter
• Even with discrete event, implementation is key– Semi-asynchronous execution scales poorly
– Fully asynchronous execution needed
41 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Trial 0 Trial r-1 Trial r
d+1,0
d,r d,r+1 Δ>0
Δ==0
Band d+1 Band d+2Band d
…
New band started
Δ>0 Ends when Δ==0
LBTS computation for band d
Algorithm Design and Development for Algorithm Design and Development for Scalable Discrete Event ExecutionScalable Discrete Event Execution
Design algorithms optimized for Cray XT5, Blue Gene P/QDesign algorithms optimized for Cray XT5, Blue Gene P/Q
• Design new virtual-time synchronization algorithmDesign new virtual-time synchronization algorithm
• Design novel rollback control schemesDesign novel rollback control schemes
• Design discrete event-specific flow controlDesign discrete event-specific flow control
Design algorithms optimized for Cray XT5, Blue Gene P/QDesign algorithms optimized for Cray XT5, Blue Gene P/Q
• Design new virtual-time synchronization algorithmDesign new virtual-time synchronization algorithm
• Design novel rollback control schemesDesign novel rollback control schemes
• Design discrete event-specific flow controlDesign discrete event-specific flow control
Current synchronization algorithm
42 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Additional Important Algorithmic AspectsAdditional Important Algorithmic Aspects
• Novel separation of event Novel separation of event communication from synchronizationcommunication from synchronization– Prioritization support in our Prioritization support in our
communication layercommunication layer
– ““QoS” support for fast synchronizationQoS” support for fast synchronization
• Novel timestamp-aware bufferingNovel timestamp-aware buffering– Exploit near Exploit near vsvs. far timestamps. far timestamps
– Coordinated with virtual-time Coordinated with virtual-time synchronizationsynchronization
• Novel separation of event Novel separation of event communication from synchronizationcommunication from synchronization– Prioritization support in our Prioritization support in our
communication layercommunication layer
– ““QoS” support for fast synchronizationQoS” support for fast synchronization
• Novel timestamp-aware bufferingNovel timestamp-aware buffering– Exploit near Exploit near vsvs. far timestamps. far timestamps
– Coordinated with virtual-time Coordinated with virtual-time synchronizationsynchronization
• Optimized Optimized rollback dynamicsrollback dynamics– Stability and throttling Stability and throttling
mechanismsmechanisms
– Cancel back protocolsCancel back protocols
Example of the “transient event” problem
transientmessage
CoreD
wallclock time
Past
Future
CoreC
CoreB
CoreA
transientmessage
CoreD
wallclock time
Past
Future
CoreC
CoreB
CoreA
43 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Data Integration Interface DevelopmentData Integration Interface Development
Application Programming Interface (API) toApplication Programming Interface (API) to– Incorporate streaming input into discrete Incorporate streaming input into discrete
event executionevent execution
– Achieve runtime efficiency as an important Achieve runtime efficiency as an important considerationconsideration
Application Programming Interface (API) toApplication Programming Interface (API) to– Incorporate streaming input into discrete Incorporate streaming input into discrete
event executionevent execution
– Achieve runtime efficiency as an important Achieve runtime efficiency as an important considerationconsideration
Novel concepts supporting latency-hidingNovel concepts supporting latency-hiding– To permit maximal concurrency without violating time-To permit maximal concurrency without violating time-
ordering between live simulation and real-time inputsordering between live simulation and real-time inputs
– Reuse optimistic synchronization for latency-hiding for Reuse optimistic synchronization for latency-hiding for unpredictable data input from external sourcesunpredictable data input from external sources
Novel concepts supporting latency-hidingNovel concepts supporting latency-hiding– To permit maximal concurrency without violating time-To permit maximal concurrency without violating time-
ordering between live simulation and real-time inputsordering between live simulation and real-time inputs
– Reuse optimistic synchronization for latency-hiding for Reuse optimistic synchronization for latency-hiding for unpredictable data input from external sourcesunpredictable data input from external sources
Interconnection Network(s)
…Machine
ProcessorCore
SimulatorProcess
LPLP
LPLP
ProcessorCore
SimulatorProcess
LPLP
LPLP
…Machine
ProcessorCore
SimulatorProcess
LPLP
LPLP
ProcessorCore
SimulatorProcess
LPLP
LPLP
SimulatorProcess
LPLP
LPLP
LPLP
LPLP
ProcessorCore
SimulatorProcess
LPLP
LPLP
ProcessorCore
SimulatorProcess
LPLP
LPLP
SimulatorProcess
LPLP
LPLP
LPLP
LPLP
…Machine
ProcessorCore
SimulatorProcess
LPLP
LPLP
ProcessorCore
SimulatorProcess
LPLP
LPLP
…Machine
ProcessorCore
SimulatorProcess
LPLP
LPLP
ProcessorCore
SimulatorProcess
LPLP
LPLP
SimulatorProcess
LPLP
LPLP
LPLP
LPLP
ProcessorCore
SimulatorProcess
LPLP
LPLP
ProcessorCore
SimulatorProcess
LPLP
LPLP
SimulatorProcess
LPLP
LPLP
LPLP
LPLP
…
LP=Logical Process with its own timeline
44 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Software ImplementationSoftware Implementation
Runtime algorithms and data integration Runtime algorithms and data integration interfaces realized in softwareinterfaces realized in software
– Primarily in C/C++Primarily in C/C++
– Building on current software (scales to 10Building on current software (scales to 1044))
– Optimized for performance on Cray XT5 and Optimized for performance on Cray XT5 and Blue Gene PBlue Gene P
Runtime algorithms and data integration Runtime algorithms and data integration interfaces realized in softwareinterfaces realized in software
– Primarily in C/C++Primarily in C/C++
– Building on current software (scales to 10Building on current software (scales to 1044))
– Optimized for performance on Cray XT5 and Optimized for performance on Cray XT5 and Blue Gene PBlue Gene P
Communication to be structured flexiblyCommunication to be structured flexibly– Use MPI or Portals or combinationUse MPI or Portals or combination
– Will explore potentially new layersWill explore potentially new layers
46 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Entire runtime and data integration Entire runtime and data integration frameworks to be exercisedframeworks to be exercised
– Instantiate scenarios scaled up from Instantiate scenarios scaled up from smaller-scale scenarios in literaturesmaller-scale scenarios in literature
– Experiment with strong-scaling as Experiment with strong-scaling as well as weak-scaling, as appropriate well as weak-scaling, as appropriate for each application areafor each application area
Entire runtime and data integration Entire runtime and data integration frameworks to be exercisedframeworks to be exercised
– Instantiate scenarios scaled up from Instantiate scenarios scaled up from smaller-scale scenarios in literaturesmaller-scale scenarios in literature
– Experiment with strong-scaling as Experiment with strong-scaling as well as weak-scaling, as appropriate well as weak-scaling, as appropriate for each application areafor each application area
Application Benchmarking and Application Benchmarking and DemonstrationDemonstration
At-scale simulation from each At-scale simulation from each areaarea
Example: Probability of infection in epidemiological model
Example inter-entity networks
47 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
StatusStatus
Showed preliminary evidence that PDES isShowed preliminary evidence that PDES is– Feasible even at the largest core-countsFeasible even at the largest core-counts
– Adequately scalable to over 100,000 coresAdequately scalable to over 100,000 cores
– But should be improved much, much moreBut should be improved much, much more
Applications can now move beyond “if” and begin to contemplate Applications can now move beyond “if” and begin to contemplate on “how” to use petascale discrete event executionon “how” to use petascale discrete event execution
Showed preliminary evidence that PDES isShowed preliminary evidence that PDES is– Feasible even at the largest core-countsFeasible even at the largest core-counts
– Adequately scalable to over 100,000 coresAdequately scalable to over 100,000 cores
– But should be improved much, much moreBut should be improved much, much more
Applications can now move beyond “if” and begin to contemplate Applications can now move beyond “if” and begin to contemplate on “how” to use petascale discrete event executionon “how” to use petascale discrete event execution
48 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
51 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
MarketingMarketing
Simulation community’s responsibilitySimulation community’s responsibility– Identify potential, benefitsIdentify potential, benefits– Invent new methods, methodologies, capabilitiesInvent new methods, methodologies, capabilities– Educate about need, potential, benefitEducate about need, potential, benefit
Simulation community’s responsibilitySimulation community’s responsibility– Identify potential, benefitsIdentify potential, benefits– Invent new methods, methodologies, capabilitiesInvent new methods, methodologies, capabilities– Educate about need, potential, benefitEducate about need, potential, benefit
Text book definition of Text book definition of marketingmarketing
““Creating the needCreating the need””
Text book definition of Text book definition of marketingmarketing
““Creating the needCreating the need””
52 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Lighter Vein or Reality?Lighter Vein or Reality?
• David Nicol once noted
“PADS research tends to scratch where it doesn’t itch”
• Now, probably time to ponder
“Have we been tolerating some (very bothersome) itches for lack of a long scratching stick?”
• David Nicol once noted
“PADS research tends to scratch where it doesn’t itch”
• Now, probably time to ponder
“Have we been tolerating some (very bothersome) itches for lack of a long scratching stick?”
PADS=Parallel and Distributed Simulation
53 Managed by UT-Battellefor the U.S. Department of Energy IEEE DS-RT'09, Singapore
Perspective and ActionPerspective and Action
• Assume immense computing powerAssume immense computing power
• Conceive large simulation-enabled solutionsConceive large simulation-enabled solutions
• Assume immense computing powerAssume immense computing power
• Conceive large simulation-enabled solutionsConceive large simulation-enabled solutions
““Perfect opportunity to expand our outlook in Perfect opportunity to expand our outlook in simulation-based methods and methodologies”simulation-based methods and methodologies”
• 101055-10-1066 cores nearly a reality cores nearly a realityMillion-core computers impending Million-core computers impending
www.exascale.orgwww.exascale.org
• Nation-scale, world-scale Nation-scale, world-scale questions of increasing interestquestions of increasing interestCompositional dynamics of millions to billion Compositional dynamics of millions to billion
processes, individualsprocesses, individuals
• 101055-10-1066 cores nearly a reality cores nearly a realityMillion-core computers impending Million-core computers impending
www.exascale.orgwww.exascale.org
• Nation-scale, world-scale Nation-scale, world-scale questions of increasing interestquestions of increasing interestCompositional dynamics of millions to billion Compositional dynamics of millions to billion