Evaluation of Agent Evaluation of Agent Teamwork Teamwork A High Performance Distributed A High Performance Distributed Computing Middleware Computing Middleware Solomon Lane Solomon Lane Agent Teamwork Research Agent Teamwork Research Assistant Assistant October 2006 – March 2007 October 2006 – March 2007
19
Embed
Evaluation of Agent Teamwork A High Performance Distributed Computing Middleware
Evaluation of Agent Teamwork A High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007. What is Agent Teamwork?. HPDC Middleware Job Dispatch & Termination Programming Framework Under Ongoing Development. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Evaluation of Agent Evaluation of Agent TeamworkTeamwork
A High Performance Distributed A High Performance Distributed Computing MiddlewareComputing Middleware
Solomon LaneSolomon Lane
Agent Teamwork Research Agent Teamwork Research AssistantAssistant
October 2006 – March 2007October 2006 – March 2007
Reference Platform Reference Platform ChallengesChallenges• Administrator Access to MachinesAdministrator Access to Machines
• Host Config & Cryptic Error MessagesHost Config & Cryptic Error Messages– DNS vs hosts files DNS vs hosts files – Inconsistent hosts filesInconsistent hosts files– Inconsistent ptr recordsInconsistent ptr records– Inconsistent port acls Inconsistent port acls – : globus_init: failed: globus_init: failed
• GTK AuthenticationGTK Authentication
Debugging
• Strace
• TcpDump
• GDB
Job Dispatching and Job Dispatching and Termination Function Termination Function EvaluationEvaluation• Not evaluating the job execution Not evaluating the job execution
performanceperformance
• MethodologyMethodology– Ported available test program to the MPICH-G2 Ported available test program to the MPICH-G2
frameworkframework– measure how long it takes a job submission to be measure how long it takes a job submission to be
deployed, executed and cleaned updeployed, executed and cleaned up– Run with 2-64 nodes across the two clusters in a Run with 2-64 nodes across the two clusters in a
depth-first node distribution series and a depth-first node distribution series and a breadth-first node distribution seriesbreadth-first node distribution series
ResultsResults
ResultsResults
ResultsResults
Framework Function Framework Function EvaluationEvaluation• Framework Issues Framework Issues
• Goal to write 3 benchmark programs that Goal to write 3 benchmark programs that have communication intensive algorithms.have communication intensive algorithms.
Benchmark ProgramsBenchmark Programs
• MD - a molecular dynamics MD - a molecular dynamics simulationsimulation
• Wave2D - a wave dissemination Wave2D - a wave dissemination simulationsimulation
• Mandelbrot - a Mandelbrot generatorMandelbrot - a Mandelbrot generator
• Develop a pre-processor to convert Develop a pre-processor to convert conventionally programmed code conventionally programmed code into the snapshot-able func_n modelinto the snapshot-able func_n model
Skills Developed During Skills Developed During ProjectProject• Significant experience with globus, Significant experience with globus,
openPBS and the mpiopenPBS and the mpi
• Extensive debugging with tcpdump, Extensive debugging with tcpdump, strace, and gdbstrace, and gdb
• experience with performance experience with performance analysis and writing MPI programsanalysis and writing MPI programs
• new insights and understanding of new insights and understanding of HPDCHPDC