This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Intel Cluster Checker 3.0 is a systems tool for Linux high performance compute clusters
• Detects issues
• Provides diagnoses
• Suggests remedies
2
Introduction
The third generation of Intel® Cluster Checker adds significant capabilities over previous versions and will be available as part of Intel® Parallel Studio XE 2016 Cluster Edition for Linux*
As a developer targeting a cluster, I want to write code that runs and performs its tasks with the best performance I can achieve -but the complexities and possible issues of clusters challenge both me, as a developer, and my users.
Symptoms are subjective indications of health Signs are objective indications of health detected by direct observation Diagnoses are the identification of the root cause of an issue Remedies are methods to resolve an issue
8
Intel® Cluster Checker 3.0 – Concept
Concept Human Cluster
Symptom I am nauseous and fatigued My job is running slow
Signs Dehydrated, Fever,Nauseous
DGEMM performance on nodeX is 25% of peakZombie process on nodeX is using 100% cpu
Diagnosis Flu Zombie process is stealing cycles
Remedy Drink plenty of clear fluids, take 2 aspirin, and bed rest
Symptoms are subjective indications of health Signs are objective indications of health detected by direct observation Diagnoses are the identification of the root cause of an issue Remedies are methods to resolve an issue
21
Intel® Cluster Checker 3.0
Concept Human Cluster
Symptom I am nauseous and fatigued My job is running slow
Signs Dehydrated, Fever,Nauseous
DGEMM performance on nodeX is 25% of peakZombie process on nodeX is using 100% cpu
Diagnosis Flu Zombie process is stealing cycles
Remedy Drink plenty of clear fluids, take 2 aspirin, and bed rest
How to program the API – e.g. C++ sample code snippets
23
Intel® Cluster Checker 3.0
// INPUT AND CONFIGURATION
// set up databaseauto database = std::make_shared<clck::SQLite>();
// set up node list: names, roles, groupsstd::vector<clck::Node> nodes;
// set up configuration: database, nodes, extensions, etc.clck::Layer::Config config;
// set up presentation layerclck::Layer layer(config);
// set up suppressions: confidence, severity, nodes, etc.std::vector<clck::Layer::Suppression> suppressions;
INPUT AND CONFIGURATION ANALYSIS RESULTS PROCESSING
// ANALYSIS
// start analysislayer.analyze(suppressions);
// loop in another thread{
// number of rules remaining to be fired and number of rules already runint remaining, completed;layer.progress(remaining, completed);
}
// loop in another thread{
// wait for messageslayer.message.wait();// internal messages of various severity that can be displayedstd::vector<clck::Layer::Message> = layer.get_messages();
}
INPUT AND CONFIGURATION ANALYSIS RESULTS PROCESSING
// RESULTS PROCESSING
// set up filters: confidence, severity, nodes, types, etc.clck::Layer::Filter filter;
// set up sorting orderstd::vector<clck::Layer::Sorting> sorting;
// signs and diagnoses (filtered and sorted)std::vector<std::shared_ptr<clck::Fault>> faults = layer.get_faults(filter, sorting);
// process signs and diagnosesfor (auto &fault : faults) {}
INPUT AND CONFIGURATION ANALYSIS RESULTS PROCESSING
Legal Disclaimer & Optimization NoticeINFORMATION IN THIS DOCUMENT IS PROVIDED “AS IS”. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. INTEL ASSUMES NO LIABILITY WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO THIS INFORMATION INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT.
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance ofthat product when combined with other products.
Intel’s compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice.