Optimizing HPC Applications with Intel® Cluster Tools takes the reader on a tour of the fast-growing area of high performance computing and the optimization of hybrid programs. These programs typically combine distributed memory and shared memory programming models and use the Message Passing Interface (MPI) and OpenMP for multi-threading to achieve the ultimate goal of high performance at low power consumption on enterprise-class workstations and compute clusters.
The book focuses on optimization for clusters consisting of the Intel® Xeon processor, but the optimization methodologies also apply to the Intel® Xeon Phi™ coprocessor and heterogeneous clusters mixing both architectures. Besides the tutorial and reference content, the authors address and refute many myths and misconceptions surrounding the topic. The text is augmented and enriched by descriptions of real-life situations.
Software professionals will use this book to design, develop, and optimize their parallel programs on Intel platforms. Students of computer science and engineering will value the book as a comprehensive reader, suitable to many optimization courses offered around the world. The novice reader will enjoy a thorough grounding in the exciting world of parallel computing.
Chapter 1: No Time to Read this Book?
Chapter 2: Overview of Platform Architectures
Chapter 3: Top-Down Software Optimization
Chapter 4: Addressing System Bottlenecks
Chapter 5: Addressing Application Bottlenecks: Distributed Memory
Chapter 6: Addressing Application Bottlenecks: Shared Memory
Chapter 7: Addressing Application Bottlenecks: Microarchitecture
Chapter 8: Application Design Considerations