Analyzing and Tuning SPECAccel Performance for GPU Workloads on Lenovo ThinkSystem ServersPlanning / Implementation

Author
Published
24 May 2019
Form Number
LP1146
PDF size
24 pages, 1.0 MB

Abstract

The use of GPU high-performance accelerators are especially important in High Performance Computing (HPC) workloads. As a result, it is critical to ensure that the GPUs provide the most processing power possible by adjusting key performance and measuring the results.

The SPECAccel benchmark uses the OpenCL and OpenACC programing paradigm to provide a comparative measure of parallel computing performance among systems equipped with an accelerator. Through runtime characteristic analysis, this paper provides a best practices recipe for Lenovo® ThinkSystem™ servers to obtain the best performance for GPU workloads, as well as other applications that have characteristics similar to the SPECAccel benchmark.

This paper is intend for ThinkSystem end users and technical sales representatives who want to understand how to tune GPU performance. The paper assumes readers are familiar with Linux and have basic experience with programming languages such as C/C++.

Table of Contents

Introduction
Accelerator Programing Paradigm
SPECAccel benchmark
ThinkSystem SR650
Analysis of CPU performance
Analysis of memory performance
Analysis of PCIe performance
Analysis of NVIDIA V100 GPU performance
Performance tuning
Performance world records
Conclusion

To view the document, click the Download PDF button.

Related product families

Product families related to this document are the following: