Performance modeling for architectural and program analysis

Update Item Information
Publication Type thesis
School or College College of Engineering
Department Computing
Author Lo, Yu jung
Title Performance modeling for architectural and program analysis
Date 2015-05
Description To address the need of understanding and optimizing the performance of complex applications and achieving sustained application performance across different architectures, we need performance models and tools that could quantify the theoretical performance and the resultant gap between theoretical and observed performance. This thesis proposes a benchmark-driven Roofline Model Toolkit to provide theoretical and achievable performance, and their resultant gap for multicore, manycore, and accelerated architectures. Roofline micro benchmarks are specialized to quantify the behavior of different architectural features. Compared to previous work on performance characterization, these micro benchmarks focus on capturing the performance of each level of the memory hierarchy, along with thread-level parallelism(TLP), instruction-level parallelism(ILP), and explicit Single Instruction, Multiple Data(SIMD) parallelism, measured in the context of the compilers and runtime environment on the target architecture. We also developed benchmarks to explore detailed memory subsystems behaviors and evaluate parallelization overhead. Beyond on-chip performance, we measure sustained Peripheral Component Interconnect Express(PCIe) throughput with four Graphics Processing Unit(GPU) memory managed mechanisms. By combining results from the architecture characterization with the Roofline Model based solely on architectural specification, this work offers insights for performance prediction of current and future architectures and their software systems. To that end, we instrument three applications and plot their resultant performance on the corresponding Roofline Model when run on a Blue Gene/Q architecture.
Type Text
Publisher University of Utah
Subject CUDA unified memory; Memory Bandwidth; Roofline
Dissertation Institution University of Utah
Dissertation Name Master of Science
Language eng
Rights Management Copyright © Yu jung Lo 2015
Format Medium application/pdf
Format Extent 1,447,868 Bytes
Identifier etd3/id/3721
ARK ark:/87278/s65t6tt9
Setname ir_etd
ID 197272
Reference URL https://collections.lib.utah.edu/ark:/87278/s65t6tt9