Designing a parallel dataflow architecture for streaming large-scale visualization on heterogeneous platforms

Update Item Information
Publication Type dissertation
School or College College of Engineering
Department Computing
Author Vo, Huy T.
Title Designing a parallel dataflow architecture for streaming large-scale visualization on heterogeneous platforms
Date 2011-08
Description Dataflow pipeline models are widely used in visualization systems. Despite recent advancements in parallel architecture, most systems still support only a single CPU or a small collection of CPUs such as a SMP workstation. Even for systems that are specifically tuned towards parallel visualization, their execution models only provide support for data-parallelism while ignoring taskparallelism and pipeline-parallelism. With the recent popularization of machines equipped with multicore CPUs and multi-GPU units, these visualization systems are undoubtedly falling further behind in reaching maximum efficiency. On the other hand, there exist several libraries that can schedule program executions on multiple CPUs and/or multiple GPUs. However, due to differences in executing a task graph and a pipeline along with their APIs being considerably low-level, it still remains a challenge to integrate these run-time libraries into current visualization systems. Thus, there is a need for a redesigned dataflow architecture to fully support and exploit the power of highly parallel machines in large-scale visualization. The new design must be able to schedule executions on heterogeneous platforms while at the same time supporting arbitrarily large datasets through the use of streaming data structures. The primary goal of this dissertation work is to develop a parallel dataflow architecture for streaming large-scale visualizations. The framework includes supports for platforms ranging from multicore processors to clusters consisting of thousands CPUs and GPUs. We achieve this in our system by introducing the notion of Virtual Processing Elements and Task-Oriented Modules along with a highly customizable scheduler that controls the assignment of tasks to elements dynamically. This creates an intuitive way to maintain multiple CPU/GPU kernels yet still provide coherency and synchronization across module executions. We have implemented these techniques into HyperFlow which is made of an API with all basic dataflow constructs described in the dissertation, and a distributed run-time library that can be used to deploy those pipelines on multicore, multi-GPU and cluster-based platforms.
Type Text
Publisher University of Utah
Subject Dataflow architecture; Heterogeneous platforms; Multi-CPU; Multi-GPU; Parallel execution
Dissertation Institution University of Utah
Dissertation Name Doctor of Philosophy
Language eng
Rights Management Copyright © Huy T. Vo 2011
Format Medium application/pdf
Format Extent 35,190,222 bytes
Identifier us-etd3,38255
Source Original housed in Marriott Library Special Collections, QA3.5 2011 .V6
ARK ark:/87278/s6gh9zqf
Setname ir_etd
ID 194543
Reference URL https://collections.lib.utah.edu/ark:/87278/s6gh9zqf