• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 96
Next 10 →

Using the SkelCL Library for High-Level GPU Programming of 2D Applications

by Michel Steuwer, Sergei Gorlatch, Matthias Buß, Stefan Breuer
"... Abstract. Application programming for GPUs (Graphics Processing Units) is complex and error-prone, because the popular approaches — CUDA and OpenCL — are intrinsically low-level and offer no special support for systems consisting of multiple GPUs. The SkelCL library offers pre-implemented recurring ..."
Abstract - Add to MetaCart
processing applications on two-dimensional data. We show that both from a productivity and a performance point of view it is beneficial to use the high-level abstractions of SkelCL. 1

SkelCL - A Portable Skeleton Library for High-Level GPU Programming

by Michel Steuwer, Sergei Gorlatch - In IPDPSW-25: International Symposium on Parallel and Distributed Processing Workshops and PhD Forum. IEEE , 2011
"... c©2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to s ..."
Abstract - Cited by 22 (8 self) - Add to MetaCart
high-level abstractions. The especially challenging systems with multiple GPU are not addressed at all by these low-level programming models. We propose SkelCL – a library providing so-called algorithmic skeletons that capture recurring patterns of parallel compu-tation and communication, together

Towards highlevel programming of multi-GPU systems using the SkelCL library

by Michel Steuwer, Sergei Gorlatch - in 2012 IEEE International Symposium on Parallel and Distributed Processing Workshops (IPDPSW , 2012
"... c©2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to s ..."
Abstract - Cited by 2 (1 self) - Add to MetaCart
provides an abstract vector data type and a high-level data (re)distribution mechanism to shield the programmer from the low-level data transfers between the system’s main memory and multiple GPUs. In this paper, we focus on the specific support in SkelCL for systems with multiple GPUs and use a real

High-level programming for medical imaging on multi-GPU systems using the SkelCL library

by Michel Steuwer, Sergei Gorlatch - In ICCS , 2013
"... Application development for modern high-performance systems with Graphics Processing Units (GPUs) relies on low-level programming approaches like CUDA and OpenCL, which leads to complex, lengthy and error-prone programs. In this paper, we present SkelCL – a high-level programming model for systems w ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
Application development for modern high-performance systems with Graphics Processing Units (GPUs) relies on low-level programming approaches like CUDA and OpenCL, which leads to complex, lengthy and error-prone programs. In this paper, we present SkelCL – a high-level programming model for systems

Extending the SkelCL Skeleton Library for Stencil Computations on Multi-GPU Systems

by Stefan Breuer, Michel Steuwer, Sergei Gorlatch
"... The implementation of stencil computations on modern, mas-sively parallel systems with GPUs and other accelerators currently relies on manually-tuned coding using low-level approaches like OpenCL and CUDA, which makes it a com-plex, time-consuming, and error-prone task. We describe how stencil compu ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
computations can be programmed in our SkelCL approach that combines high level of programming abstrac-tion with competitive performance on multi-GPU systems. SkelCL extends the OpenCL standard by three high-level features: 1) pre-implemented parallel patterns (a.k.a. skele-tons); 2) container data types

SkelCL – A Portable Multi-GPU Skeleton Library

by Michel Steuwer, Sergei Gorlatch , 2010
"... Modern Graphics Processing Units (GPU) are increasingly used as general-purpose processors. While the two currently most widely used programming models for GPUs, CUDA and OpenCL, are a step ahead as compared to ex-tremely laborious shader programming, they still remain effort-demanding and error-pro ..."
Abstract - Add to MetaCart
present SkelCL – a novel library for simplifying OpenCL programming of single and multiple GPUs. SkelCL provides for the application programmer two main abstractions: Firstly, a unified memory management that implicitly transfers data transfers between the system’s and the GPU’s memory. Lazy copying

Parallel Processing Letters c © World Scientific Publishing Company HIGH-LEVEL PROGRAMMING OF STENCIL COMPUTATIONS ON MULTI-GPU SYSTEMS USING THE SKELCL LIBRARY

by Michel Steuwer, Michael Haidl, Stefan Breuer, Sergei Gorlatch , 2014
"... Communicated by Guest Editors The implementation of stencil computations on modern, massively parallel systems with GPUs and other accelerators currently relies on manually-tuned coding using low-level approaches like OpenCL and CUDA. This makes development of stencil applications a complex, time-co ..."
Abstract - Add to MetaCart
-consuming, and error-prone task. We describe how stencil computations can be programmed in our SkelCL approach that combines high-level programming ab-stractions with competitive performance on multi-GPU systems. SkelCL extends the OpenCL standard by three high-level features: 1) pre-implemented parallel patterns (a

Accelerator: using data parallelism to program GPUs for general-purpose uses

by David Tarditi, Sidd Puri, Jose Oglesby - in Proceedings of the 12th international conference on Architectural , 2006
"... GPUs are difficult to program for general-purpose uses. Programmers can either learn graphics APIs and convert their applications to use graphics pipeline operations or they can use stream programming abstractions of GPUs. We describe Accelerator, a system that uses data parallelism to program GPUs ..."
Abstract - Cited by 117 (0 self) - Add to MetaCart
for general-purpose uses instead. Programmers use a conventional imperative programming language and a library that provides only high-level data-parallel operations. No aspects of GPUs are exposed to programmers. The library implementation compiles the data-parallel operations on the fly to optimized GPU

The PICASSO Framework: A High-Level Tool for Interface Design

by Lawrence A. Rowe, Joseph A. Konstan, Brian C. Smith
"... PICASSO is a graphical user interface development system that includes an interface toolkit and an application framework. The application framework provides high-level abstractions including modal dialog boxes and non-modal frames and panels that simplify the development of GUI applications. These a ..."
Abstract - Add to MetaCart
PICASSO is a graphical user interface development system that includes an interface toolkit and an application framework. The application framework provides high-level abstractions including modal dialog boxes and non-modal frames and panels that simplify the development of GUI applications

Hydrodynamic Computation with Hybrid Programming on CPU-GPU Clusters

by Tingxing Dong, Veselin Dobrev, Tzanio Kolev, Robert Rieben, Stanimire Tomov, Jack Dongarra
"... The explosion of parallelism and heterogeneity in today’s computer architectures has created opportunities as well as challenges for redesigning legacy numerical software to harness the power of new hardware. In this paper we address the main challenges in redesigning BLAST – a numerical library tha ..."
Abstract - Add to MetaCart
that solves the equations of compressible hydrodynamics using high order finite element methods (FEM) in a moving Lagrangian frame – to support CPU-GPU clusters. We use a hybrid MPI + OpenMP + CUDA programming model that includes two layers: domain decomposed MPI parallelization and OpenMP + CUDA acceleration
Next 10 →
Results 1 - 10 of 96
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University