Results 1 - 10
of
286
Data-parallel string-manipulating programs
, 2012
"... Applications ranging from malware detection to graphics to Web security sanitization depend on string transformations, but writing such transformations is a challenge. Making these transformations run in parallel on a cluster of machines or special hardware is an even greater challenge. We answer th ..."
Abstract
-
Cited by 8 (5 self)
- Add to MetaCart
, or JavaScript. Next, we show how the resulting transducers, post-exploration, fit into a recent advance in data-parallel compilation of finite state machines. Finally, we describe a concrete implementation built on the Windows High Perfor-mance Computing framework in a cluster. We have implemented our
Augur: Data-Parallel Probabilistic Modeling
"... Implementing inference procedures for each new probabilistic model is time-consuming and error-prone. Probabilistic programming addresses this problem by allowing a user to specify the model and then automatically generating the inference procedure. To make this practical it is important to generate ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
to generate high per-formance inference code. In turn, on modern architectures, high performance re-quires parallel execution. In this paper we present Augur, a probabilistic modeling language and compiler for Bayesian networks designed to make effective use of data-parallel architectures such as GPUs. We
Data-Parallel Programming on the Cell BE and the GPU using the RapidMind Development Platform
"... Abstract – The Cell BE processor is capable of achieving very high levels of performance via parallel computation. The processors in video accelerators, known as GPUs, are also high performance parallel processors. The RapidMind Development Platform provides a simple data-parallel model of execution ..."
Abstract
- Add to MetaCart
Abstract – The Cell BE processor is capable of achieving very high levels of performance via parallel computation. The processors in video accelerators, known as GPUs, are also high performance parallel processors. The RapidMind Development Platform provides a simple data-parallel model
Data-Parallel Programming on the Cell BE and the GPU using the RapidMind Development Platform
"... Abstract – The Cell BE processor is capable of achieving very high levels of performance via parallel computation. The processors in video accelerators, known as GPUs, are also high performance parallel processors. The RapidMind Development Platform provides a simple data-parallel model of execution ..."
Abstract
- Add to MetaCart
Abstract – The Cell BE processor is capable of achieving very high levels of performance via parallel computation. The processors in video accelerators, known as GPUs, are also high performance parallel processors. The RapidMind Development Platform provides a simple data-parallel model
Fast Address Sequence Generation for Data-Parallel Programs Using Integer Lattices
- IN LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, C.-H. HUANG ET AL. (EDITORS), LECTURE NOTES IN COMPUTER SCIENCE
, 1996
"... In data-parallel languages such as High Performance Fortran and Fortran D, arrays are mapped to processors through a two step process involving alignment followed by distribution. A compiler that generates code for each processor has to compute the sequence of local memory addresses accessed by each ..."
Abstract
-
Cited by 20 (7 self)
- Add to MetaCart
In data-parallel languages such as High Performance Fortran and Fortran D, arrays are mapped to processors through a two step process involving alignment followed by distribution. A compiler that generates code for each processor has to compute the sequence of local memory addresses accessed
High-Performance GISAXS Code for Polymer Science*
"... Grazing Incidence Small-Angle Scattering (GISAXS) is a valuable experimental technique in probing nanostructures of relevance to polymer science [1]. Experimentalists are expressing a growing need for efficient and scalable simulation tools to dissect the massive volume of GISAXS data gathered at st ..."
Abstract
- Add to MetaCart
at stateof-the-art beamlines. New high-performance computing algorithms, codes, and software tools have been developed to analyze GISAXS images generated at synchrotron light sources. We have implemented a flexible GISAXS simulation code based on the Distorted Wave Born Approximation (DWBA) written in C
PIPS Is not (just) Polyhedral Software Adding GPU Code Generation in PIPS
"... Parallel and heterogeneous computing are growing in audience thanks to the increased performance brought by ubiquitous manycores and GPUs. However, available programming models, like OPENCL or CUDA, are far from being straightforward to use. As a consequence, several automated or semi-automated appr ..."
Abstract
- Add to MetaCart
-automated approaches have been proposed to automatically generate hardware-level codes from high-level sequential sources. Polyhedral models are becoming more popular because of their combination of expressiveness, compactness, and accurate abstraction of the data-parallel behaviour of programs. These models provide
Optimising Purely Functional GPU Programs
"... Purely functional, embedded array programs are a good match for SIMD hardware, such as GPUs. However, the naive compilation of such programs quickly leads to both code explosion and an excessive use of intermediate data structures. The resulting slowdown is not acceptable on target hardware that is ..."
Abstract
-
Cited by 12 (3 self)
- Add to MetaCart
Purely functional, embedded array programs are a good match for SIMD hardware, such as GPUs. However, the naive compilation of such programs quickly leads to both code explosion and an excessive use of intermediate data structures. The resulting slowdown is not acceptable on target hardware
Meta-programming and Auto-tuning in the Search for High Performance GPU Code
"... Writing high performance GPGPU code is often difficult and time-consuming, potentially requiring laborious manual tuning of low-level details. Despite these challenges, the cost in ignoring GPUs in high performance computing is increasingly large. Auto-tuning is a potential solution to the problem o ..."
Abstract
- Add to MetaCart
Writing high performance GPGPU code is often difficult and time-consuming, potentially requiring laborious manual tuning of low-level details. Despite these challenges, the cost in ignoring GPUs in high performance computing is increasingly large. Auto-tuning is a potential solution to the problem
Architecture Synthesis of High-Performance Application-Specific Processors
, 1991
"... Abstract The key principles of the Application-Specific Processor Design (ASPD) methodology include: a semi-custom compilation-driven design/implementation approach, the exploitation of fine-grained parallelism for high performance. and the adaptation of datapath topology to the data transfers requ ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
Abstract The key principles of the Application-Specific Processor Design (ASPD) methodology include: a semi-custom compilation-driven design/implementation approach, the exploitation of fine-grained parallelism for high performance. and the adaptation of datapath topology to the data transfers
Results 1 - 10
of
286