• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 286
Next 10 →

Data-parallel string-manipulating programs

by Margus Veanes, David Molnar, Todd Mytkowicz, Benjamin Livshits , 2012
"... Applications ranging from malware detection to graphics to Web security sanitization depend on string transformations, but writing such transformations is a challenge. Making these transformations run in parallel on a cluster of machines or special hardware is an even greater challenge. We answer th ..."
Abstract - Cited by 8 (5 self) - Add to MetaCart
, or JavaScript. Next, we show how the resulting transducers, post-exploration, fit into a recent advance in data-parallel compilation of finite state machines. Finally, we describe a concrete implementation built on the Windows High Perfor-mance Computing framework in a cluster. We have implemented our

Augur: Data-Parallel Probabilistic Modeling

by Jean-baptiste Tristan, Daniel Huang, Joseph Tassarotti, Adam Pocock, Stephen J. Green, Guy L. Steele
"... Implementing inference procedures for each new probabilistic model is time-consuming and error-prone. Probabilistic programming addresses this problem by allowing a user to specify the model and then automatically generating the inference procedure. To make this practical it is important to generate ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
to generate high per-formance inference code. In turn, on modern architectures, high performance re-quires parallel execution. In this paper we present Augur, a probabilistic modeling language and compiler for Bayesian networks designed to make effective use of data-parallel architectures such as GPUs. We

Data-Parallel Programming on the Cell BE and the GPU using the RapidMind Development Platform

by unknown authors
"... Abstract – The Cell BE processor is capable of achieving very high levels of performance via parallel computation. The processors in video accelerators, known as GPUs, are also high performance parallel processors. The RapidMind Development Platform provides a simple data-parallel model of execution ..."
Abstract - Add to MetaCart
Abstract – The Cell BE processor is capable of achieving very high levels of performance via parallel computation. The processors in video accelerators, known as GPUs, are also high performance parallel processors. The RapidMind Development Platform provides a simple data-parallel model

Data-Parallel Programming on the Cell BE and the GPU using the RapidMind Development Platform

by unknown authors
"... Abstract – The Cell BE processor is capable of achieving very high levels of performance via parallel computation. The processors in video accelerators, known as GPUs, are also high performance parallel processors. The RapidMind Development Platform provides a simple data-parallel model of execution ..."
Abstract - Add to MetaCart
Abstract – The Cell BE processor is capable of achieving very high levels of performance via parallel computation. The processors in video accelerators, known as GPUs, are also high performance parallel processors. The RapidMind Development Platform provides a simple data-parallel model

Fast Address Sequence Generation for Data-Parallel Programs Using Integer Lattices

by Ashwath Thirumalai, J. Ramanujam - IN LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, C.-H. HUANG ET AL. (EDITORS), LECTURE NOTES IN COMPUTER SCIENCE , 1996
"... In data-parallel languages such as High Performance Fortran and Fortran D, arrays are mapped to processors through a two step process involving alignment followed by distribution. A compiler that generates code for each processor has to compute the sequence of local memory addresses accessed by each ..."
Abstract - Cited by 20 (7 self) - Add to MetaCart
In data-parallel languages such as High Performance Fortran and Fortran D, arrays are mapped to processors through a two step process involving alignment followed by distribution. A compiler that generates code for each processor has to compute the sequence of local memory addresses accessed

High-Performance GISAXS Code for Polymer Science*

by Slim Chourou, Abhinav Sarje, Sherry Li, Elaine Chan, Er Hexemer
"... Grazing Incidence Small-Angle Scattering (GISAXS) is a valuable experimental technique in probing nanostructures of relevance to polymer science [1]. Experimentalists are expressing a growing need for efficient and scalable simulation tools to dissect the massive volume of GISAXS data gathered at st ..."
Abstract - Add to MetaCart
at stateof-the-art beamlines. New high-performance computing algorithms, codes, and software tools have been developed to analyze GISAXS images generated at synchrotron light sources. We have implemented a flexible GISAXS simulation code based on the Distorted Wave Born Approximation (DWBA) written in C

PIPS Is not (just) Polyhedral Software Adding GPU Code Generation in PIPS

by Mehdi Amini
"... Parallel and heterogeneous computing are growing in audience thanks to the increased performance brought by ubiquitous manycores and GPUs. However, available programming models, like OPENCL or CUDA, are far from being straightforward to use. As a consequence, several automated or semi-automated appr ..."
Abstract - Add to MetaCart
-automated approaches have been proposed to automatically generate hardware-level codes from high-level sequential sources. Polyhedral models are becoming more popular because of their combination of expressiveness, compactness, and accurate abstraction of the data-parallel behaviour of programs. These models provide

Optimising Purely Functional GPU Programs

by Trevor L. Mcdonell, Manuel M. T. Chakravarty, Gabriele Keller
"... Purely functional, embedded array programs are a good match for SIMD hardware, such as GPUs. However, the naive compilation of such programs quickly leads to both code explosion and an excessive use of intermediate data structures. The resulting slowdown is not acceptable on target hardware that is ..."
Abstract - Cited by 12 (3 self) - Add to MetaCart
Purely functional, embedded array programs are a good match for SIMD hardware, such as GPUs. However, the naive compilation of such programs quickly leads to both code explosion and an excessive use of intermediate data structures. The resulting slowdown is not acceptable on target hardware

Meta-programming and Auto-tuning in the Search for High Performance GPU Code

by Michael Vollmer, Bo Joel, Svensson Eric, Holk Ryan, R. Newton
"... Writing high performance GPGPU code is often difficult and time-consuming, potentially requiring laborious manual tuning of low-level details. Despite these challenges, the cost in ignoring GPUs in high performance computing is increasingly large. Auto-tuning is a potential solution to the problem o ..."
Abstract - Add to MetaCart
Writing high performance GPGPU code is often difficult and time-consuming, potentially requiring laborious manual tuning of low-level details. Despite these challenges, the cost in ignoring GPUs in high performance computing is increasingly large. Auto-tuning is a potential solution to the problem

Architecture Synthesis of High-Performance Application-Specific Processors

by Mauricio Breternitz Jr , John Paul Shen , 1991
"... Abstract The key principles of the Application-Specific Processor Design (ASPD) methodology include: a semi-custom compilation-driven design/implementation approach, the exploitation of fine-grained parallelism for high performance. and the adaptation of datapath topology to the data transfers requ ..."
Abstract - Cited by 3 (0 self) - Add to MetaCart
Abstract The key principles of the Application-Specific Processor Design (ASPD) methodology include: a semi-custom compilation-driven design/implementation approach, the exploitation of fine-grained parallelism for high performance. and the adaptation of datapath topology to the data transfers
Next 10 →
Results 1 - 10 of 286
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University