Results 1 - 10
of
16
Characterization of Intel Xeon Phi for Linear Algebra
, 2014
"... This study focuses on applicability of Intel Xeon Phi coprocessor for some of the Basic Linear Algebra Subprograms (BLAS) subroutines. Based on Many Integrated Core (MIC) architecture, the vector processing unit (VPU) in Xeon Phi coprocessor provides data parallelism at a very fine grain, working on ..."
Abstract
- Add to MetaCart
This study focuses on applicability of Intel Xeon Phi coprocessor for some of the Basic Linear Algebra Subprograms (BLAS) subroutines. Based on Many Integrated Core (MIC) architecture, the vector processing unit (VPU) in Xeon Phi coprocessor provides data parallelism at a very fine grain, working
Energy Characterization and Instruction-Level Energy Model of Intel’s Xeon Phi Processor
"... Abstract—Intel’s Xeon Phi is the first commercial manycore/multi-thread x86-based processor. Xeon Phi belongs to a new breed of high performance computing processors that seek high compute density as well as energy efficiency. However, no highlevel energy model is available for Xeon Phi software dev ..."
Abstract
-
Cited by 4 (0 self)
- Add to MetaCart
Abstract—Intel’s Xeon Phi is the first commercial manycore/multi-thread x86-based processor. Xeon Phi belongs to a new breed of high performance computing processors that seek high compute density as well as energy efficiency. However, no highlevel energy model is available for Xeon Phi software
Parallel Audio Quick Search on Shared-Memory Multiprocessor Systems
"... Audio search plays an important role in analyzing audio data and retrieving useful audio information. In this paper, a Partially Overlapping Block-Parallel Active Search method (POBPAS) is proposed to perform audio quick search on shared-memory multiprocessor systems (SMPs). This method uses a prope ..."
Abstract
- Add to MetaCart
performance characterization analysis of the parallel implementation of the POBPAS for three data sets on two Intel Xeon SMPs. Experimental results indicate that there are no obvious parallel limiting factors in the implementation except memory bandwidth. As a result, it can achieve 11.3X speedup for a larger
Positive-Definite Matrices on Heterogeneous GPU-Based Systems
"... Abstract—The goal of this paper is to implement an efficient matrix inversion of symmetric positive-definite matrices on heterogeneous GPU-based systems. The matrix inversion pro-cedure can be split into three stages: computing the Cholesky factorization, inverting the Cholesky factor and calculatin ..."
Abstract
- Add to MetaCart
and calculating the product of the inverted Cholesky factor with its transpose to get the final inverted matrix. Using high performance data layout, which represents the matrix in the system memory with an optimized cache-aware format, the computation of the three stages is decomposed into fine
unknown title
"... considered in performance predictions. Most existing approaches use established prediction models [3, 18] to estimate the performance of already existing complex software systems. Their main focus lies on the questions: i) ”How can we automatically derive or extract the models we need? ” and ii) ”Ho ..."
Abstract
- Add to MetaCart
considered in performance predictions. Most existing approaches use established prediction models [3, 18] to estimate the performance of already existing complex software systems. Their main focus lies on the questions: i) ”How can we automatically derive or extract the models we need? ” and ii
Software and Systems Modeling The final publication is available at www.springerlink.com Performance Modeling and Analysis of Message-oriented Event-driven Systems
"... Abstract Message-oriented event-driven systems are becoming increasingly ubiquitous in many industry do-mains including telecommunications, transportation and supply chain management. Applications in these areas typically have stringent requirements for performance and scalability. To guarantee adeq ..."
Abstract
- Add to MetaCart
Abstract Message-oriented event-driven systems are becoming increasingly ubiquitous in many industry do-mains including telecommunications, transportation and supply chain management. Applications in these areas typically have stringent requirements for performance and scalability. To guarantee
1 MOSES: a Framework for QoS Driven Runtime Adaptation of Service-oriented Systems
"... Abstract—Architecting software systems according to the serviceoriented paradigm, and designing runtime self-adaptable systems are two relevant research areas in today’s software engineering. In this paper we address issues that lie at the intersection of these two important fields. First, we presen ..."
Abstract
- Add to MetaCart
present a characterization of the problem space of self-adaptation for service-oriented systems, thus providing a frame of reference where our and other approaches can be classified. Then, we present MOSES, a methodology and a software tool implementing it to support QoS-driven adaptation of a service
Databases and Distributed Systems Group, TU Darmstadt, Germany,
"... Message-oriented middleware (MOM) is at the core of a vast number of financial services and telco applications, and is gaining increasing traction in other industries, such as manufacturing, transportation, health-care and supply chain management. Novel messaging applications, however, pose some ser ..."
Abstract
- Add to MetaCart
workload characterization of SPECjms2007 with the goal to help users understand the internal components of the workload and the way they are scaled, ii) we show how the workload can be customized to exercise and evaluate selected aspects of MOM performance, iii) we present a case study of a leading JMS
Saskatoon By
"... In presenting this thesis in partial fulfilment of the requirements for a Postgraduate degree ..."
Abstract
- Add to MetaCart
In presenting this thesis in partial fulfilment of the requirements for a Postgraduate degree
Results 1 - 10
of
16