• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Predicting coherence communication by tracking synchronization points at run time.” (2012)

by S Demetriades, S Cho
Venue:in MICRO,
Add To MetaCart

Tools

Sorted by:
Results 1 - 1 of 1

Modeling Cache Coherence Misses on Multicores

by Xiaoyue Pan , Bengt Jonsson
"... Abstract-While maintaining the coherency of private caches, invalidation-based cache coherence protocols introduce cache coherence misses. We address the problem of predicting the number of cache coherence misses in the private cache of a parallel application when running on a multicore system with ..."
Abstract - Add to MetaCart
Abstract-While maintaining the coherency of private caches, invalidation-based cache coherence protocols introduce cache coherence misses. We address the problem of predicting the number of cache coherence misses in the private cache of a parallel application when running on a multicore system with an invalidation-based cache coherence protocol. We propose three new performance models (uniform, phased and symmetric) for estimating the number of coherence misses from information about inter-core data sharing patterns and the individual core's data reuse patterns. The inputs to the uniform and phased models are the write frequency and reuse distance distribution of shared data from different cores. This input can be obtained either from profiling the target application on a single core or by analyzing the data access pattern statically, and does not need a detailed simulation of the pattern of interleaving accesses to shared data. The output of the models is an estimated number of coherence misses of the target application. The output can be combined with the number of other kinds of misses to estimate the total number of misses in each core's private cache. This output can also be used to guide program optimization to improve cache performance. We evaluate our models with a set of benchmarks from the PARSEC benchmark suite on real hardware.
(Show Context)

Citation Context

... which makes our profiling process insensitive to interference. Another advantage of analytical-based approaches including ours is the ability to evaluate performance in another system’s settings. For example, our model can be used to predict the cache misses with a different cache size. Optimization with inter-core data sharing: Zhang et al [23] point out that the inter-core data reuse is not fully exploited by the current on-chip cache hierarchy or the state-of-the-art optimizations. An optimization scheme that balances the inter-core and intra-core data reuse is proposed. Demetriades et al [24] propose a run-time coherence miss prediction scheme. The scheme is based on the observation that the coherence misses and synchronization points in a program are usually correlated. IX. CONCLUSION In this paper, we proposed three new analytical models to analyze cache coherence misses for a multi-threaded application on multicore. The model builds on the observation that the occurence of a coherence miss is caused by a foreign write interleaving with the reuse of a shared cache line. The model quantifies the cache coherence misses of a core with the reuse distance distribution and the frequen...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University