Results 1 - 10
of
625
Hitting the Memory Wall: Implications of the Obvious
- Computer Architecture News
, 1995
"... This brief note points out something obvious--- something the authors "knew" without really understanding. With apologies to those who did understand, we offer it to those others who, like us, missed the point. We all know that the rate of improvement in microprocessor speed exceeds the ra ..."
Abstract
-
Cited by 393 (1 self)
- Add to MetaCart
the rate of improvement in DRAM memory speed--- each is improving exponentially, but the exponent for microprocessors is substantially larger than that for DRAMs. The difference between diverging exponentials also grows exponentially; so, although the disparity between processor and memory speed is already
Approaches to Addressing the Memory Wall
"... The memory wall is the predicted situation where improvements to processor speed will be masked by the much slower improvement in dynamic random access (DRAM) memory speed. Since the prediction was made in 1995, considerable progress has been made in addressing the memory wall. There have been advan ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
The memory wall is the predicted situation where improvements to processor speed will be masked by the much slower improvement in dynamic random access (DRAM) memory speed. Since the prediction was made in 1995, considerable progress has been made in addressing the memory wall. There have been
How Multithreading Addresses the Memory Wall
"... The memory wall is the predicted situation where improvements to processor speed will be masked by the much slower improvement in dynamic random access (DRAM) memory speed. Since the prediction was made in 1995, considerable progress has been made in addressing the memory wall. There have been advan ..."
Abstract
- Add to MetaCart
The memory wall is the predicted situation where improvements to processor speed will be masked by the much slower improvement in dynamic random access (DRAM) memory speed. Since the prediction was made in 1995, considerable progress has been made in addressing the memory wall. There have been
Overcoming the memory wall in packet processing: hammers or ladders
- In ANCS ’05
, 2005
"... Overhead of memory accesses limits the performance of packet processing applications. To overcome this bottleneck, today’s network processors can utilize a wide-range of mechanisms—such as multi-level memory hierarchy, wide-word accesses, special-purpose result-caches, asynchronous memory, and hardw ..."
Abstract
-
Cited by 14 (0 self)
- Add to MetaCart
Overhead of memory accesses limits the performance of packet processing applications. To overcome this bottleneck, today’s network processors can utilize a wide-range of mechanisms—such as multi-level memory hierarchy, wide-word accesses, special-purpose result-caches, asynchronous memory
Organizing the Last Line of Defense before Hitting the Memory Wall for CMPs
- In Proceedings of the Tenth IEEE Symposium on High-Performance Computer Architecture
, 2004
"... The last line of defense in the cache hierarchy before going to off-chip memory is very critical in chip multiprocessors (CMPs) from both the performance and power perspectives. This paper investigates different organizations for this last line of defense (assumed to be L2 in this paper) towards red ..."
Abstract
-
Cited by 77 (1 self)
- Add to MetaCart
The last line of defense in the cache hierarchy before going to off-chip memory is very critical in chip multiprocessors (CMPs) from both the performance and power perspectives. This paper investigates different organizations for this last line of defense (assumed to be L2 in this paper) towards
BREAKING THE MEMORY WALL FOR HIGHLY MULTI-THREADED CORES
, 2010
"... Emerging applications such as scientific computation, media processing, machine learning and data mining are commonly computation- and data- intensive [1], and they usually exhibit abundant parallelism. These applications motivate the design of throughputoriented many- and multi-core architectures t ..."
Abstract
- Add to MetaCart
is not limited by the computation resources, but by the overhead in data movement. In fact, adding more cores or threads is likely to harm performance due to contention in the memory system. It is therefore important to improve data management to either reduce or tolerate data movement and associated latencies
Named Memorial Wall 6 Order Form 8
"... Not many 19-year-olds set off on a national campaign to save lives during their freshman year in college. But Dieter Schmitz is everything but a normal 19 year-old. Schmitz, a San Diego State hospitality and tourism management freshman, who many know from the hit MTV reality show, “Laguna Beach: Th ..."
Abstract
- Add to MetaCart
Not many 19-year-olds set off on a national campaign to save lives during their freshman year in college. But Dieter Schmitz is everything but a normal 19 year-old. Schmitz, a San Diego State hospitality and tourism management freshman, who many know from the hit MTV reality show, “Laguna Beach: The Real Orange County,” has used his fame to found and create his own non-profit, Running Home 4 Teens. RH4T has caught the nation’s attention and support, all while benefiting SAVE.
Overcoming the Memory Wall in Symbolic Algebra: A Faster Permutation Multiplication
- SIGSAM Bulletin
, 2003
"... The traditional permutation multiplication algorithm is now limited by memory latency and not by CPU speed. A new cache-aware permutation algorithm speeds up permutation multiplication by a factor of 3.4 on current CPUs. The new algorithm is limited by memory bandwidth, but not by memory latency. ..."
Abstract
-
Cited by 7 (3 self)
- Add to MetaCart
. Current trends indicate improving memory bandwidth and stagnant memory latency. This makes the new algorithm especially important for future computer architectures. In addition, we believe this "memory wall" will soon force a redesign of other common algorithms of symbolic algebra.
MANAGING THE FPGA MEMORY WALL: CUSTOM COMPUTING OR VECTOR PROCESSING?
"... Managing the memory wall is critical for massively par-allel FPGA applications where data-sets are large and exter-nal memory must be used. We demonstrate that a soft vector processor can efficiently stream data from external memory whilst running computation in parallel. A non-trivial neu-ral compu ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
Managing the memory wall is critical for massively par-allel FPGA applications where data-sets are large and exter-nal memory must be used. We demonstrate that a soft vector processor can efficiently stream data from external memory whilst running computation in parallel. A non-trivial neu
Scalable Parallel Coset Enumeration: Bulk Definition and the Memory Wall
"... Coset enumeration, like Gröbner bases, is a notoriously difficult algorithm to parallelize. We demonstrate a successful shared memory parallelization achieving a 7 times speedup on an Origin 2000 CC-NUMA computer using 16 CPUs. We take as a testbed, an enumeration of Lyons's group (8,835,15 ..."
Abstract
-
Cited by 8 (4 self)
- Add to MetaCart
Coset enumeration, like Gröbner bases, is a notoriously difficult algorithm to parallelize. We demonstrate a successful shared memory parallelization achieving a 7 times speedup on an Origin 2000 CC-NUMA computer using 16 CPUs. We take as a testbed, an enumeration of Lyons's group (8
Results 1 - 10
of
625