|
28
|
Merge: A Programming Model for Heterogeneous Multi-core Systems Abstract
– Michael D. Linderman, Jamison D. Collins, Hong Wang, Teresa H. Meng
|
|
35
|
Chip Multiprocessing and the Cell Broadband Engine
– M Gschwind
|
|
82
|
Nvidia tesla: A unified graphics and computing architecture
– E Lindholm, J Nickolls, S Oberman, J Montrym
- 2008
|
|
25
|
Carbon: architectural support for fine-grained parallelism on chip multiprocessors
– Sanjeev Kumar, Christopher J. Hughes, Anthony Nguyen
- 2007
|
|
3637
|
D.A.Patterson, “Computer Architecture: A quantitative Approach”, Fourth edition
– J L Hennessy
- 2007
|
|
84
|
A class of parallel tiled linear algebra algorithms for multicore architectures
– Alfredo Buttari, Julien Langou, Jakub Kurzak, Jack Dongarra
- 2007
|
|
112
|
Photon Mapping on Programmable Graphics Hardware
– Timothy J. Purcell, Craig Donner, Mike Cammarano, Henrik Wann Jensen, Pat Hanrahan
- 2003
|
|
101
|
The design and implementation of a firstgeneration cell processor
– D Pham, S Asano, M Bolliger, M N Day, H P Hofstee, C Johns, J Kahle, A Kameyama, J Keaty, Y Masubuchi, M Riley, D Shippy, D Stasiak, M Suzuoki, M Wang, J Warnock, S Weitzel, D Wendel, T Yamazaki, K Yazawa
- 2005
|
|
57
|
Accelerator: using data parallelism to program GPUs for general-purpose uses
– David Tarditi, Sidd Puri, Jose Oglesby
- 2006
|
|
73
|
Skadron K: Scalable Parallel Programming with CUDA. Queue 2008
– J Nickolls, I Buck, M Garland
|
|
32
|
W.: MCUDA: An efficient implementation of CUDA kernels for multi-core CPUs
– J A Stratton, S S Stone, mei W Hwu
- 2008
|
|
114
|
Brook for GPUs: Stream Computing on Graphics Hardware
– Ian Buck, Tim Foley, Daniel Horn, Jeremy Sugerman, Kayvon Fatahalian, Mike Houston, Pat Hanrahan
- 2004
|
|
93
|
Interactive Global Illumination using Fast Ray Tracing
– Ingo Wald, Thomas Kollig, Carsten Benthin, Alexander Keller, Philipp Slusallek
- 2002
|
|
33
|
Theoretical modeling of superscalar processor performance
– D B Noonburg, J P Shen
- 1994
|
|
24
|
Packet-based whitted and distribution ray tracing
– S Boulos, D Edwards, J D Lacewell, J Kniss, J Kautz, P Shirley, I Wald
- 2007
|
|
9
|
A first-order fine-grained multithreaded throughput model
– X E Chen, T M Aamodt
- 2009
|
|
231
|
A survey of general-purpose computation on graphics hardware
– John D. Owens, David Luebke, Naga Govindaraju, Mark Harris, Jens Krüger, Aaron E. Lefohn, Tim Purcell
- 2007
|
|
99
|
Programmable stream processors
– Ujval J. Kapasi, William J. Dally, Scott Rixner, John D. Owens, Brucek Khailany
- 2003
|
|
16
|
QR and Cholesky factorizations using vector capabilities of GPUs
– V Volkov, J LU Demmel
- 2008
|