@MISC{Vouzis_agpu, author = {Panagiotis Vouzis and Mark Arnold}, title = {A GPU Implementation of the Complex Logarithmic Number System}, year = {} }
Share
OpenURL
Abstract
Abstract — In this paper we present a technique to implement the Complex Logarithmic Number System (CLNS) on a Graphics Processing Unit (GPU). Although CLNS multiplication is a simple FP addition, CLNS addition involves evaluations of transcendental functions, which can be carried out in a few different ways by utilizing the GPU hardware resources, such as the special function units, the floating point units, or the texture memory. We propose the implementation of CLNS by using the linear interpolation capabilities of the, otherwise unused, GPU texture memory. In the context of an algorithm that involves complex arithmetic the CLNS additions can be overlapped with other computations by offloading them to the texture memory saving clock cycles. The proposed technique has been implemented on an Nvidia GT200 GPU with CUDA, and its clock-cycle requirements and error behavior are presented and analyzed. I.