GSWO: a programming model for GPU-enabled parallelization of sliding window operations in image processing.

Po Yang, Gordon Clapworthy, Feng Dong, Valeriu Codreanu, David Williams, Baoquan Liu, Jos B. T. M. Roerdink, Zhikun Deng

Research output: Contribution to journalArticle

5 Citations (Scopus)
1 Downloads (Pure)

Abstract

Sliding Window Operations (SWOs) are widely used in image processing applications. They often have to be performed repeatedly across the target image, which can demand significant computing resources when processing large images with large windows. In applications in which real-time performance is essential, running these filters on a CPU often fails to deliver results within an acceptable timeframe. The emergence of sophisticated graphic processing units (GPUs) presents an opportunity to address this challenge. However, GPU programming requires a steep learning curve and is error-prone for novices, so the availability of a tool that can produce a GPU implementation automatically from the original CPU source code can provide an attractive means by which the GPU power can be harnessed effectively. This paper presents a GPU-enabled programming model, called GSWO, which can assist GPU novices by converting their SWO-based image processing applications from the original C/C++ source code to CUDA code in a highly automated manner. This model includes a new set of simple SWO pragmas to generate GPU kernels and to support effective GPU memory management. We have implemented this programming model based on a CPU-to-GPU translator (C2GPU). Evaluations have been performed on a number of typical SWO image filters and applications. The experimental results show that the GSWO model is capable of efficiently accelerating these applications, with improved applicability and a speed-up of performance compared to several leading CPU-to-GPU source-to-source translators.
Original languageEnglish
Pages (from-to)332-345
Number of pages14
JournalSignal Processing: Image Communication
Volume47
DOIs
Publication statusPublished - 2 Jul 2016

    Fingerprint

Keywords

  • parallel computing
  • sliding window operation
  • CUDA
  • automatic translation

Cite this