Region of interest scalable image compression using semantic communications

Research output: Chapter in Book/Report/Conference proceedingConference contribution book

42 Downloads (Pure)

Abstract

Growing consumer demand for media content over a wide range of devices has made scalable image compression vital in today’s media landscape. Image compression is conventionally achieved by means of statistical signal processing, but since recently, deep learning techniques are seen to be widely as well. Capabilities of such systems also enable accurate identification of regions of interest in images, leading optimised performance in most applications. This paper proposes a region-of-interest scalable image compression system using semantic communications, where an autoencoder-based semantic encoder performs the base level compression, while a Semantic Mask Extracting Transformer (SeMExT) enables identification of regions of interest to create enhancement layers with different quality levels using a scalable JPEG encoder. When benchmarked against scalable JPEG across a variety of images, the proposed system demonstrates significantly improved compressive performance. The base layer achieved 61.4 times more compression on average, along with better rate-distortion performance at any given quality level.
Original languageEnglish
Title of host publicationIEEE 42nd International Conference on Consumer Electronics
Place of PublicationPiscataway, NJ
PublisherIEEE
Number of pages4
Publication statusAccepted/In press - 1 Nov 2023
EventIEEE 42nd International Conference on Consumer Electronics - Las Vegas, United States
Duration: 5 Jan 20248 Jan 2024
https://icce.org/2024/

Conference

ConferenceIEEE 42nd International Conference on Consumer Electronics
Abbreviated titleIEEE ICCE 2024
Country/TerritoryUnited States
CityLas Vegas
Period5/01/248/01/24
Internet address

Keywords

  • deep neural networks
  • image compression
  • region of interest
  • scalable image compression
  • communications

Fingerprint

Dive into the research topics of 'Region of interest scalable image compression using semantic communications'. Together they form a unique fingerprint.

Cite this