A multi-task framework for 2D scene parsing and 3D reconstruction in indoor space documentation

Serdar Erişen, Mansour Mehranfar, André Borrmann

Research output: Chapter in Book/Report/Conference proceedingConference contribution book

Abstract

Smart indoor environments with recognition technologies that prioritise human comfort and well-being necessitate accurate spatial information. Laser scanners and photogrammetry technologies have become essential for creating accurate as-built digital building models by capturing high-quality point clouds with precise geometry, yet they often lack proper semantic annotations. This limitation necessitates developing methods to improve 2D and 3D scene understanding. To address this, a multi-task learning framework is proposed combining depth estimation and semantic segmentation techniques to enhance 2D scene parsing for effective 3D reconstruction from single images by generating and learning 3D surface masks. The results on the selected TUM CMS Indoor Point Clouds dataset demonstrate the effectiveness of the proposed framework in 3D reconstruction, achieving 81.55% mIoU accuracy, which supports multiple applications for indoor digital twinning, automated 3D object recognition, and simulation tasks in indoor spaces.
Original languageEnglish
Title of host publicationEG-ICE 2025
Subtitle of host publicationAI-Driven Collaboration for Sustainable and Resilient Built Environments Conference Proceedings
EditorsAlejandro Moreno-Rangel, Bimal Kumar
Place of PublicationGlasgow
Number of pages10
DOIs
Publication statusPublished - 1 Jul 2025
EventEG-ICE 2025: International Workshop on Intelligent Computing in Engineering - The Technology and Innovation Centre, Glasgow, United Kingdom
Duration: 1 Jul 20253 Jul 2025
https://egice2025.co.uk/

Conference

ConferenceEG-ICE 2025: International Workshop on Intelligent Computing in Engineering
Country/TerritoryUnited Kingdom
CityGlasgow
Period1/07/253/07/25
Internet address

Keywords

  • indoor environments
  • semanic segmentation
  • point clouds
  • digital twinning
  • 3D reconstruction

Fingerprint

Dive into the research topics of 'A multi-task framework for 2D scene parsing and 3D reconstruction in indoor space documentation'. Together they form a unique fingerprint.

Cite this