Content-based classification of construction drawings: a comparative study of vision transformers and graph attention networks

Andrea Carrara, Stavros Nousias, André Borrmann

Research output: Chapter in Book/Report/Conference proceedingConference contribution book

Abstract

Automated classification of construction drawings is essential for improving efficiency and reducing errors in Architecture, Engineering, and Construction (AEC) workflows. While Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) have shown success in image-based tasks, they often fall short in capturing the relational and symbolic structures inherent in technical drawings. This paper presents a comparative study of Vision Transformers and Graph Attention Networks (GAT) using a real-world dataset of 450 professional construction drawings, each labeled in four standardized categories: Project Phase, Discipline, Representation, and Level. Drawings are represented in two formats: rasterized images and structured vector-based graphs and processed through dedicated deep learning pipelines. Experimental results reveal that GNNs outperform ViTs in overall accuracy, particularly in structure-sensitive categories like Level, by leveraging spatial and topological relationships. While pretrained ViTs demonstrate strong performance, particularly in visually distinct categories, and offer faster training throughput, GNNs provide superior generalization and interpretability. The study highlights the trade-offs between accuracy, computational efficiency, and practical deployment, offering valuable insights for integrating deep learning into real-world AEC classification systems.
Original languageEnglish
Title of host publicationEG-ICE 2025
Subtitle of host publicationAI-Driven Collaboration for Sustainable and Resilient Built Environments Conference Proceedings
EditorsAlejandro Moreno-Rangel, Bimal Kumar
Place of PublicationGlasgow
Number of pages9
DOIs
Publication statusPublished - 1 Jul 2025
EventEG-ICE 2025: International Workshop on Intelligent Computing in Engineering - The Technology and Innovation Centre, Glasgow, United Kingdom
Duration: 1 Jul 20253 Jul 2025
https://egice2025.co.uk/

Conference

ConferenceEG-ICE 2025: International Workshop on Intelligent Computing in Engineering
Country/TerritoryUnited Kingdom
CityGlasgow
Period1/07/253/07/25
Internet address

Keywords

  • construction drawing classification
  • graph neural network
  • vision transformer
  • transformer based models
  • comparative analysis

Fingerprint

Dive into the research topics of 'Content-based classification of construction drawings: a comparative study of vision transformers and graph attention networks'. Together they form a unique fingerprint.

Cite this