TY - GEN
T1 - Segment Anything Model for Scan-to-Structural Analysis in Cultural Heritage
AU - Galanakis, Demitrios
AU - Lucho, Stuardo
AU - Maravelakis, Emmanuel
AU - Bolanakis, Nikolaos
AU - Konstantaras, Antonios
AU - Vidakis, Nectarios
AU - Petousis, Markos
AU - Treuillet, Sylvie
AU - Desquesnes, Xavier
AU - Brunetaud, Xavier
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Segment anything model (SAM) proposed by META Artificial Intelligence (AI) has disrupted the space of Deep Learning (DL) and Machine Learning (ML) and claims performance that outrivals conventional Convolutional Neural Networks (CNNs). The well recognized success of foundation models paved the way towards Natural Language Processing (NLP), which in computer vision domain, rests upon the concept of vision transformers (ViT). The claimed scalability and zero-shot predictability of the proposed mode has spurred extensive research among different fields. In this sense, this paper aims to investigate Segment Anything Model (SAM) potential in cultural heritage (CH) inclined scenarios. This is part of an ongoing project that seeks to implement scan-to-structural analysis at stone level. Based on the findings of this research, SAM zero-shot segmentation ability is quite promising for complex scene understanding. In addition, its tendency to favor small objects may provide a unique opportunity for damage detection and defects classification. Finally, even without any pre-training, SAM exhibits remarkable performance in edge detection and masking, without sacrificing generalizability and its accuracy in segmentation correlates to spatial resolution and background separation.
AB - Segment anything model (SAM) proposed by META Artificial Intelligence (AI) has disrupted the space of Deep Learning (DL) and Machine Learning (ML) and claims performance that outrivals conventional Convolutional Neural Networks (CNNs). The well recognized success of foundation models paved the way towards Natural Language Processing (NLP), which in computer vision domain, rests upon the concept of vision transformers (ViT). The claimed scalability and zero-shot predictability of the proposed mode has spurred extensive research among different fields. In this sense, this paper aims to investigate Segment Anything Model (SAM) potential in cultural heritage (CH) inclined scenarios. This is part of an ongoing project that seeks to implement scan-to-structural analysis at stone level. Based on the findings of this research, SAM zero-shot segmentation ability is quite promising for complex scene understanding. In addition, its tendency to favor small objects may provide a unique opportunity for damage detection and defects classification. Finally, even without any pre-training, SAM exhibits remarkable performance in edge detection and masking, without sacrificing generalizability and its accuracy in segmentation correlates to spatial resolution and background separation.
KW - cultural heritage
KW - FEM
KW - Finite Element Modelling
KW - HBIM
KW - Heritage BIM
KW - segment anything model
UR - http://www.scopus.com/inward/record.url?scp=85204549761&partnerID=8YFLogxK
U2 - 10.1109/EEITE61750.2024.10654401
DO - 10.1109/EEITE61750.2024.10654401
M3 - Conference contribution
AN - SCOPUS:85204549761
T3 - EEITE 2024 - Proceedings of 2024 5th International Conference in Electronic Engineering, Information Technology and Education
BT - EEITE 2024 - Proceedings of 2024 5th International Conference in Electronic Engineering, Information Technology and Education
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 5th International Conference in Electronic Engineering, Information Technology and Education, EEITE 2024
Y2 - 29 May 2024 through 31 May 2024
ER -