Visual Instance Retrieval for Cultural Heritage Artifacts using Feature Pyramid Network
Abstract
Digitized photographs are commonly employed by archaeologists to assist in uncovering ancient artefacts. However, locating a specific image within a vast collection remains a significant obstacle. The metadata associated with images is often sparse, marking keyword-based searches difficult. In this paper, we propose a new visual search method to improve retrieval performance by utilizing visual descriptors generated from a feature pyramid network. This network is a convolutional neural network (CNN) model that incorporates additional modules for feature extraction and enhancement. The first module encodes an image into regional features through spatial pyramid pooling, while the second module emphasizes distinctive spatial features. Additionally, we introduce a two-stage feature attention to enhance feature quality and a compact descriptor is then formed by aggregating these features for searching the image. We tested our proposed method on benchmark datasets and a public vast collection of Thailand’s ancient artefacts. Results from our experiments show that the proposed method achieves 77.9% of mean average precision, which outperforms existing CNN-based visual descriptors.
Keywords: Landmark retreival, Image retreival, Deep Learning
DOI: 10.54941/ahfe1002933
Cite this paper
More from this volume
- HCD methodologies and simulation for visual rehabilitator’s education in oMERO project
- Designing a Digital Crown-Mapping Application for Pedestrian and Cyclists
- On the way to hybrid intelligence: influence of the human-system interaction rate on the human cognitive performance
- Human-Centered Design of Voice Communications: Gender Aspects
- Framework of Future Industrial Worker Characteristics
- Optimal Explanation Generation using Attention Distribution Model
- Exploration of Intercultural Mechanism of Language through the Interaction with Sensitive Qualities of Artistic Objects
- Blinking, Beeping or Just Driving? Investigating Different Communication Concepts for an Autonomously Parking E-Cargo Bike from a User Perspective
- Application of Double Skin Facade to Improve the Thermal Comfort Level in an Experimental Chamber
- The Future Impact of Digital Assistants on Aviation Safety Culture
- Efficient Inductive Logic Programming based on Predictive A*-like Algorithm
- Personalized Learning Path (PLP) – "App" for improving academic performance and prevention of dropouts in India


AHFE Open Access