Common Ground: the Swedish Archaeology of Athens, Istanbul, and Rome – Arches Data Integration Project

Project Context:

The Swedish Institutes’ archaeological datasets were fragmented across multiple databases, file formats, and institutions, limiting research interoperability and increasing administrative burdens. The project aimed to create a unified, semantically modelled research environment using Arches and CIDOC CRM, enabling sustainable, cross-institutional access to excavation, survey, archival, spatial, and image data.

Scope & Datasets:

Integration of seven primary archaeological databases (Kalaureia, Asea, Pragmata, Etruria, Labraunda, Asine archives) totaling ~115,000 records and ~145 fields, alongside GIS files and image collections. The datasets cover excavation finds, contextual information, spatial data, and archival documentation.

Key Activities:

  • System Setup & Customization: Arches instance deployment and interface customization
  • Data Analysis & Wrangling: Structural review, cleaning, and standardization
  • Semantic Modelling: CIDOC CRM–based conceptual models, harmonized across datasets
  • Data Transformation & Loading: Conversion to Arches-compatible formats, ingestion, and validation
  • Testing & Documentation: Quality control, feedback cycles, and creation of workflow and technical documentation

Outcome:

A unified, semantically integrated research environment enabling cross-institutional access, long-term sustainability, and a foundation for future Linked Open Data integration, reducing data silos and supporting collaborative archaeological research.