Knowledge Graphs for Digital Twins in AECO
Knowledge Graphs for Digital Twins in AECO
Automation in Construction
journal homepage: [Link]/locate/autcon
A R T I C L E I N F O A B S T R A C T
Keywords: The emergence of digital twin technologies offers a promising avenue for improving decision-making through the
Digital twin integrated use of up-to-date physical or synthetically simulated data. Nevertheless, the practical implementation
BIM of digital twins in the built environment remains a significant challenge. This paper describes a system that
Data integration
seamlessly integrates data into digital twins of built assets. The system uses a knowledge graph to achieve data
Knowledge graph
Microservice architecture
integration, which is designed to be modular, flexible, and interoperable. The graph includes BIM models,
metadata from an external IoT platform, and process-related information. The system is microservice-based and
revolves around a graph database housing the knowledge graph. It employs dynamic operations to update the
knowledge graph and is tested using civil engineering infrastructure examples. Results from this work can be
used to create pipelines that extract and operate with data connecting computational agents integrated into the
system as microservices or connected through the system API.
1. Introduction exchange. Cloud computing enables remote data and application stor
age, enhancing software management for various purposes. Advanced
The built environment is a dynamic system of systems that inter simulation (AS), Data Analytics (DA), and Artificial Intelligence (AI)
connect economic infrastructure, social infrastructure, and the natural unlock the value of data, transforming it into a crucial business asset.
environment. The built environment serves as a vital network of services These advancements lead to intricate data structures, linking physical
that knit the proper functioning of societies. It continuously evolves and assets to their Digital Twins (DTs): virtual replicas integrating physical
as a result, the Architecture, Engineering, Construction, and Operation data, virtual models, simulations, AI, and other layers. These DTs
players navigate the intricate process of designing, constructing, oper become unexpected asset interfaces for end-users, potentially serving as
ating, maintaining, and decommissioning the constituent assets which comprehensive decision-making hubs. While Building Information
are unique, but whose digital forms tend to be different for all involved Modelling (BIM) offers dependable virtual representations and digital
agents. Efficiency in pre-digital ecosystems was maximized with collaboration in the built environment, there remains a gap in inte
decentralized organizational and data structures. Efficiency on such grating DTs with BIM, IoT systems, cloud data, simulations, and AI into
ambitious information constructs such as digital twins can only be an adaptable solution capable of addressing stakeholder diversity and
maximized with more adaptive data structures. The successful execution specific built asset requirements.
of projects throughout the lifespan of an asset relies on the collaboration European efforts are directed towards bridging this gap. Exemplary
of multidisciplinary stakeholders who simultaneously manage multiple Research and Innovation Actions (RIA) such as Ashvin [1], BIMprove
projects across various locations, with independent collaborators, and [2], COGITO [3], or BIM2TWIN [4] altogether explore the development
with diverse contexts. In this sense, presently, AECO is a complex of digital twin solutions from a vast perspective. This particular research
network that generates, consumes, and exchanges an extensive volume paper is framed within one of these projects, Ashvin, whose vision for
of data of diverse natures, in multiple formats, and through different demonstrating digital twin applications at design, construction, and
communication channels. operation was based on implementing a DT cloud platform and a series
The Internet of Things (IoT) is increasing data sharing among a wide of cohesive tools on ten very varied demonstrators. Ranging from
range of physical and virtual devices, facilitating real-time information buildings to bridges, from industrial to residential buildings, or from
* Corresponding author.
E-mail address: [Link]@[Link] (C. Ramonell).
[Link]
Received 31 May 2023; Received in revised form 20 September 2023; Accepted 25 September 2023
Available online 5 October 2023
0926-5805/© 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC license ([Link]
nc/4.0/).
C. Ramonell et al. Automation in Construction 156 (2023) 105109
airport runways to quay walls in ports, these demonstrators shaped the The research focused on developing digital twin applications is
need for embedding such variety. Aggregated information from diverse relatively mature in other disciplines such as the aerospace industry [5],
assets reveals strengths, weaknesses, and system needs. Flexible archi healthcare [6], and manufacturing [7]. More recently, the term shows a
tectures must accommodate varied tools, use cases, and data. digitization path in the construction industry [8] whose ultimate goal is
This paper outlines a system for integrating data from various efficiency maximization. The construction sector is very varied. Thus,
sources to create comprehensive digital twins of built assets. The system the design and implementation of Digital Twins are not yet consensually
utilizes a knowledge graph as the primary integration mechanism of defined within the sector. The level of development is still patchy
varied pieces of information that comprise a digital twin. This proposal ranging from models that cannot be updated easily with physical data to
relies on the present nature of BIM and existing Standards. The knowl hyper-specific yet accurate single-function DTs that serve one particular
edge graph allows for efficient querying and exploration of contextual solution. Those actions are presently driven by the industry needs and by
ized asset data. It also establishes a base for generating information use-case-specific outcomes.
pipelines that operate with data and that provide services. On a service- In the realm of the built environment, the digital twin concept
based platform, end-users can retrieve information for timely assistance initially drew inspiration from the aerospace and manufacturing sectors
in decision-making processes. In the remainder of this paper, section 2 [10]. These industries utilize tailored, prolonged monitoring systems in
introduces the concept of vast-scoped digital twins and elaborates on all controlled industrial settings, offering highly accurate predefined sim
needs and challenges for implementation in the built environment. ulations for enhancing product productivity and life-cycle management.
Section 3 elaborates on the objectives of the study and introduces the Nevertheless, the diverse and complex nature of the built environment
methodology, based on a set of civil engineering infrastructures in which introduces unique challenges, which hinder the direct application of
the system is tested. Section 4 elaborates on the BIM-to-Twin transition strategies employed in other industries.
(or evolution) whereas Section 5 describes more comprehensively, the
use of Knowledge Graphs in Digital Twins (KGDT) and finally section 7 2.1. The virtual representation of built assets
describes the knowledge graph-based proposed system.
To create a comprehensive virtual representation of a given asset of
2. Digital twins of built assets in the AEC industry: vision and the built environment, multiple knowledge domains must be described
challenges and integrated within a Unified Virtual Model (UVM). For instance, a
UVM of a building should encompass domains such as architecture,
The term “Digital Twin” still deserves lines of presentation. structural engineering, HVAC, MEP or building management. This UVM
Consensus about a unique definition is far from reached. In an attempt to model constitutes the core of an ambitious, comprehensive digital twin:
identify a general definition of the term, Vanderhorn and Mahadevan a single source of truth in which geometric representations, built
[9] reviewed 65 different definitions published between 2010 and 2020 product descriptions, process representations, multi-physics and mana
from which they abstracted the following: gerial information are seamlessly intertwined.
A Digital Twin is “a virtual representation of a physical system (and its The built environment also encompasses different asset scales. These
associated environment and processes) that is updated through the exchange scales span from individual elements within an asset to, for instance, a
of information between the physical and the virtual systems” [9]. large network of assets in a city [11]. At the element level, virtual
Noticeably, this definition is generic, and specific questions arise models must capture detailed physical properties and behaviour of in
regarding many topics such as i) the nature of all virtual representations dividual asset components. At the asset level, the models must account
of physical systems, ii) the type of data that can be obtained from the for the components’ spatial arrangement, interactions, and asset systems
physical system and how it is transported and integrated within the behaviour. The models must describe the broader context at the infra
virtual system or, iii) the level of development of the Digital Twin. structure and city levels, including the natural and urban environment,
Understanding industry specifics, context, and use case needs shapes a transportation systems, and utilities.
digital twin’s functionality and technology components. Moreover, built assets are unique. Built assets are bespoke, with
2
C. Ramonell et al. Automation in Construction 156 (2023) 105109
Fig. 2. Traversing the built environment scale through digital twin intercommunication.
3
C. Ramonell et al. Automation in Construction 156 (2023) 105109
Digital twins are expected to contextualize this data within a single showcasing how this system is structured and how it enables semantic
UVM to provide meaningful and useful access to such varied collected integration and interaction with the information it contains, regardless
data. This contextualization of data enables different computational of its nature.
agents to establish pipelines that extract, analyse and transform this BIM models represent the Standard virtual representation for all
data. This is a more comprehensive way to provide insights for decision- demonstrators. To build their digital twin system, creating a BIM model
making support services. Thus, to improve usefulness, maintainability represents a necessary first step since it provides the semantic basis. This
and adaptability, digital twins must support technologies that allow basis is thus extended for integrating the various types of data collected
adding agents modularly via Application Programming Interfaces (API). in each demonstrator.
4
C. Ramonell et al. Automation in Construction 156 (2023) 105109
5
C. Ramonell et al. Automation in Construction 156 (2023) 105109
6
C. Ramonell et al. Automation in Construction 156 (2023) 105109
new knowledge from data through semantic axioms and inference rules. stack, the concept is not limited to these technologies. Companies are
Ontologies can be published openly on the web to promote interopera increasingly using property graph models and graph databases to store
bility among graph-based applications or can be created as private in and manage their knowledge graphs, where transactional efficiency and
formation assets for cross-enterprise data management. As such, they data privacy are critical concerns [38]. Examples of open-access
have become essential data models that allow for the creation of knowledge graphs built on RDF-related formats include DBpedia [39],
knowledge graphs, interrelating concepts from different domains [37]. Wikidata [40], BabelNet [41], and YAGO [42].
5.1.3. Knowledge graphs 5.2. Knowledge graphs as DT enablers for the built environment
A knowledge graph is a graph-based data structure that emphasizes
contextual understanding of data by interlinking metadata. This is ideal Akroyd et al. [43,44] suggest that a dynamic general-purpose
for application in scenarios that require integrating, managing, and knowledge graph is ideally suited for digital twins. This knowledge
extracting value from diverse sources at a large scale [38]. Knowledge graph should include a combination of ontologies and autonomous
graphs provide several advantages over traditional data models for agents that continually operate on it. By using ontologies, knowledge
modelling, structuring, managing, and analyzing heterogeneous and graphs promote the established use of data, facilitating reuse and
complex data with dynamic relationships [37]. They allow representing interoperability. Their multi-domain nature allows for the addition of
complex abstractions of knowledge in specific domains or across do new ontologies and the establishment of relationships between related
mains. Unlike relational or NoSQL models, knowledge graphs allow data terms to enhance interconnectivity. Additionally, their distributed na
to evolve flexibly, as they do not require pre-defined schemas. Moreover, ture permits hosting diverse data types in different locations while
these constructs can be made interoperable by mapping their entities to providing links to their original repositories. Thus, data owners retain
existing ontologies, enabling transactions of data and its context be control over hosting and access depending on data sensitivity. Further
tween software applications. Modern graph analytics provide further more, the hierarchical and extensible ontological structure allows rep
insights from the domains described in the graphs. While knowledge resentation at various scales (product, building, city, and national
graphs are often associated with RDF and the semantic web technology systems). The interconnectedness of concepts and instances in
7
C. Ramonell et al. Automation in Construction 156 (2023) 105109
knowledge graphs, combined with dynamic updates from computational ETH Centre (SEC) to improve city management and planning. Within
agents and live data feeds, enables multiple interactions between players this frame, Chadzynski et al. [46] integrated city information models in
within a given digital twin. These properties suggest high suitability for the knowledge graph based on OntoCityGML ontology [47], and sub
implementing larger-scale digital twins. sequently implemented a set of intelligent software agents to perform
For instance, at a city scale, City Knowledge Graphs (CKG) [45] are specific data processing and analytical tasks based on interactions with
being developed by the Cambridge Centre for Advanced Research and such graph. A knowledge graph-based system was also used within the
Education in Singapore (CARES) in collaboration with the Singapore- CReDo project (Climate Resilience Demonstrator) as part of the digital
8
C. Ramonell et al. Automation in Construction 156 (2023) 105109
Fig. 9. Example of the effect of the conversion in the structure of IFC relations. Relation entities are bridged using the corresponding inverse attributes of the
IFC entities.
The potential lies in the IFC schema, which is an ontology [51] that
Table 4 can be utilized for constructing the knowledge graph. The translation of
Import results for the IFC models of the ASHVIN demonstrators. the IFC schema to a more common ontology language is proposed by
File Demo IFC Schema Nodes Relations Pauwels and Terkaj [52]. These authors generated an OWL version,
[Link] 9 IFC4 44 122 named the ifcOWL ontology. Presently, ifcOWL graphs result in rather
[Link] 10 IFC4 93 199 large models due to the built-in complexity of the IFC schema (mainly its
Valdelinares_IFC4.ifc 1 IFC4 157 517 geometrical definitions). Alternatively, the Linked Building Data (LBD)
[Link] 1 IFC4 205 979 group has developed a set of more granular ontologies conceived to
[Link] 4 IFC2X3 405 1320
StegDortmund-R2021_Neu.ifc 8 IFC2X3 688 2618
contribute modularly and thus, they can be extended with third-party
Kineum_CM.ifc 5 IFC4 721 2249 contributions. These ontologies help generate a more flexible descrip
[Link] 5 IFC4 820 3301 tion of the built environment that adjusts to each use case’s needs [53].
[Link] 5 IFC4 846 3376 Some of these ontologies are listed below:
MILE_Avila.ifc 6 IFC4 993 4897
GDY_ZB_30_before_renovation.ifc 2 IFC2X3 1148 4984
zadar_airport.ifc 3 IFC4 1714 7244 • Building Topology Ontology [54]
Llobregat_Viaduct.ifc 7 IFC4 2083 8895 • Building Element Ontology [55]
MILE_PE_STR_U1.ifc 6 IFC2X3 6373 42,104 • Ontology for distribution elements [56]
Kineum_plan16_NCC.ifc 5 IFC2X3 12,599 74,595 • Damage monitoring Ontology [57]
A-40-V-4_space.ifc 5 IFC2X3 28,664 219,895
• Bridge topology Ontology [58]
• Building Product Ontology [59]
twin programme in the UK [48], where the knowledge graph combined • Ontology for Managing Properties [60]
the description of assets from energy, water and telecom networks with • Ontology for Managing Geometry [61]
data from flood simulation and with models that describe the effect of • File ontology for geometry formats [62]
the flood on the assets [49,50]. Narrowing the scale to the building level,
knowledge graph-based systems are not yet found in academic Although those ontologies still do not cover the whole semantic
literature. richness of the IFC schema, they are paving the way to more efficient
9
C. Ramonell et al. Automation in Construction 156 (2023) 105109
Table 5
Conversion result of [Link].
Table 6
Conversion result of Kineum_CM.ifc.
methods for modelling, sharing and operating with data in the built 6. A system for data integration in digital twins of built assets
environment. The graph data structure enables the unified management
of federated BIM models, and graph databases enable granular access In this section, a system that dynamically updates a knowledge graph
and management of information through API queries [63]. IFC models is proposed. It represents a unified interface for multiple data sources
can be expressed in graph format and can be mapped to ifcOWL or that contribute to a digital twin of a built asset. The system is built based
existing LBD ontologies and extended with on-demand knowledge and on the varied requirements and challenges posed by ten different varied-
metadata from external data sources. Therefore, a system capable of in-nature demo sites described in Sections 2 and 3.
dynamically maintaining a knowledge graph represents a promising From the BIM perspective, each demonstrator is initially described
candidate to enable multi-scale and cross-domain digital twins in the with an IFC model. These models are processed to transform the infor
built environment. mation in its original STEP format into two separate parts that are
interlinked: a geometry file and a property-graph-based semantic model.
This graph-based semantic representation of the demonstrators can be
10
C. Ramonell et al. Automation in Construction 156 (2023) 105109
Table 7
Conversion result of GDY_ZB_30_before_renovation.ifc.
extended on demand and, if needed, mapped to existing ontologies to the asset semantic description. The ontologies describe the organizing
make the content interoperable. The property graph is the semantic principles of such data sources and relate them with concepts described
backbone of the system. Section 6.1 describes the system architecture. in the IFC schema.
The transformation from IFC files to a property graph is explained in
Section 6.2.
Data integration is performed considering two scenarios described in 6.1. System architecture
Section 6.2 and Section 6.3. The former illustrates the integration of data
from a local database. It contains digitized, machine-readable repre The system is developed based on a microservices architecture. This
sentations of asset-related processes. The latter shows the integration of design approach breaks the system into small, independent services.
sensor data from an external IoT platform. The integration of both data Each micro-service represents a specific functionality within the system
sources is driven by ontologies that are explicitly developed to extend that can be developed, deployed and scaled independently. Through
the central knowledge graph and, thus, semantically link the data with lightweight information exchanges, these micro-services collaborate to
form a cohesive application. This architecture offers numerous
11
C. Ramonell et al. Automation in Construction 156 (2023) 105109
Fig. 11. IFC task simplified ontology. For more detailed information about their definition and properties, refer to [28].
advantages, including enhanced software resiliency, flexibility, and series of systematic steps.
scalability [15]. Micro-services are increasingly prevalent in the devel As a first step, the end-user can employ a graphical user interface
opment of cloud-based applications. (GUI) to transmit the IFC file to the server, where it is securely stored.
One of the key enablers of micro-services development is contain Subsequently, two services help: IfcConvert and Ifc2Graph are employed
erization. Containerization is a technology that enables developers to in parallel to handle the processing of geometry and semantics,
package applications and their dependencies into self-contained units respectively. This simultaneous processing ensures the conversion of
called containers. Each container includes codes, runtime environments, both aspects of the IFC model.
libraries and configurations. It represents a lightweight and portable Subsequently, geometrical data is stored in the server in GLB format.
package. In this study, Docker [64] is selected as the containerization Likewise, semantic information is stored in the Neo4j database and
platform. It is a widely used platform and it also provides Docker Hub structured as a property graph, following the IFC schema [28]. The IFC
[65], a cloud-based repository that allows developers to publish, share schema can be mapped to existing ontologies using an open-source
and discover containers. plugin called Neosemantics [70], which extends the capabilities of
Table 3 provides a concise description of the functionality of the Neo4j to handle RDF data. The process is visually depicted in Fig. 5. This
containers that are used to compose the proposed system. All containers, figure zooms within a specific sub-system representation of this part
the ones developed by the authors and those used by third parties are which belongs to the vaster system.
available on Docker Hub. From Fig. 5, two comments are worth pointing out:
The knowledge graph is persistently stored in the Neo4j graph
database, adhering to the principles of the labelled property graph • IfcConvert is built upon an open-source command-line application
model. The graph provides a semantic index of the data available in the provided by IfcOpenshell. This application facilitates the conversion
system, which is contextualized with the asset information model. The of IFC files into geometry file formats that offer greater interopera
system also provides mechanisms to keep the graph dynamically upda bility, such as OBJ, DAE, GLB, STP, IGS, XML or SVG. The binary GLB
ted as new information is added, as described in the following sections. format is selected in this particular research since it enables more
The developed micro-services communicate using HTTP (HyperText efficient storage and retrieval of geometric information.
Transfer Protocol) requests via REST APIs. Additionally, AMQP • IFC2Graph has been developed with Python and IfcOpenShell Python
(Advanced Messaging Queuing Protocol) communication is used to library to parse IFC data and then communicate with Neo4j [71].
enable event-driven functionalities to keep the graph updated based on
events that are triggered by specific system actions. Fig. 4 shows the Fig. 6 displays the fundamental structure within the graph database
overall system architecture schema. after importing IFC files. The database encompasses assets (represented
It is worth emphasizing that the successful usefulness of the system by blue nodes). Each asset can be associated with multiple IFC files
relies upon designing adequate end-user tools with adapted graphical (yellow nodes). Encapsulated information within each file is linked to
interfaces that enable interaction between users and data. the corresponding file node and it is represented following the IFC-to-
graph conversion algorithm.
6.2. Built asset semantic model: IFC to property-graph
6.2.1. IFC to labelled property graph
IFC2Graph is a developed algorithm for converting IFC files in STEP
The generation of DT for the abovementioned demonstrators re
format to a central Neo4j database. It is specifically designed for
quires a sufficiently comprehensive IFC model that encompasses both
importing the non-geometrical information of IFC models. Similar al
geometric and semantic information. The process of transferring the
gorithms are found in [72], in which researchers develop ways for
data from the IFC file to the digital twin system is carried out through a
12
C. Ramonell et al. Automation in Construction 156 (2023) 105109
Fig. 12. Example of task structure in the knowledge graph for a construction process on Demo 6: Column concrete vibration monitoring. Labels and properties are set
according to the IFC schema. [28].
13
C. Ramonell et al. Automation in Construction 156 (2023) 105109
Fig. 14. Simplified taxonomy for maintenance tasks linked to IFC-based ontologies (processes).
14
C. Ramonell et al. Automation in Construction 156 (2023) 105109
Fig. 15. Example of task data model for a load test according to [75].
15
C. Ramonell et al. Automation in Construction 156 (2023) 105109
Fig. 16. Load test tasks in the knowledge graph. Classes for each node are highlighted and framed with a red rectangle. (For interpretation of the references to colour
in this figure legend, the reader is referred to the web version of this article.)
Secondly, the abstract version of those activities needs to be trans according to the IFC schema.
formed into a graph and thus, be linked to the central knowledge graph. This ontology is used as a basis for transformation from JSON to its
This is the way to enable the semantic search of information along with graph format. Fig. 12 exemplifies how tasks are represented in this way:
the rest of the asset model. The amount of information retrieved in this Although tasks described according to the IFC schema provide
activity summary is a decision that needs to be made by the designer comprehensive temporal information, they lack a categorization that
depending on the application of the Digital Twin. facilitates the differentiation between task types for construction and
The IFC schema provides a generic data model for representing maintenance purposes. Hence, to enhance the information detail within
processes (tasks). The use of classes such as IfcWorkPlan, IfcWork the digital twin and enable the systematic storage of task information it
Schedule, IfcTask, and IfcTaskTime allow managing and grouping tasks to is imperative to develop taxonomies that describe these task categories.
be performed in the asset. Fig. 11 depicts a simplified ontology defined This study is not focused on the development of comprehensive
16
C. Ramonell et al. Automation in Construction 156 (2023) 105109
Fig. 17. Mainflux Ontology (blue). Map to IFC (orange). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version
of this article.)
17
C. Ramonell et al. Automation in Construction 156 (2023) 105109
Fig. 19. Mainflux IoT platform setup generated after importing an IFC file.
Fig. 20. Connectivity between IfcSensors and Mainflux things in the graph.
18
C. Ramonell et al. Automation in Construction 156 (2023) 105109
Fig. 21. Enabled retrieval of contextualized data using integrated IFC models, processes and IoT platform metadata.
The platform also accepts files related to point clouds, images or videos. digital twins. In this study, those requirements have been identified and
Fig. 17 presents the ontology used to integrate the data available in dealt with throughout the development of a Knowledge Graph. Among
Mainflux into the knowledge graph. those requirements, it is understood that digital twin data models need
Mainflux is coupled with the digital twin system using a service to be modular, extendable and interoperable. Multiple domains and
(Mainfluxsync). When a new IFC model is imported, an event is trig different scales need to be encompassed. Likewise, different types of
gered, and that service automatically sets up the Platform to accom data formats with various storage locations need to be contextualized
modate data produced on-site (see Fig. 18). and integrated. It is concluded that a given digital twin base software
Two main channels are created: the “Status” and “RawData” chan system needs to be modular, cloud-based, and operable through APIs.
nels. The former is dedicated to communicating and modifying the To develop a software system that can cope with such requirements,
configuration of connected devices deployed on-site. The latter is a knowledge graph-centred system is proposed. Knowledge graphs allow
dedicated to transmitting live measurements. IfcSensors imported into coupling the semantics of the IFC schema with other ontologies,
the system are mapped as things into the IoT platform and subsequently enabling wider data contextualization. The inherent flexibility of the
connected to both channels. Equivalent entities in the IoT platform and graph model allows dynamic model updates, and the use of modern
the BIM knowledge graph share Global Unique Identifiers (GUIDs), a key graph databases provides granular and efficient data exploration and
aspect for integration. Finally, IoT entities are imported into the graph querying.
according to the ontology. Fig. 19 displays the Mainflux IoT platform The system is developed using a micro-service architecture, which
GUI with the initial setup. facilitates Cloud deployment and enables modularly defined function
Mainflux “Things” and IfcSensors sharing GUID are explicitly con alities. Those functionalities include (1) importing IFC models, which
nected using an IS_A relation within the graph, indicating the equiva are transformed into a graph model, (2) linking process data stored in a
lency of both entities (see Fig. 20). This enables bridging the gap local database and (3) linking sensor data stored in an external IoT
between BIM and IoT allowing contextualized interaction with IoT data platform. The knowledge graph acts as the contextual interface that
through semantic queries. Fig. 21 shows an example of a human- provides a comprehensive view of all data and all models forming a
readable query subsequently written using Cypher. The result shows Digital Twin. The knowledge graph allows us to perform complex
BIM-IoT-Process linked data. queries across data. The system can be thus extended with additional on-
demand data sources following the ontology-based integration pre
7. Conclusions sented throughout this paper.
The Knowledge Graph has been tested for practical implementation
Digital twins are being adopted across industries and are continu in a set of demonstrators. It is observed that this framework represents a
ously evolving to adapt to industry needs. Wide-scoped and fragmented versatile technological compound for the creation of digital twins for
characteristics of the built environment impose specific requirements on built assets. The framework bridges the gap of how DTs can integrate
19
C. Ramonell et al. Automation in Construction 156 (2023) 105109
BIM with IoT systems and other data silos, regardless of the type of in Technologies and Skills, Springer international publishing, 2021, pp. 131–155.
ISBN: 9783030824303.
formation therein contained.
[16] E. Halmetoja, The role of digital twins and their application for the built
The research group is presently developing Knowledge-Graph-based environment, in: Industry 4.0 for the Built Environment: Methodologies,
technologies for many demo cases depicted in this paper. This will result Technologies and Skills, Springer International Publishing, 2022, pp. 415–442.
in a set of information pipelines that extract and operate with data ISBN: 9783030824303.
[17] The official Portal for European Data. [Link]. [Link]
connecting computational agents both internally (integrated into the (accessed May 16, 2023).
system as microservices) and externally (connected through the system [18] A. Lukaszewska, M. Gilun, F. Johansson, R. Jongeling, R. Tomar, C. Claeson-
API). Authentication management, data security, and improvement of Jonsson, M. Jungmann, J. Campos, J. Gonçalves, V.K. Papanikolaou, R. Chacón, D.
G. Carrera, C. Ramonell, H. Posada, L. Ungureanu, S. Skaric, I. Stipanovič, D7.1
interaction workflows with the system are subsequent steps for ASHVIN Technology Demonstration Plan, 2021.
development. [19] BIM Dictionary. [Link] (accessed Feb. 10, 2023).
[20] International Organization for Standardization, ISO 19650 BIM Building
Information Modelling | BSI. [Link]
Declaration of Competing Interest (accessed Feb. 16, 2023).
[21] X.I.N.G. Xuejiao, Z. Botao, L. Hanbin, G.C. Hongliang, G. Chen, Automatic code
The authors declare the following financial interests/personal re compliance checking for design drawings of architecture major and its key
technologies based on BIM, J. Civ. Eng. Manag. 36 (05) (2019) 129–136, https://
lationships which may be considered as potential competing interests: [Link]/10.13579/[Link].2095-0985.2019.05.019.
Rolando Chacon reports financial support was provided by Horizon [22] J. Teizer, J. Melzner, BIM for construction safety and health, in: Building
Europe. Information Modeling: Technology Foundations and Industry Practice, 2018,
pp. 349–365. ISBN: 9783319928616.
[23] T. Fink, BIM for structural engineering, in: Building Information Modeling:
Data availability Technology Foundations and Industry Practice, 2018, pp. 329–336. ISBN:
9783319928616.
[24] A. Braun, S. Tuttas, U. Stilla, A. Borrmann, Bim-based progress monitoring, in:
The data that has been used is confidential.
Building Information Modeling: Technology Foundations and Industry Practice,
2018, pp. 463–476. ISBN: 9783319928616.
Acknowledgements [25] C.V. Treeck, R. Wimmer, T. Maile, BIM for energy analysis, in: Building
Information Modeling: Technology Foundations and Industry Practice, 2018,
pp. 337–347. ISBN: 9783319928616.
This paper was prepared in the context of project ASHVIN. ASHVIN [26] A. Seyrfar, I. Osman, H. Ataei, BIM and building emergency response management:
has received funding from the European Union’s Horizon 2020 research review of applications, For. Eng. (2022) 613–619, [Link]
and innovation programme under grant agreement No 958161. This 9780784484548.063.
[27] H. Alavi, R. Bortolini, N. Forcada, BIM-based decision support for building
publication reflects only author’s view and that the European Com condition assessment, Autom. Constr. 135 (2022) 104117, [Link]
mission is not responsible for any uses that may be made of the infor 10.1016/[Link].2021.104117.
mation it contains. The first author acknowledges the funding of the FI- [28] IFC4.3.1.0 Documentation. [Link]
(accessed May 16, 2023).
2021 AGAUR PhD grant. [29] P. Pauwels, D. Shelden, J. Brouwer, D. Sparks, S. Nirvik, T.P. McGinley, Open data
standards and BIM on the cloud, in: Buildings and Semantics, CRC press, 2022,
References pp. 101–136. ISBN: 9781032023120.
[30] W. Solihin, C. Eastman, Y.C. Lee, D.H. Yang, A simplified relational database
Schema for transformation of BIM data into a query-efficient and spatially enabled
[1] ASHVIN – Digitising and Transforming the European Construction Industry. htt
database, Autom. Constr. 84 (2017) 367–383, [Link]
ps://[Link]/ (accessed Jan. 17, 2023).
autcon.2017.10.002.
[2] Home - Cogito. [Link] (accessed May 22, 2023).
[31] S. Das, J. Srinivasan, M. Perry, E.I. Chong, J. Banerjee, A tale of two graphs:
[3] BIMprove H2020 - Build Smarter, Cleaner, Cheaper. - Build Smarter, Cleaner,
property graphs as RDF in Oracle, EDBT (2014, March) 762–773, [Link]
Cheaper. [Link] (accessed May 22, 2023).
10.5441/002/edbt.2014.82.
[4] HOME - BIM2TWIN. [Link] (accessed May 22, 2023).
[32] Neo4j, Neo4j Graph Data Platform | Graph Database Management System.
[5] L. Li, S. Aslam, A. Wileman, S. Perinpanayagam, Digital twin in aerospace industry:
[Link] (accessed Feb. 10, 2023).
a gentle introduction, IEEE Access 10 (2021) 9543–9562, [Link]
[33] R. Angles, H. Thakkar, D. Tomaszuk, Mapping RDF databases to property graph
10.1109/ACCESS.2021.3136458.
databases, IEEE Access 8 (2020) 86091–86110, [Link]
[6] M. Alazab, L.U. Khan, S. Koppu, S.P. Ramu, M. Iyapparaja, P. Boobalan, T. Baker, P.
ACCESS.2020.2993117.
K.R. Maddikunta, T.R. Gadekallu, A. Aljuhani, Digital twins for healthcare 4.0-
[34] J. Barrassa, A. Cowley, Neosemantics (n10s): Neo4j RDF & Semantics toolkit -
recent advances, architecture, and open challenges, in: IEEE Consumer Electronics
Neo4j Labs. [Link] accessed Feb. 10, 2023.
Magazine, 2022, [Link]
[35] R. Stuger, V.R. Benjamins, D. Fensel, Knowledge engineering: principles and
[7] I. Onaji, D. Tiwari, P. Soulatiantork, B. Song, A. Tiwari, Digital twin in
methods, Data Knowl. Eng. 25 (2) (1998) 161–197, [Link]
manufacturing: conceptual framework and case studies, Int. J. Comput. Integr.
S0169-023X(97)00056-6.
Manuf. 35 (8) (2022) 831–858, [Link]
[36] OWL, OWL - Semantic Web Standards. [Link] (accessed Feb.
0951192X.2022.2027014.
10, 2023).
[8] M. Pregnolato, S. Gunner, E. Voyagaki, R. De Risi, N. Carhart, G. Gavriel, P. Tully,
[37] A. Hogan, E. Blomqvist, M. Cochez, C. d’Amato, G.D. Melo, C. Gutierrez, J.E. Labra
T. Tryfonas, J. Macdonald, C. Taylor, Towards civil engineering 4.0: concept,
Gayo, S. Kirrane, S. Neumaier, A. Polleres, R. Navigli, A.N. Ngonga, S.M. Rashid,
workflow and application of digital twins for existing infrastructure, Autom.
A. Rula, L. Schmelzeisen, J. Sequeda, S. Staab, A. Zimmermann, Knowledge graphs,
Constr. 141 (2022) 104421, [Link]
ACM Comput Surveys (CSUR) 54 (4) (2021) 1–37, [Link]
[9] E. VanDerHorn, S. Mahadevan, Digital twin: generalization, characterization and
arXiv.2003.02320.
implementation, Decis. Support. Syst. 145 (2021) 113524, [Link]
[38] J. Barrasa, A.E. Hodler, J. Webber, Knowledge Graphs. Data in Context for
10.1016/[Link].2021.113524.
Responsive Businesses, O’Reilly Media, Inc, 2021. ISBN: 9781098104856.
[10] J.M.D. Delgado, L. Oyedele, Digital twins for the built environment: learning from
[39] DBpedia Association. [Link] (accessed Feb. 10, 2023).
conceptual and process models in manufacturing, Adv. Eng. Inform. 49 (2021)
[40] Wikidata. [Link] (accessed Feb. 10,
101332, [Link]
2023).
[11] Q. Lu, A.K. Parlikad, P. Woodall, G. Don Ranasinghe, X. Xie, Z. Liang,
[41] BabelNet. [Link] (accessed Feb. 10, 2023).
E. Konstantinou, J. Heaton, J. Schooling, Developing a digital twin at building and
[42] J. Hoffart, F.M. Suchanek, K. Berberich, G. Weikum, YAGO2: a spatially and
City levels: case study of West Cambridge campus, J. Manag. Eng. 36 (3) (2020),
temporally enhanced Knowledge Base from Wikipedia, Artif. Intell. 194 (2013)
05020004, [Link]
28–61, [Link]
[12] C. Boje, A. Guerriero, S. Kubicki, Y. Rezgui, Towards a semantic construction
[43] J. Akroyd, Z. Harper, D. Soutar, F. Farazi, A. Bhave, S. Mosbach, M. Kraft,
digital twin: directions for future research, Autom. Constr. 114 (2020) 103179,
Universal digital twin: land use, Data-Centric Eng 3 (2022), [Link]
[Link]
10.1017/dce.2021.21.
[13] Council, G, K. Lamb, Gemini Papers: Summary, Apollo - University of Cambridge
[44] J. Akroyd, S. Mosbach, A. Bhave, M. Kraft, Universal digital twin-a dynamic
Repository, 2022, [Link]
knowledge graph, Data-Centric Eng 2 (2021), [Link]
[14] R. Chacón, J.R. Casas, C. Ramonell, H. Posada, I. Stipanovic, S. Skaric,
dce.2021.10.
Requirements and challenges for infusion of SHM systems within digital twins
[45] Cambridge CARES. [Link] (accessed July
platforms, Struct. Infrastruct. Eng. (2023), [Link]
22, 2023).
15732479.2023.2225486.
[46] A. Chadzynsky, S. Li, A. Grisiute, F. Farazi, C. Lindberg, S. Mosbach, P. Herthogs,
[15] M. Kosicki, M. Tsiliakos, K. ElAshry, M. Tsigkari, Big data and cloud computing for
M. Kraft, Semantic 3D city agents—an intelligent automation for dynamic
the built environment, in: Industry 4.0 for the Built Environment: Methodologies,
20
C. Ramonell et al. Automation in Construction 156 (2023) 105109
geospatial knowledge graphs, Energy and AI 8 (2022) 100137, [Link] [59] A. Wagner, L.K. Moeller, C. Leifgen, C. Eller, Building Product Ontology. https
10.1016/[Link].2022.100137. ://[Link]/ontologies/bpo/, Nov. 04, 2019.
[47] A. Chadzynski, N. Krdzavac, F. Farazi, M.Q. Lim, S. Li, A. Grisiute, P. Herthogs, [60] M.H. Ramussen, M. Lefrançois, M. Bonduel, Ontology for Property Management.
A. von Richthofen, S. Cairns, M. Kraft, Semantic 3D city database — an enabler for [Link] 2018, November 21.
a dynamic geospatial knowledge graph, Energy and AI 6 (2021) 100106, https:// [61] A. Wagner, M. Bonduel, P. Pauwels, OMG: Ontology for Managing Geometry. https
[Link]/10.1016/[Link].2021.100106. ://[Link]/ontologies/omg/, 2019, November 12.
[48] National Digital Twin Programme, Centre for Digital Built Britain completed its [62] A. Wagner, M. Bonduel, P. Pauwels, R. Uwe, Relating geometry descriptions to its
five-year mission and closed its doors at the end of September 2022. [Link] derivatives on the web, in: 2019 European Conference on Computing in
[Link]/what-we-did/national-digital-twin-programme (accessed May 16, Construction, European Council on Computing in Construction (EC3), 2019,
2023). pp. 304–313, [Link]
[49] S. Hayes, C. Dent, B. Mawdsley, R. Judson, T. Collingwood, CReDo: overview [63] N. Sahlab, S. Kamm, T. Müller, N. Jazdi, M. Weyrich, Knowledge graphs as
report, Apollo - University of Cambridge Repository, 2022, [Link] enhancers of intelligent digital twins, in: 2021 4th IEEE International Conference
10.17863/CAM.82908. on Industrial Cyber-Physical Systems (ICPS), IEEE, 2021, May, pp. 19–24, https://
[50] J. Akroyd, CReDo Methodology Papers: Implementation, Apollo - University of [Link]/10.1109/ICPS49255.2021.9468219.
Cambridge Repository, 2022, [Link] [64] Docker. [Link] (accessed July 22, 2023).
[51] P. Pauwels, A. Costin, M.H. Rasmussen, Knowledge graphs and linked data for the [65] Docker Hub. [Link] (accessed July 22, 2023).
built environment, in: Industry 4.0 for the Built Environment: Methodologies, [66] Neo4j Graph Database & Analytics | Graph Database Management System.
Technologies and Skills, Springer international publishing, 2021, pp. 157–183. [Link] (accessed July 22, 2023).
ISBN: 9783030824303. [67] MongoDB. [Link] (accessed July 22, 2023).
[52] P. Pauwels, W. Terkaj, EXPRESS to OWL for construction industry: towards a [68] Messaging that Just Works — RabbitMQ. [Link] (accessed
recommendable and usable IfcOWL ontology, Autom. Constr. 63 (2016) 100–133, May 16, 2023).
[Link] [69] IfcOpenShell - The Open Source IFC Toolkit and Geometry Engine. https:
[53] D. Mavrokapnidis, K. Katsigarakis, P. Pauwels, E. Petrova, I. Korolija, D. Rovas, //[Link]/ (accessed Feb. 12, 2023).
A linked-data paradigm for the integration of static and dynamic building data in [70] neosemantics (n10s): Neo4j RDF & Semantics toolkit - Neo4j Labs. [Link]
digital twins, in: Proceedings of the 8th ACM International Conference on Systems com/labs/neosemantics/ (accessed July 22, 2023).
for Energy-Efficient Buildings, Cities, and Transportation, 2021, pp. 369–372. [71] Neo4j Python Driver 5.8 — Neo4j Python Driver 5.8. [Link]
ISBN: 9781450391146. cs/api/python-driver/current/ (accessed May 15, 2023).
[54] M.H. Rasmussen, M. Lefrançois, G.F. Schneider, P. Pauwels, BOT: the building [72] J. Zhu, P. Wu, X. Lei, IFC-graph for facilitating building information access and
topology ontology of the W3C linked building data group, Semantic Web 12 (1) query, Autom. Constr. 148 (2023) 104778, [Link]
(2021) 143–161, [Link] autcon.2023.104778.
[55] P. Pauwels, Building Element Ontology. [Link] [73] Pydantic. [Link] (accessed May 19, 2023).
nt/[Link], 2020 accessed Feb. 12, 2023. [74] Cypher. [Link] (accessed May 19, 2023).
[56] P. Pauwels, Distribution Element Ontology. [Link] [75] NAP 2–4-2.0 PRUEBAS DE CARGA. [Link]
ionelement/[Link], 2019 accessed Feb. 12, 2023. [Link]/v0/AB0851B0FE894FBAC1258668003D4617/$FILE/NAP%
[57] A.-H. Hamdan, M. Bonduel, R.J. Scherer, An Ontological Model for the 202-4-2.0_Pruebas%20de%[Link]?OpenElement (accessed July 22, 2023).
Representation of Damage to Constructions, Accessed: Feb. 12, 2023. Available: htt [76] A. Keränen, C. Bormann. Sensor Measurement Lists (SenML) Fields for Indicating
p://[Link]/1999/02/22-rdf-syntax-ns. Data Value Content-Format RFC 9193 2023, 2020 doi:10.17487/rfc9193.
[58] A.-H. Hamdan, R.J. Scherer, Integration of BIM-related Bridge Information in an
Ontological Knowledgebase, Accessed: Feb. 12, 2023. [Online]. Available: htt
ps://[Link]/Alhakam/bridgeOntology.
21