Open accessResearch articleFirst published online 2025-5
On the Formal Alignment of Foundational Ontologies: Building Mappings Between Basic Formal Ontology and Descriptive Ontology for Linguistic and Cognitive Engineering
The development of an ontology-based ecosystem suitable for interoperability and for the reliable exchange of data and information is a longstanding goal of the applied ontology community. The construction of such a system relies on the alignment of a variety of ontologies. The alignment can be vertical, that is, across systems in different ontological layers (foundational, reference and domain) or horizontal, that is, across systems within a single layer. While techniques for vertical and horizontal alignments among reference and domain ontologies have been continuously studied for more than 15 years; the development of techniques for the alignment of foundational ontologies and across foundational and reference ontologies has been scattered or hardly addressed. This paper aims to bring attention to this latter problem in several ways: Highlighting the motivations for the alignment across foundational ontologies; presenting a methodology to develop such alignments; implementing the methodology to the case of the ontologies Basic Formal Ontology and Descriptive Ontology for Linguistic and Cognitive Engineering; and showing how an alignment deepens our understanding of such systems. The paper discusses also some alternative types of alignments, and briefly compares advantages and difficulties.
In the 1980s the continuous improvement of storage and computational capacities made evident that interoperability across an increasing number of data and systems was becoming a fundamental problem in information science. In the early 1990s the research community started to focus on the hidden assumptions behind the generation and collection of data and the construction of information systems at large and, in a few years, this turned into a visionary research line, soon called applied ontology, that quickly became ‘the’ approach to solve interoperability problems. The roots of interoperability barriers were not yet clear but the growing availability and potentiality of large datasets, varying in both organisation and quality, drove attention to what one means by interoperability and how it could be achieved (Gardner, 2005; Ouksel & Sheth, 1999). A first source of problems, as it became clear later, was the lack of conceptual clarity and coherence in the internal organisation of datastores. Relational databases are powerful and successful tools but have important limitations (in particular when dealing with heterogeneous data) due to their rigidity and structural complexity. A second source of problems was the lack of a consistent methodology to merge databases and, more generally, to cope with the different structures of information systems. The fact that data were collected from different sources (each following its own purposes) using different approaches (relying on often implicit assumptions) hampered any attempt to develop such a methodology, and information sharing across data storage remained a problem.
The applied ontology vision relied on two tenets: (i) exploitation of formal semantics, and (ii) transparency of the assumptions. It followed that the process of data integration should be analysed at different levels: Syntactic, semantic, conceptual and structural. The focus on the semantics of data and on the conceptualisation of data organisation has brought a sharp change of perspective, and led to studying how to make explicit the viewpoint one takes on reality. In the following ten years, the idea that one has to make explicit the assumptions behind data collection and data management became popular, even though how to do it remained unclear (Ouksel & Sheth, 1999).
By focussing on semantics and hidden assumptions, applied ontology made possible to develop methodologies for data integration and systems’ interoperability. The idea was to study interoperability problems by separating interoperability levels, and to use formal logic to uncover semantic inconsistencies, and philosophical analysis to uncover conceptual inconsistencies. These distinctions allowed the clarification of the implicit choices on which a system or dataset relies, and to use logical and philosophical analyses to identify possible mismatches across systems. The next step was to harvest logical approaches to develop an alignment methodology among different semantic frameworks, and similarly to harvest philosophical approaches for an alignment methodology among different conceptual frameworks. This procedure gives, as one would expect, a general methodology whose outcome is a mapping across the systems at stake; mapping which is reliable both at the semantic and conceptual levels, and that explicitly shows the compatibility extension of those systems. This approach suggested that datasets and information systems should be shared together with their semantic and conceptual frameworks, now known as their ontology. Each system, augmented with its (explicitly documented) ontology, was then considered transparent and suitable for interoperability, if not for full integration. Admittedly, systems may rely on incompatible semantic or conceptual choices, the alignments one can find in such cases may be only partial. This, one would have argued, is the cost of the pluralistic approach in modelling. The community, with only limited exceptions, accepted this cost as needed to practically verify which modelling approaches were optimal and for which purposes.
Today we have a set of general methodologies (Euzenat & Shvaiko, 2013; Fernández-López et al., 1997; Jarrar & Meersman, 2009; Suárez-Figueroa et al., 2012) that help to develop and align datasets and information systems. While these approaches need time to be consolidated, they leave open the problem of aligning the most general ontologies, called foundational or top-level, aimed to be broadly applicable. How interoperability may work at this level is still unclear, mainly because of the complexity of these systems. Early works include a comparative study of Basic Formal Ontology (bfo), Descriptive Ontology for Linguistic and Cognitive Engineering (dolce) and Object-Centered High-level Reference Ontology (ochre), with an alignment of ochre to dolce (Masolo et al., 2003). Unfortunately, this study uses an ad hoc approach and does not provide a systemic view. So far no general methodology has been investigated and no systematic study has been carried out.
The present paper is part of a larger effort to build an ontology ecosystem as described by the visionary OntoCommons European project.1 Motivated by this line of research, the paper proposes an approach to align top-level formal ontologies and presents problems, advantages and limitations as elicited in the study of the alignment of two well-known systems, namely dolce (Borgo et al., 2022a) and bfo (Arp et al., 2015; Otte et al., 2022; Smith et al., 2020). The main purpose is to identify and investigate the kind of problems, and possible solutions, that emerge in the alignment of foundational ontologies. The hope is that the collection and study of these cases can highlight the specificity of these issues and their relevance to understand ontology interoperability at this level. Furthermore, it is hoped that practical experiences like this can lead to developinging a shared and general methodology for the alignment of top-level ontologies.
The view underlying the work in this paper is known as ‘ontological pluralism’. It claims that science needs to remain open to different views, that is, to ontologies representing different conceptualisations of the world, and thus should allow the coexistence of possibly mutually inconsistent models.2 Inconsistency can be at the conceptual as well as at the formal level. Interoperability across mutually inconsistent ontologies is obtained by (possibly partial) alignments via formal mappings. The OntoCommons ontology ecosystem is essentially a network of interconnected domain, reference and foundational ontologies, which is being built starting from an initial set of ontologies and a number of guidelines. The alignment presented in this paper is one of the results achieved by the project. The alignment (and underpinning conceptual and technical choices) is based on the first-order logic (FOL) versions of these ontologies. The - mapping applies to the most recent versions of these two ontologies in the CLIF language of Common Logic (CL), an ISO standard3, as submitted by the groups that developed these ontologies, see Section 2 for more details. The choice to use CL is the result of several considerations including that the use of CL is fairly consolidated in the research literature and in applications, and that the visibility and stability of standards can help to increase users’ trust in an ontology ecosystem and to foster its adoption and extension.
The aims of this paper are:
to report the theoretical and practical challenges one faces in aligning foundational large ontologies (these systems have more than 100 axioms);
to highlight the most difficult and intertwined choices one might be confronted with during alignment processes;
to discuss possible alignment strategies and their consequences on the alignment; and
to provide an analysis of the main differences/similarities between and according to the obtained alignment.
The paper is structured as follows: Section 2 introduces the two ontologies and the notation. Section 3 describes the adopted methodology and the software used to support and check the alignment. Section 4 presents the alignment strategy and discusses the main categories that form the core of the two ontologies. The two following sections, Section 5 and Section 6, form the technical core of the paper: They present the formal alignment as two distinct mappings: From to and from to . Since the motivation of this work is the construction of an ontology ecosystem that includes also reference and domain ontologies, Section 7 adds some considerations on the possible use (and consequences) of the given alignments to support mappings across languages of the OWL family (W3C OWL Working Group, 2012). The last session, Section 8, summarises the results, discusses other open questions, and lists the next steps in this research line.
bfo and dolce
bfo and dolce are among the most well-known and used foundational ontologies. Several other foundational ontologies can be found in the literature, all developed within the last 20 years and each investigating different ontological choices (Borgo et al., 2022b). The decision to investigate the alignment between bfo and dolce is due to the choice made in the OntoCommons project which, in turn, has selected the two ontologies that have been developed earlier in this research area and that have influenced how applied ontologies are presented and discussed today. Note that, leaving aside terminological and other lightweight alignments, a formal comparison of bfo and dolce was partially investigated in 2003 (Masolo et al., 2003) and, more recently, a conceptual analysis has been proposed (Guarino, 2017). A brief historical view of the bfo and dolce ontologies is given next, an introduction of their formal approaches follows. For more information on the ontologies see (Borgo et al., 2022b; ISO 21838-2, 2021; ISO 21838-3, 2023).
is a top-level ontology, motivated by a realist philosophical position, intended to represent entities as described by science and designed to support information integration, retrieval, and analysis across all domains of scientific investigation. has been under development since around 2002 (Grenon et al., 2004) and serves as the top-level ontology of the Open Biological and Biomedical Ontology Foundry (OBO)4 and the Industrial Ontology Foundry5. has changed considerably over the years and today there are several versions available. The one considered in this paper is the presented in 2020 as part of the standard ISO 21838 (part 2) (ISO 21838-2, 2021). This version is axiomatised in CL (CLIF) (ISO 24707, 2007) and in the Web Ontology Language (OWL), as required by the standard.
is a top-level ontology, motivated by a cognitive view of reality, and designed to support sharing of data, information and knowledge across all domains. was developed in early 2000 as part of the European project ‘WonderWeb’6, published in the WonderWeb Deliverable 18 (Masolo et al., 2003), and is axiomatised in FOL extended with modalities, namely, the standard necessity and possibility operators. has been applied in a number of areas, in several European and international projects, and is also provided in application-oriented versions in OWL, in particular: -lite, -ultralite, and -zero. The ontology is consolidated as it has been only minimally changed over the years making it the most stable top-level ontology worldwide. The one considered in this paper is the presented in 2020 as part of the standard ISO 21838 (part 3) (ISO 21838-3, 2023). This version is axiomatised in CL (CLIF) (ISO 24707, 2007) and in the Web Ontology Language (OWL), as required by the standard.
bfo: CL Version (as of November 12, 2021)
We start from the logical theory bfo-cl consisting of all the axioms of BFO 2020 in CLIF, released on 12 November 2021 and accessible in GitHub7. The version we actually use in the alignment is labelled , is given in the syntax of Prover9 and presents four differences with respect to bfo-cl: (i) Inverse relations that can be found in bfo-cl (e.g., hasContinuantPart is the inverse of continuantPartOf) are not present in , and instead they are replaced by the original relation with arguments in reverse order (the temporal argument maintains its position); (ii) the ‘AtSomeTime’ and ‘AtAllTimes’ relations present in bfo-cl are never used in other axioms of bfo-cl and, for this reason, are not considered in 8; (iii) relations introduced in bfo-cl with ‘if and only if’ clauses are present in as syntactic definitions, that is, while in bfo-cl all these relations may be considered as ‘genuine’ additional predicates that are related with other predicates by a bi-conditional axiom, we consider these relations as ‘parametric macros’ that can be replaced by their definient, and have no impact on the interpretations of the theory (cfr. Section 3.1); and (iv) further syntactic definitions are added to improve the readability of some formulas.
To have more compact formulas, the bfo predicates have been shortened as shown in Table 1 (where the third column gives the original predicates in bfo-cl) and in Table 2. To avoid confusion, we indicate the bfo-predicates in while the dolce-predicates in . The classes of particulars and universals are represented in bfo by unary predicates (and not by individual constants): We use for particulars and for universals. The taxonomy of the particulars is depicted in Figure 1 (vertical lines represent ISA relationships, solid lines indicate a partition). All the universals in the taxonomy in Figure 1 are ‘rigid’ (in the sense that their instances cannot migrate over time to another universal) except for , , and .
Taxonomy of Basic Formal Ontology (bfo) (Vertical Lines Stand for ISA Relationships, Solid Lines Indicate a Partition).
Primitive Relations of bfo.
is an instance of at time
instanceOf
exists at time
existsAt
is a continuant part of at time
continuantPartOf
is an occurrent part of
occurentPartOf
is a member part of
memberPartOf
is a temporal part of
temporalPartOf
occupies spatiotemporal region
occupiesSpatiotemporalRegion
occupies spatial region at time
occupiesSpatialRegion
occupies temporal region
occupiesTemporalRegion
temporally projects onto
temporallyProjectsOnto
spatially projects onto at time
spatiallyProjectsOnto
occurs in
occursIn
is located in at time
locatedIn
specifically depends on
specificallyDependsOn
concretises
concretises
generically depends on at time
genericallyDependsOn
participates in at time
participatesIn
realises
realises
is the history of
historyOf
precedes
precedes
is the first instant of
firstInstantOf
is the last instant of
lastInstantOf
is the material basis of at time
materialBasisOf
Categories of bfo.
Continuant Fiat Boundary
Quality
Continuant
Relational Quality
Disposition
Realisable Entity
Fiat Line
Role
Fiat Object Part
Specifically Dependent Continuant
Fiat Point
Site
Fiat Surface
Spatial Region
Function
0d Spatial Region
Generically Dependent Continuant
1d Spatial Region
History
2d Spatial Region
Independent Continuant
3d Spatial Region
Immaterial Entity
Temporal Region
Material Entity
0d Temporal Region
Object
1d Temporal Region
Object Aggregate
Spatiotemporal Region
Occurrent
Temporal Instant
Process Boundary
Temporal Interval
Process
The axioms considered in – with the exception of the ones forcing the rigidity of the -categories and the taxonomical constraints depicted in Figure 1 – are reported in Appendix where the third column indicates the label originally adopted in to refer to the axiom.9 The syntactic definitions (d1)-(d8) will be used along the paper (notation: ‘’ stands for ‘ is syntactically defined by ’ and ‘’ for ‘ at time , is an instance of ’, see Section 4.1).
dolce: CL Version
We write for the logical theory consisting of all the axioms in the CL-version of dolce (see Appendix).10 With respect to the original version of dolce introduced in Masolo et al. (2003), from now on called dolce-d18, presents two main differences following the dolce-iso version:
Modality operators are not used as they are not part of the language of CL (since dolce-d18 is a first order modal theory relying on the modal logic QS5 with constant domain, the lack of modal operators in changes the intended models of dolce);
The mereological fusion operator (operator in dolce-d18) is not available in the language of CL (for each property expressible in the system, dolce-d18 enforces the existence of the mereological sum of all the entities that satisfy that property; this cannot be ensured in due to the lack of the fusion operator.)
The first simplification weakens several notions of dependence which are strongly grounded on modality. To partially overcome this difference, has the additional primitive of temporary existential dependence () which, however, compensates only in part the lack of modal operators.
Concerning the second simplification, is based on extensional mereologies, rather than general extensional mereologies assumed in . Note that does not enforce the closure of the domain under (binary) sum and product. It is left to the user to add sum and product operators depending on modelling needs. Additionally, some notions that in dolce-d18 are defined via the mereological fusion operator, are introduced in as primitives. In particular, in we have the temporal relation , read as ‘ is (exactly) located at time ’, and the temporary spatial relation , ‘at time , is (exactly) located at space ’.
Two additional simplifications, with respect to , are adopted in (again following dolce-iso):
only direct qualities are considered ();
the one-to-one link between quality types (e.g., temporal location, colour location) and quality spaces (e.g., time interval, colour region) assumed in dolce-d1811 is not enforced anymore. The same quality kind can be linked to several quality spaces, a given colour location can be located in an RGB colour-space, a colour spindle space, etc.
Table 3 lists the primitive relations of , Table 4 lists the categories of , and Figure 2 shows the taxonomy of (vertical lines stand for ISA relationships, solid lines indicate a partition).
Primitive Relations of dolce.
is a direct quality of
is existentially dependent on at time
constitutes at time
is part of
participates in at time
is the immediate quale of
is (exactly) located at space at time
is (exactly) located at time
is part of at time
is the temporary quale of at time
Taxonomy of Descriptive Ontology for Linguistic and Cognitive Engineering (dolce) (Vertical Lines Stand for ISA Relationships, Solid Lines Indicate a Partition).
Categories of dolce.
Abstract
Physical Endurant
Accomplishment
Physical Object
Achievement
Physical Quality
Agentive Physical Object
Physical Region
Abstract Quality
Process
Abstract Region
Quality
Arbitrary Sum
Region
Agentive Social Object
Space Region
Endurant
Social Agent
Event
Society
Feature
Social Object
Amount Of Matter
Spatial Location
Mental Object
State
Non-agentive Physical Object
Stative
Non-agentive Social Object
Time Interval
Non-physical Endurant
Temporal Quality
Non-physical Object
Temporal Location
Perdurant
Temporal Region
The axioms considered in – with the exception of the ones concerning the taxonomical constraints depicted in Figure 2 – are reported in Appendix. The syntactic definitions (d1)-(d7) will be used along the paper. When the context is clear, we will use and interchangeably.
Notation
In the paper we adopt the following notation to referencing formulas: (a), (t) and (d), followed by an integer, identify axioms, theorems, and syntactic definitions, respectively, in the order in which they are listed in the theories. More specifically,
(a), (t), (d) are used for axioms, theorems, and definitions of (index stands for );
(a), (t), (d) are used for axioms, theorems, and definitions of (index stands for );
(d) are used for the mappings from to ;
(d) are used for the mappings from to ;
(t) are used for theorems of the theory (an extension of that adds the mappings to , see Section 3.1 for an explanation of what is exactly meant with ‘extension’) that include some predicate of the mappings;
(t) are used for theorems of the theory (an extension of that adds the mappings to , see Section 6.1 for a definition) that include some predicate of the mappings.
General Strategy and Role of Methodological Assumptions in Aligning TLOs
Before introducing the mappings and the difficulties we encountered, we provide the assumptions, strategy, and further considerations behind this work. These observations motivate the choices made in building the mappings and determine in which sense the result is correct.
The approach is centred on the CL-versions of the two ontologies. These, with their explanatory documentation, are taken as the primary sources of information on the ontologies and are considered in our strategy from the start. Here are the steps that characterise an alignment strategy12:
analyse the CL-axioms together with the available documentation13 to ensure proper understanding of the overall ontologies and the intended interpretations of the primitives in each;
establish and document the methodological choices;
introduce formal mappings from one ontology to the other (and vice versa) with a description of what is covered;
test (manual and with theorem provers) that the set of mappings is coherent and aligns what is intended;
evaluate if the obtained set of mappings can be further generalised.
The strategy characterised by steps (S1)–(S5) is clearly underspecified. To become operative, one has to give the methodological choices which determine the kind of alignment sought (and how it can be verified). For instance, one may want to align the terminology, the primitives, the concepts, the inferences, the (intended) domains or even the whole models of the ontologies. This kind of choice leads to paying attention to some features of these systems, leaving aside others. Generally speaking, steps (S1)–(S5) may generate incompatible mappings when applied to preserve distinct features. In particular, one has to state what happens in case of inconsistencies across the available documentation. One may decide that the description in natural language is the most reliable of the authors’ intentions. Others may insist that the formal theory has priority since it is the only one that can be tested for consistency. Furthermore, the documentation might not be precise or detailed on aspects relevant for the alignment since the latter depends also on how the other ontology is developed. For instance, during the construction of the bfo and dolce alignment that we present in the next sections, the intended interpretation of some concepts was unclear to the alignment developers and clarifying questions were sent to the ontology authors to collect further information. This approach might not always be possible and, even so, it might not solve all cases (sometimes the authors of an ontology might not have a ready answer especially if the question raises foundational problems). In this paper, we keep bfo and dolce as is, that is, no extensions or modifications of the original ontologies have been considered to improve the mappings (even though some suggestions are proposed in Section 5.2 and Section 6.2). This is an important requirement in the context of the OntoCommons ontology ecosystem that contains several mid- and domain-levels ontologies already aligned with bfo and dolce.
Here are the assumptions adopted to align bfo and dolce.
Given an analysis of the documentation and the examples in the ontologies, any elicited informal correspondence across the notions of the two ontologies is assumed to provide a basis for a mapping of categories and relations of one ontology into the other. For instance, occurrents are described and used, in the bfo formal system and accompanying documentation, in a way that is conceptually similar to that of perdurants in the dolce formal system and documentation. This similarity is taken as an informal correspondence and leads to the consideration of the alignment of the bfo occurrent category to the dolce perdurant category. (Note that such a correspondence might be suggested by common examples in the two ontologies, or even be based on the comparison of relevant relations which are, perhaps for different reasons, considered similar like the bfo relation continuantPartOf and the dolce relation temporary parthood).
A mapping aims to embed all (or most of) the members of a category of the source ontology (e.g., in the case of Section 5) in the domain of quantification of the target ontology ( in Section 5). In other terms, the purpose is to maximise the coverage of the entities of one ontology by the domain of the other ontology. Given this scenario, the study of which axioms of the target ontology hold or fail after the mappings from point (M1) are formalised, highlights genuine differences between the ontological commitments of the two ontologies (at least relatively to the CL language), and which entities of the source ontology are problematic in, or even incompatible with, the target ontology.
Only direct mappings that can be formalised in FOL as syntactic definitions are considered, that is, a mapping is a FOL syntactic definition of a primitive of the target ontology in terms of the primitives of the source ontology. This choice allows us to use theorem provers to verify which axioms of the target ontology are preserved by the mappings. Meta-modelling techniques, like the application of abstraction processes or set-theoretical (second-order) constructions to enrich the domain of the source ontology with additional entities, are not considered. Analogously for embeddings of the target and source ontologies into richer ontologies that would serve as bridges between them, see (Chui & Grüninger, 2014). The analysis presented in this paper can be seen as a prerequisite to the application of these other techniques.
Note that the assumptions (M2) and (M3) are interrelated and are seen from the perspective of an interactive development of mappings that starts with the analytical step (M1). In particular, (M2) emphasises that the alignment is domain focussed in the sense that the goal is to maximise coverage of the entities of the source ontology by the domain of the target ontology.
General Observations From the Alignment of bfo and dolce
The alignment between large ontologies presents a formal and a conceptual challenge: The formal challenge consists of the identification and the verification of the mappings; the conceptual challenge refers to the choice among alternative mapping options. For instance, in Sections 5.2 and 6.2 we discuss some alternative mappings which have not been adopted in our alignment. These mappings modify (by restricting or expanding the mappings proposed in, respectively, Sections 5.1 and 6.1) the way some notions of the target ontology are modelled in terms of the ones of the source ontology. In particular, these alternative mappings present possible solutions (or improvements) to the partiality of the alignments developed in Sections 5 and 6. Furthermore, formal constructions (definitely more complex) can be implemented to extend the domain of the source ontology to resemble more closely the domain of the target ontology, facilitating a richer mapping across these systems. This result can be achieved also by allowing for the definition of primitives of the target ontology in the source ontology, via reference to newly introduced entities. This case is not specific to the alignment of bfo and dolce. It is a choice to consider when specialising an alignment strategy.
Assumption (M3) may be better understood if we compare two approaches to extend a theory via definitions. Recall that the goal is to have formulas in the source ontology that capture (or best approximate) the concepts in the target ontology. In this example, we take as the source ontology and as the target ontology. (Analogous considerations hold for the other direction.)
The first approach, which we adopt in this paper, is to extend theory with a set of syntactic definitions – the mappings in Section 5.1 –, which inject primitives into , that is, these definitions are introduced as a way to ‘understand’ the primitives in the dolce language. We use the symbol for these syntactic definitions and treat them as ‘parametric macros’. That is, one can simply substitute at the syntactic level the definiens for the definendum. Note that these syntactic definitions have parameters which must be suitably matched. For instance, suppose we aim to define the primitive of parthood on occurrents in via formula . As we know, the lefthand side (namely, ) is a primitive in . Let us assume that is an expression in the language of with and as the only free variables. In this case, and are treated as parameters and the formula in the language of becomes formula in the language of . Similarly, -formula becomes -formula .
The second approach consists in adding these definitions as new formulas in the source ontology, obtaining a definitional extension of the source theory. It suffices to turn our syntactic definitions into biconditionals rather than syntactic definitions as described above. Syntactically, this means to use the symbol (if and only if) instead of the symbol, and to close the expression by adding universal quantifiers for all the free variables. For example, in the previous example one would add the entire formula directly into the source theory. The advantage of using syntactic definitions instead of extensions by definitions is that the vocabulary of the theory in which we introduce the syntactic definition does not change. Using the ‘if and only if’ clause automatically adds the defined predicates to the ontology language (e.g., we would add the -predicate into the vocabulary when using the biconditional in the previous example). In Section 4.1 we will see a case where the latter approach adds new ontological commitments into the theory with important consequences on the view of universals.
Once has been extended with these syntactic definitions (and possibly further syntactic definitions already present in ) – we dub such extension – it is possible to check how much of the theory is preserved: Each axiom in expressible14 in is proved or disproved in . The analysis in Section 5.2 of the preserved/non-preserved axioms allows, modulo the introduced mapping, to highlight and understand similarities and divergences between the two ontologies. The mapping in the other direction, that is, the extension with the syntactic definitions of -primitives in terms of -primitives, is formalised in Section 6. The combined analysis of the mappings dolce-into-bfo and bfo-into-dolce points out genuine ontological differences as well as problematic aspects of the adopted technique.
These observations hold beyond the bfo and dolce cases. Also, the general strategy can be iterated to further refine the mappings and possibly solve false cases of agreement and disagreement.
Use of Theorems Provers and Model Builders
The material presented in the next sections is tested using theorem provers, namely, Prover915 and Vampire16. For the generation of models and countermodels, the software Mace417 was used. According to the literature (Sutcliffe, 2016) and in our experience, Vampire is usually, but not always, faster than Prover9. The proofs produced by Prover9 are generally shorter and easier to read (not least because Prover9 offers the possibility of removing universal quantification from the start of formulas – Prover9 assumes by convention that any free variable in a formula is implicitly universally quantified). To take advantage of these differences, we used both theorem provers.
Our methodology is as follows. We rewrite all axioms and definitions of and in the syntax accepted by the software (Vampire accepts tptp (Sutcliffe, 2016), Prover9 uses its own syntax). To prove a theorem regarding the mappings from dolce to bfo, we add to the tptp or Prover9-syntax version of dolce the encoding, in the appropriate syntax, of both the mappings in Section 5.1 and the formula to be proven (i.e., we rewrite the whole plus the formula to be proven). Then we submit this large theory to the prover and ask if the formula is a theorem in that theory, that is, if it follows from . We proceed similarly in the case of the mappings from bfo to dolce. The documentation provided in Appendix of Masolo et al. (2023) refers to these automatically generated proofs. When the provers do not find a proof of a statement, and this happens in several cases, we provide a manual proof. Some of the manually generated proofs are verified by at least one prover by adding intermediate lemmas (which are themselves automatically verified).
To formally verify counterexamples of relevant sentences, we produce a counterexample by hand and then proceed in the following way: We write axioms (in the prover syntax) enforcing the existence of the exact number of individual constants required by the counterexample, as well as each and every relation holding among them. For the sake of brevity, some of the relations are skipped by exploiting taxonomical axioms. That is, once we add the axioms of the (or ) taxonomy, it suffices to specify to which leaf-class each constant belongs: the constant will automatically belong to each superclass and not belong to disjoint classes, thanks to the ISA relationship axioms holding between the classes of the taxonomy. Additionally, the disjunction axioms between the taxonomical classes spare us from stating that individuals belonging to disjoint classes are different (non-identity among different individual constants belonging to the same class must be still forced through appropriate axioms). In this way, we obtain a theory that syntactically describes the domain and the relationships at the core of the counterexample, which is complete by construction. At this point, Mace4 is used with these axioms as input to verify that the given (small) set of axioms is consistent and to generate an explicit model. Then, for each axiom of (or ), we verify that it holds in the model, by asking the provers if the axiom follows from the axioms that specify the model, thus proving that the model is a model of (). Lastly, we check that in this model the sentence that we want to disprove is actually false.
Note that the standard way to generate counterexamples with Mace4, that is, by asking Mace4 to find a model of all the axioms of () with the addition of the negation of the sentence to be disproved, does not work in most of the cases we encountered perhaps due to the complexity of these ontologies.
All the proofs and models relative to our alignments are collected in Masolo et al. (2023).
Preliminary Considerations
This section focuses on points specific to the alignment of bfo and dolce. This allows us to introduce two problems related to the technical choices made in these systems, and to discuss the different levels at which these theories investigate some ontological classes.
Representation of Categories
In the categories in Table 4 are represented by means of FOL unary predicates. These categories are disjoint and assumed to correspond to rigid properties: if an entity belongs to one of these categories, it must belong to that category from the time it starts to exist to the time it ceases to exist (if any). To state that an entity is an endurant, one formally writes . Similarly for the other categories.
In , all the universals in Table 2 are in the domain of quantification and a temporary instance-of primitive relation (written ) is introduced to represent when an individual is an instance of a universal at a given time. For instance, the fact that ‘at time , is a continuant’ is formally written , where corresponds to the universal being a continuant. bfo universals are non-empty and rigid through time with the exception of , , and . For instance, a material entity could belong to at some time, and later to (or ) and, perhaps, change again later, provided at each point in time it is classified by one and only one of these three universals.
These different representational choices raise an initial problem because, according to (M3), one should individuate to which kind of dolce entities the bfo universals correspond (Section 5.2 adds further considerations on this point). We consider only the -universals present in the taxonomy, and start from the subset that can be defined in . For each universal , a syntactic definition of form is introduced. Note that here the ‘parameters’ are and , while is a constant. This means that, first, we do not define the notion of instance-of but define only the instantiation of a given . That is, in general, the definitions of and could be different. Second, as discussed earlier, will not appear in and thus will not become an individual in the domain of . If we were to use the biconditional conditions, writing in the theory, both the predicate and the individual constant would be added to the vocabulary of , and this would imply having some universal in the domain of .
A way to avoid this problem is to substitute for , that is, to map the needed bfo categories to dolce predicates, and the instance-of relation to (logical) predication. We adopt the first strategy because it is closer to the formalisation in , and it avoids including universals in the domain of as well as the relation instance-of among the primitives of .
In this perspective, axioms quantifying over universals are disregarded by the mappings. Weaker versions of these axioms are introduced by considering the (finite) set of those universals explicitly used to generate the mappings. For instance, can replace the axiom (a6).
Different Ontological Focus and Resolution
The informal presentations of bfo occurrents and dolce perdurants are very close. This observation is further supported by the fact that both the ontologies introduce a primitive of parthood simpliciter for these entities (relation in and in ). Analogously, we notice a similarity between bfo continuants and dolce endurants, given that both ontologies conceive the temporary parthood relation as a primitive ( in and in ). Things are different for temporal/spatial regions and qualities. At first sight, these categories appear to be similar. Formally, they are classified in different ways in the two ontologies. classifies qualities (and, more generally, specific dependent continuants) and spatial regions under continuants while temporal regions are classified under occurrents. In , temporal regions, spatial regions, and qualities are neither endurants nor perdurants.18 In particular, temporal regions and spatial regions are abstract entities in . This difference – mainly grounded on the way the distinction between endurants/continuants and perdurants/occurents is characterised – complicates the comparison and needs to be taken into account in defining the mappings (this can be seen, for instance, in the case of the definition (d1) in relative to the primitive of ).
How the most general categories (endurants/continuants on the one hand, perdurants/occurrents on the other hand) are specialised diverges considerably in the two ontologies:
On perdurants vs. occurrents
In dolce perdurants are specialised mainly on the basis of two notions, both extensively discussed in the linguistic and philosophical literature: Homeomericity (roughly, any temporal part of a perdurant of a certain kind is a perdurant of the same kind) and cumulativity (roughly, the mereological sum of perdurants of a certain kind is a perdurant also of the same kind); in bfo processes are distinguished from process boundaries mainly based on the dimensionality of their temporal locations and on the fact that they are temporal proper parts of other occurrents.
On endurants vs. continuants
The spatial and material dimensions of these kinds of entities play a central role in both dolce and bfo (see Section 5.2 for their differences). dolce presents a finer taxonomy mainly aimed to cover the notions of agentivity and sociality. bfo, which is driven by the distinction between fiat vs. bona fide entities, explicitly introduces the notion of aggregate, and relies on the dimensionality of the spatial (rather than temporal) location to distinguish sites vs. continuant fiat boundaries.
On qualities
The branch of the taxonomy for qualities in dolce is detached from that of endurants and continuants, differently from bfo, and is motivated by the comparability principle: It makes sense to compare the colour of a rose and the colour of a vase, while it does not make sense to compare the colour of a rose and the weight of a vase. Qualities are clustered in terms of maximal comparability, for example, colours, weights, lengths, shapes, etc. form different quality types. Spaces of regions have a structure and are associated accordingly, for example, for colours, the space has regions for red, blue, green, etc. Instead, bfo distinguishes classes of specifically dependent continuants according to the existence, the nature of, and the participants in their realisation processes. These different ways of classifying entities are not incompatible per se. For instance, one could consider cumulativity, agentivity, comparability in bfo, and dimensionality of temporal/spatial regions and realisation processes in dolce. But, clearly, this approach requires extending bfo and dolce, a choice that should be pondered carefully. The alignment strategy based on ontology extension is not exploited in the work we present here. The consequence is that some primitive notions and categories of the target ontology cannot be defined in terms of those in the source ontology.
The Mapping From dolce to bfo
This section presents the technical results of the establishment of a mapping from dolce to bfo, that is, it shows how to define in the categories and primitive relations of . As anticipated, the mapping does not cover all categories and relations.19 In other terms, the starting point is the domain of dolce, and the goal is to classify and relate the entities of this domain in terms of the categories and relations of bfo.
The first part of the section, Section 5.1, reports the elements covered by the mapping and the conceptual motivations for its limitations. This part makes technically clear the theoretical and formal barriers to completing the coverage of the ontology in the kind of mapping that our general strategy and related assumptions can generate. At the end, we list the primitive relations and the categories of which are covered by the mappings with the corresponding syntactic definitions in . Next, we verify whether the axioms of hold in when extended with the given syntactic definitions.
The second part, Section 5.2, provides an analysis of the achieved results, the limitations and possible alternative strategies.
The Definitions of bfo Primitives in the Language of dolce
According to our previous discussion and point (M3) of Section 3, our goal is to introduce syntactic definitions of primitives in terms of primitives.
First of all, lacks the language to refer to the dimensionality of the instances of a given category. This implies that these distinctions cannot be modelled in this theory, thus:
the subcategories of and of cannot be distinguished;
the distinction between and cannot be introduced (sites are three-dimensional while continuant fiat boundaries are two-, one-, or zero-dimensional);
the distinction between and cannot be fully characterised.
Concerning (ii) we introduce a definition only for the disjunction of and (written ), see (d12). is clearly not among the categories in the bfo taxonomy and it might not be acceptable as a bfo universal. It is used in the mapping with a technical role: is a shortcut for . Concerning (iii) we identify temporal instants (intervals) with atomic (non-atomic) time intervals, see (d9) and (d10). Admittedly, this is a very rough characterisation. First, temporal atoms can have the same dimensionality of the times they are part of. Second, in bfo the finite sum of (distinct) zero-dimensional temporal regions is still zero-dimensional while the sum of (distinct) atomic times is always not atomic, that is, according to (d9) and (d10), it is an interval. Third, bfo time intervals are convex while dolce non-atomic time intervals are not necessarily convex. Thus, (d10) seems to approximate the category better than . At the same time, if all the non-atomic entities are instances of , then and collapse. For these reasons, we omit and and consider only the rough definitions of and . Note that, in bfo, the convexity of intervals is characterised by means of the primitive of precedence (). To define in dolce one should extend the theory with an order relation defined on time intervals. Similarly for (firstInstantOf) and (lastInstantOf). It follows that the mapping does not cover the relations , , and .
As anticipated earlier, see also assumption (M1) in Section 3, the category of specifically dependent continuants () seems to correspond to the dolce category of qualities. However, as observed in Section 4.2, bfo and dolce distinguish the subclasses of these categories based on different criteria. In bfo, the subcategories of are mainly characterised by means of the primitives (realises) and (materialBasisOf) for which, however, a complete characterisation is missing.20 As a consequence, , , and the subcategories of are not covered by the mapping.
A similar situation holds for the subcategories of material entities (). In particular, object aggregates are intimately connected to the (memberPartOf) primitive. None of these notions can be defined without an extension of dolce. It follows that also and the subcategories of are not covered by the mapping.
Other cases are more subtle. The primitive (historyOf) seems to be naturally defined in by:
However the effectiveness of this definition depends on the existential assumptions on perdurants. For instance, assume that participates only in some perdurant and only for some time at the beginning of , and, moreover, that there is no perdurant which is proper part of , in particular there is no ‘beginning’ of (this can happen because dolce does not commit to the existence of all the temporal slices of a perdurant). Then would not have a history, since the only candidate to be the in the definition would be , for which the definient is false (if , then there is a time, say the end of , where is present but does not participate in ). These technical difficulties, and the fact that histories seem to have a marginal role in the bfo ontology, suggest leaving out also and .
Spatiotemporal regions are compatible but not enforced in dolce. One could introduce them among the regions together with the corresponding qualities. The projections regulating the relationships with spatial and temporal regions (corresponding to the bfo and ) would also be necessary. Alternatively, violating (M3), spatiotemporal regions could be constructed (at the semantic level, for instance) as pairs time interval, space region. As we do not exploit constructive extensions, we do not consider them here even though, from the expressive power viewpoint, spatiotemporal regions seem not to change the language. This suggests that their introduction might be, at least technically, less controversial. More details on spatiotemporal regions can be found in Masolo et al. (2023).
The definition (d17) of (and the ones of and which are grounded on it) is just an approximation that introduces a clear link between concretisation and . In particular, while the first argument of the original concretisation can also be a process (see (a105) in the Appendix), (d17) contemplates only specifically dependent continuants. It is not clear to us (also because the documentation contains very few examples of generically dependent continuants concretised by processes) how processes can be taken into account starting from .
Finally, following the discussion in Section 4.1, instance-of () is not directly defined. Instead, we define in the instantiation of the needed universals. We also define the primitive predicate and of , introduced to distinguish particulars from universals, even though turns out to be empty in (when at least a temporal interval exists, see (t2) in Section 5.2).
Summing up, among the bfo notions, in the following we introduce:
syntactic definitions for the primitive predicates: , , , , , , , , , , , , , ; and
syntactic definitions of form for each category , , , , , , , , , , , , , .
These syntactic definitions are given below together with a short informal description. (Once a definition has been introduced, it may be used only in definitions given later to avoid circularity.) A deeper analysis about these definitions and their impact on the preservation of the axioms of bfo is in Section 5.2.
coincides with on s, s, and s ( is defined in (d6)). Time intervals -exist at every subinterval of themselves while the other abstract entities -exist at every time. This extension matches the fact that is defined on both temporal regions and spatial regions (which seem very close to dolce time intervals and space regions, respectively, see (d7) and (d13)).
is a restriction of (the latter applies also to s). The definition preserves the fact that in bfo is defined only on occurrents (that include temporal regions but not spatial regions).
This is the classical definition of temporal part or slice, that is, is the maximal part of during the temporal extension of .
is the restriction of to s (in dolce is defined for all the entities present in time, that is, also for s and s).
A process boundary is a temporally atomic perdurant ( is defined in (d2)), that is a temporal proper part of at least another perdurant. As said, the atomicity of the temporal location of a perdurant is an approximation of its instantaneity.
Processes are perdurants that are not process boundaries (all the perdurants are present at some time (t1)).
Temporal regions coincide with dolce time intervals.
As said, spatiotemporal regions are ruled out. An occurrent is any entity among process boundaries, processes and temporal regions (whose definitions are given above).
Temporal instants are atomic time intervals.
Temporal intervals are non-atomic time intervals (in bfo they are self-connected but this property cannot be defined in dolce).
Material entities are non instantaneous (to match (a8) in Appendix) endurants that during their whole life are (at least partially) spatially co-localised with an amount of matter.
Both sites and continuant fiat boundaries are dolce features localised in space that during their whole life are never (partially) spatially co-localised with an amount of matter.
Spatial regions coincide with dolce space regions.
Specifically dependent continuants are dolce qualities inhering (in the sense of ) in an independent continuant (as defined in (d15)) that is not a spatial region.
Concretisation is a form of dolce between a non-physical endurant that does not have a spatial localisation () and a specifically dependent continuant (as defined in (d16)) .
Generic dependent continuants are non-physical endurants (from (d17)) that are concretised during their whole life.
According to (d15), (d16), (d18), and (d19), continuants include s, s, and s. For s, coincides with . For s, coincides with . dolce does not define a parthood relation on s but it is defined on endurants. One can use the latter to infer a parthood relation on qualities (according to dolce each quality inheres in a single entity, ).22
is the restriction of the to independent continuants (as defined in (d15)) that are not spatial regions.
(the participation) is the restriction of (the participation) to continuants that are not spatial regions and to processes (as defined above).
holds when the spatial location of the perdurant is always included in the one of the material entity/site/continuant fiat boundary (this definition closely corresponds to the formal one provided in the documentation of bfo).
At time , an independent continuant (that is not a space region) is located in another independent continuant (that is not a space region) when the spatial location (at ) of the first continuant is included in the spatial location (at ) of the second continuant (this definition closely corresponds to the formal one provided in the documentation of ).
is the restriction of the direct quality to specifically dependent continuants (as defined in (d16)).
At time , the generic dependent continuant generically depends on the independent continuant (that is not a spatial region) , when it is concretised (during a part of ) by a specifically dependent continuant inhering in .
Particulars are entities that -exist in time (to match (a4) and (a7) in Appendix).
Universals are non-particulars (to match (a1) and (a2) in Appendix).
(d1),…,(d28) indicates the theory extended with the definitions listed above.
As said, some bfo-axioms involve relations or categories that have not been defined in the mappings. When possible and relevant we considered approximations of these axioms as expressible in the dolce language, otherwise we set them aside. The first column in the tables in Appendix shows which axioms (or their approximations) are preserved, which are not, and which are disregarded (because some of the notions involved have not been mapped). Additional details as well as the proofs of all the theorems can be found in Masolo et al. (2023). In the following analysis we mainly focus on the results that point out some ontological differences between and .
Analysis
The list of the bfo axioms, including the information on whether and how they are preserved by the mapping, is presented in Appendix. Note that the axioms forcing the rigidity of the -categories and the taxonomical constraints depicted in Figure 1 are also preserved (relatively to the universals considered in the mappings) but are not listed in Appendix. We note that the percentage of preserved axioms is not a truly informative evaluation of the mapping since this number would not account for: (i) the reasons for the (non-)preservation of the axioms; (ii) the ontological correctness of the mappings; and (iii) the main differences between the two ontologies. Here we provide a ‘qualitative’ evaluation which in our view is ontologically more informative, even though it remains partial due to the complexity of the ontologies and of the mapping.
We begin our analysis by considering how the entities in the domain of are classified by the categories of bfo as encoded by our definitions. Theorem (t1) shows that endurants, perdurants, and qualities are all bfo particulars (in the sense of the bfo notion of particular, defined in (d27)). The case of abstracts is more complex. (t2) shows that if there exists at least a time interval then all the abstracts are bfo particulars too. This is due to the definition of (d1) that assures that abstracts exist at every time. However, while in dolce the existence of endurants, perdurants, and qualities requires them to have a temporal location (and then to be present in time), see (a19) and (t1), this does not hold for abstracts, that is, it is possible to have models of dolce that contain only abstracts and no time intervals. In this particular case we would not have bfo particulars, only universals (t3).
t1
t1
t2
t3
(t1) seems to conform to both (M1) and (M2), by corroborating the idea that dolce particulars roughly correspond to bfo particulars, and that all the particulars of dolce are ‘imported’ into the ones of bfo. However, (t3) introduces some doubts about the nature of abstracts. The definition of (d1), starting from the fact that temporal and spatial regions are particulars in bfo, extended this idea to all regions (and to all abstracts), that is, there is a presupposition that all the regions have a uniform nature, they are all particulars in this case (even though, in bfo, temporal and spatiotemporal regions are occurrents while spatial regions are continuants). However, other options are possible. One could consider, for instance, the following two variants of (d1):
In the first case time intervals and space regions become bfo particulars as before (matching the fact that temporal and spatial regions are particulars in bfo) but, according to (d27) and (d28), the rest of the abstracts would be included as universals. This possibility is interesting because the examples for dolce (spaces of) regions are colours, weights, shapes, etc., while, in bfo, being red, being blue, being round, etc., are universals. In this perspective, one can also presuppose that the dolce parthood relation holding between regions (in a given space) represents a sort of (intensional) ISA relation, while – or, maybe better, the composition of and – represents instantiation (at a time). This option deserves a detailed analysis even though it introduces differentiation between time intervals and space regions on the one side, and the remaining regions on the other side.
The second option excludes all abstracts, including time intervals and space regions, from particulars. Even in this case one could see the dolce relation between regions as a sort of ISA relation and and as sorts of instantiation relations (here note that the (a23) and (a79) go in the direction of considering and as compositions of and as suggested in the previous case also).
In both these cases, the fact that (some) regions become universals is grounded on (d27) and (d28). The latter is motivated by (a1) and (a2), that is, by the fact that the entities in the domain of bfo are partitioned into universals and particulars. However, one could decide to reject (a1) and assume that some regions are neither bfo particulars nor bfo universals. They would form a kind of entity that bfo does not consider. Also this option deserves a deeper analysis.
There are also more subtle correspondences to consider. Theorem (t4) shows that perdurants are the union of processes and process boundaries. However, in , the bfo axioms (a43), (a45), (a47), (a60), and (a62) are not preserved making evident that the distinction between processes and process boundaries behaves quite differently from the one in .
t4
bfo presupposes that processes and process boundaries have a different temporal nature, and constrains their relationship, which is quite complex, via a set of axioms. On the other hand, (d9) and (d10) – together with (d5) and (d6) – reduce this difference to atomicity and to the existence of strictly larger (relatively to temporary parthood) perdurants. Looking at the counterexamples that cause several BFO axioms not to be preserved, this different behaviour about processes and process boundaries seems mainly due to the poor approximation of the notions of temporal instant and temporal interval provided by (d9) and (d10) together with the low commitment of dolce on the existence of perdurants.
For instance, in dolce it is possible to have three temporally co-localised perdurants , , and all with atomic temporal locations and such that is the mereological sum of and . To fix the idea with an example, consider ‘Yesterday John saw the first ray of sunshine coming in over the mountain at dawn’, ‘Yesterday Mary saw the first ray of sunshine coming in over the mountain at dawn’, and ‘Yesterday John and Mary saw the first ray of sunshine coming in over the mountain at dawn’. There can also be two additional perdurants and such that the temporal location of does not overlap the one of and is the mereological sum of and (e.g., ‘Today John saw again the first ray of sunshine’). In this scenario, according to (d6) and (d7), is a process because it is not a process boundary, and it is not a process boundary because, though it has an atomic temporal location, it is not a temporal proper part of anything (because is part of , but it is a non-maximal part, since exists); is a process boundary (against (a43) and (a47)) because it has an atomic temporal location and it is a proper temporal part of . For another example, in dolce it is possible to have perdurants that are not proper temporal parts of any other perdurant and such that all their parts are temporally co-localised with them (possibly with an atomic temporal location) and in their turn are not proper temporal part of any other perdurant (e.g., consider and in the previous example). According to (d5) and (d6), these perdurants as well as all their parts are classified as processes against (a45), (a60), and (a62). These examples may seem peculiar, yet they prevent the preservation of several axioms, and are not easy to rule out. One could get rid of some of these cases by requiring that processes have a non-atomic temporal location. Then, one needs to force the removal of (t4), otherwise it would be possible to have perdurants with an atomic temporal location that are not temporal proper part of any other perdurant.
There is a further difference concerning the ‘temporal behavior’ of dolce perdurants vs. bfo occurrents. In bfo, occurrent parthood between processes or process boundaries is equivalent to occurrent parthood between the corresponding spatiotemporal regions, therefore different processes or process boundaries cannot be spatiotemporally co-localised. dolce makes the opposite choice and allows for spatiotemporally co-localised perdurants. A similar situation arises between dolce endurants and bfo continuants as highlighted by the fact that neither (a78) nor (a20) can be proved in . In dolce, for instance, an amount of matter constituting a statue at a given time is different from the statue and it is not a -part of the statue at (and vice versa) even though the two endurants are spatially co-localised at (and possibly during their whole life), see (a88) and (t2).
Similarly, in dolce, a sum of bricks is different from a wall even though they can mereologically coincide at a given time. This view allows for the co-existence of distinct entities in the same spatial and temporal locations. Such entities are further distinguished on the basis of their ‘modal’ properties. In comparison, bfo embraces a ‘non-multiplicativist’ perspective by assuming that a given spatial or spatiotemporal region cannot be the location of different continuants or occurrents (see in particular the treatment of the example of the statue and the clay in Otte et al. (2022). In other words, space and time locations are not fundamental for the identity of entities in dolce. Thus, at first sight endurants and occurrents on the one hand, and and on the other, seem to be different ways to capture the ‘same’ ontological category. Nonetheless, a conceptual and formal analysis shows that , highlighting that the views embraced by these systems, while apparently similar, remain incompatible.
These are not the only differences, of course. From we see that the (as defined in (d20)) does not satisfy the supplementation principle, and from that the existence of the product is not guaranteed. Note that the existence of products cannot be inferred for either since .
According to (d20) and the definitions of the subclasses of , for independent continuants that are not spatial regions, reduces to . However, for independent continuants that are not object aggregates, the original is antisymmetric, but is not so the relation defined by (d20), that in this case is equivalent to . As we have seen, this fact has an important impact on the entities that are classified as endurant or as continuant in the two theories. One could recover the antisymmetry by changing the definition of . This amounts to breaking the correspondence between and for independent continuants by ‘injecting’ antisymmetry into (d20), for example,
This alternative definition has some advantages: (i) it preserves antisymmetry as in the original bfo; and (ii) it states explicitly one difference between and . However, some differences remain. We can have endurants that in dolce satisfy and are mapped to but do not satisfy the newly defined . Take our previous example, the (sum of the) bricks and the wall do not satisfy the new . In all these cases, (a78) fails.
Another approach is to specialise the definition of to take into account the different cases, for example,
Note that the antisymmetry of is again lost: The bricks and the wall -coincide at even though they are different. One could further change the definition of stating that the bricks (or the wall) have no spatial location at , or that at they have different spatial locations. One could proceed similarly in the case of the statue and the clay. However, this option seems to go against the natural interpretation of in .23
These examples suggest to follow a different approach, possibly closer to the original commitments of the two ontologies. Consider again the example of the statue and the clay, or the bricks and the wall. There are two different endurants that are spatially coincident at least at a given time. One can think that (a78) and (a20) aim at ruling out these kinds of examples: It is not a matter of avoiding spatial coincidence or parthood between the bricks and the wall, their goal, one could argue, is to rule out one of these two entities, that is, to rule out from the domain of quantification of , either the bricks or the wall (both of them are in the domain of ).24 One can then adopt a strategy that ‘filters out’ some of the dolce endurants. Setting this filter is not trivial. For the case of the statue and the clay, one can rely on the fact that constitution is a sort of order relation in dolce. One can maintain the substratum (the clay) as an element in the domain, and discard the statue due to the fact that it is the substratum that constitutes the statue. However, in the case of the bricks and the wall we cannot use to do the same, because is not asymmetric while constitution is. One needs to find a different relation or some other principle to rule out one of these. But this is not all. Let us assume that we do not import the statue into bfo. Does it mean that bfo cannot talk about statues or that it relies on a different representation? After all, it classifies the clay under amount of matter as well as under statue. One would need to translate the dolce claims about the statue into bfo claims about the amount of clay (since it is the only element left in the domain of bfo). The consequences of this approach – in particular how to track statues through the change of their substratum – deserve more investigation. It is however important to note that this first analytical step allows us to understand the differences and to individuate the main problems that a mapping needs to address.
The analysis of more refined correspondences, like (t5), (t6), and (t7), confirms that temporally extended amounts of matters and physical objects are material entities. The other direction does not hold since non-physical endurants could very well be material entities. However, (t4), (t5), together with (t8), (t9), and (t10), show that there are endurants and qualities, all of which are particulars (see (t1)), that are neither continuants nor occurrents. This means that the partition of particulars into continuants and occurrents, assumed by bfo, is not preserved by the mapping: some endurants and qualities end up in an ontological ‘limbo’.
t5
t6
t7
t8
t9
t10
On the other hand, the imported entities do not necessarily cover all the categories of bfo. There are at least two reasons for that, one factual and one structural. The factual reason is that some of the dolce categories may be empty. For instance, if there are no features, then would be empty. Consequently, axiom (a6), which states that all universals are non-empty, is not preserved. The structural reason is that, according to the mapping, some categories (as defined in the definitions) are necessarily empty. This is the case of universals (assuming that there is at least a time in the domain of quantification, see (t2)). One solution is to consider some types of entities, for example, those that earlier we said are in an ontological ‘limbo’, as bfo universals. We already saw that some regions (e.g., abstracts) could be seen as universals. Similarly, one could assume that specific subclasses of non-physical endurant () are actually concepts behaving like bfo universals. Note that this approach has an important impact on the theorems one can prove.
One way to choose among the alternative approaches in developing the mapping, is to look at what the mapping optimises. For instance, strict and restrictive mappings, carefully set to not allow exchanges of entities that would be in the ontological ‘limbo’ of the target ontology, might be preferred in the context of data transfer, they are safer in this context. Mappings focussing on ontological significance and the maximisation of the number of imported entities are instead arguably better for the goals of comparing ontologies, highlighting core differences, and establishing general interoperability results.
The Mapping From bfo to dolce
The Definitions of dolce Primitives in the Language of bfo
In this section we introduce syntactic definitions of dolce notions in terms of bfo primitives. The starting point is the domain of bfo. We aim to classify and relate the entities in this domain in terms of the categories and relations of dolce. Analogously to the dolce to bfo direction in Section 5 and with similar motivations (see the analysis of Section 6.2), we obtain only a partial mapping covering a subset of the categories and primitives of dolce.
As discussed in Section 4.2, in dolce the agentive and social dimensions play an important role in characterising the subcategories of physical object () and non-physical endurant () but those dimensions are not considered in bfo. Similarly, the subcategories of perdurant () rely on the notions of homeomericity and cumulativity, but these notions are beyond the scope of bfo. This leads to excluding these subcategories from the mapping. Furthermore, while it is not clear how the distinction between amounts of matter and physical objects can be made, regarding features we can rely on the bfo categories of site, continuant fiat boundary and fiat object part. Thus, within the category of physical endurants () we include in the mapping only the category of features ().
We will see that physical endurants correspond to independent continuants that are not regions, see (d3), while the only entities that can be mapped to non-physical endurants are generically dependent continuants, see (d5). In dolce, arbitrary sums () have necessarily a physical and a non-physical part, see (a41). To match this axiom, in bfo we should find continuants that are not spatial regions and are neither independent nor generically dependent continuants. The only option is that of specifically dependent continuants. However, following our guidelines in Section 3, these are mapped to dolce physical qualities, as formalised in (d6). As a result, the category of arbitrary sums would be empty. For this reason, is not covered in the mapping.
In Section 5.2 we have seen that one of the main differences between dolce and bfo concerns the reduction of the parthood relations to spatial and spatiotemporal inclusions. In particular, in bfo there are no spatially co-localised distinct independent continuants (that are not object aggregates) and there are no spatiotemporally co-localised distinct occurrents. We then lack one of the main basis to build a constitution relation among endurants and perdurants. In principle, one could see the relation between a generically dependent continuant and the mereological sum of its carriers at a given time as a form of constitution. This solution is debatable and would model only a very limited notion of constitution, quite different from the general one in dolce. Thus, we refrain from introducing in the mapping.
Concerning qualities, bfo accepts only qualities inhering in independent continuants. From the point of view of dolce, this restricts the mapping to physical qualities only, see (d7).
Regarding regions, there are time intervals and space regions. To locate entities in time and space we need to rely on and only because dolce’s temporal and spatial locations have no correspondent entities in bfo. In particular, the and relations do not make sense in this setting. Alternatively, as previously discussed, one could import some bfo universals into dolce regions and assume that instance-of corresponds to ( composed with) . In this case, the ISA relation would correspond to the primitive , that in dolce is defined on regions. This alternative, which we do not investigate here, seems worth developing.
Finally, the categories for facts and for sets, that in the original taxonomy of dolce (Masolo et al., 2003) appear under abstract (), have not been taken into account in dolce-cl, thus we ignore them here.
Summing up, in the following we discuss these dolce notions:
syntactic definitions for the primitive relations: , , , , , , ; and
syntactic definitions for the categories: , , , , , , , , , , , , and .
Below we list these syntactic definitions providing just short informal descriptions, a deeper analysis about these definitions and their impact on the preservation of the axioms of dolce is done in Section 6.2.
Perdurants coincide with the disjunction of processes and process boundaries (see (d2) for the definition of ).
We rule out from endurants spatial regions and specifically dependent continuant that are mapped, respectively, to space regions and physical qualities, see (d10) and (d6).
Physical endurants are independent continuants that are not spatial regions.
Features are the union of sites, continuant fiat boundaries, and fiat object parts.
Non-physical endurants coincide with generically dependent continuants.
Physical qualities coincide with specifically dependent continuants. Note that relational qualities and realisable entities, like roles and dispositions, are imported into physical qualities.
Only physical qualities exist (bfo has only qualities of independent continuants).
Time intervals coincide with temporal regions. One could consider a stronger definition, that is, assume that corresponds to . However, following (M2) we aim to import all the temporal regions.
Among temporal regions there are only time intervals.
Space regions coincide with spatial regions.
Among physical regions there are only space regions.
Regions are temporal regions or spatial regions.
Among abstracts there are only time intervals and space regions.
The temporal location of is the maximal time at which exists. (In bfo, is defined only on processes and process boundaries).
For perdurant and time intervals coincides with while for space regions it reduces to constant parthood, that is, is a continuant part of during its whole existence (note that in bfo, all the spatial regions exist at least at a time).
For endurants coincides with .
is not defined on process boundaries while dolce does not impose any constraint on the kind of the process involved in . The existential quantification on aims to mitigate this difference.
The relation of direct quality coincides with inheritance (see (d8)).
is the generalisation of , , and ( counts as a sort of mutual dependence).
For independent continuants that are not spatial regions, coincides with (first disjunct). The spatial location, at , of a generically dependent continuant is the spatial region of the maximal entity (if it exists) on which generically depends on at (second disjunct). The spatial location of a quality at is the spatial region of the maximal (at ) entity (if it exists) in which inheres (third disjunct). The spatial location of a perdurant is the spatial projection of its spatiotemporal location (fourth disjunct). Note that in bfo, no axiom guarantees that a specifically dependent continuant inheres in a unique continuant, therefore for qualities we define the spatial location only when there is a ‘maximal continuant’ in which inheres in. A similar argument applies for generic dependent continuants.
One may think that the mapping given by these definitions is too restrictive: In dolce physical and non-physical endurants are not limited to, respectively, independent continuants and generically dependent continuants (the latter are a quite special kind of entities), and so on. This may suggest to include as well as , but not the converse formulas. However, we need to remember that we are considering the mapping from bfo to dolce. The starting point is the domain of bfo, that is, we need first of all to try to classify the entities that belong to the domain of bfo in terms of the categories of dolce. By looking at the mappings it is easy to see that all the particulars25 of bfo are classified in terms of dolce categories except for spatiotemporal regions:26 (i) independent continuants which are not spatial regions are mapped to physical endurants; (ii) spatial regions to space regions; (iii) generically dependent continuants to non-physical endurants; (iv) specifically dependent continuants to dolce qualities; (v) process and process boundaries to perdurants; and (vi) temporal regions to time intervals. The fact that dolce intuitively accepts, for instance, additional physical or non-physical endurants suggests that, in general, dolce has a domain of particulars larger than the domain of particulars of bfo. Our methodological choice (M3) does not allow the mappings to ‘enrich’ the domain of bfo with new entities. However, by introducing the mapping at the semantic level, that is, in this case, as an operator translating any bfo-structure into a dolce-structure, one could enrich the original domain of bfo by set-theoretically building new entities. This is the approach followed in the literature to map theories of time based on points to theories based on intervals, see (Van Benthem, 1983). The theory based on intervals does not admit points, but points can be built as sets of intervals satisfying a certain relation definable in the theory of intervals. Analogously, one can think of building some entities, for example, statues as opposed to amounts of clay, starting from the amounts of clay and the specific kinds of universals they instantiate. Like in Section 5, the investigation of this kind of ‘semantic mapping’ is not considered in this work.
Furthermore, notice that (a6) states that all the universals are non-empty. From an applicative perspective, this is a strong constraint because it forces the user to model even entities which might not be relevant to the application (and perhaps time consuming to analyse). Note that, once (a6) is removed, the existential constraints in are compatible with the instantiation of only some categories. From a technical perspective, (a6) requires very large models that make the identification and management of counterexamples very difficult even with the support of dedicated software. For these reasons, we do not consider (a6) in the comparison.
In the following, is the set (a6) with the definitions of Section 6.1, that is, (a6)(d1),…,(d20). Some axioms of dolce use relations or categories that have not been defined in the mappings. When possible and relevant we considered approximations of these axioms as expressible in the dolce language, otherwise we set them aside. As already said, the first column in the tables in Appendix shows which axioms (or their approximations) have been preserved, non preserved, or disregarded. Additional details as well as the proofs of all the theorems are in Masolo et al. (2023). Again, in the following analysis we mainly focus on the results that point out some ontological differences between and .
Analysis
The list of dolce axioms, including the information on whether and how they are preserved by the mapping, is presented in Appendix. Note that the axioms concerning the taxonomical constraints depicted in Figure 2 are also preserved (relatively to the categories considered in the mappings) but are not listed in Appendix. Following what done in Section 5.2, we start our qualitative analysis by considering where bfo entities end up in dolce via the mapping. With the exception of the spatiotemporal regions, all the particulars of bfo find their place in dolce, in the sense that each instance of a bfo universal is mapped to be an instance of some dolce category. Spatiotemporal regions are not mapped to some dolce category even though, as already anticipated, one could extend dolce with a category for spatiotemporal regions. Furthermore, it is not clear to us, in terms of expressive power, what spatiotemporal regions add to temporal and spatial regions. On the other hand, the universals of bfo are not mapped into dolce. However, bfo universals could be mapped in dolce using alternative mapping strategies, some of which have been suggested already: for example mapping universals to regions or to non-physical endurants. Both alternatives would require to reconsidering the mappings (d5) and (d12), and perhaps (d9) and (d11).
From the opposite point of view, that is, analysing what happens to dolce categories by following the mapping from bfo, one notes that some of these are empty (i.e. no entity of bfo can instantiate them). This is the case of , , , and . Other categories are drastically reduced or expanded: physical qualities/regions collapse to spatial locations/space regions, temporal intervals are mapped to time regions. In agreement with earlier observations, the reduction or emptiness of some categories together with the multiplicativist approach adopted by dolce27 suggest that dolce accepts a larger domain of particulars. Whether this larger variety of entities translates into a higher expressive power needs to be further investigated (but see (Otte et al., 2022) and (Borgo et al., 2022a) for an analysis of how classical representation examples are treated in and ).
Concerning the preservation of the axioms of dolce, our results show that few, and quite technical, differences between and prevent the preservation of several axioms. In dolce, is defined only for the entities that have a temporal extension, see (d6), while in bfo is a primitive relation.28 In bfo an entity may exist at two non overlapping temporal regions without existing at any bigger temporal region. In these situations the condition in the definition of , see (d14), is never satisfied, and consequently the relation is not satisfied. This difference hampers the preservation of several axioms of dolce because is heavily used in the characterisation of the primitives of dolce (in particular ). At the same time, the difference seems to have a technical, more than ontological, nature; after all, in both ontologies the entity exists at both times. A similar problem can be seen in axioms like (a35) and (a55) where the holding of a relation at two times does not suffice to infer that the relation holds also at the sum of those times (provided such sum exists). However, the fact that an entity exists at two times but not at their sum prevents, for the majority of relations, the possibility for the relation to hold at the sum. The bfo relation is one of these relations: A continuant can occupy spatial regions (even the same spatial region) at two given times without having a spatial location at the sum of these times. This fact, together with the definition of (d20), clarifies why (a84) and several other dolce theorems involving are non preserved.
These difficulties may suggest that a different mapping technique could be preferred. Rather than encapsulating the relation between and into the mapping (d14), one could take a more articulated procedure: first introduce a direct mapping between and , then define by using (as defined in the mapping), for example,
In this way, entities that, for instance, exist at two times but not at their mereological sum would still lack a temporal location but this would not prevent from holding when applied to them (due to the first mapping). We can then try to recover some theorems that do not require at the price of possibly losing the connection between and originally present in . The applicability of this mapping technique to other primitives and its actual consequences are not further studied in this paper.
The problems highlighted up to this point were discussed because of their impact on the mapping but are primarily of technical nature. The mapping highlights also genuine ontological differences. First, the defined , like , does not satisfy the supplementation axiom (a7) assumed in dolce. Second, the relation is not equivalent to the original , in particular it does not satisfy (a9), that is, the fact that (direct) qualities inhere in a unique bearer. For relational qualities this can be problematic if, for instance, one wants to distinguish ‘Mary’s love for John’ from ‘John’s love for Mary’: in both cases we obtain a relational quality inhering in both John and Mary. Third, the reflexivity of the defined , like the one of , does not hold for all the continuants. It follows that in bfo one can have continuants for which is not defined. Fourth, while in dolce all the perdurants must have participants at every time they exist (a56), bfo guarantees the existence of participants for processes but not for process boundaries (see (a113)). Furthermore, in bfo, continuants do not necessarily participate in processes while in dolce all the endurants participate in a process when they exist (a57).
Grounding OWL Mappings on FOL Mappings
For both and an OWL version exists. These versions are used when usability and tractability become important applicative factors that prevail over the deep and explicit characterisation of ontological commitments. Given the low expressive power of OWL, the majority of the axioms in the FOL versions and the majority of the mappings introduced in the previous sections – mappings that, in their turn, are expressed as FOL syntactic definitions – cannot be preserved or can be only very roughly approximated. The semantic correspondences established via the FOL mappings can however be used to ground and justify OWL mappings.
In OWL, mappings are arguably among the most important ones. A mapping assures that a class of an ontology is subsumed by a class of another ontology. To justify, for instance, a mapping from to (where is a category and is a category) the following two conditions must hold (where is the set of all the subcategories of in the taxonomy in Figure 2 that have been explicitly considered in the mapping from to ):
(c1)
(c2) , for all
Condition (c1) assures that is subsumed by while (c2) assures that is not subsumed by any subcategory of , that is, (c1)+(c2) guarantee that is the minimal category subsuming .
For instance, the holding of:
justifies a mapping from material entities to endurants (the category of arbitrary sums has been ruled out from the mapping from to ).
A similar reasoning can be done for and mappings. Note however that sub-properties mappings often require to restrict the domain and range of the source relation to conjunctions of classes. To explicitly express this kind of mappings, the Semantic Web Rule Language (SWRL)29 might be needed.
The construction of an ontology ecosystem for information classification and exchange requires formal alignments across ontologies. From the theoretical viewpoint, the most essential alignments are those across foundational ontologies. This paper has presented a general strategy and has motivated some methodological assumptions for such a construction. Furthermore, it has applied this approach to two well-known ontologies, namely bfo and dolce, which might seem to provide an ideal test for formal alignments due to their overall similarity. The analysis has revealed that similarities may hide deep differences and that even apparently minor modelling differences have a relevant impact when formal alignments are sought.
The applied ontology research community has not yet investigated in depth the formal alignment of foundational ontologies. One can imagine the development of a toolkit of strategies, assumptions and methodologies to guide the construction of formal ontology alignments depending on what one aims to preserve across the systems. Some alternatives have been already highlighted in the paper, including a discussion of their consequences. Yet, there are many alternatives and we lack even a general framework to systematically classify them, leaving aside comparing them. Overall, the study in this paper has made clear that the strong interactions existing across the notions used in a foundational ontology, let them be categories or relations, may turn even small changes in the adopted definitions into important changes in how much of the two systems the mapping may hope to align.
As foundational ontologies mature in terms of their formalisation and conceptual coherence, the exploitation of formal alignments across these systems acquires more relevance and gets the community closer to the construction of an ontology-based marketplace for reliable data and information exchange.
In the future, we plan to deepen the analysis here presented comparing what can be gained and lost depending on the strategy, assumptions and methodologies. The deliverable D2.7 of the OntoCommons project (Masolo et al., 2023) investigates formal alignments between and the ‘Elementary Multiperspective Material Ontology’ (emmo)30 that embraces a completely different ontological commitment.
Footnotes
Acknowledgement
The authors thank the participants of the OntoCommons project for their support and in particular Emanuele Ghedini, Francesco A. Zaccarini, Luca Biccheri, Nicola Guarino, Daniele Porello, Emilio M. Sanfilippo, and Laure Vieu.
ORCID iDs
Francesco Compagno
Stefano Borgo
Claudio Masolo
Funding
The author(s) received the following financial support for the research, authorship and/or publication of this article: This article has been developed within the OntoCommons project (GA 958371, ontocommons.eu) and in part within the project ERC Advanced Grant C-FORS (GA 101054836).
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Notes
‘+’: preserved axiom
‘∼’: preserved a weaker version of the axiom (the version proved in reported between ‘[⋯]’)
‘°’: non preserved axiom
‘ ’: axiom discarded because it contains non-mapped primitives
The Theory B and Axioms Preservation from Db
+
ab1
eto
par(x)∨uni(x)
+
ab2
qkp
par(x)→¬uni(x)
∼
ab3
ekp
x::tu→par(x)∧uni(u)∧tm(t) [only for the universals considered in the mapping]
+
ab4
oap
EX(x,t)→par(x)∧tm(t)
∼
ab5
bee
EX(x,t)↔∃u(x::tu) [only for the universals considered in the mapping]
ArpR.SmithB.SpearA. D. (2015). Building ontologies with basic formal ontology. The MIT Press.
2.
BorgoS.FerrarioR.GangemiA.GuarinoN.MasoloC.PorelloD.SanfilippoE. M.VieuL. (2022a). DOLCE: A descriptive ontology for linguistic and cognitive engineering. Applied Ontology, 17(1), 45–69.
3.
BorgoS.GaltonA.KutzO. (2022b). Foundational ontologies in action. Applied Ontology, 17(1), 1–16.
4.
ChuiC.GrüningerM. (2014). Merging the dolce and psl upper ontologies. In Proceedings of the international joint conference on knowledge discovery, knowledge engineering and knowledge management - volume 2, IC3K 2014, (pp. 16–26). SCITEPRESS - Science and Technology Publications.
Fernández-LópezM.Gomez-PerezA.JuristoN. (1997). Methontology: From ontological art towards ontological engineering. Engineering Workshop on Ontological Engineering (AAAI97), AAAI Technical Report SS-97-06, 33–40.
7.
GardnerS. P. (2005). Ontologies and semantic data integration. Drug Discovery Today, 10(14), 1001–1007.
8.
GrenonP.SmithB.GoldbergL. (2004). Biodynamic ontology: Applying bfo in the biomedical domain. Studies in Health Technology and Informatics, 102, 20–38.
9.
GuarinoN. (2017). BFO and DOLCE: So far, so close.... Cosmos+Taxis, 4(4), 10–18.
10.
ISO 21838-2 (2021). Information technology – top-level ontologies (tlo) – part 2: Basic formal ontology. Standard ISO/IEC 21838-2:2021, International Organization for Standardization, Geneva, CH.
11.
ISO 21838-3 (2023). Information technology – top-level ontologies (tlo) – part 3: Descriptive ontology for linguistic and cognitive engineering (dolce). Standard ISO/IEC 21838-3:2023, International Organization for Standardization, Geneva, CH.
12.
ISO 24707 (2007). Information technology – common logic (cl): A framework for a family of logic-based languages. Standard ISO/IEC 24707:2007, International Organization for Standardization, Geneva, CH.
13.
JarrarM.MeersmanR. (2009). Ontology engineering – The DOGMA approach. In T. S. Dillon, E. Chang, R. Meersman, & K. Sycara (Eds.), Advances in web semantics I: Ontologies, web services and applied semantic web (pp. 7–34). Springer. https://doi.org/10.1007/978-3-540-89784-2_2.
OukselA.ShethA. (1999). Semantic interoperability in global information systems: A brief introduction to the research area and the special section. SIGMOD Record, 28, 5–12.
18.
SmithB.AlmeidaM.BonaJ.BrochhausenM.CeustersW.CourtotM.DipertR.GoldfainA.GrenonP.HastingsJ.HoganW.JacuzzoL.JohanssonI.MungallC.NataleD.NeuhausF.OvertonJ.PetosaA.RovettoR.RuttenbergA.ResslerM.RudnikiR.SeppalaS.SchulzS.ZhengJ. (2020). Basic formal ontology 2020 - specification and user’s guide (revised version from september 9, 2020). Technical report.
19.
Suárez-FigueroaM. C.Gómez-PérezA.Fernández-LópezM. (2012). The NeOn methodology for ontology engineering. In M. C. Suárez-Figueroa, A. Gómez-Pérez, E. Motta, & A. Gangemi (Eds.), Ontology engineering in a networked world (pp. 9–34). Springer. https://doi.org/10.1007/978-3-642-24794-1_2.
20.
SutcliffeG. (2016). The CADE ATP system competition - CASC. AI Magazine, 37(2), 99–101.
21.
Van BenthemJ. (1983). The logic of time. Kluwer Academic Publisher.
22.
W3C OWL Working Group (2012). OWL 2 web ontology language document overview (second edition).