Abstract
The re-engineering of vocabularies into ontologies can save considerable time in the development of ontologies. Current methods that guide the re-engineering of thesauri into ontologies often convert vocabularies merely syntactically and ignore problems arising from interpreting vocabularies as ontologies, i.e. as sets of statements of facts. Current re-engineering methods also do not make use of the semantic capabilities of formal languages in order to detect logical mistakes and improve vocabularies. In this paper, we introduce a content-focused method for building domain-specific ontologies based on a thesaurus, a popular type of vocabulary. Application of the method results in an ontology that not only adheres to the semantics of the description logic OWL, but also contains a semantically rich description of the modeled entities, enables non-trivial, automated reasoning, and can be integrated with other ontologies following the same development principles. We explain the motivation and sub-activities for each of the steps in our method and illustrate their application through a case study in the domain of agricultural fertilizers based on the ACROVOC Thesaurus. Our method shows, first and foremost, that a considerable manual effort is required to derive a semantically rich ontology from a thesaurus, particularly in connection with the alignment to a top-level ontology as well as for the identification and formal specification of membership conditions. Applying our method will likely change the structure of a thesaurus considerably. Our method is particularly useful where a highly reliable is-a hierarchy or consistent definitions are crucial.
Introduction
The creation of knowledge-dense ontologies can take tremendous time [80]. For this reason, it is desirable to re-use existing models as ontologies [79]. Also, the re-engineering of non-ontological models for their use as ontologies has become popular. Controlled vocabularies (referred to as “vocabularies” in the following), more recently known as knowledge organization systems and often referred to as terminologies, are examples of non-ontological resources and are generally considered interesting candidates for re-use as ontologies [91,92]. The reason is that such vocabularies have often matured over decades and contain several thousand or even hundreds of thousands of concepts and natural language terms. This eliminates or at least reduces the effort of eliciting concepts in the ontology development process. Second, the concepts in a vocabulary are generally structured through a number of relationships. These relationships can be used as a starting point for developing the structure of an ontology.
Today’s practices of re-engineering vocabularies into ontologies differ significantly. The main reason is that there are different understandings of ontologies. In this paper, we refer to ontologies as statements of necessary and general features of a certain domain of reality in a computable formal language like
Although there are numerous ontologies that are coded in OWL-DL, for example the ontologies published by the OBO Foundry [58], we are not aware of any publication that describes an
Cardillo et al. [13] developed a script that converts a thesaurus into an OWL representation without any further consideration of the correctness of the results or using the expressive potential of OWL to define the meaning of classes. The report of the NeOn project [3] does mention the re-engineering of a vocabulary into an OWL-DL ontology as “TBox re-engineering”, but only refers to the Scarlet program [71] and the use of WordNet without detailing any exact procedure.
There are numerous general ontology engineering methods, some of which have the specific focus of developing an OWL-DL ontology, e.g., Borgida and Brachman [8] or Noy and McGuinness [56]. Moreover, there are content-focused methods that provide guidance for
Our method aims not only at compliance to OWL standards, but also at developing
make full use of the semantic expressivity of OWL,
are consistent and provide true reasoning results and
can be integrated with other ontologies following the same development principles.
The method we will present guides specifically the re-engineering of a
The paper is structured as follows: In the subsequent Section 2 we explain the distinctive characteristics of OWL DL-compliant ontologies and how they shape our re-engineering approach. Section 3 details how the re-engineering method was derived. Section 4 discusses the individual steps of our re-engineering method. In an earlier paper [43], we already provided an outline of the method. The present paper presents a matured version of the method in much more detail. In Section 5 we will reflect on the method as a whole and summarize our results in Section 6.
Characterization of the re-engineering method
Our goal is the re-engineering of a thesaurus into a semantically adequate OWL-DL ontology. In this section, we describe the essential properties of OWL-DL ontologies and contrast these with other approaches to pinpoint the distinguishing features of our re-engineering approach.
Characteristics of OWL-compliant engineering
The OWL syntax and the associated description logic semantics constitute a formal system that supports automated reasoning based on membership conditions and other features. Automated reasoning can be used both for automated consistency checks of ontologies (i.e. for proving the absence of contradictions) and inferring facts that have not explicitly been asserted [4]. Advantages of OWL-DL are its computational tractability, the reasoning support for consistency checking and for the generation of the inferred class hierarchy, as well as the use of XML-based syntaxes and unique identifiers (IRIs and URIs). These are also the reasons for our focus on OWL-DL ontologies in this paper.
A complete overview of the characteristics is given in the OWL specification [93]. Of particular importance are the strict separation of TBox and ABox, i.e., of classes and individuals, and the strict semantics of relationships.
OWL-DL standardly distinguishes between instances (individuals) and abstractions of them (classes). This separation corresponds to the distinction of the so-called TBox and ABox: The TBox, the ‘terminology component’, “contains intensional knowledge in the form of a terminology and is built through declarations that describe general properties of concepts”. By contrast, the ABox, the ‘assertion component’, “contains extensional knowledge – also called assertional knowledge – knowledge that is specific to the individuals of the domain of discourse” [54]. In other words, the TBox concentrates on the intensional specification of classes using previously specified relationships while the ABox uses the definitions made in the TBox to describe particular things (individuals). The TBox acts, thus, as a metamodel for the ABox, “a model that consists of statements about models” [40]. We follow Guarino et al. [25] and Schulz and Jansen [39] in considering only intensional knowledge (the TBox) to be part of an ontology proper.
Concepts in thesauri are – with some exceptions – intensional entities that are labeled by general terms, terms that are predicable of more than one individual [12]. As is shown in Fig. 1, re-engineering thesauri into ontologies means that the majority of the thesaurus content (see (b) in Fig. 1) ends up in the TBox (2). Only few thesaurus concepts, in specific references to particular things such as the Mekong River or Rocky Mountains, end up in the ABox, but are then not considered part of the ontology (TBox).

TBox re-engineering process for thesauri and other types of vocabularies.
A thesaurus uses a data model (see (a) in Fig. 1) to determine its logical structure and organize its content [16]. Shifting the content of the thesaurus into the TBox requires structural re-engineering that is caused by the differences between the thesaurus data model (a) and the metamodel that underlies the formal system and, thus, the ontology language (see (1) in Fig. 1).
A specific use of relationships is another significant characteristic of OWL-compliant ontologies. When using relationships in the TBox of such ontologies, one needs to respect the following three characteristics:
Relationships involve implicit or (in OWL) explicit quantification, which is relevant for the semantics of relational expressions [76]. Statements like
Any relationship from a class
This implies that the relationship ‘
In OWL, these rules also apply to the ABox, with the difference being that the relationships describe characteristics of the instances of classes, not membership conditions. The rules apply for all relationships except for the subclass relationship and the instantiation relationship, which are built-in features of OWL-DL.
OWL brings with it certain limitations, which are often discussed as
OWL is limited to countable quantifiers (
Unlike some forms of modal logic, OWL has no primitives that could express the modality of a statement, i.e. a statement cannot be qualified by through modal operators such as “It is
OWL has no primitives that can express the tense or aspect of a statement, e.g., statements like “John
As far as the definition of general terms through classes is concerned, OWL can only provide statements that are true for all members of the class, not just some members; i.e. the (true) statement “Some fertilizers pollute soil” cannot be expressed, only the (false) “All fertilizers pollute soil”.
As OWL has no variables, it has peculiar problems with nested quantification and identity statements.
These limitations are particularly significant when comparing ontologies described in OWL with thesauri and hence represent problems for any project of re-engineering a thesaurus into an ontology, including the present case study.
In Knowledge Engineering, the term “ontology” is often used in a loose sense to describe different types of models that do not share the characteristics of OWL-DL ontologies which we described in the preceding section. There are at least two groups of approaches in the re-engineering literature according to the respective ‘loose’ sense these papers connect with the term “ontology”.
A first group of re-engineering publications does not refer to any specific description language of ontologies, but rather associates ontologies with the freedom of defining customized relationships between concepts, which is not normally the case with thesauri and other types of vocabularies. This re-engineering approach, thus, focuses on changing the content of a given thesaurus, mainly through a refinement of relationships without changing the overall structure of a thesaurus. An example is Soergel et al. [83], whose approach also underlies the publications of Kawtrakul et al. [42] and Sánchez-Alonso and Sicilia [73] and has similarities with ontological augmenting of thesaurus relationships described by Tudhope et al. [89].

ABox “re-engineering” process for thesauri and other types of vocabularies.
In a second group of re-engineering publications, an ontology is a description of a model that uses the Resource Description Framework (RDF) [64] and adheres to the RDFS Semantics [29]. According to this approach, as displayed in Fig. 2, re-engineering a thesaurus or other vocabulary type into an ontology means, first, to describe the data model of a thesaurus or other vocabulary types (a) in a schema (2) using RDFS (1). In the case of thesauri it can suffice to adopt the Simple Knowledge Organization System (SKOS) [34], which is closely oriented on the thesaurus data model described in ISO 25964-1:2011 [36]. The schema is then used to describe the content of a domain-specific thesaurus or vocabulary (b), which does not require any structural changes in the thesaurus or vocabulary [3] and, thus, does not check the adequacy of the content. This kind of re-engineering is, therefore, easy to automate and re quires relatively little manual work. Such re-engineering is described by Villazón-Terrazas [92] and used by the NeOn project [3] and van Assem [91]. Both Villazón-Terrazas and the NeOn project use the terms “TBox re-engineering” and “ABox re-engineering” in ways that do not comply with DL semantics.
Neither of the two groups of re-engineering approaches separates classes from individuals. E.g., in RDFS, instances of classes can themselves be classes. A datatype (rdfs:Datatype), for example, is both an instance (rdf:type) of a class (rdfs:class) and a subclass of such class (rdf:SubClassOf). Moreover, instances can have instances, which is not possible in OWL-DL. For example, the sub-class relationship (rdf:SubClassOf) is declared an instance (rdf:type) of a property (rdf:Property) and a property (rdf:Property) is declared an instance (rdf:type) of a class (rdfs:class) [29].
Moreover, neither of the two approaches recognizes the importance of defining necessary membership conditions of classes. As a result, the ontologies produced by these re-engineering approaches cannot be used as OWL-DL ontologies (while it is not difficult to represent OWL-DL ontologies in RDFS using the mappings from the OWL specification [34]). Thus far, the re-engineering strategy presented here differs significantly from these two groups of approaches.
The re-engineering method that we present in this paper was developed in two phases: We started with (1) developing a naïve re-engineering method based on previous literature and then (2) refined and validated the method during the case study. In the first phase, we compared the structure of thesauri with the structure of ontologies theoretically. More specifically, we compared the thesaurus structure described in the thesaurus standard ISO 25964-1:2011 [36] with the structure of realist ontologies [44] and their specific representation in the description logic OWL [32,53]. Based on this structural comparison, we translated the identified differences and similarities into an initial set of steps for re-engineering thesauri into ontologies.
Additionally, we elicited certain steps for the general development or engineering of semantically adequate ontologies from the literature. However, we did not find any single method comprising all the steps that we have adopted. The combination of the steps from the theoretical analysis and the general ontology engineering literature constituted the naïve re-engineering method and is laid out in Appendix A.
In the second phase of refining and validation, we applied the naïve re-engineering method in a case study in order to re-engineer a portion of an existing thesaurus into a semantically adequate ontology. In this course, we added, merged or removed certain steps, changed their sequence and introduced sub-activities. Appendix A provides an overview of the changes by showing how the steps of the naïve re-engineering method are related to the steps in the final re-engineering method that we will introduce in the following section.
During re-engineering we were confronted with two challenges. First, re-engineering was highly time-consuming, which was anticipated. This challenge limited the number of representational units that could be feasibly re-engineered in the case study. In a real-world scenario, time is, of course, correlated with costs. Second, a variety of skills are required for the re-engineering that is rarely concentrated in a single person: knowledge of the structure of thesauri, experience in logic-based modeling (here: experience in the correct use of the modeling language OWL), familiarity with an appropriate modeling tool, knowledge of specific philosophical notions, familiarity with specific existing top-level and domain-specific ontologies, but also knowledge in the domain of the thesaurus to be re-engineered (here: agriculture). This challenge we met by working in a team to cover the required skills.
For the case study we chose the fertilizer branch of the AGROVOC thesaurus [2], which comprises 31 concepts subordinated to ‘Fertilizers’. Additionally, we re-engineered a number of other concepts from the AGROVOC thesaurus that are closely related to fertilizers and were frequently needed when defining membership conditions of fertilizer types (step 3 of our method) and formalizing these (step 5), for example ‘plant nutrient’. We chose the fertilizer-related portion of the AGROVOC thesaurus because of the specific interest in a project participant in a fertilizer ontology and because the AGROVOC is a mature and widely used thesaurus.

Method for engineering quality ontologies based on thesauri.
Our re-engineering method consists of seven steps that are shown in Fig. 3. In the figure, the big arrows connecting the steps indicate that the method is expected to be applied sequentially, except for steps 3 to 6 that form a block of strongly interdependent and, thus, iteratively applied steps. Appendix B provides a more detailed overview of the method by summarizing the sub-activities for each step. The following subsections will, for each of the steps, discuss the purpose, provide an explanation of the activities involved, demonstrate the step to re-engineer the chosen portion of the AGROVOC thesaurus, and, finally, discuss the respective step. The demonstration of each step is structured according to the sub-activities that we will introduce in the explanation of the step.
By applying the re-engineering method, the thesaurus gradually becomes an ontology. We will refer to this model-in-transition as the “re-engineered thesaurus” or the “emerging ontology” to avoid confusion. Steps 3, 5, 6 and 7 will likely be realized using an ontology editor like Protégé [62].
Step 1: Preparatory refinement and checking of the thesaurus
Purpose
We base our re-engineering method on the thesaurus standard ISO 25964-1:2011 [36]. However, thesauri are not necessarily in line with this particular standard: thesaurus standards have been developed and changed over time, whereas the data structure of an actual thesaurus system is practically inert after it has been implemented. Hence it is possible or even likely that domain-specific thesauri may often not have adopted all the changes in the standards and re-engineering should begin with checking and refining the thesaurus so that further steps can rely on a stable basis. Furthermore, some of the optional features of thesauri-like node labels for indicating characteristics of division of the thesaurus concepts can be helpful for later analytical steps; for this reason, we encourage their use at this point.
In some cases, the refinement of the thesaurus may be impeded by the specific thesaurus management software used. For this reason, this step may be customized, combined with other steps or even skipped if the specific case of the re-engineered thesaurus requires or allows doing so. Nevertheless, various activities of this step are pivotal for deriving a useful basis for the is-a hierarchy of an ontology.
Actions to be taken
In accordance with the ISO thesaurus standard ISO 25964-1:2011, a thesaurus should possess the following features, and we will now discuss the matching sub-activities to ensure these features:
Distinction between concepts and terms.
Distinction between different types of hierarchical relationships.
Rejection of invalid relationships.
Removing hierarchical cycles.
Assigning orphans to the thesaurus hierarchy.
Identification of arrays of concepts based on common characteristics of division.
(a) The distinction between
(b)
(c) In the course of differentiating hierarchical relationships, relationships that fail to conform to the semantics of “relationship”, as defined in the thesaurus standards, should not be represented in the emerging ontology. Also,
(d) The thesaurus should also be analyzed for
(e)
(f) For later steps in the re-engineering method it is worth introducing
Thesauri may contain further kinds of errors such as unidirectional relationships between concepts, different thesaurus relationships between the same pair of concepts, terms with exactly the same spelling assigned to different concepts, or hierarchical or associative relationships between non-preferred terms in term-based thesauri. Such errors may become the source of populating structural problems in thesauri that may be difficult to resolve later. They also result in mistakes when adopted in the ontology and should be detected by thesaurus management software [36]. We will not further discuss such errors here.
Application of the step to the fertilizer ontology
(a) The AGROVOC thesaurus does not distinguish between concepts and terms. Unique identifiers (term codes) are provided for terms, but not for concepts. Figure 4 shows how the thesaurus terms have been transformed to be compatible with the concept-based thesaurus structure recommended in ISO 25964-1:2011. While non-preferred terms point to a preferred term in the original term-based thesaurus, a concept is introduced for every preferred term when changing to a concept-based thesaurus. The preferred term and the non-preferred terms point to the concept in a concept-based thesaurus and their status as either preferred or non-preferred terms is indicated through different relationships or in meta-information about a term. The described separation between terms and concepts did not require a distinct effort, but could be realized implicitly in the course of the naïve conversion (step 2).

Conversion process from a term-based thesaurus like the AGROVOC to a concept-based thesaurus.
(b) Like many other thesauri, AGROVOC does not distinguish between different types of hierarchical relationships. However, our analysis revealed that all hierarchical relationships between ‘fertilizer’ and its subordinated concepts are, by chance, proper generic relations between super-concepts and sub-concepts, as shown in Fig. 5. Other parts of the AGROVOC thesaurus do also display the other types of hierarchical thesaurus relationships like the instance relationship (Colorado River–Rivers) or the hierarchical part-of relationship (Root hairs–Roots).

Concept hierarchy in the re-engineered thesaurus. (Capitalization follows AGROVOC).
(c) We noted some erroneous relationships amongst the fertilizer-related concepts. Specifically, some concepts were hierarchically related and associated at the same time. For example, ‘Biofertilizers’ was not only associated with ‘Fertilizers’, but also hierarchically subordinated to ‘Fertilizers’ (along the path of ‘Organic fertilizers’). The erroneous associative relationships were simply ignored in our case study because, as we will argue in Section 4.2, they will not be transferred into the ontology. We did not encounter relationships using a non-preferred term as a relatum that we would have to consider as a structural relationship in the ontology. We found only one scope note connected to a non-preferred term; it could easily be assigned to the concept itself because there was no competing scope note for the preferred term.
(d–e) We could not detect any
(f) The AGROVOC thesaurus does not contain any
Our analysis revealed that checking and refinement of a thesaurus against standards is necessary to ensure a reliable basis for subsequent steps of the re-engineering process. At this stage, the fertilizer-related part of the AGROVOC thesaurus now conforms to the ISO standard.
Purpose
Naïve conversion aims at a representation of the thesaurus in the formal language OWL-DL, a standard format that allows an unambiguous interpretation of the emerging ontology. Moreover, the formal representation in OWL-DL allows using automated reasoning tools to check the ontology for consistency (the absence of contradictions from the joint assertions made in an ontology), and infer the full class hierarchy in later steps.
Because of some fundamental structural differences between the thesaurus data model and the structural specification of OWL-DL, we connect the syntactic conversion with some initial structural changes of the thesaurus. Nevertheless, at this stage these, changes are implemented naïvely or mechanically only. It is, thus, possible that the model resulting from the syntactic conversion shows inconsistencies and contradictions that can later be detected through automated reasoning. The correction of these inconsistencies and contradictions is the subject of the following re-engineering steps.
Actions to be taken
In this step, two actions are to be applied sequentially:
Choice or development of conversion tools.
Conversion of the thesaurus into the formal language.
(a) It is desirable to carry out the described syntactic conversion automatically with conversion tools, particularly when the goal is to re-engineer a complete thesaurus. The likelihood of being able to use existing tools, e.g., the one by Cardillo et al. [13], instead of needing to write customized scripts or programs is higher if the thesaurus is available in common exchange formats such as SKOS [35].
(b) After the refinement in step 1, the thesaurus is assumed to be concept-based according to ISO 25964-1:2011. On this basis, we can convert the thesaurus syntactically into a representation through a formal language by applying the mappings between representational units in thesauri and OWL as shown in Fig. 6. The diagram is to be read as follows: some concepts (in thesauri) reference individuals (in OWL). The name of the relation (
Thesaurus concepts and facets in the role of top-level elements, can either correspond to an intensionally specified class or an intensionally specified datatype. In some cases, thesaurus concepts do not correspond to intensionally specified classes, but rather refer to individuals (e.g., the Yangtze River) or a particular collection of individuals (e.g., the Rocky Mountains as a particular collection of mountains). When defining the classes or the individuals in the emerging ontology, it is advisable to adopt RFC 3986 [67] or other conventions for the names of the entities (the identifiers, called URIs and IRIs in OWL).

Relatedness of the relata in thesauri and the relata in OWL.
The terms of a thesaurus and the labels of the facets now become labels of classes. Language tags can be used to distinguish the languages of the labels. Subtypes of labels need to be defined if it is desired to keep the distinction between preferred and non-preferred terms. Definitions, scope notes, and other notes and housekeeping information can be transferred to comments or appropriately defined subproperties thereof. It might also be desirable to transform node labels into
Figure 7 shows mapping for relationships using the same notation. The

Relatedness of relationships in thesauri and relationships in OWL.
Correct
OWL allows ordering relations hierarchically by means of subproperty axioms. Such an axiom expresses an is-a relationship between two relations, e.g., the structural-part-of relation can be said to be a subproperty of the part-of relation. Such hierarchies of relation are, however, not typical for thesauri.
Application of the step to the fertilizer ontology
An automatic syntactic conversion was not worth the effort in our re-engineering case, first, because of the particular export formats, second, because of the small portion of the thesaurus that we actually wanted to target. Although the AGROVOC website offers an OWL version of the AGROVOC thesaurus, this file has (1) computing problems as well as (2) structural problems:
(1) With a size of about 400 Megabytes, the file was far too large to be processed efficiently. It required a computer with 8 processing cores and 8 GB of free memory to load the file in less than an hour. We know of no programs that support splitting ontology files of such a size into smaller portions.
(2) The way the OWL file is structured is not useful for our purpose. Most classes are direct siblings of the top concept “Thing” and very few classes are subordinated by the subclass axiom. We wanted to start with the class hierarchy mirroring that of the original AGROVOC thesaurus. An even bigger problem is that the class labels were not attached to the classes in a way that Protégé could display them.
For these reasons, and since we wanted to re-engineer a relatively small portion of the AGROVOC thesaurus only, it was faster for us to enter the class hierarchy for the ‘fertilizer’ tree manually using the Protégé-OWL editor. We started the conversion by creating classes for all fertilizer concepts. We decided not to introduce any arrays or household nodes into the ontology.
In a second step, we added the terms as labels to the classes. We retained the distinction between preferred and non-preferred terms by assigning them to the annotation properties “preferred term” and “non-preferred term” respectively and by introducing a subproperty of the default property “label”. We also copied the preferred term to the “label” annotation property for further modification (see step 6). Furthermore, we defined a “scope note” as a subproperty of the default “comment” annotation property and copied the scope notes for the concepts into this field. Terms and notes in languages other than English were omitted when entering the thesaurus terms as class labels.
The fertilizer branch of the AGROVOC thesaurus contained generic relationships only. We adopted these as is-a relationships in our fertilizer ontology, though they are subject to further validation in later steps.
Purpose
Since the thesaurus hierarchy may have been a mix of generic relationships and other kinds of relationships, there may be several disconnected portions of classes organized through is-a relationships after the naïve conversion in step 2. Leaving things in this state would often mean that the upper-most classes of these portions are all assigned to the most general class, i.e. to the class ‘Thing’ (as it is called in Protégé). It is the goal of the present step to organize these portions into a coherent is-a hierarchy and, thus, to connect them – directly or indirectly – by aligning them to a common top-level ontology.
Alignment to top-level classes is of considerable importance, for it is meant to establish interoperability with other ontologies aligned to the same top-level ontology. It also allows an economic specification of membership conditions of classes because subordinate classes inherit membership conditions and other formal specifications from superordinate classes (e.g., disjointedness from other classes) through is-a relationships. The alignment also establishes connections in the emerging ontology that are required for non-trivial automated reasoning.
Actions to be taken
The alignment comprises the following sequence of activities:
Choice of an existing top-level ontology.
Alignment of the separated portions of the emerging ontology to the top-level ontology.
(a) The choice of an existing top-level ontology is the most fundamental step. It involves getting an overview of existing top-level ontologies. Some of the commonly cited top-level ontologies include the
(b) Aligning disconnected portions of the emerging ontology to the chosen top-level ontology aims for a state where every such portion of hierarchically connected classes is either directly or indirectly (through the class hierarchy) connected to the top-level ontology without creating any hierarchical cycles. To achieve this situation, each top-level class of a thesaurus portion is asserted to be
Special consideration should be given to poly-hierarchies, i.e. to classes with more than one parent in the is-a hierarchy. As described in [65] and [50], poly-hierarchies do often come with conflicting membership conditions inherited from the various hierarchical paths. Thus, the existence of poly-hierarchies frequently indicates mistakes in the is-a hierarchy. Non-conflicting poly-hierarchies (also referred to as “multiple inheritance” or “diamond hierarchies”) [41] are addressed in step 7.
Application of the step to the fertilizer ontology
(a) While most top-level ontologies are domain-independent, there are also so-called upper-domain ontologies that describe general kinds of certain domains. Since fertilizers belong to the field of biochemistry, we decided to use BioTop [6], an upper-domain ontology for the life sciences. BioTop is particularly suited for our purposes because it provides (1) a fine-grained distinction of material entities, (2) a comprehensive set of formally defined relationships [75], and (3) bridges to the most common top-level ontologies in the life sciences, i.e. BFO and DOLCE. As a result, our re-engineered fragment of AGROVOC can be used in combination with either of these two top-level ontologies.
(b) Our naïve conversion in step 2 of our method resulted in a single portion of hierarchically connected classes. We decided to align the class ‘fertilizer’ to be a subclass of the BioTop class ‘compound of collective material entities’. A collective material entity is an aggregate of material entities that belong to the same type [41]. But as fertilizers are mixtures of different kinds of molecules, we decided to model them as compounds of collective material entities. In fact, the thesaurus was designed to describe real-life fertilizers that can be purchased as products, and these will never be the pure substances.
Discussion
While some authors have doubts about the utility of top-level ontologies [48], our experience with the alignment is that top-level ontologies have an important guiding function by asking us to make categorial distinctions and decisions. BioTop, in particular, presented itself as a bundle of helpful micro-theories about ontological problems, for example, the distinction between dependent and independent entities. Thus, the categorial distinctions built into BioTop reduced the burden of decision-making and prevented misclassifications.
Naturally, adopting a top-level ontology implies a commitment to the specific theories that underlie the distinctions of the categories and relations. Even without weighing the advantages and disadvantages of BioTop against potential alternatives (e.g., adopting DOLCE [21] or BFO [84] and the Relation Ontology [82]), our choice of BioTop added considerable semantic information to our fertilizer ontology. Obviously, domain ontologies that are aligned to the same top-level ontology can be more easily integrated and related to each other. Thus, top-level ontologies have the advantage of securing similar design standards across ontology projects.
Step 4: Identification of membership conditions
Purpose
The unique advantage of logic-based ontology languages like OWL is that they allow specifying the meaning of a class through
In order to clarify the meaning of the classes, we suggest beginning with an informal (natural language) specification of the classes with membership conditions. It is not only the basis for the formal specification of the membership conditions in step 5, but also provides a stronger foundation and permits revisions of the alignments described in step 3.
Actions to be taken
Up to four iteratively applied actions may be necessary in this step:
Identifying the meaning of the classes.
Collection of definitions in natural language.
Deciding for principles of including classes and resolving ambiguity.
Extraction or definition of membership conditions.
(a) The primary step in the definition of membership conditions is to understand to what the concepts in the thesaurus precisely refer. For this purpose, we exploit all the means that the re-engineered thesaurus offers to express the meaning of its concepts: assigned natural language terms, hierarchy, associative relations, qualifiers, scope notes, definitions, as well as the purpose or focus of the thesaurus overall.
(b) Consulting natural language definitions from encyclopedias and dictionaries is helpful in various ways: (i) A concept may lack a definition in the thesaurus, (ii) the defining phrase of a thesaurus definition may be ambiguous or otherwise difficult to understand, (iii) there might be need for additional information to identify additional membership conditions for classes, or (iv) one wants to uncover possible ambiguities of the term to be defined. These definitions should be as domain-specific as possible in order to have a qualitatively good basis for the definition of membership conditions. Any definition needs to be in line with the meaning of a thesaurus concept. Where there are no useful definitions, it may be necessary to consult domain experts to create explicit definitions.
(c) In a thesaurus, at least one natural language term has to be assigned to a concept. For human users, these labels are the most important indicator of the meaning of a concept. Unfortunately, terms in natural language are almost always ambiguous and have different meanings in different communities and cultural contexts. Moreover, the specific understanding of a term may change over time while the term remains assigned to the same concept in the thesaurus. Sometimes terms have multiple meanings even in a single community or discipline, especially if there are different schools of thought. For example, one will generally have an intuitive idea of what a concept labeled “water” represents. If one asks whether a class “water” shall include instances such as water ice cubes, water in a plasma aggregate state, waste water or salt water, there may be differing opinions.
If alternative interpretations of a concept are not disambiguated by the information collected in the previous two actions, the ontology (re-)engineer is forced to make a decision between
including several classes for a given term, each for every meaning,
selecting one out of several possible meanings, or
rejecting the inclusion of a given class entirely if it is not possible to identify any meaning that makes sense in the given context.
There are no objective criteria for such a decision. Decisions will rather depend on factors like a specific interest in using the ontology, a specific viewpoint of the ontology (re-)engineer or the relevant peer group. We recommend that such decisions be consistent and guided by transparent principles. Such principles could be, e.g., specifying material entities from a chemical point of view only or always including several classes in the case of conflicting definitions in the subject field.
(d) There exists little practical guidance for deciding whether or not (i) a membership condition is a valid
Sometimes the hierarchical whole-part relationships or the associative relationships in the thesaurus can be adopted as membership conditions. There may, however, be kinds of entities for which no formal membership condition can be stated. In such cases, definitions should be provided in natural language or First Order Logic, which are more expressive than OWL. If need be, examples or typical characteristics can be included as comments instead of a definition. Natural language definitions and other comments are, in any case, helpful for both ontology maintainer and user, even though they are not visible for automated reasoning programs.
Application of the step to the fertilizer ontology
(a) We initially attempted to understand the meaning of the fertilizer concepts in the AGROVOC thesaurus. While there are natural language terms (with or without qualifiers) and hierarchical and associative relationships for all of the concepts in AGROVOC, there are no definitions and just a few scope notes (which in AGROVOC have the character of definitions, but are rarely provided). This turned out to be a major issue for grasping the precise meaning of a concept and strongly impeded the extraction of membership conditions.
(b) We compensated for the lack of definitions in the AGROVOC thesaurus by encyclopedic and regulatory definitions. More specifically, we obtained definitions primarily from
(c) The definitions gathered in the previous action did not suffice to disambiguate the meaning of all concepts. Because we did neither have a specific application of the ontology in mind nor a sponsor with a specific interest, we had to decide on principles which we could follow. Since the majority of definitions adopted from
(d) The collected definitions allowed us to grasp the meaning of concepts more precisely and to extract membership conditions. We will discuss this in detail for the concept ‘fertilizer’ before summarizing our work for specific fertilizer types and concepts closely related to fertilizers.
Information revealing the meaning of ‘fertilizer’ in the AGROVOC thesaurus
Information revealing the meaning of ‘fertilizer’ in the AGROVOC thesaurus
The encyclopedia definition as well as the definition by the EC point to three conditions:
being a material
being involvable in (chemical) processes improving the plant nutrient level of soils
containing nutrients for plants.
With condition (a) we summarized the description “natural or manufactured material” in the encyclopedia definition. We disregarded the limitation to “a solid
The condition (b), as it is formulated, is not sufficient. There are fertilizers that are put directly onto plants, more specifically onto those parts of a plant that are not underground (i.e. not on the roots), so that the nutrients do not have to use the chemical reaction path via the soil. For this reason, we re-formulated the condition (b) to express what fertilizers have to be capable of:
being able to release plant nutrients.
We acknowledge that this condition may have to be further detailed, e.g., by a property of ‘being water soluble’ in the case of fertilizers applied on soils and a property of ‘being liquid’ in the case of fertilizers applied on plant leaves. This requires further detailed investigation, which we did not pursue.
The formulation of condition (c) is not satisfactory either. It is not enough for a material to contain
containing a significant mass proportion of plant nutrients.
A more precise way of expressing the modifier “significant” is to indicate a minimum amount of plant nutrients per weight unit. For this purpose, we analyzed the fertilizer-related regulation of the European Commission [19] and the official regulation in Germany, the
containing a minimal mass proportion of 0.168% plant nutrients.
Note, however, that these conditions are not jointly sufficient for being a fertilizer because conditions (a)–(c**) are true for many water-soluble substances with little amounts of any plant nutrient (e.g., nitrogen) that would not be considered fertilizers, e.g., various medicaments. For this reason, we will characterize fertilizers with necessary conditions only.
We faced similar problems to those we encountered with the class ‘fertilizer’ when identifying membership conditions for the classes ‘compound fertilizer’ and ‘micronutrient fertilizer’. Compound fertilizers need to contain a minimum mass proportion of 0.27% of two or more different
Fertilizer classes characterized by specific nutrients such as ‘calcium fertilizer’ or ‘nitrogen phosphorus fertilizer’ had the same pattern in terms of their analysis and generally refer to two membership conditions: containing a minimum mass proportion of the characterizing chemical element or molecule (e.g., 14.30% calcium or 4.50% nitrogen). These fertilizer types we could specify with necessary and sufficient conditions. An exception are the classes ‘ammonium fertilizer’, ‘nitrate fertilizer’, ‘rock phosphate’, ‘superphosphate’ and ‘nitrophosphate’. We could specify them with necessary conditions only because we lacked sources that indicate minimum mass proportions of molecules by which these fertilizer types are characterized.
There are at least two different interpretations for the term ‘organic fertilizer’. For the ‘social’ interpretation, the term refers to naturally occurring or naturally derived fertilizers; for the ‘scientific’ interpretation, it refers to fertilizers containing a significant mass proportion of the chemical element carbon. The social and the scientific interpretation are incompatible in the sense that they do not have the same extension in reality: unprocessed, naturally occurring mineral materials such as rock phosphate do not contain carbon – or if they do, then only in irrelevant amounts that are not type-defining. Since AGROVOC did not provide any disambiguating hint, we followed the previously established principle to use the scientific interpretation. However, we were not able to specify the carbon amount necessary for an organic fertilizer more precisely.
Specific subtypes of organic fertilizers (‘biofertilizer’, ‘compost’, ‘fish manure’, ‘green manure’ and ‘guano’) are generally characterized as the outcomes of specific processes with specific inputs. For example, fish manures are fish carcasses or parts of fish (offal) that have undergone the process of drying and crushing or powdering. Biofertilizers have the peculiar feature that in the very moment they are sold they are not fertilizers in the strict sense because biofertilizers are active microorganisms, bacteria or fungi that develop a symbiotic relationship with plants. At that time, they do not contain the relevant amount of plant nutrients, which conflicts with our membership conditions for the class ‘fertilizer’. It is only in the course of active processes that biofertilizers
The class ‘inorganic fertilizer’ could only be defined as not being an organic fertilizer, which negates the containment of carbon. Organomineral fertilizers contain significant mass proportions of organic fertilizers and inorganic fertilizers; again, it was not possible to state proportions precisely enough to specify necessary and sufficient membership conditions. Liquid fertilizers and liquid gas fertilizers are characterized by the specific aggregate state in which they are applied. Slow-release fertilizers are characterized by their disposition to release plant nutrients slowly, but there is no explicit and clear-cut maximal velocity for this. Fertilizer-pesticide combinations also contain significant amounts of pesticides.
Some of the concepts in the fertilizer branch have not been included as subclasses of ‘fertilizer’ in the ontology, namely ‘potting compost’ and ‘fertilizer combination’. Potting composts do not necessarily contain significant amounts of plant nutrients and fertilizer combinations are hard to be delineated precisely from other materials.
The tables in Appendix D provide a concise summary of all fertilizer-related classes, their membership conditions, as well as an indicator of whether the conditions are necessary ones only or whether they are also sufficient conditions expressing a definition for the class. It also contains further information about the results of the alignment process discussed in the next step.
Discussion
The identification of membership conditions, which underlies all subsequent steps, turned out to be the most difficult step in the re-engineering process. The first observation from our case study is that natural language definitions can facilitate the identification of necessary membership conditions. The second observation is that identifying membership conditions stimulates thinking about the precise meaning of thesaurus concepts and whether the class hierarchy of generic relationship that was adopted from the thesaurus is free of contradictions and consistently restricts the intension.
Some of the terms we dealt with can, in natural language, refer to things in quite different states. For example, “compost” may refer to the stuff in compost piles before and after its degradation through microorganisms and “biofertilizer” may refer to either (a) a product containing organisms or (b) the product once it has been applied to the field and has bound or solubilized plant nutrients. Only the second meaning fits our definition of ‘fertilizer’. In some cases, like ‘potting compost’, we could not think of any way in which the real-life entities could fulfill the membership conditions to be considered a fertilizer and rejected them as subclasses of ‘fertilizer’. Such issues raise the question whether we have to improve the membership conditions that we specified for ‘fertilizer’ or other subordinate classes. They also challenge modeling decisions that have to be made between conflicting definitions. For example, we had to choose between different interpretations of ‘organic fertilizer’ and limit the membership conditions of the class ‘plant micronutrient’ to the commonalities that we found in partially conflicting definitions. Overall, the identification of membership conditions confronts one with eventual ambiguities in the thesaurus or in language in general.
Sometimes it was also difficult to decide whether a given set of necessary membership conditions is sufficient to define a class. Decisions in this respect have consequences for the reasoning results. Reviewing the inferred class hierarchy as the outcome of the reasoning included in step 5 made us revise and rethink our membership conditions. For example, we wondered whether ‘compost’ is the outcome of instances of the same type of decomposition as ‘guano’ or ‘green manure’ or not. It exceeded the possibility of our study to research this question further and we assumed that there is a general decomposition process or a group of such processes.
While the collection of natural language definitions from existing sources can be pursued quite mechanically, one may end up with incoherent or conflicting results. For this and other reasons, precisely specifying the frequently encountered membership condition of containing “significant amounts” of certain plant nutrients turned out to be a complex endeavor. Therefore, membership conditions cannot be considered a “nice to have” feature of an ontology. Instead, the richness of membership conditions must be acknowledged as a key characteristic that describes the quality of an ontology and the intellectual effort that has been invested in the development of an ontology. In our case, the identification of membership conditions was tremendously time-intensive.
Purpose
Step 5 aims to express all membership conditions gathered in the previous step in the formal language OWL using a common set of formally well-defined relationships. For this purpose, relationships as well as domain-specific ontologies are adopted and eventually amended. Because eventually-adopted domain-specific ontologies are aligned to the same top-level ontology chosen in step 3, the end result of this step is a state where the emerging ontology, the top-level ontology as well as other adopted and eventually-amended domain-specific ontologies are integrated, that is, they are commonly densely interlinked through membership conditions. Based on the formal specification of the membership conditions, an automated reasoning program (or, for short, a reasoner) can interpret and check them in combination with the class hierarchy.
Actions to be taken
The formal specification of classes/membership conditions can be subdivided into the following activities that are likely to be applied iteratively:
Choice of formal relations.
Choice of existing domain-specific ontologies to be re-used (and, if necessary, their alignment to the top-level ontology).
“Amendment” of the external ontologies.
Formalization of the class specifications.
Adding natural language definitions and comments as class annotations.
Consistency check and plausibility check of the fully inferred class hierarchy.
(a) A fixed set of formally defined relationships (object properties in OWL) should be adopted, such as the Relation Ontology [82] or the relationships defined in BioTop [6]. The relationships are a necessary component to formally express the membership conditions that we collected in step 4. This avoids making mistakes in defining semantically precise and consistent relationships, but also enables the integration of ontologies. The adopted relationships should have a strong tie with the adopted top-level ontology because many relationships are, and should be, constrained in their domain and range with reference to a top-level ontology. Which relationships are necessitated depends on the domain at stake, but a useful set of formally defined relationships in ontologies will generally comprise spatial, mereological and temporal relations. Most fundamental is the subclass-of (is-a) relation, which is a pre-defined part of OWL.
(b) Classes from the top-level ontology (chosen in step 3) will be useful to state very general membership conditions. For example, we will want to state that portions of agricultural fertilizers are
Terms can be imported from several ontologies, eventually using the support of tools like OntoFox [96] or MIREOT [14]. When one term is covered by several domain ontologies, the ontology that fits best to our method should be preferred. Particular advantages are formal membership conditions, alignment to the same top-level ontology and shared types of relationships. Otherwise, the domain ontologies that are adopted in this step need to be aligned to the same top-level ontology that was adopted in step 3 – in the same way the classes derived from the thesaurus have been aligned to it.
(c) If needed classes and relationships are not found in existing ontologies, there are two options: either ontology developers can suggest these classes to be added to the respective domain ontologies or the classes are created within the emerging ontology itself. Newly created classes should, of course, not duplicate what is already contained in the imported ontologies. The introduction of new classes is, of course, unavoidable if a new domain is to be described. However, introducing new types of relationships should be avoided and considered a last resort because idiosyncratic relationships are a major obstacle for interoperability. Proliferating relationships in OWL can also severely impede the performance of the reasoning algorithms. In many cases, however, the urge to introduce new relationships is due to an insufficient ontological analysis. E.g., the relationship ‘digests’ need not be introduced as a new basic type if there are already generic relations like ‘agent of’ and ‘patient of’, by means of which ‘digest’ can be expressed as
Newly introduced classes also need to find a place in the class hierarchy. They should be subsumed under a class in the top-level ontology or under a class in one of the (aligned) domain-specific ontologies. The assignment should be done with care because the aligned class will inherit all membership conditions of the superordinate classes. In cases of doubt, the class in question should be subsumed under a more general class.
When introducing new classes, it would, on the one hand, be desirable to fully specify them with membership conditions to enhance consistency checking and infer implicit class subsumptions. On the other hand, this is as time-consuming as re-engineering the thesaurus concepts. Moreover, the membership conditions for new classes will, in turn, refer to still other classes and so forth. We, therefore, recommend only specifying membership conditions for classes that are at the heart of the modeled domain and leaving fringe classes to specialists in these other domains. Nevertheless, in an ideal world, the membership conditions of all classes both within a single ontology and across different ontologies form a complex and interdependent network.
(d) The formal specification of classes is realized by adding the necessary membership conditions identified in step 4 as so-called anonymous superclasses using the subclass axiom. If only necessary membership conditions are stated, a class is called a
When formalizing membership conditions, one has to respect the formal properties of the relationships, such as their domain, range, transitivity, disjointness, inverse implication or reflexivity [53]. In cases where hierarchical whole-part relationships or associative relationships from the thesaurus have been adopted into the ontology as membership conditions, they will normally have to be refined at this stage to be matched to semantically precise formal relationships.
The formal specification of classes by membership conditions is also the step where guidelines for the correct and complete use of OWL [8,33,66,77] or ontology design patterns (ODPs) for standardized ontology engineering or circumventing expressivity problems of OWL [61] should be applied. Following the guidelines may also imply defining additional axioms such as the disjointness of classes or the transitivity of relationships.
(e) Natural language definitions should be added at least when no formal specification is possible. Comments may, e.g., detail membership conditions that could not be formalized. It is important to bear in mind that information in natural language cannot be processed by automated reasoning programs.
(f) Automated reasoning allows checking the consistency of the emerging ontology on a regular basis during re-engineering. A reasoner can automatically infer new subsumptions, equivalences or other axioms if they are entailed logically by the explicitly asserted ontology.
The formalized membership conditions and the densely linked ontologies resulting from the previous activities in step 5 lay the ground based on which a reasoner can effectively check the consistency of the emerging ontology. Using a reasoner that is appropriate for the chosen OWL-DL semantics, possible contradictions can be tracked and removed.
Using a reasoner also allows distinguishing between the asserted ontology and the inferred ontology. The asserted ontology comprises the explicitly asserted statements only, while the inferred ontology also comprises all inferred statements. When speaking about “the” ontology, the reference is generally to the asserted ontology in this paper.
Application of the step to the fertilizer ontology
(a) Our need for formally defined relationships was entirely satisfied with the relationships contained in BioTop [75], the upper-domain ontology adopted in step 3. This was fortunate because the membership conditions in BioTop are formally specified and refer to the classes defined in BioTop. A reasoner can, thus, use the relationship specifications to check the consistency of our emerging ontology.
(b) Since our formal specifications frequently refer to chemical entities, we adopted ChEBI (=Chemical Entities of Biological Interest) [17], an ontology from the chemistry domain, the major feature of which is the completeness and hierarchical organization of the chemical elements, molecules and other entities that it models. While it was helpful that ChEBI is available in OWL format, a disadvantage of ChEBI was that it does not give explicit membership conditions (as of March 2012).
Since the range of molecules is enormous, ChEBI is a very large and complex ontology. In order to keep our ontology tractable for the automated reasoner, we extracted a fragment of less than 10% of ChEBI’s original size that contains the chemical entities that are relevant for us. The slimming down was challenging, since ChEBI makes intensive use of multi-hierarchies and there was a high risk of (unintentionally) deleting branches that were to be retained because they are connected with other relevant paths at a lower level. Nevertheless, this may be considered a weakness of our chosen ontology editor, Protégé, because classes from a hierarchical path should not be deleted without user interventions if they also belong to other hierarchical paths.
Though we may have been able to adopt more ontologies, we did not search for other ontologies that could have been suitable. Searching and evaluating ontologies can be a time-consuming effort. Because our main interest was to illustrate the process of choosing and aligning external ontologies, we limited ourselves to ChEBI.
As BioTop already provides bridge classes for this purpose [6], it was not difficult to align ChEBI classes to BioTop, our chosen top-level ontology. The first three entries in Table 2 show the classes that were aligned (implicitly aligning the subordinate classes) indicating the alignment axioms in the second column. We also amended some membership conditions for specific classes in ChEBI. The amended classes are listed in the last three rows of Table 2. The respective entries in the 2nd column indicate the newly-asserted membership conditions.
Amendments to imported ChEBI classes
Amendments to imported ChEBI classes
Newly added classes
(c) The specifications of the various fertilizer types required the introduction of the classes listed in Table 3. Only the classes that are central to the fertilizer domain were specified with membership conditions (as done for fertilizers in step 4). A first group of these classes are the ones listed in the first row of entries in Table 3 (‘plant nutrient’, etc.). The members of these classes are characterized by their ability to be picked up as nutrients by a plant. They differ in terms of the chemical elements they comprise and group the chemical elements by the quantity in which they are required by plant nutrients.
We also introduced classes for processes and dispositions [69]. E.g., we introduced a class ‘plant nutrient uptake disposition’ comprising all instances of the disposition to
We did not introduce new relationships in the current development step because BioTop already contained all relationships needed – with the exception of some relationships that were tentatively used in sub-activity (d) to represent certain features as data properties.
Table 3 indicates the ontology (BioTop or ChEBI) to which the newly introduced classes have been aligned. While not listing the precise alignments here, we always chose the most specific class in the ontology to which we aligned. Nevertheless, we only stated alignments about which we were confident; for this reason, we sometimes aligned to quite general classes.
(d) The natural language formulations of the membership conditions are concisely summarized in Appendix D. They are already formulated with an eye on the relationships provided by BioTop, so that they translate well into OWL class expressions. Only some classes like ‘fish manure’ and ‘guano’ have complex membership conditions and, thus, also complex formal expressions. The phrase ‘being a’, as used in the natural language formulations of membership conditions in previous steps, translates into the OWL axiom ‘SubClassOf’. In the case of classes that are defined with necessary and sufficient conditions, the ‘EquivalentTo equivalentTo’ axiom is used; the subclass conditions become part of the class expression that is asserted to be equivalent.
The formal specification of the membership conditions to contain a minimal proportion of plant nutrients turned out to be problematic because the expressivity of OWL2 does not provide a straightforward way to express proportions. Simply adding annotations is easy to implement, but the quantification is not machine-readable then. Using the minimum modifier for a relationship (the ObjectMinCardinality axiom), e.g.,
Automated reasoning algorithms cannot recognize the intended semantic difference in the relationship label and would still interpret the modifier as a condition in a countable sense. As a workaround, we tentatively resorted to Data properties in OWL (the DataMinCardinality axiom), e.g.,
A similar problem for working with relationships in OWL and Protégé is that they cannot be combined with quantities in percentages, but only with cardinalities given in natural numbers. This problem can be solved by scaling the values and expressing them as parts per million (abbreviated ppm) with respect to the mass proportion as was done in the examples above. The minimum plant nutrient proportions in percentages were transferred into a parts per million (ppm) measure, i.e. a value of 1680 refers to a share of 1680 millionths of the number of particles (=0.168%). All measures, including the ppm measure, refer to mass proportions (as opposed to a volume proportion).
(e) We refrained from adding the natural language definitions identified in step 4 as class annotations because we often modified these definitions with the goal of expressing more precise membership conditions.
(f) Based on the formal specification of the aligned ontology with its membership conditions for the various classes, we were able to check the ontology for consistency in a non-trivial way and infer subsumptions in the class hierarchy that have not already been asserted. For this purpose we used the reasoner Hermit [30], which is available as an embedded plug-in for the Protégé-OWL editor.
The reasoning process revealed various initial modeling mistakes that we subsequently resolved. Many mistakes were technical ones, similar to the typical mistakes described in guidelines [8,33,66]. Other mistakes revealed an insufficient understanding of the adopted (imported) ontologies and relationships. The reasoning results also made us wonder whether we made mistakes in asserting is-a relationships and membership conditions. For example, we felt uncertain about the correctness of the inferred subsumptions of ‘NPK fertilizer’ and ‘green manure’ indicated in Fig. 8, but could not finally find arguments against them. In the case of the NPK fertilizers, this uneasiness is, of course, due to the ambiguity connected to superordinate terms like ‘Compound fertilizers’.

Inferred fertilizer class hierarchy after alignment.
It also turned out that there are considerable problems with reasoning over the data properties that we introduced as described above. When defining values for the data properties that are greater than 1000, Hermit aborted the initialization of the reasoning process with error messages. Moreover, the computing time increased tremendously when using data properties in the fertilizer class definitions. While the first problem could have been avoided by indicating the mass proportions in per mill (thousandths) instead of millionths and rounding them, attempts to improve the performance by dissolving the data property hierarchy were not successful.
It is outside the scope of this paper to determine whether the problem with the data properties is a general one or a particular problem of the Hermit reasoner. In the end, the data properties had to be removed from the class specifications to be able to use the reasoner. In consequence, the concerned class specifications became primitive ones with insufficient membership conditions. This, in turn, results in the loss of desirable reasoning inferences since new class subsumptions can only be inferred under classes defined with necessary and sufficient conditions.
We checked manually whether valid subsumptions can be inferred based on the minimum plant nutrient levels. For this purpose, we left out all conditions involving data properties. This would normally imply that the defined classes become primitive classes, but we kept their status as defined classes, which expectedly led to wrongly inferred subsumptions. We addressed the problem by manually sorting out wrongly inferred subsumptions, which are struck through in Fig. 8. After the critical review, we had left only very few (correctly inferred) new is-a relationships that were not stated in the previously asserted class hierarchy (indicated in bold font in Fig. 8). The NPK fertilizers were subsumed deeper in the hierarchy under nitrogen phosphorus fertilizer, nitrogen potassium fertilizer as well as under phosphorus potassium fertilizer, which is a plausible result.
The subsumption of ‘green manure fertilizer’ under ‘compost’ appears more debatable. It results from the assumption that plants used as green manure undergo the same degradation process as other material that is usually referred to as “compost”.
Discussion
BioTop provided a comprehensive set of part-of and other relationships whose formal specifications refer to the very same top-level categories that we adopted for our alignment in step 3. The relationships satisfied our modeling needs entirely and, again, took decisions from us and potentially avoided wrong conclusions and modeling mistakes.
The problems faced with the formal specification of classes through membership conditions demonstrated clearly that the expressivity of a formal language can impede the formally correct specification of membership conditions. One can even be forced to remove membership conditions that have been identified earlier. In consequence, classes may lose their quality of being specified through necessary and sufficient membership conditions (being defined classes).
Purpose
In this step, a naming convention is chosen and the labels of classes and other entities are adjusted accordingly. This improves both readability and intelligibility of the ontology for ontology developers and users. Further, one can observe that the labels in ontologies are meant to express the context-free meaning (intension) of a class as precise as possible. While being highly recommended for maintenance and other possible usage reasons, the labeling does not change the semantics of a class for computers.
Actions to be taken
The adjustment involves two sequential steps:
Choice of a labeling convention.
Adjusting the class labels.
Currently, there are no universally accepted conventions on how ontology classes should be labeled [87]. Nevertheless, common practices have been summarized [74] and it ought to be checked whether similar conventions exist in one’s field. For example, it appears to be generally accepted that names for ontology classes should be in their singular form. In any case, care should be taken to apply a single naming style consistently for all classes of the ontology.
It should be noted that the labeling described here does not concern the unique identifier (URI/IRI) of the classes or properties as specified in RFC 3986 [67]. Synonym sets from the source thesaurus could be transferred to the ontology using the labeling provisions of the respective ontology language, but this does not belong to the ontological content in the strict sense, though the integration of synonyms may be useful for some applications of the ontology.
Application of the step to the fertilizer ontology
We adopted common conventions in biomedical ontologies for the class labeling summarized by Schober et al. [74]. The application of the conventions often changed the first letters from upper case to lower case and also the plural forms, which are often used in thesauri, have been changed into the singular form of the nouns. The abbreviation ‘NPK’ (standing for nitrogen, phosphorus and potassium) is an exception and we left it unchanged because lower case letters would make the class label confusing. For example, the thesaurus concept with the preferred term ‘Fertilizers’ was labeled ‘fertilizer’ when modeled as a class in the ontology.
The identified membership conditions motivated us to change the formulations of some class labels. All fertilizer types were re-labeled to begin with “portion of” to emphasize that we deal with amounts of materials, not with countable objects. The word “fertilizer” was added to the class labels “rock phosphate”, “superphosphate” and “nitrophosphate” to indicate their use as fertilizers. The ending “fertilizer” was also added to the labels of various subclasses of the ‘organic fertilizer’ class: ‘compost’, ‘fish manure’, ‘green manure’ and ‘guano’. In these cases, the ending “fertilizer” emphasizes that it is not the bare organic material put on a compost heap, the unprocessed fish manure, the plant biomass called ‘green manure’, or the excrements of certain animals themselves that act as the fertilizer, but rather only the outcome of specific processes to which the previously mentioned materials are input. In the case of ‘fish manure’, we adopted the commonly used term “fish fertilizer”. Appendix E provides a complete overview of the labeling changes.
Step 7: Dissolving poly-hierarchies
Purpose
In order to achieve an ontology that can easily be maintained, an asserted ontology should consist of a single is-a hierarchy. Thus, poly-hierarchies should be dissolved. Recall that we removed semantically incorrect poly-hierarchies that lead to contradictory membership conditions already in step 3. Dissolving semantically correct poly-hierarchies is an optional step insofar as it changes the structure but not the semantics of the ontology. Mono-hierarchies are easier to implement and maintain, but sometimes it might be intellectually challenging to decide which is-a relation is to be dissolved.
Actions to be taken
Assume that a target class X has two or more direct superclasses. In order to dissolve the poly-hierarchy, we have to solve two problems: Which class is to be kept as a direct superclass? And how do we retain the information given by the subclass-of relationship connecting X to the other superclasses? A clear case for discarding direct superclasses is when they are specified with necessary and sufficient membership conditions that can also be directly applied to X. In general, the strategy for retaining the information is to (a) add the restrictions of the classes along the dissolved class paths to the specification of the target class X and (b) remove any subsumption of the target class under classes of the dissolved class path from the specification of the target class X. This way, the poly-hierarchy is dissolved in the asserted ontology, but it will be restored by generating the inferred ontology through automated reasoning. Dissolving poly-hierarchies in such a way is one aspect of the
Application of the step to the fertilizer ontology
In the ontology that we have modeled, there are only two classes that are poly-hierarchically subsumed under several classes: ‘liquid fertilizer’ and ‘liquid gas fertilizer’. Since dissolving the poly-hierarchy is to be handled in the same way in these two cases, we will only discuss the poly-hierarchy of the class ‘portion of liquid fertilizer’ here, illustrated in Fig. 9.

Poly-hierarchy for ‘liquid fertilizer’ (the dotted arrow indicates the is-a relationship dissolved by us).
We decided to resolve the poly-hierarchy by making ‘liquid fertilizer’ primarily belong to the class ‘fertilizer’. Thus, we replaced the hierarchical subsumption under ‘portion of heterogenous liquid’ (indicated through a dotted arrow in Fig. 9) by adding a membership condition to the specification of the class ‘liquid fertilizer’, namely
Of course, membership conditions that are already part of the ‘liquid fertilizer’ specification or its superclasses along the retained class path do not have to be added again to the specification. The formal specification of the class changes as follows:
In the previous section, we have discussed the various steps of our re-engineering method. They are concisely summarized in Appendix B, including all sub-activities. In this section, we will reflect on the method as a whole, in particular the benefit and effort of applying it, its generality and limitations.
The overarching motivation for the steps in our method was to re-engineer thesauri into semantically adequate ontologies that (a) make full and correct use of the semantic expressivity of OWL, (b) facilitate the integration of the ontologies with other ontologies following the same development principles, and (c) are consistent and provide reasoning results that correspond to the represented domain. Our method achieves this goal by addressing the following requirements:
The ontology is described in a well-defined syntax and adheres to the description logic semantics (steps 2 and 5).
The meaning of classes is expressed by means of membership conditions (steps 4 and 5).
Newly created as well as imported classes are aligned to a top-level ontology; a common set of formal relationships is used (steps 3 and 5).
The ontology is logically consistent and inferences from the asserted ontology are plausible (step 5).
The ontology has a rigorous is-a hierarchy in which the intension of classes (the specification of the classes) becomes more restrictive at every subordinate level (steps 3–5).
Natural language terms either reflect the meaning of a class as precisely as possible
Requirement (5) is probably the least obvious; it is put into effect by the adoption of the generic relationships in a thesaurus as an is-a hierarchy and its gradual refinement by basing it on membership conditions (step 5), adopting high-level membership conditions through the alignment to a top-level ontology (step 3) and, finally, checking the is-a hierarchy for its consistency (step 5).
The overall benefits of a semantically adequate ontology, as opposed to a thesaurus, need to be further investigated. The rigorous is-a hierarchy makes ontologies especially suited for automated processing, like automatic classifications and clustering. Another particular usage of an ontology is to assure interoperability among databases. Moreover, it might also be easier to maintain an ontology than a thesaurus. The comparative performance of thesauri and ontologies in natural language processing or information retrieval may depend on the specific application scenario. Because of the many structural changes and the removal of many relationships from a thesaurus, an ontology cannot always be assumed to be better than a thesaurus.
The effort of applying our re-engineering method was considerable. By far the biggest effort lies in specifying the intension of the respective concepts/classes with necessary and possibly sufficient membership conditions (steps 4 and 5). Determining minimum proportions of plant nutrients in fertilizers and formalizing these in OWL has literally become a study in its own right. It also took considerable time to become adjusted to the framework of BioTop and the ChEBI ontology to express the membership conditions using these ontologies (step 5). For example, it was not clear to us whether we should model fertilizers as having a
The effort of thesaurus re-engineering and ontology engineering in general can be reduced under certain circumstances:
The effort to prepare and check the thesaurus (step 1) depends on the quality of the existing thesaurus. Ideally, it can be skipped entirely.
The involvement of domain experts can save time during the identification of membership conditions (step 4).
Experience with the chosen top-level ontology and imported ontologies reduces the alignment effort (step 3) as well as the effort to formalize the membership conditions (step 5).
Experience in modeling with OWL reduces the effort to formally specify membership conditions correctly (step 5).
Optional steps and sub-activities such as adjusting entity labels (step 6) or dissolving poly-hierarchies (step 7) may be omitted (see Appendix B for an overview of optional steps).
Steps 2, 6, and 7 may be at least partially automatable while the other steps appear to have no automation potential at the current state of the art without substantial quality losses.
During re-engineering, we faced various difficult decisions such as which top-level ontology to adopt and when to adopt classes from other domain-specific ontologies (as opposed to creating new classes), and which sources to choose. These decisions could be facilitated by way of standardization as suggested by the strategy of the OBO Foundry to collect orthogonal reference ontologies for well-defined domains [58]. Another problem was to decide for which of the classes we should give detailed membership conditions (as opposed to simply subsuming them under some existing classes). In practice, this question will often have to be answered with an eye on the prospective use of the ontology.
The alignment to a top-level ontology (step 3) as well as defining and formalizing membership conditions (steps 4 and 5) are the key features of our method because they express the meaning (the semantics) of entities and improve the is-a hierarchy. For the most part, it is the application of these three steps in ontology engineering that makes OWL-DL-compliant ontologies differ from thesauri and distinguish our method from the re-engineering approaches mentioned in Section 2.2. We believe that these steps are essential for a reliable integration of classes and class hierarchies on the Semantic Web.
Re-engineering a thesaurus into an ontology based on our method can change the structure of the thesaurus considerably. Not only the hierarchy of the thesaurus may be changed fundamentally, but also many relationships of the thesaurus may have to be rejected. These and further differences between thesauri and OWL-DL ontologies are described in greater detail in Kless et al. [45]. Differences and similarities between thesauri and ontologies have already been theoretically anticipated in a prior comparative study of relata and relationships in thesauri and ontologies [44]. However, we did not face all these differences in this case study. For example, there was no need to set apart generic relationships (is-a relations) from other types of hierarchical thesaurus relationships. The method describes the need to address such issues, but had no opportunity to collect practical experience during the re-engineering of the fertilizer branch.
Many of the steps that we have adopted in our re-engineering method have been successfully applied in ontology engineering in the life sciences. It is an open question whether one faces bigger problems when applying our method in other domains such as the social sciences. For example, it is not at all trivial to align phenomena like ‘freedom’ or ‘success’ to a top-level class. This does not question the applicability of our re-engineering method as such, but rather points to ontological problems that arise when dealing with such phenomena.
Our case study used a thesaurus as a starting point for re-engineering and could, thus, rest on a given number of existing concepts, terms and relationships. Nevertheless, a great part of the method is not specific to thesauri, but could be seen as a method of ontology engineering and re-engineering
Conclusions
We presented a method with seven steps and numerous sub-activities for re-engineering thesauri into semantically adequate ontologies using the description-logic-based Web Ontology Language (OWL). We motivated each step in our method and gave a detailed explanation of the activities for its realization. Further, we demonstrated the applicability of the method by applying it to a portion of the AGROVOC thesaurus that is concerned with agricultural fertilizers.
The method is applicable to all thesauri that follow the basic structure laid out in the current ISO standard for thesauri and its predecessors. It differs from previous re-engineering approaches by making full use of OWL to specify the meaning of concepts. The major strength of this method lies in producing ontologies that are truthful representations of things in reality and can be integrated consistently with each other. These benefits are achieved by imposing a rigid is-a hierarchy and removing relationships and other content from thesauri which are not appropriate for a formal ontology.
Footnotes
Acknowledgements
The research of D.K. has been enabled through the David Hay Memorial Fund and the PORES travel and research grant provided by University of Melbourne, with special thanks to Edmund Kazmierczak and Simon Milton for their support in setting up the research visit. The work of L.J. has been supported by the German Research Foundation (DFG) under the auspices of the GoodOD project.
The authors very much thank Jens Wiebensohn from the Agricultural Science department at the University of Rostock for sharing his knowledge about fertilizers.
Source of the steps for the re-engineering method
In Section 3 we detailed that the steps in our re-engineering method are the results of the practical application of a naïve re-engineering method. The steps in the naïve re-engineering method stem from (a) a theoretical comparison of thesauri and ontologies [44], (b) an analysis of ontology re-engineering literature (Hahn [27], Wroe et al. [95], and Guarino and Welty [26]) and (c) an analysis of general ontology-engineering literature. These steps are summarized in Table 4, which also lists the respective authors and publications. We included only steps that are content-focused as well as precise and actionable.
General steps for the development of qualitatively good ontologies Hahn advises to remove cycles in the is-a hierarchy, which is part of the establishment of a (correct) is-a hierarchy.
General ontology engineering step
Reference backing the step
a)
Distinction of intensional and extensional entities (universals and particulars in ontological realism)
Smith and Ceusters [81], OBO Foundry principle under discussion, Borgida and Brachman [8]
b)
Establishment of an is-a hierarchy
Noy and McGuinness [56], Borgida and Brachman [8], Staab et al. [85], Hahn [27]
A
c)
Alignment to a top-level ontology
Uschold and King [90], Smith and Ceusters [81], OBO Foundry principle under discussion, Jansen and Schulz [38], Hahn [27]
d)
Application of the OntoClean method
Borgida and Brachman [8], Guarino and Welty [26]
e)
Establishment of a single inheritance hierarchy
Rector [65], Smith and Ceusters [81], OBO Foundry principle under discussion
f)
Adoption of a well-founded set of ontological relationships that harmonize with the chosen top-level ontology
Borgida and Brachman [8], Accepted OBO Foundry principle
g)
Definition of a rich set of membership conditions as a basis for the ontology’s hierarchy (the is-a relationships)
García et al. [22], Noy and McGuinness [56], Borgida and Brachman [8], Staab et al. [85], Hahn [27], Wroe et al. [95]
h)
(Correct) Codification in a formal representation language
Uschold and King [90], Fernández-López et al. [20], García et al. [22], Borgida and Brachman [8], Staab et al. [85], Rector et al. [66], Accepted OBO Foundry principle, Hahn [27], Wroe et al. [95]
i)
Provision of metadata for all classes and relationships such as textual definitions and labels
Accepted OBO Foundry principle, Jansen and Schulz [38]
j)
Adhering to naming convention for the labels
Accepted OBO Foundry principle, Jansen and Schulz [38]
k)
Delineation from existing ontologies
Smith and Ceusters [81], Accepted OBO Foundry principle
Figure 10 shows the steps of the naïve re-engineering method on the left hand side and relates them to the steps in the final re-engineering method that we introduced in this paper. The relationships indicate that a step in the naïve re-engineering method is either equivalent to the indicated step in the final re-engineering method or that it is a direct or indirect part of the step in the final re-engineering method. It should be noted that Fig. 10 does not show the various sub-activities of the steps in the final re-engineering method (summarized in Appendix B).
Relation between steps in the naïve re-engineering method and the final re-engineering method.
We did not adopt the OntoClean method [26] from the naïve re-engineering into our final re-engineering method because we did not detect any errors in the is-a hierarchy when applying OntoClean. The reason for not gaining a benefit from OntoClean may be that improving the is-a hierarchy is an implicit result of steps 3, 4 and 5 of our method. In particular, the alignment to a top-level ontology in step 3 may affect the is-a hierarchy in a way that is comparable to an application of the OntoClean method. Nevertheless, the degree of overlap depends on the top-level ontology, but also on a correct application of the top-level ontology and its corresponding set of relationships. It requires further investigation to determine, whether the effects of applying OntoClean are the same as applying our method, or whether OntoClean should be added as an additional step to our method.
Overview of the steps and sub-activities of the re-engineering method
The re-engineering method explained in Section 4 consists of various steps and sub-activities. We provide an overview of the sub-activities for every step here.
Preparatory refinement and checking of the thesaurus∗ Naïve conversion Choice or development of conversion tools∗ Conversion of the thesaurus into the formal language Alignment to a top-level ontology Choice of an existing top-level ontology Alignment of the separated portions of the emerging ontology to the top-level ontology Identification of membership conditions (in natural language) Formal specification of membership conditions Choice of existing domain-specific ontologies and their alignment to the top-level ontology∗ Choice of a set of formal relations “Amendment” of the external ontologies∗ Formalization of the class specifications Adding natural language definitions and comments as class annotations∗ Optional step, the usefulness of which depends on the characteristics of the thesaurus (steps 1, 4.b/c), its storage format and storage system (step 2.a), the availability and quality of existing ontologies (step 5.a/c/e), the intended usage of the ontology (steps 6 and 7), and personal preference in general. Consistency check and inference of class hierarchy Adjustment of spelling, punctuation and other aspects of entity labels∗ Dissolving poly-hierarchies∗
Defined arrays of fertilizer concepts in the AGROVOC thesaurus
This overview presents the arrays that we defined in the course of preparing and checking fertilizer concepts in the AGROVOC thesaurus during the case study. The respective step was discussed in Section 4.1. The node labels that indicate the arrays are highlighted in italic font.
Membership conditions after alignment (step 4)
This appendix summarizes the membership conditions for fertilizer concepts and fertilizer-related concepts in the AGROVOC thesaurus. The membership conditions are fundamentally based on their extraction from natural language definitions as described in Section 4.3. Nevertheless, the status presented here was only achieved after the alignments of the fertilizer classes and adopted ontologies to a top-level ontology. This was elaborated in Section 4.3.
In addition, Table 5 indicates the plant nutrient levels extracted from the fertilizer regulation by the European Commission [19] and the German fertilizer regulation “Düngemittelverordnung” [18].
In Tables 6–7, we use natural language to describe the membership conditions. The wording is as close as possible to the class names, relationship names and property names of the imported ontologies. We have split the complex conditions of some classes (‘Fish manures’ and ‘Guano’) into several dependent conditions using some auxiliary classes (indicated in italic font) in order to improve the readability. These classes do not appear in the formal specification where they are simply nested into each other, that is, the name of the auxiliary classes is replaced by their respective definitions.
Necessary parts of element- or molecule-focused fertilizers in relation to ChEBI Classification as primitive class specified with necessary conditions or as defined class specified with necessary and sufficient conditions. ‘Apatite’ is subsumed under ‘collective material entity’ in ChEBI, so that no reference to the granular part is necessary. Membership conditions of fertilizer classes Classification as primitive class specified with necessary conditions or as defined class specified with necessary and sufficient conditions. This condition has been amended in line with the ‘fertilizer’ specification. Membership conditions of classes closely related to agricultural fertilizers Classification as primitive class specified with necessary conditions or as defined class specified with necessary and sufficient conditions.
Class/fertilizer type
Granular part (as defined in ChEBI)
pm of atom
Necessary/sufficient
A
Calcium fertilizers
‘calcium molecular entity’
143000 Ca
necessary and sufficient
NPK fertilizers
‘phosphorus molecular entity’ and ‘potassium molecular entity’ and ‘nitrogen molecular entity’
654 P
necessary and sufficient
Nitrogen phosphorus fertilizers
‘phosphorus molecular entity’ and ‘nitrogen molecular entity’
654 K
necessary and sufficient
Nitrophosphates
‘calcium hydrogen phosphate’ and ‘ammonium nitrate’ and ‘diammonium hydrogen phosphate’
n/a
necessary
Nitrogen potassium fertilizers
‘potassium molecular entity’ and ‘nitrogen molecular entity’
2075 K
necessary and sufficient
Phosphorus potassium fertilizers
‘phosphorus molecular entity’ and ‘potassium molecular entity’
654 P
necessary and sufficient
Magnesium fertilizers
‘magnesium molecular entity’
84600 Mg
necessary and sufficient
Phosphate fertilizers
‘phosphorus molecular entity’
30956 P
necessary and sufficient
Rock phosphate
‘apatite’
B
n/a
necessary
Superphosphate
‘calcium sulfate’ and ‘calcium bis(dihydrogenphosphate)’
n/a
necessary
Potash fertilizers
‘potassium molecular entity’
58100 K
necessary and sufficient
Sulphur fertilizers
‘sulfur molecular entity’
55000 S
necessary and sufficient
Nitrogen fertilizers
‘nitrogen molecular entity’
45000 N
necessary and sufficient
Ammonium fertilizers
‘ammonium compound’
n/a
necessary
Nitrate fertilizers
‘nitrates’
n/a
necessary
Class/fertilizer type
Membership conditions
Necessary/sufficient
A
Fertilizers
being a compound of collective material entities
necessary
fertilizer types listed in Table 5, e.g., calcium fertilizer
being a fertilizer
see Table 5
Compound fertilizers
being a fertilizer
necessary
Micronutrient fertilizers
being a fertilizer
necessary
Organic fertilizers
being a fertilizer
necessary
Biofertilizers
being a fertilizer
necessary and sufficient
Composts
being a fertilizer
necessary and sufficient
Fish manures
being a fertilizer
necessary and sufficient
Green manures
being a fertilizer
necessary
Guano
being a fertilizer
necessary
Inorganic fertilizers
being a fertilizer
necessary
Organomineral fertilizers
being a fertilizer
necessary
Liquid fertilizers
being a fertilizer
necessary and sufficient
Liquid gas fertilizers
being a fertilizer
necessary and sufficient
Slow release fertilizers
being a fertilizer
necessary and sufficient
fertilizer pesticide combinations
being a compound of collective material entities
B
necessary
Class
Membership conditions
Necessary/ sufficient
A
plant nutrient
(a)
being a molecular entity
necessary and sufficient
(b)
being either a primary plant nutrient or secondary plant nutrient or plant micronutrient
(c)
bearing the disposition to be picked up by plants
plant micronutrient
(a)
being a plant nutrient
necessary
(b)
being a molecule that contains either boron or copper or iron or manganese or molybdenum or zinc
primary plant nutrient
(a)
being a plant nutrient
necessary and sufficient
(b)
being a molecule that contains either phosphorus or potassium or nitrogen
secondary plant nutrient
(a)
being a plant nutrient
necessary
(b)
being a molecule that contains either calcium or magnesium or sulfur
plant nutrient disposition
(a)
being a disposition
necessary
(b)
being realizable by a plant nutrient uptake process
plant nutrient uptake process
(a)
being a kind of bio molecular process, the locus of which is a plant and the participants in the process are plant nutrients
necessary
(b)
realizing some disposition of being a plant nutrient
plant nutrient release disposition
(a)
being a disposition
necessary
(b)
being realizable by a plant nutrient release process
plant nutrient release process
(a)
being a process
necessary
(b)
realizing some disposition to release plant nutrients
plant nutrient slow release disposition
(a)
being a plant nutrient release disposition
necessary
Adjustments of labels (step 6)
Table 8 presents the adjustments to the class labels as described in Section 4.6.
Comparison of class labels with former preferred terms in the AGROVOC thesaurus
Preferred term in the AGROVOC thesaurus
Class label in the fertilizer ontology
Fertilizers
portion of fertilizer
Nitrogen fertilizers
portion of nitrogen fertilizer
ammonium fertilizers
portion of ammonium fertilizer
nitrate fertilizers
portion of nitrate fertilizer
Phosphate fertilizers
portion of phosphate fertilizer
Rock phosphate
portion of rock phosphate fertilizer
Superphosphate
portion of superphosphate fertilizer
Potash fertilizers
portion of potash fertilizer
Calcium fertilizers
portion of calcium fertilizer
Magnesium fertilizers
portion of magnesium fertilizer
Sulphur fertilizers
portion of sulphur fertilizer
Compound fertilizers
portion of compound fertilizer
NPK fertilizers
portion of NPK fertilizer
Nitrogen phosphorus fertilizers
portion of nitrogen phosphorus fertilizer
Nitrophosphates
portion of nitrophosphate fertilizer
Nitrogen potassium fertilizers
portion of nitrogen potassium fertilizer
Phosphorus potassium fertilizers
portion of phosphorus potassium fertilizer
Micronutrient fertilizers
portion of micronutrient fertilizer
Organic fertilizers
portion of organic fertilizer
Biofertilizers
portion of biofertilizer
Composts
portion of compost fertilizer
Fish manure
portion of fish fertilizer
Green manures
portion of green manure fertilizer
Guano
portion of guano fertilizer
Organomineral fertilizers
portion of organomineral fertilizer
fertilizer pesticide combinations
portion of fertilizer pesticide combination
Inorganic fertilizers
portion of inorganic fertilizer
Liquid fertilizers
portion of liquid fertilizer
Liquid gas fertilizers
portion of liquid gas fertilizer
Slow release fertilizers
portion of slow release fertilizer
seabirds
seabird
Goats
goat
whales
whale
plant
plant
degradation
degradation
solubilization
solubilization
crushing
crushing
drying
drying
Excretion
excretion
pesticides
pesticide
