Abstract
This paper describes the construction of the LOTED2 ontology for the representation of European public procurement notices. LOTED2 follows initiatives around the creation of linked data-compliant representations of information regarding tender notices in Europe, but focusing on placing such representations within their legal context. It is therefore considered a legal ontology, as it supports the identification of legal concepts and more generally, legal reasoning. Unlike many other legal ontologies however, LOTED2 is designed to support the creation of Semantic Web applications. The methodology applied for building LOTED2 therefore seeks to find a compromise between the accurate representation of legal concepts and the usability of the ontology as a knowledge model for Semantic Web applications, while creating connections to other relevant ontologies in the domain.
Introduction and motivation
Over the last twenty years, AI & Law research has widely investigated the field of legal ontology. Up to now, efforts have focused on the development of legal ontologies, aimed at the representation of fundamental basic concepts of legal knowledge and at the description of specific legal domains, to support legal case-based reasoning, decision support systems, legal compliance checking, and more generally to map the complexity of legal knowledge to formal languages such as OWL. Specific methodologies and approaches used for building these ontologies show that legal ontology engineering has its own peculiarities [11], requiring particular conceptual structures for the representation of legal concepts, including notions of legal rules, preconditions, legal consequences, etc. [38,41].
At the same time, the Linked Data trend and the emergence of second generation Semantic Web applications have highlighted the significance of intelligence arising from the integration of dispersed and heterogeneous data from many sources, rather than from closed knowledge based systems [14]. Thus for this kind of applications, there is a need for flexible and modular ontologies, which can be easily integrated to discover non trivial connections between data.
Since law impacts and affects the everyday life of all individuals, representing legal knowledge in the Semantic Web scenario is both a timely need and a challenge. To see this, one has only to consider the huge amount of Open Government Data released at present. Indeed, the e-Government sector is one of the major drivers in the emergence of Open Linked Data and governments are making accessible a large number of datasets, about a wide range of topics, such as spending reports, administrative staff organizations, public healthcare, etc. However, without the right interpretation, data are only raw data, which by themselves may not provide useful information.
Open Government Data are in many cases related to the legal domain and, as a result, legal ontologies may play a key role uncovering the semantics of these data and driving the integration of this information with other datasets. Thus, it would be possible to build semantic e-Government applications, which may provide a significant contribution in bridging the gap between citizens and institutions [24].
Nevertheless, as recently emphasized by some scholars, the shift towards the Linked Data framework and the new paradigm of Semantic Web applications implies a reconsideration in the legal ontologies’ role and utility, “questioning the need for a highly axiomatised and unified knowledge representation” and conceiving instead “a new way of designing legal ontologies and of embedding them into architectures for legal information systems and other web services” [10].
As far as we know, at present time no explicit attempts have been made to figure out alternative methodologies for building legal ontologies aimed at supporting web services and, in particular, Semantic Web applications. It is still not clear how we may or should rethink the approach to legal ontologies so as to meet current and growing demand for better access to the legal information by disclosing the semantics of the increasing amount of legal (and legal-related) data. Moreover, we do not know to what extent it is possible for the legal ontology engineer to come to terms with the new challenges, meeting the need of Linked Data and Semantic Web advanced scenarios, while at the same time, preserving the richness of contents, the constraints and the other peculiar features of the legal domain. Furthermore, we are still not able to predict what a change of mind in legal ontologies’ role, design, context and use may entail in terms of both benefits and risks.
All things considered, in an attempt to clarify at least part of these uncertainties, in this paper we explore a novel approach to the design of legal ontologies, having in mind their use in the Linked Data framework and specifically in Semantic Web applications as means for delivering better access to the legal information enclosed in Open Government Data. The approach we advocate here represents a sort of compromise between an accurate representation of the legal knowledge and the requirements demanded by the Web of Data, since we try to represent legal concepts in such a way as to drive the construction of Linked Data applications (or lightweight Semantic Web applications). Besides, and against the backdrop of the challenges arising from the Legal Semantic Web applications’ design, the work inspires and motivates a discussion on the peculiarities of legal ontologies: their relationship with the Semantic Web and their ability to be integrated with (even) non-legal ontologies in the Linked Data framework.
The application scenario for the investigation of this approach is the European public contracts domain. The public procurement domain is a complex and very technical legal field but also a strategic one, since it accounts for 17% of the EU Gross Domestic Product (GDP). As such, it represents one of the major public finance levers: a significant boost to the European countries economies depends precisely on this public expenditure. Because of its significance, public procurement concerns many stakeholders including:
Data on public contract notices are open data by their nature; by law they must be accessible, because the whole public procurement process should be based on transparency and the advertising of contract notices is an essential step of the public procurement transparent procedure. Indeed, public administrations must aim to get ‘value for money’ (public money, taxpayers’ money) by choosing, through transparent procedures, the best offer among the largest possible number of candidates. Therefore, any contract notice is issued with the intention to reach the largest number of potential candidates, because the more economic operators take part in the tender, the more competition increases. Naturally, as the competition increases, the chance to get better products and services increases as well.
Today, public institutions at all levels (regional, national or supranational) publish their procurement notices on the Web. In the vast majority of cases, the advertisement on the institutional web-sites is an essential condition, required by law, for the conduct of tenders. In the European Union, tender notices for the award of public contracts ‘above the EU threshold’1 For the current values of the thresholds, see http://www.ojec.com/Threshholds.aspx.
In the recognition of these potentialities, we have developed an OWL 2 DL ontology of European public procurement notices, whose main purpose is to serve as a means for ensuring a better access to procurement information. Though information about procurement notices is nowadays largely available on the Web (and specifically on the TED web site with regards to the European procurements), we believe that in many cases it is not fully understandable (and therefore usable) by non-experts or public procurement newbies, notably by small and medium enterprises. Accordingly, the ontology we have built has been conceived to offer – paraphrasing Dietz’s words – a new understanding of procurements “such that one would be able to look through the distracting and confusing appearance [of them] right into their deep kernel” [18]. At the same time, we decided not to hide the legal complexity of the domain, by reducing the ontological commitments. Rather we tried, accepting a sort of compromise, to preserve the legal concepts’ representation behind the “little semantics” necessary to engage a larger audience than legal experts. Such an expressive modeling of the domain allows the discovery of connections with other domains, e.g., business domains, and the integration with other relevant ontologies, specifically with Good Relations [28].
We call this new ontology LOTED2, since it can be considered as an evolution of LOTED3
The rest of this paper is structured as follows: Section 2 outlines related work on reusing Linked Open Data associated with the European public procurement notices and on other existing ontologies on public contracts. In Section 3 we describe the methodology we used to build the LOTED2 ontology, discussing its aims and its design. In Section 4 we describe the ontology in detail, its modularity structure and the modules by which it is composed; in Section 5 we discuss the integration of LOTED2 with the most widely used ontology for describing e-commerce scenarios, namely Good Relations, while in Section 6 we provide some examples of instantiation of the ontology with TED data. Finally, in Section 7 we reiterate the main results from our work on LOTED2, emphasizing the lessons learned and highlighting further research questions.
LOTED ontology
The LOTED ontology4
Although the LOTED ontology satisfies the requirement of usability and is very suited to the purpose for which it has been designed, it does not actually represent knowledge about the domain, as it merely defines the structure of data objects in the domain. However, authors were aware of this aspect and a more expressive representation of the domain of procurement was planned as part of future work on the LOTED project. The LOTED2 ontology has been developed to achieve this goal.
Note that the version of the Public Contract Ontology we are referring to was available at the time of the first submission of this article (3/09/2012) at http://purl.org/procurement/public-contracts#. The Deliverable 9a.1.1 (“Framework for Creating Linked Data in the Domain of Public Sector Contracts” – available at http://static.lod2.eu/Deliverables/deliverable-9a.1.1.pdf) describes in details this earlier version of the ontology. However, a new version of PCO has been released recently (see http://opendata.cz/public-contracts-ontology), which substantially differs from the previous one. In particular, the authors of the ontology seem to have addressed many of the points we raise in this section.
LOD26
At the heart of this infrastructure there is the Public Contracts Ontology (PCO). The authors state they are not interested in modeling every aspect related to a contract, but only “information which is available in existing systems on the Web” and “which will be usable for matching public contracts with potential suppliers” [34]. In other words, the goal of this ontology is to model a public contract as a whole, but without going into details of the domain.
PCO is more articulated than LOTED: it is not built to model the data structures of a particular system (TED), but rather tries to represent a variety of aspects of the domain, taking into account the integration with other ontologies (Good Relations, VCard,8
Call for Anything, which is available at
Nevertheless, PCO shows some weaknesses that cannot be ignored, even while keeping in mind that it is not a legal ontology. In particular, the ontology aims to provide a conceptual description of both the tendering phase and the phase of the execution of the contract. However, there is not a clear conceptual distinction between these different scenarios in the PCO. Indeed, in the ontology the different concepts of call (call for tenders, such as contract notice) and contract (public contract) are often confused.
First of all, the class
Conceptual confusion increases by defining the Class Within PCO this term denotes the tender bid submitted by the economic operator, that in the same ontology is called “Supplier”.
Summarizing, this ontology tackles complex subjects through a too simplistic approach.
‘10ders Information Services’12
Basically, the commercial services offered to small and medium enterprises span from tenders alerting systems on the basis of the subscriber profile model, to the offer of reports on the major public buyers for their products and services. It also provides a customized data mining analysis of public procurement tailored to the interest of each client.
Aside from the commendable purpose to provide an unique access point (‘a pan-European platform’) both for EU relevant tender notices published on TED, and for notices below EU threshold published on a wide range of buyer profiles of national, regional and local levels, it should be considered that there are many services providing mail alerts about tenders and other mechanisms of this kind. However, the novel approach of this project is that the platform system is built using structured open data instead of screen-scraping techniques.
From the same research academic group (WESO) comes also the MOLDEAS (‘Methods on Linked Data for E-procurement Applying Semantics’) work [2]. This project can be considered as a broader framework, which includes also the experience gained in developing Euroalert.net. Indeed, MOLDEAS aims to apply the Semantic Web and Linked Open Data approaches to public procurement notices, defining a set of goals. The first goal is to transform government controlled vocabularies such as CPV,14
http://simap.europa.eu/codes-and-nomenclatures/codes-cpv/codes-cpv_en.htm
http://simap.europa.eu/codes-and-nomenclatures/cpc/index_en.htm
The main advantages claimed by authors of Euro-alert and MOLDEAS are essentially the decrease of information dispersion (arising from the different sources where contract notice above and under EU threshold are published); the unification of data models and formats and the support to multilingual issues (through EUROVOC resources used for the enrichment of public procurement notices). An ontology about public contracts is still in development, as part of WESO group’s future work.
Looking at existing work on ontologies of public procurement domain, there is no mistaking the fact that these artifacts are suitable to be employed in Semantic Web applications. However, as mentioned above, these ontologies do not really take into account the legal aspect of the domain. This is not a fault in itself, provided the domain is described in accordance with the conceptualization dictated by the legal sources.
It is also interesting to note that none of the considered work shows a reference to legal ontologies. Given that public procurement is a legal domain, it might be worth asking why legal ontologies and the Linked Data world are so distant. A brief survey on legal ontologies and an analysis of the peculiarities related to the legal knowledge representation may help to clarify the background.
The larger group of legal ontologies developed up to now, can be classified as two diverse categories.
The first is represented by core legal ontologies, namely ontologies that draw largely on legal theory in order to extract general patterns for describing the main concepts of legal knowledge: agents, roles, norms, normative effects (such as obligations, permissions and prohibitions) and temporal properties (e.g., about normative rules validity, efficacy). Examples of this type of ontologies are: the LRI-Core ontology[8], DOLCE+CLO (Core Legal Ontology) [22], the Ontology of Fundamental Concepts [40] and LKIF-Core [32].
The second group of legal ontologies is represented by specific legal domain ontologies, i.e., conceptualizations of a particular field of legal knowledge drawn from authoritative sources such as national or European laws (e.g., concerning privacy, intellectual property, copyright, etc.). Examples are the IPRONTO ontology [17], the Copyright Ontology [23], the Customer Complaints Ontology [33], the Consumer Protection Ontology [44].
The aim of core legal ontologies is to provide an interoperability framework from which other legal ontologies could inherit general categories, and in that way trying to overcome also comparative and multilingualism issues, among diverse legal systems. On the other hand, legal domain ontologies are built with a specific application-perspective in mind, usually grounded in closed systems. As a consequence, they are focused on the reasoning potentialities more than on the integration with other ontological resources in the open, heterogeneous and large scale Semantic Web environment.
The experience gained so far shows that to deal with the complexity of legal knowledge through ontologies and within the limitations of the Semantic Web languages, requires an intensive design process. Legal ontologies are usually made of a significant amount of axioms and restrictions upon classes, which represent legal concepts, even because of rigid conceptual constraints featuring the legal domain. In fact, the conceptualisation of the legal domain depends on legal sources, which dictate the semantics of legal categories through – more or less precise and explicit – definitions and descriptions, in that way constraining the interpretations (or, at least, part of them).
In contrast, the domains described by the ontologies widely used by the Linked Data community, like for example FOAF18
First of all, legal knowledge cannot be bridled into propositional structures (atomic propositions and propositional connectives): the truth preservation in law does not make sense, because categories of true or false cannot be applied to normative propositions. Secondly only a small part of legal knowledge may be expressed through propositional logic, which is not adequate to support legal reasoning [39]. The reason is that the first step of legal reasoning is to identify a legal concept, and the utility of ontological legal concept representation is precisely to provide a way to classify individuals.
Given this background, we have the impression that there is ‘no room’ for legal ontologies in the Linked Data scenario, where lightweight ontologies like FOAF or Good Relations seem to rule the scene. Because of their characteristics, legal ontologies look set to dedicated and closed systems, to perform complex tasks, but not of integrating themselves into the Web of Data. Nevertheless, some features of legal ontologies are not necessarily incompatible with Linked Data requirements.
On the one hand, the representation of legal concepts and relations among them is useful to verify and fix the correctness of Linked Data statements on legal content, through the use of reasoners such as Hermit [42] or Pellet [43]. On the other hand, the new triples generated, i.e. inferred or materialized, can be written back into the RDF model, adding a new level of granularity in the data. This type of information is just what is useful to drive the integration of ontologies with other related ontologies and datasets.
There are still grounds, however, for asking whether the structural diversity among legal ontologies and the common Linked Data models represents an obstacle to the realization of legal Semantic Web applications and, when needed, to the integration of legal ontologies with other kinds of semantic resources. If so, it is not trivial to figure out how the shift towards a new way of conceiving legal ontologies may be attained. The work we are going to describe here, namely the LOTED2 ontology, can indirectly be considered as an ‘experiment’ carried out having in mind these questions, and thus, trying to give them an answer.
As said in the introduction, the public procurement domain is a peculiar one: it is a legal domain, i.e. a domain regulated by norms, but it is also a public finance domain, since it represents the means by which public agencies seek and acquire goods and services on the market, and then, one of the way in which public money is spent. In fact, when we speak about public procurement, we are also referring to the commerce transactions between public agencies and businesses (
Therefore, the original intention that has inspired the LOTED2 ontology’s design has been that of reusing both the most widely used core legal ontology (at the state of the art of legal ontological engineering), namely LKIF-Core, and the most widely used ontology for the e-commerce (at the state of the art of Semantic Web applications and the Linked Data experience), namely Good Relations. This with the purpose of producing an ontology of public procurement notices consistent with the upper legal categories – inherited by a core legal ontology – and with the e-commerce Linked Data vocabulary.
Nonetheless, to the best of our ability and effort, a full integration of the two ontologies was not possible. Therefore, the attempt to capitalize on the most representative domain-related ontological resources, has led us to set the design and the content of LOTED2, by agreeing some kind of compromise between the ontology in development and the two conceptual models that we have taken into account as our points of reference.
Although LOTED2 does not inherit, as legal domain ontology, its upper classes from LKIF-Core ontology, it explicitly refers19 See annotations to the ontology.
To do this, the ontological modeling activity has been conducted by the domain expert and relies heavily on a comprehensive analysis and interpretation of the legal sources governing the domain. That is why we call LOTED2 a legal ontology.
In the following, we outline in details the aims of the ontology along with the methodology followed for its construction.
LOTED2 has been designed for the following specific purposes:
to express the (main) legal concepts of the domain of public contract notices as defined in legal sources (European Directives on public contracts); to support rich semantic annotation, indexing, search and retrieval of tenders documents, such as contract notices; to make possible the reuse of semi-structured data extracted from the TED system; to enable the integration with other ontologies and vocabularies about related domains.
The ultimate goal of the ontology is to permit the construction of Legal Semantic Web applications that support public procurement by matching demand and supply.
Methodology used for building LOTED2: A backward path in re-constructing the procurement data semantics
Two main Directives cover the European public contracts domain: the Directive 2004/18/EC20
http://eurlex.europa.eu/LexUriServ/LexUriServ.do?uri=CELEX:32004L0018:en:HTML
http://eurlex.europa.eu/LexUriServ/LexUriServ.do?uri=CELEX:32004L0017:en:HTML
These two legal sources represent the reference point to derive the exact meaning of terms used to describe the procurement domain, and to extract the information needed to build any logical theory which would formalize the domain knowledge. Besides this fundamental consideration (albeit an obvious one), another aspect concerning the European tender notices published on the TED system must be taken into account. According to the Directives, the tender notices sent by contracting authorities/entities to the European Commission shall be formulated in accordance to standard forms.22 These forms, required by Commission Directive 2001/78/EC of 13 September 2001 on the use of standard forms in the publication of public contract notices, are available at http://simap.europa.eu/buyer/forms-standard/index_en.htm.

Tenders Electronic Daily tabular summary of Contract notice n. 382532-2011.
When there are standard forms for drafting a certain type of document, these can be used as starting point for the analysis of each individual document drafted according to the established format. In a certain sense, model forms are descriptions of classes of documents and, each new document drafted, is an instance of one of these classes. The analysis of these model forms may tell us information about the terms identifying parts of the document and the interrelationships among each other. In many cases those terms identify ontological classes and the structural relationships between terms can be considered as object properties or data-type properties. Thus, filling out a new document means the creation of new instances for each class.

Excerpt of a standard form for contract notices.
On the other side, we can derive the exact meaning of terms contained in standard forms only through the analysis of the authoritative sources: the sources of law, indeed, contain the definitions of the relevant terms pertaining to the domain they cover (explicit knowledge), and at the same time, they provide us with information about the nature of legal concepts: what are their features (or properties) and their relations with other concepts, in the whole context of the domain (implicit knowledge). The first is a plain literal analysis, the latter, instead, is the result of a deeper domain-expert’s interpretation.
As extensively discussed (among others) in [11], various approaches and methodologies have been adopted in legal ontology engineering: in particular, ontologies which rely on the interpretation of the sources of law usually are built following a top-down approach; constrastingly, ontologies which rely on textual analysis of legal texts are built following a bottom-up approach.
It is well known, even by the legal ontological engineering community [19], that both approaches show points of strengths, as well as some weaknesses. Legal ontologies built following a top-down approach, may be reused across different application scenarios, because they provide conceptual frameworks generic enough for being reused or specialized by other ontologies. However, legal ontologies built following such an approach may be not adequately linked to textual sources or to real data structure, since they usually contain many theoretical (or legal foundational) definitions.
Instead, following a bottom-up approach (e.g. by extracting a vocabulary from standard forms and analyzing only the syntactic relations among terms), it is possible to discover useful terminological information at a larger scale and in an easier way. However, most of the times, following a bottom-up approach, the obtained result is too detailed, typically described by means of propositional logic. As a consequence, it becomes extremely difficult to discover fine-grained abstractions concerning broader legal concepts and, at the same time, the risk of inconsistency increases. Moreover, the bottom-up approach does not help in discovering and, then, representing more complex legal conceptual structures, which are necessary to emulate the legal reasoning.
Taking into account all these premises, the design approach of LOTED2 is based both on a top-down approach (extraction of legal concepts from legal sources) and on a bottom-up one (analysis of standard forms). We believe that, given the existence of different kinds of textual sources (standard-forms and legal sources) in the domain of interest, in this way it is possible to achieve a better match between language and conceptualization, and then, a good level of correspondence between terms, which identify data, and ontological classes.
Firstly: the raw data structure, as shown in the tabular summaries, is
Secondly: the entries contained in the standardized notices are interpreted according to the articles of the Directives, where they are described in detail (explicit knowledge), and also, through a comprehensive analysis of the domain, as it emerges from the interpretation of the whole legal sources covering the domain (implicit knowledge).
Thirdly: the resulting re-composed “puzzle” is formalized as much as possible through the language used for coding the ontology.
Consider for example, the excerpt of the tabular summary related to the contract notice n. 382532-2011 (see Fig. 1) the data ‘AA’, labeled ‘Type of Authority’ (which in this specific case is a ‘body governed by public law’), is extracted from the entries of the standard form as shown in Fig. 2.
Then, an analysis of the Directives is conducted in order to search definitions or other kind of information pertaining to the “Type of Authority” record (or ontological class), especially by analyzing
In a certain sense, such a mixed approach facilitates the
This path (analysis of data ⇒ analysis of standard-forms ⇒ analysis of legal sources) is documented inside the LOTED2-core ontology, through the annotation properties. In fact, a comment (

Dependencies between LOTED2-core modules and external ontologies (GoodRelations4Tenders and VCard) in LOTED2-extended.
However, the semi-structured data rendered by the TED system represent only the essential part of all the information contained in these documents. Other relevant information is lost. This is the case, for example, when a proposal for the acquisition of similar or related supplies/works/services may result in contracts being awarded at the same time in the form of separate lots. Since in some cases, economic operators may submit an offer for a single lot only (in those cases in which it is admitted a partial type of bid, for example) this type of information may be very helpful for participation in tenders of SMEs. An XML standard for legal documents, which allows also RDFa assertions in order to link the structural part of the text with ontological classes, such as AkomaNtoso [3,46] or CEN Metalex [6], may be used in order to tag the full content of tender documents.
In this section, we describe the legal ontology of European Public Procurement notices we have produced, namely LOTED2-core. The integration of LOTED2-core with the Good Relations ontology is called LOTED2-extended and is described in Section 5.
In order to facilitate maintenance, LOTED2 has been designed with a modular approach: specifically, ten modules (Fig. 3) compose the LOTED2-core ontology. Modules have been conceived to be ‘self-contained’, ‘independent’ and ‘reusable’ [15].
An extended version of LOTED2 ontology is composed by LOTED2-core ontology, GoodRelations4Tenders and VCard.
GoodRelations4Tenders is a version of the GoodRelations ontology compliant with the public procurement domain, while, as well known, VCard describes a mapping of the VCard specification to RDF/OWL. The goal of the integration of LOTED2-core ontology with VCard is to promote the use of VCard for the description the description of ‘points of contacts’ and addresses of awarding authorities. GoodRelations4Tenders, instead, represents the
LOTED2-core is a framework module (as well as LOTED2-extended module). Its function is to hold together the modules by which is com- posed LOTED2-core ontology. Both the LOTED2-core ontology and its extended version (called LOTED2-extended) are available at http://loted.eu/ontology2 and at https://code.google.com/p/loted2/source/browse/.
In the following, we provide an overview of the modules composing LOTED2-core and a description of the main inferences that the ontology supports.
Procurements Subjective Scope
The Procurements Subjective Scope module describes the classes of legal persons who are empowered to issue a tender notice (generally called ‘call for tenders’) and to award a public procurement contract, i.e. to play the role of awarding legal entities. An enumeration of these entities is contained in art. 1 (9) Directive 2004/18/EC and in art. 2 of Directive 2004/17/EC. The first type of entities may play the role of contracting authorities; the second type of entities may play the role of contracting entities. Although both the Directives refer to ‘contracting authorities’ and ‘contracting entities’ as two different types of entities, these terms denote, from an ontological point of view, roles rather than types of entities.
Consider for example the case of a body governed by public law, that, on the one hand, issues a notice for the award of a public contract and, one the other, may submit a tender bid in a tender announced by another entity.23 The forth whereas of Directive 2004/18/EC and the eleventh of Directive 2004/17/CE, in this regard, state that Member States should ensure that the participation of a body governed by public law as a tenderer in a procedure for the award of a public contract does not cause any distortion of competition in relation to private tenderers.
However, for the purposes of the European legislator, the distinction between types of entities, which may play the role of contracting authority, and entities, which may play the role of contracting entities, is very relevant. From this distinction, in fact, depends the application of Directive 2004/18/EC (on ordinary sectors) or of Directive 2004/17/EC (on utilities sectors). Hence, LOTED2 includes two classes that refer to intermediate legal concepts, namely ‘entity operating in ordinary sectors’ and ‘entity operating in utilities sectors’. If an entity falls in the class of the first type of entity, then it may play the role of contracting authority. Instead, if an entity falls in the second type, then it may play the role of contracting entity. The definition of these classes is based on the main activity carried out by an entity. A detailed list of ordinary sector activities and utilities activities is obtained by standard forms, but should not be considered as a

WTO Members and observers in the Government Procurement Agreement (source: Wikipedia).
The Procurement Subjective Scope module is connected to the Tender Document module through the object property
Data contained in the tabular summaries of TED are basically related to tender notices. So, this module, more than others, has been built emphasizing the bottom-up approach. As in the Procurement Subjective Scope module, also in the Tender Document module the annotation property
Procurement regulation
This module describes the legislative sources that regulate the public procurement domain. Apart from the Directives, many other legislative sources regulate the European procurements domain. The most important is the Government Procurement Agreement (GPA), a pluryilateral treaty signed by a number of WTO (World Trade Organization) parties (Fig. 4), with the purpose to open up as much as possible public procurement business to international competition [7,16]. The scope and coverage of GPA is based on the type of procurement, the type of entity and the monetary threshold defined in Appendix I of the Agreement.
It is worthwhile to emphasize that signatories may negotiate the coverage of GPA with other parties, on the basis of reciprocity. For example, the utilities sector is not covered by GPA with respect to Canada and there are many limitations with respect to USA and Japan too, just to name the most relevant. This means that we cannot consider the GPA as a unique treaty for each signatory, since there are rather many bilateral agreements (Canada-EU, USA-EU, et.) negotiated by parties under the GPA framework. Thus, simply saying that a tender is within the coverage of the GPA is not enough to clarify the geographical scope of the application of each single bilateral agreement to the single contract notice. These significant divergences in the application of GPA entail a considerable complexity in defining which specific regulation covers a tender document.
Unfortunately TED system data do not provide an effective help in this direction since they are articulated in an incoherent manner. In fact the field of ‘Regulation’ data is referred once to the political geographical area of the country in which the entity that issued the tender notice is based (European Union, European Economic Area); once to the type of authority that issued the notice (European Investment Bank, European Bank for Reconstruction and Development, European Monetary Institute, European Institution/Agency or International Organization); once to the reason for which the notice is issued (External aid and European Development Fund) and finally once to the actual regulation, although not always identified with the exact wording (Agreement between the European Community and the Swiss Confederation, GPA, etc.).
The Procurement Regulation module of LOTED2-core ontology aims to provide the right interpretation of TED ‘Regulation data’, i.e. which legal source covers the single tender document. This type of information is necessary to define the jurisdiction of the regulation, namely the geo-political reference within the norm is applied and its effects are binding [9,25]. An additional module covering this aspect will be part of our future work.
Procurement competitive process
A notice is issued by an entity acting as contracting authority or contracting entity in order to announce a competition. There are many types of competitions, based on the type of notice. The most relevant type of competition is the tender (i.e. the competitive bidding) that is announced through a contract notice (but not only). Instead, a design contest is announced through a design contest notice. A qualification system is announced through a notice on the existence of this type of system, that we can consider as a competitive procurement process. This system, in fact, is used by only entity operating in utilities, for seeking qualified economic operators, which meet predefined qualification criteria that must be satisfied by potential providers of specific types of works, services and supplies. An applicant of this type of notice, which satisfies these criteria, is registered in the system as potential candidate or contractor for the particular type of contract. A notice on the existence of a qualification system with call for competition is a notice through which is announced both a qualification system and a tender, in which may participate only operators recognized as qualified according to the system.
A procurement competitive process takes place with an established administrative procedure, the award procedure that can be of different types (open, restricted, negotiated, etc.). LOTED2-core ontology does not describe award procedures.
Subjective legal situations
This module describes roles played by agents in procurements competitive processes and in organizations. As highlighted in the description of the Procurements Subjective Scope, terms such as Contracting Authority or Contracting Entity denote roles rather than types of entities. In particular they denote roles than only certain entities may play: only entities operating in ordinary sectors may play the role of contracting authority while only entities operating in utilities sectors may play the role of contracting entities. The two terms indicate basically the same concepts, namely a property that an entity assumes when awards a public contract and when carries out all the set of actions required for the awarding process of a contract. The first action is to issue a notice.24 Please note that in some specific cases public authorities may award a contract without issuing a notice: this is the case in which a tender takes place with a negotiated procedure without contract notice. Of course, this case is not examined in this paper, since LOTED2 is an ontology for describing tenders notices published on TED.
So, by issuing a contract notice certainly an entity starts to play the role of awarding legal entity. Apart from the role of awarding legal entity, this module of LOTED2 enables us to describe also roles played by ‘business entities’ in the competitive processes of procurements, and in organizations. For example, a natural or legal person (operating on the market as ‘economic operator’) who has submitted a tender bid for the award of a proposed public contract is a natural or legal person who plays the role of ‘tenderer’. And if this agent has submitted the best tender bid, then assumes the role of ‘successful tenderer’. Another important aspect in procurements’ role modeling deserves to be emphasized. Just as a body governed by public law may play the role of both awarding legal entity and tenderer, so business entities may play many roles in different procurements competitive processes.

Subjective Legal Situations and Roles in LOTED2-core.
Consider that the eligible customers of many business entities are public authorities and so their core activity is precisely the participation in tenders, even more than one at the same time. Then, this matter is not trivial. In order to represent also these cases, the described module represents (legal) situations more than simply roles. An agent may have more than one subjective legal situation that is related to a role played in a context. In the case of procurement the context is the procurement competitive bidding. These concepts and relationships have been modeled reusing and adapting the ‘Social Reality pattern’ [30,31], one of the proposed content patterns available from the catalogue of Ontology Design Patterns initiative [20,21].
Since LOTED2 is an ontology of public procurement notices, it does not represent public contracts per se. In fact, LOTED2 aims to describe the semantics of notices concerning the award of public contracts and not that of public contracts in themselves. A contract notice is the means whereby a competitive bidding for the award of a public contract is announced. So, the commitment of the ontology is to capture the information of public contract to be awarded (or proposed contract) not of the public contract awarded or in its execution. For this reason, the ontology specifies the class of
Consider the case in which the tender has been declared unsuccessful: can we speak about a contract or not? Of course not, because the contract has not been awarded and then has not been signed by parties. So, in the stage of notice publication there is not a contract, but a contract to be awarded. This is also clear if we consider that a contract notice is also known in legal doctrine as
The analysis of standard forms for contract notices and of the other tender documents shows that the object of contract is intended in the second sense, i.e. as a description of the type of good, service or work that the entity issuing the notice seeks. A rather significant aspect of procurement domain is the division of proposed public contract in lots. In some cases and under certain conditions, entities issuing a contract notice for the acquisition of similar or related supplies/works/services may decide to split the proposal into separate single proposed contracts to be awarded with the same tender. These single proposals that are also parts of a general proposal are called lots.
Since in some cases, economic operators may submit an offer for a single lot only (namely in all the cases in which is admitted a partial type of bid) this type of information may be very helpful for participation in tenders of SMEs. In fact, usually a small enterprise is specialized in one particular sector with one main offering and so it might be interested in bidding for the single lot rather than for the global proposal. Therefore, one of the commitments of LOTED2 ontology is to accurately model this particular aspect of the domain, namely lots, even by ensuring that through ontology were made possible certain inferences.
Lots are proposals of contract as well as proposal of contracts not divided into lots, but they are different in the sense that they depend – in the sense of existential dependence – by the general proposal which include them. At the same time, the general proposal is also a particular one, since it is ontologically dependent on its parts (lots, indeed); otherwise it would be empty with respect to the object of the contract. In other words, there is an ontological mutual dependence between them. Furthermore, the lot and the general proposal share the same nature: they are both proposal of contract. On the other hand, lots are also different and independent from each other, because they have different subject matter and they can be awarded to different tenderers. Therefore, it is not easy to understand if their being part of another proposal should be understood in the sense of
When a proposed contract is divided into lots through the contract notice is announced a tender for award also single lots. Through LOTED2 it is possible to infer that through the same contract notice is announced a tender for award each single lot. This result has been achieved through a set of General Class Axioms combined with property chains.
Two object properties are inferred in order to show which lots will be awarded through the same tender
Tender bid
This module describes the tender bid, namely the offer that may be submitted by the economic operator in the competitive bidding for awarding a public contract. Note that, in english, the same word ‘tender’ denotes the ‘race’, i.e. the ‘competition’ for the public contract, and also the meaning of the offer for a public contract. It is an ambiguous term and, arguably, this ambiguity has led the PCO developers to confuse the ‘race’ with the ‘offer’. For this reason, we have decided to call the offer
An important aspect is that tender bids may be of three different types: a tender bid, indeed, may be either a
A tender bid is evaluated on the basis of an award criterion: the lowest price or the most economically advantageous offer. The first is based only on the criterion of price; the second one is based on a set of combined criteria defined by the authority issuing the contract notice. Furthermore a tender bid will be opened in a certain place (defined into the contract notice) and at a certain date-time; and it may be drawn up in a specific EU language or not.
Business entity
This module describes the class of the entities playing the role of economic operators (to simplify hereinafter called ‘economic operators’). Economic operators are the subjects to whom the invitation to submit an offer for a proposed public contract is addressed. In other words, an economic operator is the potential counterpart of the awarded contract.
However, not every economic operator can sign a public contract. There are several eligibility requirements, based on certain criteria that must be fulfilled by an economic operator in order to participate in a competitive bidding. Another module describing these requirements will be part of our future works. This module describes private legal persons who are business entities, since they assume the legal form of ‘incorporates’, ‘society’, ‘cooperative’, etc. These classes are modeled following the taxonomy of LKIF-core ontology.
Top
This is an upper module in which are contained abstract classes, even useful to match the LOTED2 ontology with core ontologies and in particular with core legal ontologies, in order to foster interoperability. In particular, many classes of the ‘Procurement Top Classes’ module are modeled following the LKIF-core ontology schema. This relationship between LOTED2-core ontology and LKIF-core is what we call
Given the importance of time factor in legal domain, LKIF-core represents time (such as ‘date of publication’) as classes, while Good Relations represents time through data type properties such as
Furthermore, we believe that the LKIF-core ontology has been mostly conceived for representing legislative documents and not also administrative documents, such as tender notices. In fact, every legal document is also a legal source and there is not a clear distinction between legislative sources (which are sources of law in the proper sense) and legal sources, such as a contract (or a proposal of contract), which are sources of law only
On the other hand, in GoodRelations, a business entity is a “
For all these reasons, LOTED2-extended maintains a sort of ‘conceptual independence’ from both LKIF-core and Good Relations.
LOTED2-extended: The integration of LOTED2-core with GoodRelations4Tenders
Works carried out until now [2,34] bring up the integration of ontologies about public procurements with Good Relations ontology. In particular, as we have seen in Section 2, LOD2 ontology reuses some classes of Good Relations (Offering, Business Entity, etc.) and also WESO Research group pays attention to an integration with Good Relations, as part of its future works. Even the creator of Good Relations, Martin Hepp, often makes reference to public procurement as an interesting application domain for his ontology [29]. It is generally agreed that the object property
Good Relations is, currently, the best candidate ontology to complement an ontology of public procurement notices because it represents a strategic domain, such as the e-commerce in a non trivial way, addressing a complex domain and covering “the many intricate situation that this domain requires”, “is widely used currently in the e-commerce and linked data communities”, “it is easily applicable, actually applied and recommended by the stakeholders from the targeted domain” [13].
This is certainly true and in addition to that, we think that is also an ontology that inspires the recognition of some isomorphic patterns between legal and economic concepts, namely between the market and its legal superstructure. Nevertheless, it should be remembered that Good Relations is an ontology built with the purpose to meet the needs of B2C (Business to Consumer) or B2B (Business to Business) scenarios. In other words, Good Relations has been developed bearing in mind only the private sector. Therefore, it is not fully adaptable to the PA2B (Public Agencies to Business) scenario, because the public procurement domain has its own peculiarities that cannot be ignored, if one wants to build a Semantic Web that does not distort the principles of EC law.
Consider the principle of equality of treatment on which the Art. 23 of Directive 2004/18/EC is inspired. This article states that, unless justified by the subject-matter of the contract, technical specifications shall not refer to a specific make or source, or a particular process, or to trade marks, patents, types or a specific origin or production with the effect of favoring or eliminating certain undertakings or certain products. Such reference shall be permitted only on an exceptional basis, where a sufficiently precise and intelligible description of the object of the contract is not possible. In these specific cases, such reference shall be accompanied by the words “or equivalent”. Since Good Relations, according to its purposes, describes brands, types of products (such as models), and obviously also the origin of products, the integration of an ontology about public procurements like LOTED2 with this ontology could result in potentially law distorting effects.
Indeed, these aspects come overtly into conflict with EC procurement principle of equality of treatment. Moreover, there are also other aspects whose incompatibility with the principles of law is less clear, even though equally potentially harmful.
For example, Good Relations defines the object property ‘image=depiction’, through which it is possible to link a product to its image available on the web. This is a very useful class in the private market scenario. In the PA2B context, instead, the use of this class may create problems. Indeed, with an extensive interpretation of the Article 23 one may argue that a link to a certain image could be used in order to indicate exactly one specific type of product. This can be a sort of trap that may lead into a conflict with the principles of Procurement Regulations.

Inferences supported through the integration between LOTED2-core and GoodRelations4Tenders in LOTED2-extended.
Given these remarks, we have amended the Good Relations ontology, removing all parts non compliant with EC procurement principles. We have called this version ‘Good Relations 4 Tenders’. Apart from EC Directives-non compliant classes like
Changes in this release compared to the original Good Relations ontology consist of just the elimination of certain classes and properties. The original structure of Good Relations has been preserved.
Another point on which is worth dwelling is about how an integration between an ontology of public contracts and Good Relations should be understood. For example, Public Contracts Ontology (PCO) developed by LOD2 group adheres completely to the conceptual model of Good Relations, detrimental to the particular (and different) domain of public contracts. Instead the aim of LOTED2 ontology is to represent as closely as possible legal concepts pertaining procurements domain. Such a conceptual analysis allows the discovering of the possible connections with concepts of other domains.
So, by explicitly specifying concepts of proposed contract and of
This information is an useful one for matching public demand side with the offerings side of market. Recently, an initiative has started to make accessible information about corporations as Open Data: the ‘Open Database of the Corporate World’ (
We assume, indeed, that there are many economic operators (especially small and medium enterprises), which, looking at the world of public procurement, are skeptical or discouraged. They simply do not understand from where and how they can enter into this market, because, for instance, they are not aware of the existence of the TED system or because they do not have confidence with it. Therefore, they are not informed enough about the business opportunities coming from the public sector. To be informed in a simple way or just encouraged in participating in the European tenders, they need to know which public agency is searching for the specific product or service they sell.
In our opinion, this is the
Nevertheless, the LOTED2 ontology helps in accessing also other information, as we show in the following by instantiating the model with some examples of TED data.
In this section, we give some elements of the instantiation of TED data with respect to the ontology that we have just presented, highlighting in particular the inferences that LOTED2 supports. The instances used as examples are extracted from the semi-structured data contained in the tabular summaries and also from the full text of two real contract notices published (according to the standard forms) on the TED system. As said in the Section 3.2, the information contained in the full text of the tender documents can be marked-up by using a Legal XML standard, or just extracted by adopting scraping techniques.
As said, entities are classified on the basis of the main activity they carry out in the two diverse categories of entities operating in the utilities and entities operating in the ordinary sectors. The LOTED2 core ontology lists a set of instances of activities, which are included in the class of ordinary sectors activities and in the utilities. According to the contract notice n. 382532-2011, the main activity of The Open University is Education. Since Education is an Ordinary Activity, the ontology correctly infers that The Open University is an Entity Operating in Ordinary Sectors. The inferences of this case are represented in the Fig. 8.
The dashed rectangle represents the inferred classification of the instance, which is indeed an Entity Operating in Ordinary Sectors, while the arrow with the empty tip represents the subProperty relation between the two represented object properties. The dashed arrow represents the inference on the property ‘has activity’. Indeed an entity may carry out more than one activity, but only one is the main activity.
The classification of the entity on the basis of its main activity is particularly useful to verify that ‘contract notices-public (or ordinary) sectors’ are issued by entities operating in the ordinary sectors, while ‘contract notices-utilities’ are issued by entities operating in utilities. There are indeed different standard forms for these two categories of contract notices. This aspect is addressed through a DL restriction on the two classes, namely:
(
(
Moreover, as explained in the previous sections, the application of the EU Directives 2004/18 or 2004/1726 See the two General Class Axioms in the ontology.
Another inference allows to classify a proposed public contract as ‘proposed public service contract’, ‘proposed public works contract’ and ‘proposed public supply contract’ on the basis of the declared object of the contract. The Fig. 7 shows an example of this type of inference. It must be emphasized that the contract notices do not indicate explicitly whether a proposed contract is a mixed one (i.e. a proposed contract with both services or supplies, or works and services, or works and supplies as object) because the awarding authority qualifies the type of contract (for instance as services public contract) before the publication of the contract notice, on the basis of some detailed rules. For this reason, a public service contract is expressed in the ontology as a public contract that has at least one Service as Object of Contract, i.e. not excluding, for instance, that a proposed public service contract may have works as object of contract.

Inferred classification of the proposed public contract n. 382532-2011 under the category of proposed public service contract.

Inferred classification of The Open University under the category of Entity Operating in Ordinary Sectors.
Now, consider the contract notice n. 382532-2011, whose tabular summary is shown in Fig. 1. The contract notice is issued by The Open University and it announces the tender 382532-2011 for the award of a proposed contract, whose object is cleaning services and related. Figure 9 shows the matching between the data coming from this contract notice and the OpenCorporates database. The information coming from OpenCorporates can be filtered by searching for all the active (i.e. non in-liquidation and non inactive) corporations registered in the UK, whose sector of activity, according to the Standard Industrial Classification (SIC)27 Note that the SIC system of classification can be matched with the Common Procurement Vocabulary (CPV) that is a system of classification for the description of the object of the contract to be awarded. The Common Procurement Vocabulary identifies more than 9400 products with a code composed of 9 numbers. The aim of the CPV is to standardize, by means of a single classification system for public procurement, the terms used by contracting authorities and entities, describing the object of contracts through a uniform nomenclature. The CPV is translated into 22 official languages of the European Community. It simplifies the task of drafting notices, since it describes the subject matter of contracts, and helps also the drafting of statistics on public procurement, because it is compatible with trade monitors used throughout the world (especially those used by the United Nations). The CPV code is also included in the tabular summaries and in LOTED2-core ontology is associated to the object of contract through a dataType property.

An example of instantiation of LOTED2-extended ontology with TED and OpenCorporates data.
As explained in Section 4.6, in some cases, the proposal of public contract is split into (more than one) smaller single proposals. These proposals, called lots, are basically the result of the subdivision of a single bigger proposal into many. This aspect is particularly relevant because the division in lots enables SMEs (small and medium enterprises) to access also EU tenders that otherwise would be accessed only by large turnover’s enterprises. In fact, the EU Commission considers the sub-division of contracts in lots as a good practice in tenders’ management and encourages its use.28 See, for instance: http://ec.europa.eu/internal_market/publicprocurement/docs/sme_code_of_best_practices_en.pdf.
Therefore, we have paid close attention to this aspect in LOTED2. Figure 10 shows an instantiation of the contract notice n. 120168-201429 The contract notice is available at http://ted.europa.eu/udl?uri=TED:NOTICE:120168-2014:TEXT:EN:HTML.

An example of inferences concerning lots with an instantiation of contract notice n. 120168-2014.
Currently, more and more Open Government Data are available on the web. Since, in many cases, these data cover subjects strongly related to the legal domain, there are grounds for asking whether and how legal ontologies might support the construction of Legal Semantic Web applications that could be used for better delivering public sector information to the citizens. To that end, namely trying to give an answer to these questions, we have carried out an ‘experiment’ in the European public procurement field, building an ontology called LOTED2, whose main purpose is to support the construction of a Legal Semantic Web application for enhancing the meeting between the public demand and the market offering.
LOTED2 is a legal ontology inasmuch it tries to represent legal concepts as they result from the interpretation of the legal sources. However, unlike many other legal ontologies, LOTED2 tries to find connections with other Linked Data models relevant to the domain and, just in order to achieve this purpose, it has been built seeking for a balance between the accurate representation of legal concepts and the usability of the ontology as a knowledge model for Semantic Web applications.
Figure 11 shows the sketch of a Semantic Web application that aggregates and reconciles information about EU public procurement notices and business entities, drawing upon the ontology we have developed. The right side of the figure represents real world entities, as public institutions and economic operators. When a public institution issues a contract notice, the data pertaining to this contract notice are instantiated into the LOTED2 ontology; data from OpenCorporates become instances of LOTED2 as well (as shown in Fig. 9). Then, thanks to the inferences that LOTED2 supports and by means of a reasoner, the offering and demand side are matched.
In principle, thus, there is a place for legal ontologies in the Linked Data and Semantic Web applications scenario, but, in practice, there are still other broader issues to take into consideration.
It is worth noting that OpenCorporates makes available also data about non-European business entities and, as we have highlighted in the previous sections, under certain circumstances even the non-European traders may participate in EU tenders. The information about jurisdictions and regulations, therefore, is essential for the construction of a Semantic Web application (like the one drawn in Fig. 11) which aims at including also non-European business entities’ data. Otherwise, by linking data of companies from states not involved within the scope of application of the regulation that covers the contract notice, a sort of
Consider, for example, the contract notice 2011/S n. 236-382532 (Fig. 1) issued by ‘The Open University’ (that is a body governed by public law) for the award of a public contract concerning cleaning services. There are, indeed, two key questions related to this contract notice, with not easy solutions for traders who are interested in the application for this notice.
First: the Sigma Corporation is registered in Canada. Can the Sigma Corporation apply for this notice issued by The Open University (UK)?

Sketch of a Semantic Web application employing the LOTED2 ontology.
Second: the Gamma S.r.l. is registered in France. What types of documents should the Gamma S.r.l. submit to The Open University for participating in the tender n. 236-382532?
The answer to the first question depends on the type of ‘Regulation’ that covers the contract notice. The contract notice states that the Government Procurement Agreement covers this particular type of contract, but this is not enough to give an answer to the first question. It is necessary to consider all the exceptions agreed by single parties to the application scope of the GPA (in this case between Canada and EU). The answer to the second question, instead, depends on the criteria that must be met by business entities that want to participate in EU tenders.
Thus, in order to avoid alerting economic operators which are not admitted to the participation of this specific tender (e.g. because they have their registered headquarter in countries which are not covered by the GPA) an additional part of reasoning (which currently is not addressed by this LOTED2 version) is needed (step no. 4 in Fig. 11). Furthermore, it would be useful to provide also information concerning the requirements needed for participating in the specific tender, customized on the basis of the economic operator’s nationality.
It is worth mentioning that many initiatives have been started by the European Commission in order to improve the access of traders (in particular of SMEs) to the public procurement market in EU. This kind of vision has inspired, for example, the Peppol EU Project,30
On the other side, even if the use of a Semantic Web application like the one shown in Fig. 11 is limited within the European borders, there are other issues to take into careful account.
It is not by chance that we decided to show in Fig. 9 just the OpenCorporates data filtered with respect to the UK jurisdiction. In fact, in accordance with the “Open Company Data Index”32
There are important legal questions related to this issue. Is there an
In our opinion, this could actually be the case, and the work we have done so far should be considered just the first step towards this direction. For the time being, there is a number of lessons that we have learnt form our work.
In open systems the heterogeneity, the scale, the data quality, the reliability of information raise significant problems of control of the legal effects of heterogeneous linked data, or ‘smart’ data. Moreover, the inequalities in data accessibility affect the right of equal access to core information. Because of this, building Legal Semantic Web applications will require the ability to foresee and to cope with all kinds of risks that may emerge, in order to avoid that the challenge will turn into a danger.
Semantic interoperability among legal ontologies and Linked Data models
The integration of legal ontologies with ontological resources related to different domains represents a fairly unexplored field. A sort of
The value of “little semantics” in the legal domain
The attempt in integrating LOTED2-core and Good Relations shows also other things. On the one hand, through the properties “seeks” and the mirror inverted “offers”, Good Relations represents in an intuitive way the intents of parties when they issue an invitation to treat or a proposal for the conclusion of a contract (an ‘offer’ in legal terms). In a certain sense, Good Relations shows the final part of the legal superstructure behind the terms “seeks” and “offers”. On the other hand, the integration of the LOTED2 ontology with Good Relations shows that also in the legal domain “a little semantics goes a long way” [27]. Nevertheless, it must be emphasized that in the legal domain this little semantics is just the superficial layer of the whole of legal knowledge upstream. Many other relevant legal issues (like that of requirements for the participation in tenders) require an intensive design to be addressed within the limitations of the Semantic Web languages. That is because, in the legal domain, question answering is not only information retrieval. Information retrieval is not enough, since “question requires some deduction or inference before an appropriate answer can be given” and “regulations may contain many different articles about the same topic and one can only assess whether something is permitted or not by understanding the full documentation”. “A rather detailed understanding is required, in particular, because regulations generally contain complex structures of exceptions” [4]. In other words, question answering in the legal domain is not a trivial matter.
Compliance by design in legal ontologies modeling
The variety of related work about procurements (up to now there are three ontologies of public contracts, including LOTED2, and another is in development) reveals the strong interest of the Semantic Web community in representing legal knowledge. However, the Semantic Web community should consider the matter more carefully and the Legal Informatics community should try to take up the challenge. In this respect, it is worthwhile to emphasize that an exciting research field is emerging on the use of ontologies for checking compliance of legal documents or processes with norms regulating them [12,33], even by combining ontologies with rules formalized through interchange standards specific for the legal domain [37]. Nevertheless, at the same time, the aspect of compliance by design should be more stressed in legal ontology engineering. Ontologies for compliance and ontologies in compliance might be considered as two sides of the same coin.
Nowadays, technologies in general, and even Semantic technologies are causing so many fast and unexpected changes that the Law is under constant and severe pressure. We can not pretend that these major changes do not also require legal solutions and a way to re-think our approach to the legal knowledge. Surely, re-thinking the legal knowledge in the Semantic Web is a challenging task, not at all simple. However, it is worth to face the challenge, starting from the lesson we have learnt so far.
Footnotes
Acknowledgements
This work has been carried out when the first author was visiting Knowledge Media Institute (KMi), The Open University, thanks to a ‘Marco Polo’ grant provided by the University of Bologna.
The authors are indebted to Gioele Barabucci, Monica Palmirani and Silvio Peroni for their contribution to the ideas discussed in this paper. Isabella Distinto is also grateful to Roberta Ferrario, Claudio Masolo and Nicolas Troquard for their precious comments. Furhtermore, the authors wish to thank Rinke Hoekstra and the two anonymous reviewers for their helpful suggestions and thorough revisions.
