Sage Journals: Discover world-class research

Abstract

The Second World War (WW2) is arguably the most devastating catastrophe of human history, a topic of great interest to not only researchers but the general public. However, data about the Second World War is heterogeneous and distributed in various organizations and countries making it hard to utilize. In order to create aggregated global views of the war, a shared ontology and data infrastructure is needed to harmonize information in various data silos. This makes it possible to share data between publishers and application developers, to support data analysis in Digital Humanities research, and to develop data-driven intelligent applications. As a first step towards these goals, this article presents the WarSampo knowledge graph (KG), a shared semantic infrastructure, and a Linked Open Data (LOD) service for publishing data about WW2, with a focus on Finnish military history. The shared semantic infrastructure is based on the idea of representing war as a spatio-temporal sequence of events that soldiers, military units, and other actors participate in. The used metadata schema is an extension of CIDOC CRM, supplemented by various military history domain ontologies. With an infrastructure containing shared ontologies, maintaining the interlinked data brings upon new challenges, as one change in an ontology can propagate across several datasets that use it. To support sustainability, a repeatable automatic data transformation and linking pipeline has been created for rebuilding the whole WarSampo KG from the individual source datasets. The WarSampo KG is hosted on a data service based on W3C Semantic Web standards and best practices, including content negotiation, SPARQL API, download, automatic documentation, and other services supporting the reuse of the data. The WarSampo KG, a part of the international LOD Cloud and totalling ca. 14 million triples, is in use in nine end-user application views of the WarSampo portal, which has had over 690 000 end users since its opening in 2015.

Keywords

Linked Open Data semantic web military history World War II Finland cultural heritage digital humanities

1. Introduction: Military history as linked data

Plenty of information about WW2 is published every year in books, articles, news, web sites and services, documentaries, and films for humans to consume. This information is scattered in various military, governmental, cultural heritage, and other organizations, making it hard to find and use. Furthermore, the information is seldom published as data for reuse in computational analyses and applications. Gathering, extracting, and harmonizing information about the war is needed in order to create comprehensive global views of the war and history but this is not a simple task. This applies also to microhistory: for example, finding out the details of what happened to a perished relative during the war can be quite tedious, involving studying and aggregating data about him/her from several registries and data sources. Without harmonized, clean data, the data analysis of large military historical datasets, such as death records, would be difficult in Digital Humanities Research [5,13]. Combining information from various sources facilitates answering the complex societal research questions of “new military history” scholars [2,6].

WarSampo Initiative and Project Series. The goal of the WarSampo – Finnish Second World War on the Semantic Web initiative1

¹
The initiative and publications are presented in the initiative homepage: https://seco.cs.aalto.fi/projects/sotasampo/en/.
is to study and show how Linked Data [3] (LD) can help in solving tasks like these [18]. The initiative collects military historical data related to Finland in the Second World War (WW2). The data is published as Linked Open Data (LOD) in an open SPARQL endpoint on top of which the WarSampo portal2 ²
http://sotasampo.fi/en
has been created, including nine application perspectives to the data. The portal, targeted to both researchers and the public at large was opened in 2015. The WarSampo data service and portal were awarded with the LODLAM Challenge Open Data Prize in 2017 in Venice. The data forms an integrated interlinked 5-star LOD publication, and is part of the global LOD Cloud.3 ³
http://linkeddata.org

The WarSampo knowledge graph (KG) was published initially in 2015. The KG was first used by seven different application perspectives in the WarSampo portal, via only the SPARQL API [18]. The idea was to show that anyone could easily use the data dynamically on the client side. In 2017, by the centennial of Finnish independence, a new eighth application perspective of war cemetery data and related photographs4 ⁴
https://seco.cs.aalto.fi/projects/sotasampo/hautausmaat/
was released [26], a further demonstration of this idea. Finally, a ninth application based on a dataset of 4200 prisoners of war was aligned with the WarSampo KG and was released [33] in November 2019.

Related Work. The problem of combining and using heterogeneous cultural heritage datasets is a common problem in using Linked Data for Digital Humanities [17,36] and in Digital History [9]. Historical knowledge contextualization and visualization with experiences from the VICODI project are represented in [37], which also discusses general problems faced when modelling history with ontologies. Several humanities and cultural heritage related projects have used the CIDOC Conceptual Reference Model (CRM)5 ⁵
A list of CIDOC CRM use cases can be found at: http://www.cidoc-crm.org/useCasesPage.
[11].

Several projects have published linked data about the World War I on the web, such as Europeana Collections 1914–1918,6 ⁶
http://www.europeana-collections-1914-1918.eu
1914–1918 Online,7 ⁷
http://www.1914-1918-online.net
WW1 Discovery,8 ⁸
http://ww1.discovery.ac.uk
CENDARI9 ⁹
http://www.cendari.eu
[4], Muninn,10 ¹⁰
http://blog.muninn-project.org
and WW1LOD [35]. There are also a few works that have used the Linked Data approach to WW2, such as [8,10,46] and a LOD system on WW2 holocaust victims [42].

Our own previous research on WarSampo first presented the vision and overview of the system especially from the use case and end-user application perspectives [18,23]. In [16] data integration was concerned from the named entity linking (NEL) point of view. The maintenance problem of the interlinked dataset has been explored in [32]. Work on creating and using individual parts of the KG has been presented in several previous publications [24,26,29,31,33,34]. This dataset description complements our other publications about WarSampo by presenting in detail the KG, including the process of maintaining the data.

This article is organized as follows. The next section presents the source datasets. Section 3 discusses how the information in the source datasets was harmonized and presents the event-based data model. The data transformation process is presented in Section 4. An analysis of the data quality is given in Section 5. The stability and usefulness of the data are discussed in Sections 6 and 7, respectively. Conclusion is provided in Section 8.

Table 1
Source datasets of WarSampo, grouped by providing organization. Numbers in the article are rounded to 3 significant digits

# Source dataset Providing organization Used content Source format

1 Casualties of WW2 The National Archives of Finland 94 700 person records spreadsheet

2 War diaries The National Archives of Finland 26 400 war diaries with metadata, 9850 units, and 12 people spreadsheet

3 Senate atlas The National Archives of Finland 414 historical maps of Finland digital images

4 Municipalities The National Archives of Finland 625 wartime municipalities digital text

5 Organization cards The National Archives of Finland 132 military units & 279 people & 642 battles digital images, PDF documents

6 Units of The Finnish Army 1941–1945 The National Archives of Finland 8810 military units digital text, PDF document

7 Wartime photographs The Finnish Defence Forces 164 000 photos with metadata, 1740 people spreadsheet, API access

8 Kansa Taisteli magazine articles The Association for Military History in Finland, Bonnier Publications 3360 articles by war veterans spreadsheet, PDF documents

9 Karelian places The National Land Survey of Finland 32400 places of the annexed Karelia spreadsheet

10 Karelian maps The National Land Survey of Finland 47 wartime maps of Karelia digital images

11 Finnish Place Name Register The National Land Survey of Finland 798 000 contemporary place names XML

12 National Biography The Finnish Literature Society 699 biographies spreadsheet

13 War cemeteries The Central Organization of Finnish Camera Clubs 672 cemeteries & 2450 photographs spreadsheet, digital images

14 Prisoners of war The National Prisoners of War Project 4200 person records spreadsheet

15 Wikipedia Wikimedia Foundation 3010 people, 255 military units API, web pages

16 Knights of the Mannerheim Cross Knights of the Mannerheim Cross Foundation 191 people, 1120 medal awardings API, web pages

17 Military history literature (9 sources) - 1050 war events, 2900 military units, 585 people printed text

18 Finnish Spatio-Temporal Ontology Aalto University 488 polygons of wartime municipalities RDF

19 AMMO Ontology of Finnish Historical Occupations Aalto University 3090 occupational labels RDF

2. Source datasets

#	Source dataset	Providing organization	Used content	Source format
1	Casualties of WW2	The National Archives of Finland	94 700 person records	spreadsheet
2	War diaries	The National Archives of Finland	26 400 war diaries with metadata, 9850 units, and 12 people	spreadsheet
3	Senate atlas	The National Archives of Finland	414 historical maps of Finland	digital images
4	Municipalities	The National Archives of Finland	625 wartime municipalities	digital text
5	Organization cards	The National Archives of Finland	132 military units & 279 people & 642 battles	digital images, PDF documents
6	Units of The Finnish Army 1941–1945	The National Archives of Finland	8810 military units	digital text, PDF document
7	Wartime photographs	The Finnish Defence Forces	164 000 photos with metadata, 1740 people	spreadsheet, API access
8	Kansa Taisteli magazine articles	The Association for Military History in Finland, Bonnier Publications	3360 articles by war veterans	spreadsheet, PDF documents
9	Karelian places	The National Land Survey of Finland	32400 places of the annexed Karelia	spreadsheet
10	Karelian maps	The National Land Survey of Finland	47 wartime maps of Karelia	digital images
11	Finnish Place Name Register	The National Land Survey of Finland	798 000 contemporary place names	XML
12	National Biography	The Finnish Literature Society	699 biographies	spreadsheet
13	War cemeteries	The Central Organization of Finnish Camera Clubs	672 cemeteries & 2450 photographs	spreadsheet, digital images
14	Prisoners of war	The National Prisoners of War Project	4200 person records	spreadsheet
15	Wikipedia	Wikimedia Foundation	3010 people, 255 military units	API, web pages
16	Knights of the Mannerheim Cross	Knights of the Mannerheim Cross Foundation	191 people, 1120 medal awardings	API, web pages
17	Military history literature (9 sources)	-	1050 war events, 2900 military units, 585 people	printed text
18	Finnish Spatio-Temporal Ontology	Aalto University	488 polygons of wartime municipalities	RDF
19	AMMO Ontology of Finnish Historical Occupations	Aalto University	3090 occupational labels	RDF

Table 1 lists the heterogeneous source datasets of WarSampo. The data comes from several Finnish organizations, such as the National Archives of Finland, the Finnish Defence Forces, and the National Land Survey of Finland. Some source datasets have been created as part of the WarSampo project and related research. The source datasets are in different formats, e.g., spreadsheets, text, web pages, images, application programming interfaces (API), Extensible Markup Language (XML) documents, Portable Document Format (PDF) documents, and Resource Description Framework (RDF) graphs.

The core dataset of the system is the casualty database (source number 1 in Table 1) of the National Archives that contains detailed information about virtually every person killed in military action in Finland during the WW2. A key goal of WarSampo is to reassemble the life stories of the soldiers by gathering information about them via data linking. For this purpose, data about the military units (5) and their history (6), including original war diaries (2) are of central importance. Other integrated datasets include, among others, a massive collection of wartime photographs (7), memoirs of soldiers (8), historical maps (10), biographies (12), etc. In addition to people and units, historical (4, 9) and contemporary (11) places, are widely used for data linking. The semantic backbone of WarSampo is the 1050 WW2 events based on military history literature (17).

3. Data model

The source datasets of Table 1 were transformed into RDF and harmonized into a coherent whole using an event-based data model. Here the concepts in the source datasets are described using metadata schemas [12,47], e.g., DCMI Metadata Terms (DCT), and vocabulary models, such as SKOS and RDF Schema (RDFS). This section first motivates the event-based modeling approach used in WarSampo and then presents in more detail the model, the main entity types, and the properties used.11

¹¹
The data model is available on GitHub: https://github.com/SemanticComputing/Warsampo-schema.

Fig. 1.
The CRM based WarSampo data model for representing military history as events.

Representing Wars as Events. Since wars are essentially sequences of events, an obvious choice for representing military history is event-based modeling. There are many approaches to modeling events [38–41,44]. We use CRM with extensions to military history concepts as the conceptual framework. There are many reasons for this: Firstly, as a strongly event-based model, CRM is suitable for harmonizing the history of wars, Secondly, CRM is an ISO standard (21127:2014), which means that “reinventing the wheel” can be minimized in data modeling. Documentation and tooling are readily available for the standard and reuse of the data by others is easier. Thirdly, as CRM describes the real world rather than documents about it, it can be used effectively for harmonizing the heterogeneous source data for a unified representation of the wars and related materials. Using events also makes it possible to describe the changes of status of different entities, such as people and military units. Furthermore, using a common model for all the datasets makes querying the data more uniform.

Table 2
Namespaces of WarSampo classes and their main properties

Prefix Namespace

crm http://www.cidoc-crm.org/cidoc-crm/

dct http://purl.org/dc/terms/

foaf http://xmlns.com/foaf/0.1/

rdfs http://www.w3.org/2000/01/rdf-schema#

skos http://www.w3.org/2004/02/skos/core#

hipla http://ldf.fi/schema/hipla/

: http://ldf.fi/schema/warsa/

Fig. 2.
WarSampo main entity types with instance counts and linkage between the instances. Events have associated time spans that are not depicted in the figure.

The used CRM classes and their subclasses are presented in Fig. 1 and the used namespace prefixes in Table 2. The class structure was designed and extended iteratively, as the amount of source datasets and links between them increased. In Fig. 1, the RDFS subclass relation is represented with a white headed arrow. The relationships between class instances are presented with various properties in the KG, which are divided into two categories based on their certainty: 1) relations that are generated directly from the source dataset information (solid arrows), e.g., a birth event created from a person’s birth date in a death record, and 2) relations that are generated using entity linking methods (dotted arrows), e.g., to link a person mentioned in the caption of a photograph. Entity linking methods use heuristics and produce a small amount of erroneous links, which is discussed in Section 5.

CRM has an internal way of representing the types of entities, with the property crm:P2_has_type. However, the common way of representing specific types in LD is by introducing classes and subclasses for each specific type, and using rdf:type to state that a resource is an instance of a class. This approach is used in WarSampo, as it is more expressive, allowing multiple inheritance. In WarSampo, CRM is extended by creating new subclasses for representing the military history domain. The modeling decision is based on the need to use custom properties for the subclasses, that would not be valid for a whole CRM class. This facilitates interoperability with other systems based on CRM.

Events are represented strictly as subclasses of crm:E5_Event depicted on the right in Fig. 1. Also the other core classes in the data model are from CRM. However, for some information in the source datasets, modelling them using CRM is not feasible, e.g., marital statuses, or nationalities, as the way to model them with CRM is using groups and events, which is not in line with how people intuitively organize this kind of information [35]. In such cases, the information is annotated using simple SKOS vocabularies.

Literal names of the WarSampo resources are represented using properties skos:prefLabel and skos:altLabel, instead of the more verbose CRM label appellations, as there is no metadata available about the appellations in the data sources. Information sources are given with the property dct:source, and textual descriptions with dct:description. The data model can be extended with new CRM subclasses as needed, e.g., when integrating new datasets into the KG.

The Main Entity Types. The main entity types are presented in Fig. 2, with instance and link counts between the class instances. The arrow direction depicts the direction of linking and LOD Cloud refers to the global LOD Cloud. Next, each main entity type is explained, highlighting its most important properties. The main entity types contained in domain ontologies (DO)12 ¹²
DOs define the basic concepts used in populating metadatasets and are shared by them. DOs include, e.g., ontologies for subject matter concepts (keyword thesauri), places, people, and events.
are shown as green rectangles and the RDF metadatasets (MDS)13 ¹³
MDSs describe objects or other things in an application domain in terms of a metadata schema. Collection metadata in libraries, museums, and archives, or their harmonized aggregated versions are typical examples of MDSs.
using the DOs are shown with yellow rounded rectangles.

Person. (sources 1, 5, 7, 12, 14, 15, 16, 17 in Table 1) Person instances have been created [34] from multiple source datasets. The source datasets provide varying levels of detail about people. For most of the people (sources 1 and 14) we have ample biographical metadata, but in some cases the level of detail is not sufficient for disambiguating a person, e.g., only surname and military rank may be known.

The person resources are modeled as instances of :Person, a subclass of crm:E21_Person. Names are expressed using foaf:familyName and foaf:firstName. Person resources are further enriched with events created from the source information, e.g., :Birth, :Battle, :Death, :PersonJoining, :Promotion, or :MedalAwarding.

Military Unit. (sources 2, 5, 6, 15, 17) The military unit resources are modeled as instances of :MilitaryUnit, a subclass of crm:E74_Group. Unit activity is expressed as various related events, e.g., :Formation, :Dissolution, :Battle, and :TroopMovement.

During the WW2, changes were made to the army hierarchy: the unit identification codes and unit names were changed occasionally in order to confuse the enemies, and different units have even used identical names. The army hierarchy, including the temporal changes made in it, is modeled with :UnitJoining events that link a unit into its superior unit [34].

Death Record. (source 1) The death records (DR) contain information about the ca. 94 700 fallen in the Finnish fronts in WW2 [31]. They have served as the primary source of person instances in WarSampo. The data model of person instances is extended based on the DRs, to contain events of wounding and disappearing.

The DRs are modeled as instances of :DeathRecord, which is a subclass of crm:E31_Document. From each DR, there is a crm:P70_documents relation to the corresponding person instance. The DRs are described with custom properties that correspond to the columns of the source spreadsheet, while each DR corresponds to a spreadsheet row. The DR properties convey information about, e.g., the person’s occupation, the number of children, marital status, and burial place, using custom SKOS vocabularies. The property values are linked, when possible, to corresponding shared DOs (e.g., Places).

Prisoner Record. (source 14) Prisoner Records (PR) contain information of the ca. 4200 people captured as prisoners of war by the Soviet Union [33]. They are modeled as documents (class :PrisonerRecord) similarly as the DRs. Some properties are shared between the PRs and DRs, but in most cases the semantics is different and separate properties are used, that share a common superproperty. Typically, the PR properties depict the person’s situation at the time of capture, whereas the DRs depict the situation at the time of death.

The PRs contribute new person instances and extend the person data model with the capturing events. The PRs often contain multiple values for a property, and source annotations for property values, modeled as RDF reifications.

Event. WarSampo events have been classified into 19 subclasses of the class crm:E5_Event, which are shown in Fig. 1. They are used to model 1) war events (source 17), e.g., battles and bombardments, 2) political activities (source 17), and 3) events that describe the history of the actors in the war (all actor-related sources).

Each event is an instance of :Event or one of its subclasses (e.g., :PoliticalActivity, :Battle, :Bombardment). Events are described with textual representations via dct:description, time spans, and places of occurrence, if applicable, linking the events to Places DO. The events are linked to actors by several properties, e.g. crm:P11_had_participant, crm:P14_carried_out_by, and crm:P100_was_death_of. Time spans are instances of crm:E52_Time-Span and are represented using the properties crm:P82a_begin_of_the_begin and crm:P82b_end_of_the_end to describe the beginning and end times respectively.

Place. (sources 3, 4, 9, 10, 11, 18) WarSampo employs four distinct types of geographical data: 1) The National Archives’ list of counties and municipalities in 1939–1945, enriched with polygon boundaries from the Finnish Spatio-Temporal Ontology,14 ¹⁴
http://seco.cs.aalto.fi/ontologies/sapo/
2) Geocoded Karelian map names, 3) War cemeteries, and 4) the current Finnish Place Name Register. In addition, 461 historical map sheets were rectified on modern maps [25].

The geographical data within WarSampo is modeled with a simple schema [19], which contains properties for the place name: coordinates, a polygon, a place type, and part-of relationship of the place. Each place is an instance of a subclass of crm:E53_Place. The Finnish Place Name register is used as an external DO, served on a separate endpoint.15 ¹⁵
http://ldf.fi/pnr/sparql

Photograph. (source 7) WarSampo contains 164 000 wartime photographs with their metadata, taken by Finnish soldiers, as well as 2450 recent photographs of the Finnish war cemeteries. The photographs are represented as instances of the :Photograph class. Photography events (class :Photography) represent the taking (i.e., creation) of photographs, so that photographs that have been taken the same day and have the same description are grouped in the same event. Modeling the photographs using events has the benefit of making it possible to handle them the same way as other event-based entities and placing them on timelines. Property values link photographs to the DOs of people, military units, and places.

War Diary. (source 2) Metadata of hand-written war diaries are given as instances of the :WarDiary class, including dct:hasFormat links to the corresponding digitized online documents provided by the National Archives of Finland. The property crm:P70_documents links to related military units or people.

Article. (source 8) Metadata of the Kansa Taisteli war veteran magazine articles are given as :Article instances. The article metadata is linked to WarSampo DOs of people, military units, and places.

Occupation. (source 19) The AMMO Ontology of Finnish Historical Occupations [29] harmonizes the diverse occupational labels present in the DRs and PRs. AMMO provides the means to study people using social stratification measures via links to the international HISCO [45] classification of occupations, and to another national level classification.
4. Populating the data model

Prefix	Namespace
crm	http://www.cidoc-crm.org/cidoc-crm/
dct	http://purl.org/dc/terms/
foaf	http://xmlns.com/foaf/0.1/
rdfs	http://www.w3.org/2000/01/rdf-schema#
skos	http://www.w3.org/2004/02/skos/core#
hipla	http://ldf.fi/schema/hipla/
:	http://ldf.fi/schema/warsa/

The process of creating the WarSampo KG started with the creation of shared DOs [16], shown on the top of Fig. 3. The MDSs created from the source datasets, were then linked to the DOs. Some of the early DOs, i.e., 5610 people, military units, military ranks, and medals, have involved manual work, and the processes used to create them are not repeatable. This is also true for person record specific lightweight ontologies used by the death records and the prisoner records. These DOs are maintained directly in RDF and a repeatable data transformation pipeline was built on top of those.

To create a harmonized view of the wars, it is vital to reconcile the entities in the source datasets, by using the shared DOs. In most cases, the reconciliation is based on probabilistic NEL [15], in which a small number of erroneous or missing links is not considered a problem. As a general principle, we have tried to link more rather than less, focusing on recall rather than precision. This enables us to provide at least the relevant links for the users of the data to find more information that they might be interested in. If we emphasized precision more, some relevant information might not be found. We trust in the users’ ability to evaluate the links and give feedback if a link is wrong. In some cases, like when disambiguating references to people, we pursued to maximize both recall and precision.

When NEL is used to link literal values to resources, the original values are preserved with a separate property, in order to provide enough information for the user of the data to evaluate whether the generated link might be incorrect.

Transformation Pipeline. A repeatable data transformation pipeline is used for building the majority of the KG from the source datasets. The processes in the pipeline align and transform the source datasets into the WarSampo data model and link entities to the DOs.

If the source datasets are updated, the pipeline can be used to update the KG. By recreating the KG, the pipeline also handles change propagation caused by changes in the MDSs or DOs [32,43], which may cause other parts of the KG to need to adapt to the changes or the linking between resources may become invalid. Several of the data transformation processes employ Docker to increase reproducibility [7].

Figure 3 shows the data transformation pipeline, and links created by the entity linking to the DOs. The boxes represent structured data and the cylinders RDF data, with the yellow color depicting DOs and the green color depicting MDSs. The boxes from which the processes start show the corresponding source numbers from Table 1.

Fig. 3.

The 5-step WarSampo data transformation process. Dashed arrows represent entity linking, while solid arrows convey data flow.

Because of the interlinking between datasets, different change propagation scenarios emerge when updating the source datasets and DOs. The general strategy for handling the change propagation scenarios is to 1) transform DOs, 2) transform the datasets which both link to the Person DO and create new person instances, and 3) transform datasets that link to the DOs, but do not alter them. The steps shown in Fig. 3 are:

The place transformation processes convert three source CSV16 ¹⁶

Comma-separated values format.

files and one source XML file into RDF, along with the cemetery photograph metadata.

The Casualties transformation process transforms the CSV file into RDF death records, and links them to the DOs of military ranks, military units, occupations, places, and people [31]. The death records are matched to already existing person instances using probabilistic record linkage [14], with a logistic regression based machine learning implementation. New person instances are created in the Persons DO for the death records that don’t match any existing person.

The Prisoners of War dataset transformation process [33] is similar to the previous step, and links to the same DOs.

The war and political events originate from OCR’d texts, which are then structured into CSV files. This step takes the CSV files as input, transforms them into RDF, and links entities to the DOs [18].

Photograph metadata is transformed from a CSV file into RDF, enriched by images using the data provider’s API, and linked to the DOs of military units, people, and places.

The resulting WarSampo KG consists of 14 300 000 triples, separated into multiple DOs and MDSs. The URIs minted in the data transformation pipeline are stable over consecutive runs. For example, the person registers contain a column containing a local identifier for each person record, used to mint the WarSampo URIs for the entities.

Data Publication. The KG is available on the Linked Data Finland (LDF) platform [22], providing a home page for the KG,17 ¹⁷

The home page of the KG: http://www.ldf.fi/dataset/warsa.

and a public SPARQL endpoint.18 ¹⁸

The public SPARQL endpoint: http://ldf.fi/warsa/sparql.

To support reuse, the home page provides additional information about the KG, such as, 1) schema documentation automatically generated by the platform, 2) example SPARQL queries, and 3) metadata as a SPARQL Service Description,19 ¹⁹

https://www.w3.org/TR/sparql11-service-description/

containing Vocabulary of Interlinked Datasets (VoID)20 ²⁰

https://www.w3.org/TR/void/

metadata.

The WarSampo SPARQL endpoint is hosted on an Apache Jena Fuseki21 ²¹

https://jena.apache.org/documentation/fuseki2/

SPARQL server. The whole KG and Fuseki are contained in a Docker image, that can be easily built and started when and where needed. The DOs and the transformation pipeline results are separated into individual data repositories, which are included in the image as Git submodules.

The platform provides dereferencing of URIs for both human users and machines, and a generic RDF browser for technical data users, where a user is redirected if a WarSampo URI is visited directly with a web browser. The WarSampo URIs are of the form http://ldf.fi/warsa/DATASET/ID where DATASET is the name of the MDS or DO. The ID is an identifier consisting of a prefix and a running number. For example, the URI of an entity in the photographs dataset is http://ldf.fi/warsa/photographs/sakuva_57717.

Given a URI, e.g., of the commander-in-chief Mannerheim (http://ldf.fi/warsa/actors/person_1), end users can use a set of URL templates22 ²²

http://www.ldf.fi/uri-data-services.html

to access 1) the underlying RDF data in Turtle format,23 ²³

http://ldf.fi/warsa/actors/person_1.ttl

2) to start browsing the data,24 ²⁴

http://ldf.fi/warsa/actors/person_1

or 3) to view the “home page” of the resource entity.25 ²⁵

https://www.sotasampo.fi/en/persons/person_1

The KG is also available in Zenodo, with an associated canonical citation [30]. The KG is licensed by the open Creative Commons Attribution 4.0 license.

5. Quality of data

The WarSampo KG is based on the heterogeneous source datasets that are considered being of high quality, since most datasets originate from established national authorities. The data has not been corrected or amended in any way, but only converted into RDF and linked as they are.

The KG adheres to the 5th star level of the 5-star LD publishing principles [1]. In addition, the LDF platform provides an explicit schema and an online documentation26

²⁶
http://ldf.fi/schema/warsa/
to extend the LD publication quality to the sixth star, as suggested in the proposed 7-star model [22]. The data has been validated syntactically by the transformation pipeline and the SPARQL Server. Some schema-based validations regarding selected datasets are underway as the first steps towards obtaining the 7th star; this would require proof that the data conforms to the published schemas. Also some semantic, knowledge-based validation tests were made using SPARQL queries. These tests found out some semantic errors present in the source datasets. For example, there are a few people recorded as being wounded after their death.

Quality of Vocabulary Use. The quality of vocabulary use is on the 4th star level of the five stars of vocabulary use [28]. The WarSampo metadata schema is dereferencable by humans (1 star), and machines (2 stars), it is linked to other vocabularies, e.g., CRM, DCT, and RDFS (3 stars), and it is annotated using DCT, SKOS, and OWL vocabularies (4 stars).

Quality of Entity Linking. The WarSampo entity linking consists of NEL, person record linkage, and a few manually created links.

The NEL of event descriptions to the DOs of people, military units, and places, is accomplished with $F_{1}$ scores of 0.88, 1.00, and 0.88, respectively [16]. The NEL of photograph metadata to the DOs of people, military units, and places, is accomplished with $F_{1}$ scores of 0.80, 1.00, and 0.77, respectively [16]. The NEL of magazine article metadata to the DOs of military units, and places, is accomplished with $F_{1}$ scores of 0.79 and 0.62, respectively [16].

The person record linkage of death records results in 620 death records linked to some of the 5600 pre-existing person instances, while for the remaining 94 100 death records, new person instances are created.

The person record linkage of prisoner records results in 1255 PRs linked to some of the 99 700 pre-existing person instances, while creating 2945 new person instances in the Persons DO.

The precision of the person record linkage of both the death records and prisoner records was manually evaluated to be 1.00, based on randomly selecting 150 links from the total of 620 links for death records, and 200 links from the total of 1260 links for the prisoner records. The information on the person records and the person instances was compared, and all of the records were interpreted to be depicting the same actual people with high confidence.

External Connectivity. Linkage from WarSampo to external resources has been provided to facilitate reuse. WarSampo is connected to the national Finnish ontology infrastructure, by a total of 6110 links, of which 5530 is to KOKO,27 ²⁷
KOKO is a collection of Finnish core ontologies, which are merged together: http://finto.fi/koko/en/.
a collection of national core ontologies, and the remaining 582 to YSA.28 ²⁸
YSA is a general thesaurus in Finnish, covering all fields of research and knowledge, containing common terms and geographical names for content description: https://finto.fi/ysa/en/.
The KOKO linkage contains 3380 keyword annotations of magazine articles and 2140 skos:relatedMatch links from AMMO occupation concepts. The YSA links are additional place annotations of the war events that are in geographical scope more global than the WarSampo place ontologies.

There are 3360 external links to the digitized Kansa Taisteli magazine service29 ²⁹
http://kansataisteli.sshs.fi/
hosted by the Association for Military History in Finland. There are also 26 400 of external links to the digitized war diaries30 ³⁰
http://digi.narc.fi/digi/dosearch.ka?atun=65.SARK
hosted at the National Archives of Finland.

Linkage to other datasets of the global LOD Cloud31 ³¹
https://lod-cloud.net/dataset/warsampo
consist of 311 links to DBpedia, 159 links to Wikidata, 147 links to Muninn World War I, and 2 links to CrossRef DOI Resolver. The military personnel and army units are linked to DBpedia and Wikidata, and the military ranks to Muninn World War I. Additionally, there are 2190 links to the Finnish version of DBpedia.
6. Stability of data

The KG is mature enough to be relatively static, with only minor error corrections predicted to happen in the near future. New DOs can be added to the ontology infrastructure, without affecting the existing DOs, as the DOs are separated into distinct components, which are handled separately in the transformation pipeline.

The URIs of the Casualties MDS have been revised after initial release, stemming from the MDS originating from a time before the WarSampo infrastructure, and it had URIs which were not in the WarSampo namespace. In 2018, the MDS was revised to be fully integrated into WarSampo: the namespace was changed, the schema was revised, and the used source dataset was updated. The Casualties transformation process (step 2 in Fig. 3) was revised to be fully repeatable and to use person record linkage that is able to adapt to changes in the pre-existing Persons DO. Currently, the used WarSampo URIs can be considered stable.

The KG is versioned using semantic versioning 2.0.0,32

³²
https://semver.org/spec/v2.0.0.html
and the KG version discussed in this article is the current 2.1.0 version. The full retrospective version history is given in Table 3.

Table 3
WarSampo KG major and minor version history

Version Published Description

1.0.0 Nov 2015 Initial public release

1.1.0 Nov 2017 War cemeteries addition

2.0.0 May 2018 URI namespace of Casualties MDS changed

2.1.0 Nov 2019 Prisoners of war addition

The Linked Data Finland platform, on which the KG is hosted, is actively maintained by the authors of this article and has been operational since 2014.
7. Usefulness

Version	Published	Description
1.0.0	Nov 2015	Initial public release
1.1.0	Nov 2017	War cemeteries addition
2.0.0	May 2018	URI namespace of Casualties MDS changed
2.1.0	Nov 2019	Prisoners of war addition

Semantic Portal. The WarSampo Portal provides end users with nine different WWW based perspectives to the underlying KG. Each perspective is a separate JavaScript application, designed to convey information related to a source dataset or a certain class, in an intuitive and user-friendly way [18]. The main entities, such as people, units, and places, have their “home pages” whose URLs are of the form https://www.sotasampo.fi/en/page?uri=URI, where URI is the identifier of the corresponding individual. This makes it easy for the application perspectives or any external application to make reference to WarSampo contents, which facilitates cross-application linking.

The WarSampo KG has been accessed and used by 690 000 end users through the WarSampo Portal, corresponding to more than $10 %$ of the population of Finland. We have received written feedback from over 400 end users, mostly through the portal’s feedback form. The majority of the feedback contain corrections to the personal information of a respondent’s relative. The corrections are stored and supplied to the data providers for further consideration. There is an active open Facebook group33

³³
https://www.facebook.com/groups/sotasampo/
for community discussions.

Based on the experiences of the National Archives of Finland, the main data provider for WarSampo, users of military history data portals can be divided into three groups: academic researchers, military history enthusiasts, and private citizens. The first group has the widest range of needs regarding the data, but they often have the best skills to handle and refine the data by themselves. The focus of academic research seems to be shifting from a macro level towards individual and social aspects of war [2,6]. In the future, end-user studies could be conducted to get a more complete understanding of the users, their motivations, and needs.

Third-party Use. The core part of KG, the Casualties MDS, has been used as a basis for another popular Finnish WW2 portal, Sotapolku,34 ³⁴
http://sotapolku.fi
a system aiming at crowdsourcing detailed wartime histories of the Finnish soldiers.

Wikidata has linked some Finnish person instances to WarSampo with a distinct WarSampo property, e.g., the commander-in-chief C. G. E. Mannerheim.35 ³⁵
https://www.wikidata.org/wiki/Q152306

Parts of the KG, especially the Places DO and historical maps have been reused in the Finnish historical place and map service Hipla36 ³⁶
http://hipla.fi
as geo-gazetteers [24] and in the popular NameSampo service37 ³⁷
http://nimisampo.fi
for toponomastic research [27].

Finally, the KG was used for enriching data in the external semantic web applications Norssi High School Alumni [20], and BiographySampo [21].

Known Shortcomings and Future Work. Event-based modeling is an effective approach to representing wars, enabling the harmonization of heterogeneous data, that can be used in spatio-temporal analytics and user interfaces without the need to adjust the queries for each source dataset separately. The downside of using an event-based model for all the datasets is its complexity and verbosity: photographs are, for example, modeled as an image and an event creating it, which can lead to complex and slow queries.

Another problem is data maintenance: data modeled with CRM is considerably difficult to edit directly, due to verbosity and high level of interlinking between resources. Our solution is to support maintenance of the source datasets, which can be repeatedly integrated into the KG using the data transformation pipeline.

The data transformation practices have evolved during the project, and only later datasets are integrated into the KG with repeatable processes. Also modeling conventions have improved, and there are slight variations in how information from different source datasets is modeled.

The transformation pipeline handles most change propagation scenarios, but in some rare cases the initial DOs need manual updates. For example, if the Places DO changes, the initial state of the Persons DO may need to adapt to the changes, as there are references to e.g., municipalities of birth.

In entity linking, disambiguating some entity types without much context information has been found difficult. For example, place names are usually unambiguous on the municipality level, but automatically disambiguating the names of villages, farms, and lakes can not be done reliably due to high amount of synonymy. Furthermore, place names are often used also as surnames, which is a source of confusion in NEL from free text.

The amount of open, structured, and digitized source datasets about the war is limited. In WarSampo, the focus is on the fallen soldiers, and not much is known about the soldiers who survived the war, except for the high ranking officers who can be considered public figures. In the future, plenty of new material will become available through digitization, raising privacy concerns regarding the people who might still be alive.
8. Conclusion

The WarSampo project has transformed a number of previously isolated source datasets into a harmonized KG. Besides the large number of links between entities, also whole new entities have been extracted from textual content, e.g., people from photograph descriptions, and military units from war diaries.

The WarSampo KG enables queries that were not possible before: for example fetching all WW2 data related to a specific place, or constructing the history of a single soldier based on corresponding military unit information. By publishing shared domain ontologies and data about WW2 for everybody to use in annotations, future interoperability problems can be prevented before they arise.

Footnotes

Acknowledgements

Our work has been funded by the Finnish Ministry of Education and Culture, the National Archives of Finland, the Association for Cherishing the Memory of the Dead of the War, the Memory Foundation for the Fallen, the Finnish Cultural Foundation, the Academy of Finland, and Teri-Säätiö.

Erkki Heino and Eetu Mäkelä have previously contributed to the data model and data transformation processes.

The authors wish to acknowledge CSC – IT Center for Science, Finland, for computational resources.

References

Berners-Lee, Linked data – design issues, 2006, http://w3.org/DesignIssues/LinkedData.html.

T.D.

Biddle and

R.M.

Citino, The role of military history in the contemporary academy, Foreign Policy Research Institute Footnotes (2015), 1–6, https://www.fpri.org/docs/society_for_mil_hist_whit_paper.pdf.

Bizer,

Heath and

Berners-Lee, Linked data – the story so far, International Journal on Semantic Web and Information Systems (IJSWIS)5(3) (2009), 1–22. doi:10.4018/jswis.2009081901.

Boukhelifa,

Bryant,

Bulatović,

Čukić,

J.-D.

Fekete,

Knežević,

Lehmann,

Stuart and

Thiel, The CENDARI infrastructure, Journal on Computing and Cultural Heritage11(2) (2018), 8. doi:10.1145/3092906.

Burdick,

Drucker,

Lunenfeld,

Presner and

Schnapp, Digital Humanities, The MIT Press, 2012.

R.M.

Citino, Military histories old and new: A reintroduction, The American Historical Review112(4) (2007), 1070–1090. doi:10.1086/ahr.112.4.1070.

Cito,

Ferme and

H.C.

Gall, Using docker containers to improve reproducibility in software and web engineering research, in: Web Engineering,

Bozzon,

Cudre-Maroux and

Pautasso, eds, Lecture Notes in Computer Science, Vol. 9671, Springer, Cham, 2016, pp. 609–612. doi:10.1007/978-3-319-38791-8_58.

Collins,

Mulholland and

Zdrahal, Semantic browsing of digital collections, in: The Semantic Web – ISWC 2005,

Gil,

Motta,

Richard Benjamins and

M.A.

Musen, eds, Lecture Notes in Computer Science, Vol. 3729, Springer, Berlin, Heidelberg, 2005, pp. 127–141. doi:10.1007/11574620_12.

de Boer,

Meroño-Peñuela and

C.J.

Ockeloen, Linked data for digital history: Lessons learned from three case studies, Anejos de la Revista de Historiografía4 (2016), 139–162.

10.

de Boer,

van Doornik,

Buitinck,

Marx and

Veken, Linking the kingdom: Enriched access to a historiographical text, in: Proceedings of the Seventh International Conference on Knowledge Capture (K-CAP 2013), Association for Computing Machinery, New York, NY, USA, 2013, pp. 17–24. doi:10.1145/2479832.2479849.

11.

Doerr, The CIDOC conceptual reference module: An ontological approach to semantic interoperability of metadata, AI Magazine24(3) (2003), 75–92. doi:10.1609/aimag.v24i3.1720.

12.

Gartner, Metadata. Shaping Knowledge from Antiquity to the Semantic Web, Springer, Cham, 2016. doi:10.1007/978-3-319-40893-4.

13.

Graham,

Milligan and

Weingart, Exploring Big Historical Data. The Historian’s Macroscope, Imperial College Press, London, UK, 2015. doi:10.1142/p981.

14.

Gu,

Baxter,

Vickers and

Rainsford, Record linkage: Current practice and future directions, Technical report, CSIRO Mathematical and Information Sciences, 2003.

15.

Hachey,

Radford,

Nothman,

Honnibal and

J.R.

Curran, Evaluating entity linking with Wikipedia, Artificial Intelligence194 (2013), 130–150. doi:10.1016/j.artint.2012.04.005.

16.

Heino,

Tamper,

Mäkelä,

Leskinen,

Ikkala,

Tuominen,

Koho and

Hyvönen, Named entity linking in a complex domain: Case Second World War history, in: Language, Data, and Knowledge: First International Conference, LDK 2017,

Gracia,

Bond,

J.P.

McCrae,

Buitelaar,

Chiarcos and

Hellmann, eds, Lecture Notes in Computer Science, Vol. 10318, Springer, Cham, 2017. doi:10.1007/978-3-319-59888-8_10.

17.

Hoekstra,

Meroño-Peñuela,

Dentler,

Rijpma,

Zijdeman and

Zandhuis, An ecosystem for linked humanities data, in: The Semantic Web, Lecture Notes in Computer Science, Vol. 9989, Springer, Cham, 2016, pp. 425–440. doi:10.1007/978-3-319-47602-5_54.

18.

Hyvönen,

Heino,

Leskinen,

Ikkala,

Koho,

Tamper,

Tuominen and

Mäkelä, WarSampo data service and semantic portal for publishing Linked Open Data about the Second World War history, in: The Semantic Web. Latest Advances and New Domains: 13th International Conference, ESWC 2016,

Sack,

Blomqvist,

d’Aquin,

Ghidini,

S.P.

Ponzetto and

Lange, eds, Lecture Notes in Computer Science, Vol. 9678, Springer, Cham, 2016, pp. 758–773. doi:10.1007/978-3-319-34129-3_46.

19.

Hyvönen,

Ikkala and

Tuominen, Linked data brokering service for historical places and maps, in: Proceedings of the 1st Workshop on Humanities in the Semantic Web,

Adamou,

Daga and

Isaksen, eds, CEUR Workshop Proceedings, Vol. 1608, 2016, pp. 39–52.

20.

Hyvönen,

Leskinen,

Heino,

Tuominen and

Sirola, Reassembling and enriching the life stories in printed biographical registers: Norssi high school alumni on the semantic web, in: Language, Data, and Knowledge: First International Conference, LDK 2017,

Gracia,

Bond,

J.P.

McCrae,

Buitelaar,

Chiarcos and

Hellmann, eds, Lecture Notes in Computer Science, Vol. 10318, Springer, Cham, 2017, pp. 113–119. doi:10.1007/978-3-319-59888-8_9.

21.

Hyvönen,

Leskinen,

Tamper,

Rantala,

Ikkala,

Tuominen and

Keravuori, BiographySampo – publishing and enriching biographies on the semantic web for digital humanities research, in: The Semantic Web 16th International Conference, ESWC 2019,

Hitzler,

Fernández,

Janowicz,

Zaveri,

A.J.G.

Gray,

Lopez,

Haller and

Hammar, eds, Lecture Notes in Computer Science, Vol. 11503, Springer, Cham, 2019. doi:10.1007/978-3-030-21348-0_37.

22.

Hyvönen,

Tuominen,

Alonen and

Mäkelä, Linked data Finland: A 7-star model and platform for publishing and re-using linked datasets, in: The Semantic Web: ESWC 2014 Satellite Events,

Presutti,

Blomqvist,

Troncy,

Sack,

Papadakis and

Tordai, eds, Lecture Notes in Computer Science, Vol. 8798, Springer, Cham, 2014, pp. 226–230. doi:10.1007/978-3-319-11955-7_24.

23.

Hyvönen,

Tuominen,

Mäkelä,

Dutruit,

Apajalahti,

Heino,

Leskinen and

Ikkala, Second World War on the semantic web: The WarSampo project and semantic portal, in: Proceedings of the ISWC 2015 Posters & Demonstrations Track,

Villata,

J.Z.

Pan and

Dragoni, eds, CEUR Workshop Proceedings, Vol. 1486, 2015.

24.

Ikkala,

Hyvönen and

Tuominen, An ontology of World War II places for linking and enriching heterogeneous historical data sources, in: 17th International Conference of Historical Geographers (ICHG 2018), Book of Abstracts, Vol. 194, 2018.

25.

Ikkala,

Hyvönen and

Tuominen, Geocoding, publishing, and using historical places and old maps in linked data applications, in: Proceedings of the Digital Humanities in the Nordic Countries 3rd Conference,

Mäkelä,

Tolonen and

Tuominen, eds, CEUR Workshop Proceedings, Vol. 2084, 2018, pp. 228–234.

26.

Ikkala,

Koho,

Heino,

Leskinen,

Hyvönen and

Ahoranta, Prosopographical views to Finnish WW2 casualties through cemeteries and Linked Open Data, in: Proceedings of the Second Workshop on Humanities in the Semantic Web (WHiSe II),

Adamou,

Daga and

Isaksen, eds, CEUR Workshop Proceedings, Vol. 2014, 2017, pp. 45–56.

27.

Ikkala,

Tuominen,

Raunamaa,

Aalto,

Ainiala,

Uusitalo and

Hyvönen, NameSampo: A Linked Open Data infrastructure and workbench for toponomastic research, in: GeoHumanities’18: Proceedings of the 2nd ACM SIGSPATIAL Workshop on Geospatial Humanities,

Murrieta and

Martins, eds, Association for Computing Machinery, New York, NY, USA, 2018, pp. 1–9. doi:10.1145/3282933.3282936.

28.

Janowicz,

Hitzler,

Adams,

Kolas,

I.I.

Vardemanet al., Five stars of linked data vocabulary use, Semantic Web – Interoperability, Usability, Applicability5(3) (2014), 173–176. doi:10.3233/SW-140135.

29.

Koho,

Gasbarra,

Tuominen,

Rantala,

Jokipii and

Hyvönen, AMMO ontology of Finnish historical occupations, in: Proceedings of the First International Workshop on Open Data and Ontologies for Cultural Heritage,

Poggi, ed., CEUR Workshop Proceedings, Vol. 2375, 2019, pp. 91–96.

30.

Koho,

Heino,

Leskinen,

Ikkala,

Tamper,

Apajalahti,

Tuominen,

Mäkelä and

Hyvönen, WarSampo knowledge graph [Data set], Zenodo, 2019. doi:10.5281/zenodo.3431121.

31.

Koho,

Hyvönen,

Heino,

Tuominen,

Leskinen and

Mäkelä, Linked death – representing, publishing, and using Second World War death records as Linked Open Data, in: The Semantic Web: ESWC 2017 Satellite Events,

Blomqvist,

Hose,

Paulheim,

Ławrynowicz,

Ciravegna and

Hartig, eds, Lecture Notes in Computer Science, Vol. 10577, Springer, Cham, 2017, pp. 369–383. doi:10.1007/978-3-319-70407-4_45.

32.

Koho,

Ikkala,

Heino and

Hyvönen, Maintaining a linked data cloud and data service for Second World War history, in: Digital Heritage. Progress in Cultural Heritage: Documentation, Preservation, and Protection. 7th International Conference, EuroMed 2018, Lecture Notes in Computer Science, Vol. 11196, Springer, Cham, 2018. doi:10.1007/978-3-030-01762-0_12.

33.

Koho,

Ikkala and

Hyvönen, Reassembling the lives of Finnish prisoners of the Second World War on the semantic web, in: Proceedings of the Third Conference on Biographical Data in the Digital Age (BD 2019), CEUR Workshop Proceedings, 2020, in press.

34.

Leskinen,

Koho,

Heino,

Tamper,

Ikkala,

Tuominen,

Mäkelä and

Hyvönen, Modeling and using an actor ontology of Second World War military units and personnel, in: The Semantic Web – ISWC 2017: 16th International Semantic Web Conference,

d’Amato,

Fernandez,

Tamma,

Lecue,

Cudré-Mauroux,

Sequeda,

Lange and

Heflin, eds, Lecture Notes in Computer Science, Vol. 10588, Springer, Cham, 2017, pp. 280–296. doi:10.1007/978-3-319-68204-4_27.

35.

Mäkelä,

Törnroos,

Lindquist and

Hyvönen, WW1LOD – an application of CIDOC-CRM to World War 1 linked data, International Journal on Digital Libraries (2016). doi:10.1007/s00799-016-0186-2.

36.

Meroño-Peñuela,

Ashkpour,

Van Erp,

Mandemakers,

Breure,

Scharnhorst,

Schlobach and

van Harmelen, Semantic technologies for historical research: A survey, Semantic Web – Interoperability, Usability, Applicability6(6) (2015), 539–564. doi:10.3233/SW-140158.

37.

Nagypál,

Deswarte and

Oosthoek, Applying the semantic web: The VICODI experience in creating visual contextualization for history, Literary and Linguistic Computing20(3) (2005), 327–349. doi:10.1093/llc/fqi037.

38.

Raimond,

S.A.

Abdallah,

M.B.

Sandler and

Giasson, The music ontology, in: ISMIR 2007: Proceedings of the 8th International Conference on Music Information Retrieval,

Dixon,

Bainbridge and

Typke, eds, Austrian Computer Society, Wien, 2007.

39.

Rovera, A knowledge-based framework for events representation and reuse from historical archives, in: The Semantic Web. Latest Advances and New Domains: 13th International Conference, ESWC 2016,

Sack,

Blomqvist,

d’Aquin,

Ghidini,

S.P.

Ponzetto and

Lange, eds, Lecture Notes in Computer Science, Vol. 9678, Springer, Cham, 2016, pp. 845–852. doi:10.1007/978-3-319-34129-3_53.

40.

Scherp,

Franz,

Saathoff and

Staab, F – a model of events based on the foundational ontology Dolce+DnS ultralight, in: K-CAP’09: Proceedings of the Fifth International Conference on Knowledge Capture, Association for Computing Machinery, New York, NY, USA, 2009, pp. 137–144. doi:10.1145/1597735.1597760.

41.

Shaw,

Troncy and

Hardman, LODE: Linking open descriptions of events, in: The Semantic Web. Fourth Asian Conference, ASWC 2009,

Gómez-Pérez,

Yu and

Ding, eds, Lecture Notes in Computer Science, Vol. 5926, Springer, Berlin, Heidelberg, 2009, pp. 153–167. doi:10.1007/978-3-642-10871-6_11.

42.

Sprugnoli,

Moretti and

Tonelli, LOD navigator: Tracing movements of Italian shoah victims, Umanistica Digitale3(4) (2019). doi:10.6092/issn.2532-8816/9050.

43.

Stojanovic,

Maedche,

Motik and

Stojanovic, User-driven ontology evolution management, in: Knowledge Engineering and Knowledge Management: Ontologies and the Semantic Web 13th International Conference, EKAW 2002,

Gómez-Pérez and

V.R.

Benjamins, eds, Lecture Notes in Computer Science, Vol. 2473, Springer, Berlin, Heidelberg, 2002, pp. 285–300. doi:10.1007/3-540-45810-7_27.

44.

W.R.

van Hage,

Malaisé,

Segers,

Hollink and

Schreiber, Design and use of the simple event model (SEM), Journal of Web Semantics9(2) (2011), 128–136. doi:10.1016/j.websem.2011.03.003.

45.

M.H.D.

Van Leeuwen,

Maas and

Miles, HISCO: Historical International Standard Classification of Occupations, Leuven University Press, 2002.

46.

van Nispen and

Jongma, Holocaust and World War Two Linked Open Data developments in the Netherlands, Umanistica Digitale3(4) (2019). doi:10.6092/issn.2532-8816/9048.

47.

M.L.

Zeng and

Qin, Metadata, 2nd edn, Facet Publishing, London, UK, 2016.

WarSampo knowledge graph: Finland in the Second World War as Linked Open Data

Abstract

Keywords

1. Introduction: Military history as linked data

3. Data model

Footnotes

Acknowledgements

References