Abstract
Creative influence is responsible for a considerable part of the creative process of an artist and can largely be associated with their social circle. It has been observed that the type and amount of relationships with other fellow artists correlates with the success of an artist. Most of the recent literature has focused on using artefact similarity as a proxy for creative influence between two artists. However, this approach neglects the significance of an artist’s social network. In this work, we rely on an ontology that comprehensively model the relationship between individuals as a Knowledge Graph and we design an explainable method based on graph theory to predict the influences of an artist given their social network. We evaluate our method on a dataset of relationships between Jazz musicians and achieve accurate results when compared to baselines that rely on the distribution of the data. Our results are aligned with relevant works from the socio-cognitive and psychology fields. We show that our method generalises to resources where information on influence is not directly available and can be used to enrich existing Knowledge Graphs. The code and the ontology developed is shared at https://github.com/n28div/influence_prediction under CC-BY license.
Introduction
Identifying, capturing and hypothesising the influences of an artist is a fundamental aspect that is considered when analysing an artefact or, in general, an artist [11]. A unique and precise definition of creative influence is yet to be defined. The main difficulty lies in the subjective nature of the problem. Identifying the influence of an artist on another artist requires a profound knowledge of both entities, their geographical location, the socio-cultural context in which they lived, the technicalities of their artefacts, the artistic field and so on. Indeed, an unambiguous definition of creative influence has been identified as a problematic aspect to explore [10].
Nonetheless, understanding who are the artists who influenced other artists is an important aspect that plays a pivotal role in the creative process. One of the major aspects that correlates with artistic influence is the amount of success of an artist. Mitali and Ingram [19] analyses the work of 90 pioneers in the abstract art movement. The results provide very strong evidence that the success and fame of an artist are mostly related to their social relationships. While it is true that creativity fosters those relationships, the more artists get deeper into a clique formed by other meaningful artists, the more their work is acclaimed by critics. For instance, the success of the band
To understand the influence on an artist, it is hence important to take into account the social relationships of the artist well. Most recent approaches, however, neglect this aspect and only rely on perceptual features of an artefact - i.e. relying on similarity as a proxy - for creative influence Abe et al. [1], Elgammal and Saleh [7], O’Toole and Horvát [21], Park et al. [22], Saleh et al. [28]. Even though perceptual features yield promising results, they neglect the incidence of the social circle. This hinders the possibility of hypothesising influence between artists whose stylistic genres are completely different. For instance, the personal relationship between
In this work, we propose a method to predict the influence of musical artists by taking into account their social networks. We design an ontology, aligned to the Polifonia Ontology Network [6], that models social relationships between artists as complex situations involving different agents and concepts. We refactor the data from the Linked Jazz project [23]
1
, a Knowledge Graph encoding curated relationship between Jazz musicians (among which creative influence), to comply with the ontology and rely on it as a ground truth to perform influence prediction. We frame the influence prediction task as a classification problem where one has to identify and rank artists according to their likelihood to be influential for a given artist. Our approach is based on techniques from graph theory, namely the
Our contributions can be summarised as follows: an ontology to model the relationships between human agents, with a particular focus on artists; an explainable method to predict creative influence between artists.
The paper is structured as follows: in Section 2 we provide a review of related works addressing the prediction and identification of influence between artists. In Section 3 we present and analyse the data used in the experiments and we describe the developed ontology. In Section 4 we describe the method used to predict creative influence between artists. In Section 5 we describe the experimental setup while in Section 6 we present the obtained results. Section 7 summarises the results of previous sections and highlights potential extensions and future work.
Related works
Evaluating the influences of an artist, and in particular of a composer or a musician, is mostly considered a subjective task. Usually, experts analyse the compositions of an artist in a critical way to relate them to other important artists. One approach to detecting creative influence is to directly analyse explicit influence connections, curated by human annotators. Smith and Georges [30], for example, analyses the influences identified in the Classical Music Navigator (CMS) to better understand the influence of the composers in the dataset. A similar approach is taken by Georges and Seckin [9], where the data on creative influence is used to investigate the similarity of musical compositions.
Relying on similarity as a proxy for creative influence is a popular approach that has been explored using different techniques. Abe et al. [1] define a framework where influence can be modelled using a graph structure. Edges are added to the graph by taking into account the similarity between the two artworks. Several works have explored this approach in the visual art domain with promising results [7, 28] and in the musical domain. O’Toole and Horvát [21] models the influence of musical composition as the probability of success of a composition given its similarity to other popular compositions. In Park et al. [22], the influence of a composer on another composer is measured as the degree of similarity between their compositions. A composer is classified as influential when musical features of its compositions are re-used by other composers.
Relying on artefact similarity, however, can be a problematic approach in art. Influence can affect an artist in a negative way, in the sense that the influenced artist deliberately abstains from his influence [10]. These kinds of artists are sometimes defined as deviant artists [31, 34]. Mauskapf et al. [17] investigates similarity with respect to socio-cultural indicators, such as geographical and temporal location or organisational system in which the artist lives. Findings suggest that highly embedded individuals, i.e. individuals with a dense social network, are more likely to produce novel artefacts that can be influential to other artists. Borowiecki [5] analyses influence of the
Differently from the described approaches, we investigate the importance of social relationships without taking into account any information on the creative artefact. Our approach can be easily integrated with other methods that use similarity, to yield a more general method for uncovering hidden relations when perceptual similarity is the only measure taken into account.
Data
This section provides a detailed description and analysis of the data used in our method. In Section 3.1 we propose our ontology to model complex personal relationships. In Section 3.2 we describe how the Linked Jazz [23] Knowledge Graph is refactored to comply with the developed ontology.
Relationship ontology
Our proposed ontology is built upon the concept of
The ontology takes into account only pairwise relationships. This is intended as the two agents (
Relationships modelled in the ontology
Relationships modelled in the ontology
Figure 1 visually represents the ontology. The class

Ontology in Graffoo syntax. A pairwise relationship involving two agents is reified as a
In Fig. 2 an example of how the ontology can be used to define the friendship relationship of Table 1 is described. In order to define a friendship relationship between the musicians

Example of the friendship relationship using the ontology. The
The implemented reification allows us to represent the relationship as a whole rather than flattening it into a binary relation, resulting in a richer characterisation of the relationship and a high degree of control in further refining it. For example, in Fig. 2 the dashed properties represent refinement operations over the initial definition. It is possible to classify the relationship as both a friendship and mentorship relationship while adding documents that act as references to back up the assertion.
We rely on the data from the Linked Jazz project [23]. Linked Jazz is a Knowledge Graph containing information about famous jazz musicians and their social connections to other musicians. Data is semi-automatically annotated from the transcription of artists’ interviews using crowd-sourced annotations. While some relationship types are objective (e.g.
We align the Linked Jazz KG to our ontology (described in Section 3.1) through the use of a SPARQL construct query.
The refactored KG contains 1 431 artists with 63 relations on average between each artist. Figure 3 shows the distribution of the relationship between the entities in the KG. The most frequent relationship is the acquaintance one. This is not surprising, as the extraction of Linked Jazz has been performed from interviews. Whenever an artist mentions another artist, if the context and the information available are insufficient to hypothesise a more specific relationship, the annotators have been instructed to rely on the most generic relationship [23]. Given the distribution of Fig. 3 and the total of artists (1 431), it follows that many artists have more than one relationship type with the same artist. This happens, for instance, with

Relation distribution in the Linked Jazz KG.
Figure 4 shows the co-occurrence of different relationship types between the same two artists. As could be easily hypothesised from Fig. 3, the

Co-occurence of pairwise relationships in the Linked Jazz KG.
The Linked Jazz Knowledge Graph contains explicit information on the influence between two artists. This allows its use as a dataset for influence prediction since the ground truth is manually extracted. However, the distribution of the dataset is limited to jazz artists. This results in a narrow evaluation of our proposed method, which is not guaranteed to accurately predict influence relations between artists in other musical domains, such as pop, classical or rock music. To check the generality of our proposed method, we rely on the MEETUPS Knowledge Graph [32]. The MEETUPS KG is built by applying a complex knowledge extraction pipeline on Wikipedia pages to extract entities participating in a historical meetup. A historical meetup is identified from an artist’s Wikipedia biography when it mentions at least one or more participants and places. Moreover, the time when the meetup took place and the purpose of the encounter is extracted. The possible encounter purposes are business career; personal life; coincidence; education; public celebration; and music making.
While such purposes do not exactly match the interpersonal relationship we are interested in (Table 1), it is possible to extend MEETUPS and align it to the ontology of Section 3.1. In principle, all the historical meetups in the KG imply that the artists that take part in it are acquaintances. A more fine-grained alignment can be obtained by posing additional assumptions, described in Table 2.
MEETUPS alignment to our proposed ontology
MEETUPS alignment to our proposed ontology
Based on these alignments we refactored the MEEETUPS KG to comply with the ontology presented in Section 3.1. The refactoring was performed by a script automatising the retrieval of all the meetups from the KG. For each meetup, the script retains the participants and the purpose of the meetup. Then, for each pair of participants in a meetup,
Moreover, if
where
This results in a Knowledge Graph containing more than 2 million triples, with a total of 93 394 unique artists and 537 303 asserted relationships. Figure 5 shows the distribution of the relationship between the entities in the KG. Since we fall back to the acquaintanceship relation when the meetup purpose is not alignable to the ontology of Section 3.1, it is also the most frequent relationship in the KG. Compared to Linked Jazz, the distribution of the relationship is more balanced. For instance, the fellowship and friendship relationships are much more prominent than the ones of Fig. 3. Moreover, the bandmate relationship is less prominent. This is a direct result of the information contained in Wikipedia biographies, which is often incomplete when compared to more authoritative resources [12].

Relation distribution in the aligned MEETUPS Knowledge Graph.
Figure 6 shows the co-occurrence of different relationship types between the same two artists is shown. Similarly to Linked Jazz (Fig. 4) the

Co-occurence of pairwise relationships in the aligned MEETUPS KG.
Unlike Linked Jazz, MEETUPS is not limited to jazz music but spans different music genres, since there is not any constraint on the biographies used to extract information. For this very reason, MEETUPS does not assert any influence relationship, as they can not be extracted from historical meetups events. To overcome this issue, we extract influence relationships by relying on the AllMusic 5 website, which states a list of known influences for each artist. The information obtained on AllMusic are humanly curated through crowd-sourcing. Hence they are representative of the most important influences of an artist. We collect 12 580 influence relationships involving 3 723 artists in MEETUPS KG. Note that, compared to the 242 199 artists contained in the MEETUPS KG, influence coverage is still relatively low. Moreover, the provenance and accuracy of the information in AllMusic is explicitly clear. We will use the extracted influences as a benchmark to assess the accuracy of our proposed method as an unsupervised influence detection method.
This section provides a detailed description of our method. In Section 4.1 we describe in detail the algorithm used to compute the influences between entities in the KG while in Section 4.2 we describe the procedure designed to learn the weight of each relationship type.
f -communicability as influence indicator
Once relations are represented using the ontology described in Section 3.1, the resulting Knowledge Graph is a formally defined social network of artists. By only relying on the binary projections of the reified relationships we can extract a directed graph

Example of social graph. Intuitively, the communicability between
Our approach is based on the
Figure 7 shows an example of a social network among artists. Even though close connections should be considered more important, connections that are not too far still conceive a lot of interesting information when the creative process is being considered.
In our setting, the
The
To maintain the semantics of each predicate as formalised in
The
The weight

Examples of equation 6 computed on the example from Fig. 7. The labels
In Equation 2 we defined the adjacency matrix by relying on the weighting function
A more flexible approach is to learn the weights assigned to
While aggregating relations using a sum, as done in Equation 1 is a reasonable approach, it is also reasonable to suppose that the joint presence of two relationships, e.g.
The computational complexity of the method described so far is non-trivial. The time complexity of the method is dependant on the matrix multiplication algorithm at hand, and is proportional to the degree
Experiments
In this section, we describe the experiment that we perform on the method proposed in Section 4. In Section 5.1 we describe the experimental setup and the results obtained using the Linked Jazz KG.
Linked jazz
We experiment with the methods of Section 4 on the Linked Jazz KG, described in Section 3.2. Given the relationships of Fig. 3, we need to learn the specific weight of each relationship type.
In order to learn the weights of the function
We evaluate each model using Mean Reciprocal Rank (MRR), Mean Average Precision (MAP) and Discounted Cumulative Gain (DCG). All the listed metrics measure how high are ranked appropriate values, e.g. how high are ranked actual influential artists with respect to a reference artist. MRR can be interpreted as how far is the first influential artist in the ordered list. MAP is the average of the number of relevant entries within the first
We compare each model with a straightforward baseline inspired by the arguments of Simonton [29]. The author concludes that the relationships that mostly influence the creativity of an artist are those with friends, mentors, pupils and fellows. We hence predict a relation of creative influence whenever one of these relationship is asserted between two nodes in the Knowledge Graph. This baseline serve as a quantitative evaluation of the importance of indirect realations, since the baseline only takes into account direct relationships.
Results
This section described the results of the experiments of Section 5. In particular, in Section 6.1 we describe the results on the Linked Jazz dataset, while on Section 6.2 we show how the method trained on the Linked Jazz dataset can be used in an unsupervised fashion to discover influence relationships on MEETUPS.
Linked Jazz result
Figure 9 shows the importance of the degree

Aggregate value combining all methods as a function of the variable
This result is consistent among each different method, as can also be seen in Fig. 10, where the importance of the degree

Mean of MAP, DCG and MRR metrics compared among different methods with different degree
Table 3 reports the results obtained from the experiments of Section 5 with
Result from the experiment described in Section 5. Each value is reported alongside its standard deviation
In Table 4 the weights learned by the best method of Table 3 are described. Judging from the mean and median values, the most important relationships are the
Statistics on the learned weights from the best model of Table 3
In Table 5, the Pearson correlation between the metric results and the weight placed on each relationship is described. While the negative importance of the acquaintance relationship is confirmed again. Interestingly, the friendship relationship is the one that most positively correlates with correct influence detection. This aligns with several different studies that highlights how friendship has a big influence on the creative process of an artist [19, 29].
Correlation between the weight of a relationship and the results in the best method of Table 3
In Fig. 11 the precision-recall curve obtained by using the learned weights that yields the best performances on Linked Jazz is reported. We consider as ground-truth the influences reported on the AllMusic website. Note that the ground truth should not be seen as an oracle, but rather as an highly incomplete set of influences. In this setting, it is clear that a higher recall is preferable to a higher precision.

Precision-recall curve on the influences detected in MEETUPS as a function of the threshold.
The graph of Table 11 shows how reducing the threshold (i.e. making more conservative predictions) results in a much higher threshold with a negligible precision drop. Indeed, the break-even point (i.e. the point in which the precision and recall curves intersect, ≈0.4) suggests a conservative use of the prediction method.
In this work, we present a novel method, described in Section 4, to detect creative influence between artists using techniques based on graph theory and complex network science. Our method takes into account the individuality of a relationship type through the use of an ontology illustrated in Section 3.1. By framing the influence prediction task as a classification task we are able to obtain an interpretable model that performs better than robust baselines. The results described in Section 6 highlight how a straightforward combination of the different graph planes identified by the different relationship types results in accurate results. Moreover, the weights assigned to each relationship type are in line with relevant socio-cognitive and psychological findings, thus additionally validating the results. Our attempt to increase the accuracy of our results through the use of a machine learning approach led to less accurate predictions. Nonetheless, it is difficult to objectively rule out the possibility of combining machine learning techniques with our methods given the limited amount of training data publicly available.
We show that our method can be used to support the enhancement and integration of additional information on existing Knowledge Graphs that encode interpersonal relationship. By relying on MEETUPS, which is automatically extracted using a complex NLP pipeline, and on our method it is possible to automatically detect relationships from textual content and formulate hypotheses on creative influence.
Future works include extending the training dataset available since the main problem with the experiments relying on the neural network can be re-conducted to the limited amount of training data for the model. Possible approaches includes employing data augmentation techniques, in order to exploit the data as much as possible and reduce the chances of overfitting the model. Additionally, more complex neural network architectures can be used to approximate and integrate the
Finally, clustering methods are an interesting approach to explore, in order to detect communities of artists within the
Footnotes
Acknowledgments
The authors would like to thank Daniele Marini for his exploratory work on the subject and Alba Morales Tirado for the integration of MEETUPS. This project has received funding from the FAIR – Future Artificial Intelligence Research foundation as part of the grant agreement MUR n. 341 and from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 101004746.
An information object in DOLCE.
