Abstract
The Internet of Things (IoT) has recently been applied in the domain of cultural exhibition enabling the cultural sites to provide more personal and proactive experiences to their visitors. To come up with valuable services, several solutions to analyze the spatio-temporal trajectories of visitors have been put forward. However, they neither consider the inherent uncertainty of the underlying indoor positioning technologies – Bluetooth Low Energy (BLE), RFID, etc. – nor other visitors’ features apart from the spatio-temporal ones (e.g. the level of interaction with the museum displays). For that reason, the present work introduces RECITE, a framework to classify trajectories representing visitors’ actions that copes with the aforementioned limitations of existing solutions. Firstly, RECITE states a novel mapping process for a BLE-based indoor positioning system to accurately detect the visitors’ locations. On top of this mechanism, RECITE includes an ensemble of fuzzy rule classifiers able to tag the visitors’ ongoing trajectories in real time considering both spatio-temporal and other behavioural factors. Finally, the framework has been evaluated in a case of use scenario showing quite promising results.
Keywords
Introduction
The advent of the Internet of Things (IoT) has come with the development of several indoor location solutions based on different wireless technologies like WiFi [37], RFID [3,18] or Bluetooth Low Energy (BLE) [17]. One of the most prominent applications of these location technologies has been the development of location-based services (LBSs) able to adapt to users’ locations inside buildings [6], provide users with ambient intelligence [19] or detect incidents in road infrastructures [4]. In this context, several proposals have recently arisen in the cultural environment in order to profit from such IoT advances. Some examples are recommendation systems to proactively display personalized content to enhance the visitors’ experience in museums [2] or the active involvement of visitors in public exhibitions [29].
An important consequence of this widespread deployment of LBSs is that, now, cultural institutions have access to an unprecedented amount of visitors’ movement data. As a result, several solutions have successfully applied different data mining techniques over such indoor trajectories so as to come up with innovative cultural services [28]. For instance, several works have made use of clustering techniques to group trajectories sharing certain features [23,24,32] whereas pattern mining techniques have been applied to uncover general flows or trends [38,39]. However, we have observed meaningful limitations when it comes to perform classification analysis of such movement data.
On the one hand, although some classification algorithms have been proposed to particularly deal with indoor trajectories in cultural spaces [22], they relied on trajectory models based on sequences of Points of Interest (POIs) or building elements (e.g rooms, corridors) that individuals have gone through during their visits [25]. This high-level modelling actually hampers the realization of profound analyses in order to deeply understand how visitors behave in a particular cultural space, due to a lack of mobility details. For instance, using a POI-based trajectory representation, we may know that a visitor has been close to a certain spot but not how he has actually moved around that spot. Furthermore, this representation strongly depends on the current spatial distribution POIs. The same applies when using trajectories based on building elements.
On the other hand, indoor trajectories are usually noisy and imprecise due to multiple factors [10]. This makes it difficult to establish crisp boundaries defining the actual movement of users in this type of environments. Existing solutions generally do not take into account such problems during the classification process. They will have an impact on the classification accuracy though.
In this context, the work at hand introduces RECITE, a f
Furthermore, an ad-hoc indoor positioning system based on low-cost BLE beacons has been developed. This system is able to accurately locate users handling a guidance device (e.g. audio-guide) withing a 2-D Cartesian space. Besides, each trajectory is enriched with the interactions that the target visitor made with his device (e.g. play some audio content) during his stay. This way, this type of valuable data can be integrated in the classification process. For the sake of clarity, Fig. 1 shows the difference between our trajectory model and the POI and building-based ones.

Examples of POI-based (shown in red), building-element-based (blue) and RECITE (green) indoor trajectories for the same visitor’s displacement in a exhibition scenario with two rooms and one corridor housing five different POIs. The blue icons represent the locations of BLE beacons in the scenario. The red points represent the coordinates in the Cartesian space detected by the our solution.
All in all, to the best of the authors knowledge, this work constitutes one of the first efforts to apply fuzzy rules to classify semantically-enriched indoor trajectories focused on the cultural domain.
The remainder of the paper is structured as follows. Section 2 is devoted to describe in detail the logic structure and the processing stages of RECITE. Then, Section 3 discusses the main results of the performed experiments. Next, an overview about indoor trajectory analysis in the cultural domain is put forward in Section 4. Finally, the main conclusions and the future work are summed up in Section 5.
Figure 2 outlines the main steps of RECITE to come up with the proposed FRCs. The first step focuses on collecting the individual trajectories of the museum visitors in terms of spatio-temporal displacements along with the multimedia content displayed by visitors in their guidance devices during such movements. Then, in stage 2, the collected trajectories are split into a pre-defined number of segments (4 in the example of the figure). The third step extracts the overall features of each trajectory segment. In the fourth step, several fuzzy clustering tasks are applied to the data set, but taking into account only a segment and a class at a time. After that, for every clustering model their resulting clusters are projected to their space axis (Cartesian coordinates,

Approach overview. As an illustrative example, in the first stage, four different trajectories are collected from visitors where each musical note indicates the location where the user displayed some multimedia content in his handheld device. In stage 2, these input trajectories are split into four segments
RECITE relies on an underlying operational architecture that enables the collection of the required visitors’ behavioural data. Figure 3 shows such an architecture. Bearing this figure in mind, we can derive the following general assumptions for the framework,

General operational architecture for RECITE.
The museum includes, as part of its own infrastructure, a set of BLE beacons installed through all its exhibition areas (e.g. galleries, corridors and the like). Furthermore, each beacon is identified by means of a unique tag.
A visitor handles a guidance device during all her/his stay at the museum. The goal of this device is twofold. To begin with, it acts as a Bluetooth-based location system because it is able to periodically detect its closest BLE beacons along with the distance to them. Secondly, it allows the visitor to access the multimedia content related to any exhibition item at the museum.
The multimedia content is actually stored in a dedicated back-end Content Server accessed by the visitors’ devices on demand. This content might be video, audio or text files providing information about the different museum pieces.
The objective of this work is to obtain an automatic mechanism in such a way that, every given trajectory is classified as one among a set of p predefined labels or classes. Let it be
Individual trajectories extraction
The first stage of the framework pipeline focuses on collecting the individual trajectories of visitors using the BLE beacons installed in the museum as Fig. 2 shows. In that sense, a visitor’s trajectory can be defined as follows,
A trajectory
As we can see, RECITE defines a trajectory comprising two types of actions that a visitor can make during his stay,
Movement actions indicating physical displacements in the museum space.
Content actions representing interactions with the multimedia content accessible from the guidance device.
In the following sections we describe how these two types of actions are collected.
To capture these actions, the guidance device of a visitor v regularly sends to the Location Server (LS) a frame
In order to process this information, the LS makes use of the museum’s floor-plan image, that is a 2-D Cartesian space where the coordinates of the visitor’s location will be defined. Therefore, the LS performs a mapping process to translate the set of beacon’s distances

Illustrative example of the map-based algorithm steps in an scenario with four deployed BLE beacons. The left image shows the data set generation, where the scanner samples the beacons’ signals for certain locations. Only location frames 1 to 3 (
Lastly, in certain situations the guidance devices are not able to capture the signal of a nearby beacon
These actions are sent from the visitor’s device to the LS as well. In this case, the device just sends the time-stamp at which the user displayed certain content, together with its associated content identifier.
Each new movement or content action from a visitor is appended to its ongoing trajectory by the LS. In that sense, a trajectory is considered finished when the LS does not receive any new update during a certain time threshold
Lastly, each trajectory is labelled, manually or semi-automatically, with one of the classes in Ω. As a result, a set
Trajectory segmentation
One of the problems of the previous gathering process is that the sequences of actions defining the trajectories may have varying lengths. Hence, the second step of the framework pipeline is to apply a segmentation process to all the trajectories in
In that sense, trajectory segmentation is a well-known technique in the trajectory data mining field that divides a trajectory into fragments by several criteria, like time intervals or semantic meaning, for further processing [42].
In our setting, trajectories are split into slices of a predefined time length
A segment of a visitor trajectory
We can see that
Once the different trajectory segments have been composed, they are distributed into different sets
For example, let us consider a trajectory
Once the trajectory segments have been generated, the next step is to extract a set of descriptive features of each of them. In that sense, depending on the type of action (movement or content display) under consideration, a different type of feature is calculated. As a result, each trajectory segment is compressed into a 3-dimension vector
A trajectory segment feature
Besides, all the trajectory segment features are collected in their corresponding set
Going back to our previous illustrative trajectory
The fourth step of the framework pipeline is to identify the general action trends per segment by means of the generation of different sets of clusters. Every cluster will become a fuzzy rule, and the fuzzy rules will be combined into fuzzy classifiers (see Fig. 2).
The approach is based on the generation of a set of FRCs for every
In this phase, the fuzzy c-means (FCM) clustering algorithm [8] is applied. Before explaining that step, let us give some details about the clustering settings:
Cluster shape: we decide to obtain hyper-spherical clusters given that they are more suitable to be projected into one dimension fuzzy sets to form fuzzy it-then rules. For more information look for the so-called decomposition error in [5]. This shape is possible when we fix the norm matrix A which induces the measurement dissimilarity between data points as Fuzziness: We need to decide the level of overlapping of the fuzzy clusters to be found. This parameter is called m. Number of clusters to be identified in every space Termination parameter: We need to decide the tolerance threshold ϵ for the FCM algorithm.
The two last parameters should be defined before launching the clustering process. Although their values are context-dependent there are some works arguing what would be quite reasonable values for them [31].
Our instance of the FCM algorithm generates
For the sake of completeness, the pseudo-code of the FCM algorithm is included in Algorithm 1 where l indicates the iteration number, i the i-th cluster and k the k-th trajectory segment feature for a particular trajectory segment.

Fuzzy c-means (FCM) algorithm
To obtain the fuzzy sets involved in a fuzzy classifier for class
At this point we should recall that we carried out clustering tasks based on data of every particular class
Fuzzy rule classifier
Fuzzy rule classifier
The first step is to calculate the firing strength
then, the partial output of every rule,
and the partial outputs are combined to generate the final output
We should recall that the ultimate goal of the classifier is to label the ongoing trajectories of visitors during their stay at a museum. Due to the time-based segmentation described in Section 2.4, these target trajectories will have an increasing number of segments as time proceeds and visitors move across the museum.
In order to cope with this situation, instead of a monolithic FRC, we generate a palette of FRCs with different number fuzzy sets in their antecedents
All in all, the classification mechanism to label the visitors’ ongoing trajectories involves the following steps,
Each time a new tuple
Then, the
Finally,
Figure 5 shows an illustrative example of the aforementioned mechanism where a trajectory covers different time-based segments and, thus, feeds different FRCs. As the figure depicts, the target trajectory is labelled with different classes

Example of the online classification procedure.
The proposed framework has been evaluated in a test-bed exhibition area. This is located in Murcia, Spain and it usually hosts small exhibitions of local artists.
Place infrastructure
The exhibition place comprises two main rooms distributed along a corridor. In the rooms and the corridor, 15 iBeacons Wellcore W902 with chip TICC25411
Figure 6 depicts the floor-plan of the test-bed along with the spatial distribution of the beacons. Such area covered roughly 500 m2 and the beacons are spatially distributed every 6 meters approximately. The size of this floor-plan image is 500 × 177 pixels and each pixel represents 0.1 m2.
In order to perform the experiment in the test-bed exhibition, we installed the ad-hoc mobile application infoArt in the different guidance devices of the exhibition area.2
Firstly, the app connects to a Content Server to provide visitors with the multimedia content available at the target museum.
In addition to that, the application generates the BLE frames f every 3 seconds and sends them to the Location Server (LS) (see Fig. 3).

Use case floor plan. The red dots indicate the location of the BLE beacons. The black line depicts the ideal route to visit the exhibition area.
For this use case, the different algorithms were implemented using Python 3.6 as programming language. Besides, we profited from the implementation of the FCM method in the skfuzzy library3
As an alternative to the algorithm to detect the movement actions described in Section 2.3.1, we developed a simple mapping approach along with a calibration mechanism to improve its accuracy in our testbed. This was coined as the MinMax location algorithm. In the next two subsections, we state the main findings of our study with respect the algorithm (Section 3.2.1) and the calibration technique (Section 3.2.2).
MinMax location algorithm
This alternative algorithm profited from the fact that we know the euclidean distance to each beacon for every coordinate in our 2-D Cartesian map image. Therefore, for each incoming frame
Nevertheless, this approach suffered from the fact that our handheld guidance devices usually did not sample the same signal strength at the same distance to the transmitter. Consequently, we decided to include a calibration model for each beacon. The goal of this model was to reduce as much as possible the difference between the distance calculated by a guidance device to a beacon and the physical distance between the device and the beacon. This calibration mechanism in described in the next subsection.
Calibration of the beacons
In order to perform the required calibration of the guidance devices, as pointed out above, we took the following steps,
First of all, we gathered the signal intensity of each of the 15 deployed beacons in the setting with each guidance device at different displacements covering 1, 2.5, 5, 7.5, 10 and 15 meters. For each of those displacements, we gathered a total of 20 samples.
Next, a third-party BLE library,4
We averaged the resulting distances per beacon after 5, 10 and 20 samples. This was done because the signal intensity tends to stabilize after several samples [30].
On the basis of the previously-calculated distances, a polynomial regression model is generated. This model was used by the application to estimate more precisely the distance to each beacon.
Table 2 shows an example of this calibration process for beacon
Example of the mean distance calculated for beacon

Interpolation curve for beacon
In Fig. 7 we plot the measurements of Table 2 as 3 curves (one for each column). Whilst the x-axis represents the measured distance by the application, the y-axis indicates the real distance between the beacon and the device. From these 3 series we generated a fourth-grade polynomial regression model. This model was used by the application to more accurately detect the distance between the guidance system and beacon
In order to compare the two indoor location algorithms, we performed a visual analysis of the resulting trajectories with each approach. In that sense, given the ideal route of the use case depicted in Fig. 6, the combination of the MinMax algorithm and the calibration mechanism, produced the trajectory depicted in Fig. 8(a). Alternatively, the Map-Based algorithm generated the trajectory shown in Fig. 8(b). We can clearly see that this second trace is much more similar to the actual route followed by the visitor than the one obtained by the MinMax algorithm.
Parameters settings for the FRCs generation
Parameters settings for the FRCs generation

Examples of two captured trajectories. The trace of each one is depicted as a black line. The BLE beacons are shown as small red squares.
The main problem of the MinMax approach was that the distances that the BLE scanner library5
All in all, the MinMax algorithm achieved poorer results in terms of trajectory perception, because of the low reliability of the distances measured from the beacons. Even though we tried to tackle this problem with a calibration mechanism, we still did not achieve suitable results. Therefore, we eventually use our initial Map-based approach to capture the visitors’ movement actions for the rest of the use case analysis.
Once the indoor positioning system was calibrated, the FRCs for the collected trajectories were generated and tested.
Target classes
In the present setting, the goal of the classifier is to detect four types of visitors in the test-bed. In particular, the site operators come up with four different interesting behaviours. As a result, the target classes composing Ω are the following,
Collection of visitors’ trajectories
We collected 64 different trajectories during a 1-week period ranging from 19/04/17 to 25/04/17. Each one was manually labeled by the exhibition operators with one of the four classes previously mentioned. This gave rise to 16 trajectories of each class.
Figure 9 shows the heat map of every captured trajectories composing the

Heat map of all the collected trajectories. Pixels with color green have been more frequented by users, followed by yellow tones and finally in red we see unexplored places.
In Fig. 8, we can see two examples of collected trajectories. From these traces, we clearly observe the noisy and imprecise nature of such trajectories. This justifies the trajectory segmentation and feature extraction process to normalize the trajectories representation and the fuzzy logic approach to classify them.
Table 3 shows the key parameters used to generate the ensemble of FRCs. We can see that

Average time length of trajectories in
In order to set the number of clusters
This way, we executed FCM multiple times with different numbers of clusters and calculated the FPC associated to each resulting partition keeping the one with the highest coefficient per segment number and class. Table 4 sums up the obtained number of clusters. For example, the suitable number of clusters for the second segment of trajectories with label
Number of clusters per segment number and output class
Number of clusters per segment number and output class
From this table we can see that most of the trajectory segments from observer, interested or hurry visitors could be aggregated into a single cluster. However, trajectories segments from lost visitors were mostly clustered in two clusters instead of one. This is compatible with the fact that this type of visitors tended to roam around the museum and thus, their movements were generally more diverse.
Once the FRCs were generated on the basis of the aforementioned parameters, we studied its accuracy. For that goal, we used the F1 score as measurement. This score is calculated following the next formula:
Table 5 shows the recall, precision and F1 scores of RECITE. In order to evaluate the suitability of the segmentation approach, the table also shows the results of a FRC (
Accuracy scores of the palette of FRCs of RECITE with and without velocity features and a monolithic FRC without including a trajectory segmentation step
Accuracy scores of the palette of FRCs of RECITE with and without velocity features and a monolithic FRC without including a trajectory segmentation step
From these results we can see that the trajectory segmentation step clearly helps to improve the classification accuracy. This is because RECITE generates a FRC for each possible sub-trajectory achieving a higher level of specialization and, thus, accuracy.
Furthermore, Table 6 shows the confusion matrix of the evaluation. In that sense, this matrix includes all the intermediate trajectories that users generate during their visits. In general, the main errors occur in adjacent labels. However, 4% of hurry sub-trajectories are classified as lost ones when these two labels are not very similar. In that sense, we have observed that, sometimes, some hurry visitors return to places already seen in their attempt to quickly cross the exhibition area. This makes such visitors mimic the behaviour of lost visitors.
Confusion matrix of the RECITE FRCs. A cell in row R and column C with R ≠ C, contains the sheer number and percentage of sub-trajectories whose real label was R but it was wrongly classified as C
The next aspect we evaluated was the impact of the length of an ongoing trajectory on the classifier. The results of this evaluation are shown in Fig. 11.

F1 score of the classifier per trajectory number of segments. Each bar depicts the F1 score of the FRCs for trajectories with number of segments equal to its x-axis value.
According to this figure, we can see the F1 score of the proposal was above 0.9 regardless of the number of segments of the target trajectory. However, the accuracy of our classifier was slightly higher for trajectories with segments ranging between 3 and 9. This was because this range of segments comprised most of the trajectories whereas the number of trajectories comprising more than 10 segments was much lower. This resulted in an accuracy drop of the solution.
As it is put forward in Section 2.5, the present version of the framework extracts only three different features from each segment, namely the average x-y coordinates and the average number of content displays. Consequently, we also evaluated the effect in RECITE of adding new segment features as part of the framework, namely, the average segment speed and the average segment bearing.6
In the present setting, a trajectory’s segment bearing is regarded as the clockwise angle in degrees between north and the segment’s vector.
Bearing in mind the definition 3, each trajectory segment feature
As Table 5 shows, including these features in the solution did not actually improve the accuracy of RECITE with respect to the baseline approach. This way, the F1 score was slightly lower (0.92 vs 0.94). Regarding the confusion matrix depicted in Table 7, we can see the velocity features slightly improved the detection of lost trajectories as their classification rate increased from 91.9% (see Table 6) to 93.7%. This type of trajectories are defined by many changes of direction due to the roaming behaviour of the visitors. As a result, the bearing features help to better identify these types of movements. Nevertheless, this rate actually dropped for the other three types of trajectories.
Confusion matrix of the RECITE FRCs enriched with velocity features from the trajectories. A cell in row R and column C with R ≠ C, contains the sheer number and percentage of sub-trajectories whose real label was R but it was wrongly classified as C
The reason of this lack of actual improvement of RECITE is due to the well-known curse of dimensionality problem [27]. Adding more features to the target input space makes the trajectories segments more distant among them. As a result, the approach generated much more clusters and, thus, fuzzy rules as we can see in the Table 8. According to this table, the system generated, for example, 8 different clusters given the first segments of the trajectories labelled as
Number of clusters per segment number and output class with velocity features
Finally, FRCs are easily interpretable as they are composed of IF-THEN sentences. In this context, a drawback of enlarging the framework with new input features is that the complexity of the FRC increases and the aforementioned descriptive capability of this type of fuzzy systems is lost.
When it comes to deal with indoor trajectories in cultural spaces, it can be established three different characteristics to catalogue the existing literature: the type of enabling indoor positioning system, the modelling solution to represent the visitors movements and the type and purpose of the trajectory analysis over the collected data. Table 9 makes a review on each of these characteristics.
Key features of existing indoor trajectory mining approaches in the cultural domain
Key features of existing indoor trajectory mining approaches in the cultural domain
To begin with, RFID readers have been widely used in this context so as to locate visitors holding different types of devices [41]. In that sense, approaches can be distinguished depending on the requirement of an explicit user check-in [22] or not [23,24].
Some works have also proposed WiFi access points as indoor positioning enablers. For example, [14] combines the signal strengths of WiFi routers and the outcome of an image-processing engine able to detect the presence of visitors to come up with a precise indoor positioning solution. Ambitrack [9] also proposed tracking through cameras and image-processing. However, this approach is more expensive to install and maintain than the inexpensive BLE beacons. An interesting approach to locate visitors in an art gallery is introduced in [12] where a combination of BLE beacons along with image recognition mechanism integrated in a wearable device allows to detect whether a visitor is in front of any art keypoint of a gallery. However, in most cases BLE is used to detect presence of visitors at much more large scale (e.g. galleries or corridors) [13,26,32,38,39].
Our work also makes use of an ad-hoc indoor positioning system relying on BLE. However, unlike other solutions we consider a careful calibration process performed at device and global level based on polynomial regression. Thus, we are able to accurately locate the visitor making use of affordable beacons as described in Section 2.3.
Trajectory modelling
Regarding a building-based approach, many solutions just perform a simple mapping step considering the concrete building space where the sensors is actually located to construct the visitors’ paths. Examples of this are [38,39] that model trajectories as sequences of visited galleries. However, [15] relies on a museum graph model that accurately represents its premises, like doors, rooms or vitrines. Then, a mapping approach is performed to convert the collected spatio-temporal data from users to the graph-based representation. However, the proposal does not define any particular indoor positioning system to capture the visitors’ movement. A similar approach is followed by [1,26] where a semantic indoor trajectory model is proposed or [14] with a more holistic ontology.
POI-based approaches have also been widely used in the literature [12,22,23]. In some cases, this type of models are enriched with additional information. For example, [22] merges the visited POIs along with the digital content consulted by the user in the museum displays to compose the set of features that defines the user’s movement.
Furthermore, it is interesting to mention the work in [13] where the building bricks to compose user trajectories are the named coverage areas of the Bluetooth beacons. Thus, it provides an intermediate solution between POIs and building elements to represent the visitors’ trajectories. Similarly, [32] considers both POIs (artworks) and building elements (rooms) to compose the trajectories.
Our work is enclosed on an alternative way to model trajectories based on a two-dimensional Cartesian space. In this domain, only a few works have actually relied on this representation. For instance, [24] makes use of this technology to calculate the probability of a visitor’s position in a map image representing a gallery floor. The RECITE model leverages the accuracy of the underlying positioning system. As a result, we have been able to perform certain computations from the outdoor trajectory mining field like the segmentation step discussed in Section 2.4. Moreover, our work enriches this model by labelling certain segments of the trajectory with the content displayed by the user in his guidance device.
Trajectory analysis
Concerning the analysis of the collected trajectories, there is a wide range of solutions considering both the applied methods and their purpose. To begin with, some examples can be found where trajectory pattern mining has been carried out by means of simple frequency counting methods to discover the most frequent sequences of visited galleries [39]. In [13], authors pursue the same goal but, in this case, sequence alignment methods are applied over trajectory data. In [38] trajectories are characterized using the random walk model.
As far as trajectory clustering is concerned, many different algorithms has been tested. For instance, [24,32] make use of the k-Means clustering algorithm to aggregate visitors’ trajectories and, thus, uncover general movement trends. However, [24] performs a spatial partitioning by means of the state chain model before applying the clustering algorithm. In [23], authors perform a clustering task over the set of trajectories and map each cluster to a set of predefined visitor profiles. Later on, two sequence miners are applied to each group in order to reveal interesting patterns defining visitor behaviors.
Other works have used optimization algorithms like
In addition to that, some works just provide a preliminary design of the trajectory analysis that might be carried out. For example, [15] proposes to identify stay-points of visitors’ routes and then performs an incremental clustering over such points using time windows to detect behaviour changes during consecutive time intervals. Another worth mentioning work is [26]. It provides a semantic reasoning mechanism that, by means of a set of domain-dependent predicates, it is able to infer the purpose of visitors’ movements (e.g. visit a particular gallery or buy something at the gift shop).
In the trajectory classification field, we have noticed a scarcity of proposals within the cultural domain. Thus, [22] makes use of two well-established classification algorithms like logistic regression and multi-layer perceptron to rate the suitability of an unseen POI recommendation based on the previous movement of a user. For that goal, some profiling information of the visitor like age or gender is considered.
In this context, the work at hand provides a set of FRCs that smoothly merges the spatio-temporal data from visitors displacements and data related to the content displayed in the guidance devices to timely detect the visitors behaviour. To do so, it does not rely on any sensitive profiling data from users.
We should also mention that our work share some similarities with the proposal stated in [11]. In that work, a FRC-based mechanism to classify trajectories based on Volunteer Geographic Information (VGI) is described. However, VGI provides a more coarse-grained representation of the target users’ movement than in our setting. Consequently, a segmentation step is not necessary in that work. Furthermore, in such a work, only the spatio-temporal features of the trajectories are considered for the classification task whereas RECITE also considers other factors like the usage of the multimedia content made by visitors.
Conclusions
The cultural domain is endlessly taking advantage of IoT-related technologies. This has generated a huge amount of visitors’ data so as to come up with innovative services in museums and exhibition areas.
In this context, the present work introduces the RECITE framework. On the basis of a BLE infrastructure acting as an indoor positioning system, this framework is able to detect the movement of visitors in a cultural site and track the usage they make of the multimedia content available at their guidance devices.
On top of the collected indoor trajectory data, an ensemble of fuzzy rules classifiers is developed. Such classifiers are able to tag in real time the behaviour of visitors based on their movements and their usage of the multimedia content. To do so, we have followed a data-driven methodology that combines algorithms and techniques from different fields. To begin with, a segmentation method from the trajectory data mining field has been included so as to provide an uniform representation of the heterogeneous trajectories. Secondly, the fuzzy clustering algorithm FCM has been applied to uncover similarities in the movement behaviour of visitors and eventually compose the final fuzzy rules classifiers.
Finally, we have evaluated the framework in a test-bed scenario. In that setting, we have compared two different approaches to provide an indoor positioning system using BLE as enabling technology. The comparison shown that the mechanism based on a pre-defined map of locations provide a more accurate solution in terms of trajectory perception. Regarding the classification features of RECITE, results showed that the FRC approach allowed to accurately classify visitors as they move around the site and access the multimedia data provided by the museum.
Footnotes
Acknowledgements
This work has been supported by the Fundación Séneca del Centro de Coordinación de la Investigación de la Región de Murcia under Project 20813/PI/18, by the Spanish Ministry of Science, Innovation and Universities under grant RTC-2017-6389-5, by the PERSEIDES project TIN2017-86885-R and also co-financed with ERDF funds.
Conflict of interest
The authors have no conflict of interest to report.
