Abstract
Geostatistics has been successfully used to analyse and characterize the spatial variability of environmental properties. Besides providing estimated values at unsampled locations, geostatistics measures the accuracy of the estimate, which is a significant advantage over traditional methods used to assess pollution. This work uses universal block kriging to model and map the spatial distribution of salinity measurements gathered by an Autonomous Underwater Vehicle in a sea outfall monitoring campaign. The aim is to distinguish the effluent plume from the receiving waters, characterizing its spatial variability in the vicinity of the discharge and estimating dilution. The results demonstrate that geostatistical methodology can provide good estimates of the dispersion of effluents, which are valuable in assessing the environmental impact and managing sea outfalls. Moreover, since accurate measurements of the plume's dilution are rare, these studies may be very helpful in the future to validate dispersion models.
Keywords
1. Introduction
The physical and biological coastal processes that determine the values of environmental variables are complex and still poorly understood. Usually the observations are so unpredictable that the spatial distribution of these variables appears to be random. This randomness makes deterministic solutions adequate to describe the simplest components only [1]. The measurements of environmental variables are usually obtained in very restricted areas and constitute a sample from a continuum that cannot be recorded everywhere. However, people aim to predict the spatial distribution of these variables from a more or less sparse data set. The maps made using the classical methods of spatial interpolation usually display the spatial variation poorly [1]. The interpolators of these methods also fail to provide any estimates of the error, which are desirable for prediction. In this line of thought, geostatistical prediction is the logical solution in the sense that it provides a quantitative description of how the environmental variable differs spatially and a prediction for the places that have not been sampled. Additionally, geostatistics also provides estimates of the errors in these predictions and therefore it is possible to judge how confident we can be about them [1, 2].
Typically, the behaviour of a sea outfall discharge is a process that can be described as follows. The wastewater is usually ejected as an array of turbulent buoyant jets from ports spaced along the outfall diffuser. These turbulent buoyant jets mix with the ambient seawater, usually resulting in rapid reductions of contaminant concentrations. The seawater around coastal outfalls is often density-stratified with density increasing with depth. The discharge, whose density is close to fresh water, is lighter than the surrounding ambient seawater and the plumes rise due to buoyant forces until they reach a level of neutral buoyancy where the effluent spreads laterally, creating a horizontal spreading layer. Depending on the strength of seawater stratification, currents and other variables, the horizontal spreading layer may be submerged and will not be visible on the water surface. If the receiving waters are homogeneous or weakly stratified, the plumes will reach the surface and spread horizontally away from the source [3]. Studying the effluent mixing process in coastal waters in situ is still a complex problem. Several monitoring campaigns using natural tracers have been conducted to detect and map sewage plumes under different oceanic conditions [4-6]. The results show, however, very complex and patchy structures both in vertical and horizontal sections that are difficult to identify with the classic picture of the buoyant plume. As advanced by several investigations, the observed plume patchiness can occur due to one or a combination of factors which include: (1) variations in currents and stratification during time intervals (which can be hours in some cases), separating the beginning and the end of the field tests and resulting in different equilibrium depths or even distinct plume behaviours, (2) internal tides that can result in a given effluent concentration surface undergoing significant vertical excursions as it advects from the outfall and (3) limitations of sampling in terms of time resolution of time and space scales; in reality, field measurements are not truly synoptic - a transverse of several kilometres can last a couple of hours. Because of their extensive preparation and potentially negative impacts on the environment, detection methods using introduced tracer substances, either at the treatment plant or at the level of the diffusers in the waters, are not practical for routine monitoring of sewage effluents. However, this technique is still the most effective because it makes plume detection easier and usually allows for a quantitative estimation of dilution far away from the diffuser [7-8]. Rapid sampling is expected to reduce time and space variability during and between transects. Because of their easier field logistics, reduced cost per study, increased spatial resolution, reduced spatial aliasing effects and adaptive sampling capabilities, Autonomous Underwater Vehicles (AUVs) are especially valuable in detecting and mapping sewage plumes [9-13].
2. Geostatistical Prediction
If a stochastic approach is used for the spatial prediction problem, then z(
2.1 Spatial Correlation
The relationship between two variables (characteristics) can be evaluated using their covariance. For n pairs of observations (z1,i,z2,i), i = 1, …, n of two variables z1 and z2, the covariance is given by [1]:
where ẑ1 and ẑ2 are the means of z1 and z2, respectively. It is possible to extend this definition to relate any two random variables Z(
where μ(
and that (2) the covariance of Z between any two locations
for all
Furthermore, the author replaced the covariances with the variances of differences such as spatial characterization measurements, which, like the covariance, depend on the lag and not on the absolute position [17]:
for all
Under the intrinsic hypothesis the reverse is not true because the covariance function does not exist. The validity of the variogram in a wider range of applications (due to the weaker form of stationarity required for its existence) allow it to be more widely used in geostatistics than the covariance function. However, it has been demonstrated that in practice this is often of no relevance [16]. For a data set |z(
where N(
Usually, the n observations used in prediction are at a distance from
2.2 Ordinary Kriging
Ordinary kriging (OK) is by far the most common type of kriging used in practice. It is based on the assumption that the unknown mean μ is constant across the region of interest, or within the local neighbourhood of the prediction location (which is usually the case) [1, 16].
The predictor must be unbiased and thus the prediction error must have an expected value of zero [15]:
The kriging (or prediction) variance can be computed in terms of the variogram if condition (10) is used. After some algebra, it is possible to obtain [15]
The values of λ1, …, λn that minimize expression (11) while satisfying constraint (10) are obtained using the Lagrange multipliers method. The conditions for the minimization are given by the OK system of n + 1 equations with n + 1 unknowns [1, 15, 16]:
where ψ is the Lagrange multiplier. Its solution provides the weights λ1, …, λn and ψ is the kriging variance given by [1, 15, 16]:
In matrix notation the OK system is written as Ax = b, where A is the n + 1 by n + 1 matrix of semi-variances [1, 15, 16]:
x are the OK weights and b are semi-variances for the observations for the prediction location:
The OK weights are obtained by x = A–1 b and the kriging variance is given by
then vector b is
where
The block kriging variance is given by:
where
An equivalent procedure, which can be more computationally expensive than block kriging is obtaining the block estimate by averaging the N kriged point estimates within the block [1, 14].
2.3 Kriging in the Presence of Trend
If the variogram increases faster than |
where R(
in which βk are unknown coefficients and fk(
The estimator is unbiased if:
Assuming that R(
where γ(
Once the weights and the Lagrange multipliers are known, the kriging variance is given by [1, 15, 16]:
In matrix notation, the KT system is written as Ax = b, where
The KT weights are obtained by x = A–1 b and the kriging variance is given by
2.4 Cross Validation
Cross-validation is a procedure used to compare the performance of several competing models. It starts by splitting the data set into two sets: a modelling set and a validation set. Then, the modelling set is used for variogram modelling and kriging in the locations of the validation set. Finally, the measurements of the validation set are compared with their predictions [18]. If the average of the cross-validation errors (or Mean Error, ME) is close to 0
one may say that apparently the estimates are unbiased (z(
The RMSE value should be as small as possible, indicating that estimates are close to measurements. The kriging standard deviation represents the error predicted by the estimation method. Dividing the cross-validation error by the corresponding kriging standard deviation makes it possible to compare the magnitudes of both the actual and predicted error [1]. Therefore, the average of the standardized squared cross-validation errors (or Mean Standardized Squared Error, MSSE)
should be approximately one, indicating that the model is accurate. A scatterplot of true versus predicted values provides additional evidence on how well an estimation method has performed. Typically the intention is for the set of points to come as close as possible to the line y = x, a 45-degree line passing through the origin of the scatterplot. The coefficient of determination R2 is a good index for summarizing how close the points on the scatterplot come to falling on the 45-degree line passing through the origin [2]. R2 should be close to one.
3. MARES AUV
MARES (Modular Autonomous Robot for Environment Sampling) AUV has been successfully used to monitor sea outfall discharges (see Figure 1).

AUV MARES.
MARES is 1.5m long, it has an 8-inch diameter and weighs about 40kg in air [12, 13]. It features a plastic hull with a dry mid body (for electronics and batteries) and additional rings to accommodate sensors and actuators. Its modular structure simplifies the system's development (the case of adding sensors, for example). It is propelled by two horizontal thrusters located at the rear and two vertical thrusters, one in the front and the other in the rear. This configuration allows for small operational speeds and high manoeuvrability, including pure vertical motions. It is equipped with an omnidirectional acoustic transducer and an electronic system that allows for long baseline navigation. The vehicle can be programmed to follow predefined trajectories while collecting relevant data using the on-board sensors. A Sea-Bird Electronics 49 FastCAT CTD had already been installed on-board the MARES AUV to measure conductivity, temperature and depth. MARES's missions for environmental monitoring of wastewater discharges are conducted using GUI software that fully automates the operational procedures of the campaign. By providing visual and audio information, this software guides the user through a series of steps which include: (1) real time data acquisition from CTD and ADCP sensors, (2) effluent plume parameter modelling using the CTD and ADCP data collected, (3) automatic path creation using the plume model parameters, (4) acoustic buoys and vehicle deployment, (5) automatic acoustic network setup and (6) real time tracking of the AUV mission [13].
4. Results
4.1 Study Site
The study site is shown in Figure 2. The S. Jacinto outfall is located off the Portuguese west coast near the Aveiro estuary [9, 10]. The total length of the outfall, including the diffuser, is 3378m (the first 3135m section has a diameter of 1600mm and the last 243m section has a diameter of 1200mm). The diffuser, which consists of 72 ports alternating on each side, nominally 0.175m in diameter, is 332.5m long. Currently, only the last 20 of the 72 ports are working on a length of 98.2m. The ports are discharging upwards at an angle of 30° to the horizontal axis; the port height is about 1.3m. The outfall has a true bearing direction of 290° and is discharging at a depth varying between approximately 14 and 17m. The sea floor near the diffuser is moderately sloped, with a sandy bottom and isobaths that are parallel to the coastline. In that area the coastline itself runs at about a 200° angle with regard to true north. Flow variation through the outfall in question is not typical of WWTPs since the effluent is mainly of industrial origin. Effluent flow rate ranges most frequently between 0.6 and 0.8m3/s. During the campaign, the discharge remained fairly constant with an average flow rate of ˜0.61 m3/s. Figures 2 and 3 show a plan view of the AUV's position estimate during the plume tracking survey. A rectangular area of approximately 200 × 100m2 was covered starting 20m downstream from the middle point of the outfall diffuser. Salinity measurements were obtained at 2 and 4m depths where the effluent plume was predicted to be horizontally dispersing. In each horizontal trajectory, the vehicle described six parallel transects that were perpendicular to the direction of the current, 200m in length and at 20m intervals. When performing horizontal trajectories, vertical oscillations of the AUV were less than 1m (up and down) in the 2m survey and less than 1m down and less than 1.5m up in the 4m survey. In the 2m trajectory, the average depth of the AUV was 2.0m with a standard deviation of 0.20m. In the 4m trajectory, the average depth was 4.0m with a standard deviation of 0.33m. During the mission the vehicle moved at a fairly constant velocity of 1m/s (2 knots). Salinity data were recorded at a rate of 2.4Hz. The geostatistical analysis was conducted using R statistical software and the Gstat package of R [19, 20].

Plan view of the AUV's position estimate during the plume tracking survey (left). Map of the study site (right).

Plan view of the AUV's position estimate together with the salinity measurements at 2m (left) and 4m (right) depths.
4.2 Exploratory Analysis
A statistical analysis was conducted in order to obtain elementary knowledge on the conventional salinity data sets (see Table 1 and Figure 4). At a 2m depth the salinity ranged only from 35.152psu to 35.607psu. The mean value of the data set was 35.453psu, which was very close to the median value, which was 35.464psu. The skewness coefficient is relatively low (−0.57), indicating that the distribution is approximately symmetric. The very low value of the variation coefficient (0.0019) reflects the fact that the histogram does not have a tail of high values. At a 4m depth the salinity ranged from 35.097psu to 35.712psu. The mean value of the data set was 35.569psu, which was almost equal to the median value, which was 35.571psu. The skewness coefficient is not very high (−1.32), indicating that the histogram is only slightly asymmetric. The variation coefficient is quite low (0.0015).

Histograms of salinity measurements at 2m (left) and 4m (right) depths.
Summary statistics of salinity measured at 2 and 4m depths
The kriging methods work better if the distribution of the data values is close to a normal distribution. Therefore, it is interesting to see how close the distribution of the data values comes to being normal. Figure 4 shows the plots of the normal distribution adjusted to the histograms of the salinity measured at 2 and 4m depths. Apart from some erratic high values, it can be seen that the histograms are reasonably close to the normal distribution. The scatter map at a 2m depth (Figure 3) showed that there might be a trend across the field. That trend is not so evident on the scatter map at a 4m depth. To explore this, the relation between salinity and the Euclidean distance (3D) to the middle point of the diffuser was studied. Figure 5 shows salinity versus the distance at 2 and 4m depths fitted by a linear regression model. It can be seen that salinity increases with the distance and that is more evident at the 2m depth. The Pearson correlation coefficient and the Spearman correlation coefficient between the two variables at the 2m depth are 0.58 and 0.56, respectively. At the 4m depth these coefficients are 0.26 and 0.22, respectively. Therefore, it was decided to assume this trend in subsequent analyses.

Salinity versus the distance to the middle point of the diffuser at 2m (left) and 4m (right) depths, fitted by a linear model.
4.3 Variogram Modelling
For the purpose of the cross-validation analysis, the salinity measurements were divided into a modelling set (comprising 75% of the samples) and a validation set (comprising 25% of the samples). Modelling and validation sets were then compared in terms of their salinity measurements using the Student's t-test. The aim was to confirm that they provided unbiased sub-sets of the original data. The experimental variograms of both modelling sets (2 and 4m depth) computed using equation (8) are shown as the plotted points in Figure 6 (a) and Figure 7 (a). To explore the trend, experimental variograms of the OLS residuals from the linear model of the Euclidean distance (3D) to the middle point of the diffuser were also computed (see the plotted points in Figure 6 (b) and Figure 7 (b)). The estimation of semi-variance was conducted using a lag distance of 2m and a maximum distance of 120m. Anisotropy was investigated by calculating directional variograms. However, no anisotropy effect could be shown. All experimental variograms were fitted by Matern models (for several shape parameters) using the weighted least squares (WLS) estimation. The parameters of the fitted models are presented in Table 2. At a 2m depth, accounting for the distance to the diffuser has lowered the total variability (as expected from the results of the linear modelling), but it has also reduced the range of spatial dependence. The residual variograms at a 2m depth clearly have a substantially lowered sill and reduced range. At a 4m depth only the range has reduced slightly. All variograms have low nugget values and low nugget/sill ratios. These results indicate that local variations could be captured due to the high sampling rate and the fact that the variable under study has strong spatial dependence.

Variograms of salinity at 2m depth fitted by Matern models: (a) experimental variogram computed from the raw data using equation (8); (b) experimental variogram of the OLS residuals from a linear trend of the distance to the middle point of the diffuser.

Variograms of salinity at 4m depth fitted by Matern models: (a) experimental variogram computed from the raw data using equation (8); (b) experimental variogram of the OLS residuals from a linear trend of the distance to the middle point of the diffuser.
Parameters of the fitted variogram models for salinity at 2 and 4m depths
4.4 Cross-Validation
The block kriging method was preferred since it produced smaller prediction errors and smoother maps in comparison with the point kriging. Using the 75% modelling sets of the two depths and the variograms of the raw data, a two-dimensional Ordinary Block Kriging (BOK), with 10×10 m2 blocks, was applied to estimate salinity at the locations of 25% of the validation sets. Using the 75% modelling sets of the two depths and the variograms of the OLS residuals from a linear trend, a two-dimensional Block Kriging with Trend (BKT) with 10×10m2 blocks was applied to estimate salinity at the locations of 25% of the validation sets. The measurements of the validation sets were then compared to their predictions. The cross-validation errors computed using Equations 25-27 are shown in Table 3. For all shape parameters studied, at a 2m depth the salinity was best estimated by the BKT, while at a 4m depth the salinity was best estimated by the OBK. As expected, the BKT performed better than the OBK where the trend was more significant. At a 2m depth, for the best model (v = 0.3), the ME was −0.000997, the RMSE was 0.025023 and the R2 value was 0.8621. At a 4m depth, for the best model (v = 1.0), the ME was 0.000004, the RMSE was 0.024242 and the R2 value was 0.7706. Both MSSE values are relatively high, probably due to the smoothing effect of the block kriging. Figure 8 shows the scatterplots of true versus estimated values for the most satisfactory models. These plots show once again that observed and predicted values are highly positively correlated. Figure 8 shows the scatterplots of true versus estimated values for the most satisfactory models. These plots show once again that observed and predicted values are highly positively correlated.

Predicted versus observed salinities at 2m (left) and 4m (right) depths using the preferred models.
Cross-validation results for the salinity measurements at 2 and 4m depths.
The preferred model.
4.5 Mapping
Figure 9 shows the prediction map of salinity distribution at a 2m depth on a 2×2m2 grid, using the BKT with the preferred model. Figure 10 shows the prediction map of salinity distribution at a 4m depth on a 2×2m2 grid using the BOK with the preferred model. In the 2m kriged map the average value is 35.447psu and the standard deviation is 0.0610psu, which is in accordance with the salinity measurements (the average value is 35.453psu and the standard deviation is 0.0666psu). In the 4m kriged map the average value is 35.565psu and the standard deviation is 0.0360psu, which is in accordance with the salinity measurements (the average value is 35.569psu and the standard deviation is 0.0523psu). Figure 11 shows the maps of the kriging variance, that is

Prediction map of salinity (psu) distribution at 2m depth.

Prediction map of salinity (psu) distribution at 4m depth.

Maps of kriging variance at 2m (left) and 4m (right) depths.
4.6 Dilution Estimation
Environmental effects are all related to concentration C of a particular contaminant X. Defining Ca as the background concentration of substance X in ambient water and C0 as the concentration of X in the effluent discharge, the local dilution is as follows [21]:
In the case of the variability of the background concentration of substance X in ambient water, the local dilution is given by:
where Ca0 is the background concentration of substance X in ambient water at the discharge depth. This expression (29) can be arranged to give

Dilution map at 2m (up) and 4m (down) depths.
4. Conclusions
Through a geostatistical analysis of salinity obtained by an AUV at 2 and 4m depths in an ocean outfall monitoring campaign, it was possible to obtain kriged maps of the sewage dispersion in the field. Experimental variograms of the raw data and of the OLS residuals, from the linear model of the Euclidean distance (3D) to the middle point of diffuser for both depths, were computed and fitted by Matern models for several parameter shapes. The performance of each competing estimator/model was compared using a split-sample approach. The block kriging method was preferred since it produced smaller prediction errors and smoother maps in comparison to point kriging. For all shape parameters studied, at a 2m depth the salinity was best estimated by the BKT, while at a 4m depth the salinity was best estimated by the OBK. As expected, the BKT performed better than the OBK where the trend was more significant. Kriged maps at 2 and 4m depths show the spatial variation of salinity in the area studied and it is possible ability to identify the effluent plume that appears as a region of lower salinity when compared to the surrounding waters. Using the prediction maps of salinity distribution, it was possible to assess the performance of the outfall diffuser by estimating dilution, which in both depths was greater than the minimum admissible value imposed by Portuguese legislation.
Footnotes
5. Acknowledgments
This work was supported by project WWECO-Environmental Assessment and Modelling of Wastewater Discharges using Autonomous Underwater Vehicles Biooptical Observations, funded by the Foundation for Science and Technology (FCT) under the Program for Research Projects in all scientific areas (Programa de Projectos de Investigação em todos os domínos científicos) (ref. PTDC/MAR/74059/2006).
