Sage Journals: Discover world-class research

Abstract

We compare a recently proposed multivariate spline based on mixed partial derivatives with two other standard splines for the scattered data smoothing problem. The splines are defined as the minimiser of a penalised least squares functional. The penalties are based on partial differential operators, and are integrated using the finite element method. We compare three methods to two problems: to remove the mixture of Gaussian and impulsive noise from an image, and to recover a continuous function from a set of noisy observations.

Keywords

Sobolev space scattered data interpolation Gaussian noise impulsive noise

Introduction

We begin by outlining the scattered data problem. Consider the set of scattered points $G = {p_{i}}_{i = 1}^{N}$ in a domain $Ω \subseteq ℝ^{d}$ with $d \in ℕ$ , and the set of noisy observations at those points ${z_{i}}_{i = 1}^{N}$ . We want to reconstruct an unknown function u to approximate the given data. Assuming that the underlying data set is corrupted with Gaussian noise, we can assume that the unknown function u satisfies

z_{i} = u (p_{i}) + n_{i}

i = 1, \dots N

, where

{n_{i}}_{i = 1}^{N}

is a set of normally distributed random variables with mean 0 and variance

σ^{2}

To recover the unknown function u, we will use an approach based on the multivariate L-spline. That is, we will search for a function u that minimises the following least squares functional

\sum_{i = 1}^{N} (z_{i} - u (p_{i}))^{2} + λ \int_{Ω} (L u (x, y))^{2} d x

over a Sobolev space V, where L is a partial differential operator, and λ is a positive smoothing parameter.

We use the standard notation for Sobolev spaces on Ω.^1–3 We consider three different choices for L. The first choice is to take Lu as the gradient of u. Then we need to have $V = H^{1} (Ω)$ . However, the continuous problem is not well-posed with this choice for $d \geq 2$ , because the point value of a function is not defined in $H^{1} (Ω)$ when $d \geq 2$ . The second choice is to choose Lu as the Laplacian of u. Then we need to have $V = H^{2} (Ω)$ . Again, the continuous problem is not well-posed for d > 3, because the point value of a function is not defined in $H^{2} (Ω)$ when d > 3. The third choice is to include mixed partial derivatives of u on the gradient penalty to construct Lu.⁴ Unlike the other two choices, the resulting spline is well defined for any dimension $d \in ℕ$ .

This is the first time that computational results of the newly proposed multivariate spline⁴ are presented and compared with other existing techniques. Moreover, we observe the instability of the gradient penalty approach in our numerical experiments, which is another novelty of this contribution.

We will apply these methods to two problems. The first problem is to recover an image that have been corrupted with both Gaussian and impulsive noise. We apply a finite element method to compute the solution of the above minimisation problem. Finite element methods have recently become popular in different areas of image processing.^5–9 Finite element methods are applied in Lamichhane¹⁰ and Lamichhane¹¹ to remove the mixture of Gaussian and impulsive noise using the gradient penalty and total variation penalty, respectively.

The second problem is to recover a continuous function from a set of noisy observations. We consider observations that have been corrupted with Gaussian noise. In this example, we see spurious spikes in the solution using the gradient penalty. This is due to the fact that the gradient penalty does not control the point-wise values of the function. Numerical results show that we can increase the mesh-size to reduce the height of the spikes but they cannot be totally removed.

This paper is organised as follows. In the next section, we present the gradient penalty smoothing technique. In the third section, we present the smoothing technique based on the minimisation of a functional involving mixed partial derivatives. In the fourth section, we compare the three finite element methods in denoising images and recovering continuous functions. We discuss these results in the last section.

Multivariate spline with gradient penalty

The multivariate spline with the gradient penalty is given by the following minimisation problem

\min_{u \in V} (\sum_{i = 1}^{N} (u (p_{i}) - z_{i})^{2} + λ \int_{Ω} {‖ \nabla u ‖}^{2} d x)

(1)

Due to the choice of the minimisation functional it is natural to take $V = H^{1} (Ω)$ , for which the problem will not be well-posed when d > 1.

Now we consider a finite element discretisation of the spline. Let $C^{0} (Ω)$ be the space of continuous functions in Ω and $T_{h}$ a finite element triangulation of Ω. Note that $T_{h}$ is the set of triangles or rectangles. Then let

V_{h} = {u_{h} \in C^{0} (Ω) | u_{h} |_{T} \in P (T), T \in T_{h}}

(2)

be a finite element space, where P(T) is the linear polynomial space if T is a triangle, and P(T) is the bilinear polynomial space on T if T is a rectangle.¹² The minimisation problem leads to the variational problem of finding

u_{h} \in V_{h}

such that

a (u_{h}, v_{h}) = ℓ (v_{h}), v_{h} \in V_{h}

where the bilinear form

a (\cdot, \cdot)

and the linear form

ℓ (\cdot)

are given by

\begin{array}{l} a (u, v) = \sum_{i = 1}^{N} u (p_{i}) v (p_{i}) + λ \int_{Ω} \nabla u \cdot \nabla v d x \\ ℓ (v) = \sum_{i = 1}^{N} v (p_{i}) z_{i} \end{array}

It is easy to show that the above problem has a unique solution under the assumption that the set of scattered points $G$ is non-empty.¹⁰

Since $a (\cdot, \cdot)$ is positive definite, we can define the energy ${‖ \cdot ‖}_{a}$ on V as

{‖ v ‖}_{a}^{2} = a (v, v)

for all

v \in V

The following lemma shows that the discrete multivariate spline with the gradient penalty is well-posed for d = 1 but not well-posed for d > 1. The point-value of a function is not controlled by the gradient of the function when d > 1. The well-posedness is exhibited in the stability result first proved in Garcke and Hegland.¹³ For completeness we have given these results in Lemmas 1, 2 and 3, which are taken from Garcke and Hegland.¹³

Lemma 1. (Discrete Sobolev inequality). There exists constant $c_{d} > 0$ such that for all $u \in V_{h}$

$| u (x) | \leq c_{d} {‖ u ‖}_{H^{1} (Ω)}$ for d = 1.

$| u (x) | \leq c_{d} (1 + | \log h |) {‖ u ‖}_{H^{1} (Ω)}$ for d = 2.

$| u (x) | \leq c_{d} h^{1 - d / 2} {‖ u ‖}_{H^{1} (Ω)}$ for d > 2.

The constant c_d is independent of the mesh-size h but depends on d. These bounds are tight, and for d > 2 we have that

c_{d} \geq \frac{1}{\sqrt{3 d + 1}} {(\frac{3}{2})}^{d / 2}

Lemma 2. (Discrete Poincaré inequality). Let $(x_{0}, y_{0}) \in Ω$ and $u_{0} = u (x_{0}, y_{0})$ for $u \in V_{h}$ . Then there exist constants $c_{d} > 0$ such that

${‖ u - u_{0} ‖}_{L^{2} (Ω)} \leq c_{d} {‖ \nabla u ‖}_{L^{2} (Ω)}$ for d = 1

${‖ u - u_{0} ‖}_{L^{2} (Ω)} \leq c_{d} (1 + | \log h |) {‖ \nabla u ‖}_{L^{2} (Ω)}$ for d = 2

${‖ u - u_{0} ‖}_{L^{2} (Ω)} \leq c_{d} h^{1 - d / 2} {‖ \nabla u ‖}_{L^{2} (Ω)}$ for d > 2.

Lemma 3. (Discrete V-ellipticity). There exist constants c_d and C_d such that the energy norm on V_h satisfies

α_{d, h} {‖ u ‖}_{H^{1} (Ω)} \leq {‖ u ‖}_{a} \leq β_{d, h} {‖ u ‖}_{H^{1} (Ω)}

for all

u \in V_{h}

, where

α_{d, h}

and

β_{d, h}

are given by

$α_{d, h} = {(\frac{c_{d}}{\sqrt{λ}} + \frac{1}{\sqrt{λ}} + 1)}^{- 1}$ and $β_{d, h} = C_{d} + \sqrt{λ}$ for d = 1

$α_{d, h} = {(\frac{c_{d} (1 + | \log h |)}{\sqrt{λ}} + \frac{1}{\sqrt{λ}} + 1)}^{- 1}$ and $β_{d, h} = C_{d} (1 + | \log h |) + \sqrt{λ}$ for d = 2

$α_{d, h} = {(c_{d} λ^{- 1 / 2} h^{1 - d / 2} + λ^{- 1 / 2} + 1)}^{- 1}$ and $β_{d, h} = C_{d} h^{1 - d / 2} + \sqrt{λ}$ for d > 2.

The above results imply that for the solution $u_{h} \in V_{h}$ of the spline with the gradient penalty we have

{‖ u_{h} ‖}_{a} \leq \frac{β_{d, h}}{α_{d, h}} {‖ ℓ ‖}_{L^{2} (Ω)}

where

{‖ ℓ ‖}_{L^{2} (Ω)}

is the L²-norm of the linear functional

ℓ

Remark 4. We can see that the ill-posedness is exhibited in the stability constant being not independent of the mesh-size h. There is no easy way to remove this dependency.

Multivariate spline with Laplacian penalty

The multivariate spline with the Laplacian penalty is the solution of the following minimisation problem

\min_{u \in V} (\sum_{i = 1}^{N} (u (p_{i}) - z_{i})^{2} + λ \int_{Ω} {(Δ u)}^{2} d x)

(3)

where

V = H^{2} (Ω)

. The minimisation problem is well-posed for

d \in {1, 2, 3}

since

H^{2} (Ω) \subset C^{0} (Ω)

when

d \in {1, 2, 3}

. More details of this spline can be found in Ramsay.¹⁴ This spline will be called Laplacian spline in the following.

Remark 5. The Laplacian penalty imposes higher smoothness in the solution. Since we look for a solution, whose second derivatives are square integrable, point-values of the solution are well-defined in contrast to the gradient penalty spline. However, a finite element approximation of the Laplacian spline is more expensive than the gradient penalty spline.

While we can use a direct finite element approximation using a low order finite element space V_h to approximate the gradient penalty spline, we cannot directly use this space to approximate the Laplacian spline as $Δ u_{h}$ for $u_{h} \in V_{h}$ is not well-defined. We use a mixed finite element method proposed by Lamichhane.¹⁵ We first introduce a new variable $ϕ = Δ u$ in the minimisation formulation (3) and then write a weak equation as

\int_{Ω} ϕ μ d x = \int_{Ω} \nabla u \cdot \nabla μ d x, μ \in H^{1} (Ω)

(4)

Now choosing $ϕ \in L^{2} (Ω)$ and $u, μ \in H^{1} (Ω)$ , we have a well-defined formulation for which we can use the space V_h to discretise u and μ, whereas we use a discontinuous piecewise polynomial space W_h to discretise $ϕ$ . The basis functions of W_h and V_h satisfy a biorthogonality relationship¹⁵ so that the associated matrix corresponding to the L²-inner product $\int_{Ω} ϕ μ d x$ in (4) is diagonal. In this way, we arrive at a very efficient finite element method to approximate the solution of the minimisation problem (3). The discrete problem is then to compute

\min_{[u_{h}, ϕ_{h}] \in V_{h} \times W_{h}} (\sum_{i = 1}^{N} (u_{h} (p_{i}) - z_{i})^{2} + λ \int_{Ω} ϕ_{h}^{2} d x)

subject to the constraint

\int_{Ω} ϕ_{h} μ_{h} d x = \int_{Ω} \nabla u_{h} \cdot \nabla μ_{h} d x, μ_{h} \in V_{h}

Since $C^{0} (Ω) \subset H^{1} (Ω)$ when d > 1, the gradient penalty spline formulation does not provide a well-posed problem. The problem is well-posed only in one dimension. Similarly, $C^{0} (Ω) \subset H^{2} (Ω)$ for $d \in {1, 2, 3}$ but $C^{0} (Ω) \subset H^{2} (Ω)$ when d > 3. This motivates us to find a well-posed spline formulation for any $d \in ℕ$ , which is given in the next section.

New multivariate spline with mixed derivative penalty

In order to define the new multivariate spline, we define the associated Sobolev space. Let $B = {0, 1}^{d} \ {0}$ , where $0 \in ℝ^{d}$ is a zero vector. We use a standard multi-index notation with $α = (α_{1}, \dots, α_{d}) \in B$ so that a mixed derivative of a sufficiently smooth function u is denoted by

D^{α} u = \frac{\partial^{\sum_{i = 1}^{d} α_{i}} u}{\partial x_{1}^{α_{1}} \dots \partial x_{d}^{α_{d}}}

where we use the usual Cartesian coordinate system with

x = (x_{1}, \dots, x_{d}) \in ℝ^{d}

We now define our Sobolev space for the multivariate spline problem as

H_{m}^{1} (Ω) : = {u \in L^{2} (Ω) : D^{α} u \in L^{2} (Ω), α \in B}

which is equipped with the norm

| | u | |_{H_{m}^{1} (Ω)} = \sqrt{| | u | |_{L^{2} (Ω)}^{2} + \sum_{α \in B} | | D^{α} u | |_{L^{2} (Ω)}^{2}}

and the semi-norm

| u |_{H_{m}^{1} (Ω)} = \sqrt{\sum_{α \in B} | | D^{α} u | |_{L^{2} (Ω)}^{2}}

The semi-norm for d = 2 is simply

| v |_{H_{m}^{1} (Ω)} = \sqrt{\int_{Ω} {‖ \nabla v ‖}^{2} + {(\frac{\partial^{2} v}{\partial x \partial y})}^{2} d x}

where

| | \nabla v | |^{2}

is the Euclidean norm of the vector

\nabla v \in ℝ^{2}

We note that the space $H_{m}^{1} (Ω)$ is a Hilbert space, and $H_{m}^{1} (Ω) \subset C^{0} (Ω)$ .¹⁶ The new multivariate spline is then obtained as a solution of the minimisation problem

\min_{u \in H_{m}^{1} (Ω)} (\sum_{i = 1}^{N} (u (p_{i}) - z_{i})^{2} + λ | u |_{H_{m}^{1} (Ω)}^{2})

(5)

The associated inner-product for the semi-norm on $H_{m}^{1}$ -space for d = 2 is

{〈 u, v 〉}_{H_{m}^{1} (Ω)} = \int_{Ω} \nabla u^{T} \nabla v + \frac{\partial^{2} u}{\partial x \partial y} \frac{\partial^{2} v}{\partial x \partial y} d x

We can define a bilinear form $b (\cdot, \cdot)$ and a linear form $ℓ (\cdot)$ as

\begin{array}{l} b (u, v) = (P u)^{T} (P v) + λ 〈 u, v 〉_{H_{m}^{1} (Ω)} \\ ℓ (v) = (P v)^{T} z \end{array}

where

P u = {(u (x_{1}, y_{1}), u (x_{2}, y_{2}), \dots, u (x_{N}, y_{N}))}^{T}

is a column vector of the function values of u at the scattered points

G = {p_{i}}_{i = 1}^{N}

, and

z \in ℝ^{N}

is a column vector with ith component z_i. Then the multivariate spline problem is to find

u \in V

such that

b (u, v) = ℓ (v)

(6)

for all

v \in V

Let Ω be a rectangle in $ℝ^{2}$ and $T_{h}$ the tensor product partition of the domain with mesh size h, such that each element $T \in T_{h}$ is a rectangle. Then we define a finite element space V_h as

V_{h} = {u_{h} \in C^{0} (Ω) : u_{h} |_{T} \in P (T), T \in T_{h}},

where

P (T)

is the space of bilinear polynomials on T. We can now write our discrete multivariate spline problem as

\min_{u_{h} \in V_{h}} (\sum_{i = 1}^{N} (u_{h} (p_{i}) - z_{i})^{2} + λ | u_{h} |_{H_{m}^{1} (Ω)}^{2}) .

That is, the discrete problem is to find $u_{h} \in V_{h}$ such that

b (u_{h}, v_{h}) = ℓ (v_{h})

(7)

for all

v_{h} \in V_{h}

The discrete problem is shown to be well-posed in Lamichhane.¹¹ Here we recall some of the important results. We first show that the bilinear form $b (\cdot, \cdot)$ is positive definite on V_h.

Lemma 6. Let $λ > 0$ and let the set of scattered points $G$ be non-empty. Then the bilinear form $b (\cdot, \cdot)$ is positive definite on the vector space V_h.

Proof. If u_h = 0, then clearly $b (u_{h}, u_{h}) = 0$ . Conversely, let $b (u_{h}, u_{h}) = 0$ . Then

P u_{h} = 0, \nabla u_{h} = 0, and \frac{\partial^{2} u_{h}}{\partial x \partial y} = 0.

Since u_h is a continuous function, $\nabla u_{h} = 0$ gives that u is a constant function in Ω. Further, since $G$ is non-empty and Pu_h = 0, we have that u_h = 0.

Since $b (\cdot, \cdot)$ is positive definite, we can define the energy norm ${‖ \cdot ‖}_{b}$ on V_h as

{‖ v_{h} ‖}_{b}^{2} = b (v_{h}, v_{h})

for all

v_{h} \in V_{h}

. Since

b (\cdot, \cdot)

and

ℓ (\cdot)

satisfy the conditions of the Lax-Milgram lemma,^2,3 the unique minimiser is the solution of the discrete problem (7). In addition, the following holds.

Lemma 7. Let $λ > 0$ and let $G$ be non-empty. Then the discrete problem (7) admits a unique solution which depends continuously on the data with respect to the energy norm ${‖ \cdot ‖}_{b}$ .

Proof. We have that

\begin{array}{l} | b (u_{h}, v_{h}) | \leq {‖ u_{h} ‖}_{b} {‖ v_{h} ‖}_{b}, and \\ | ℓ (v_{h}) | \leq ‖ z ‖ {‖ v_{h} ‖}_{b} \end{array}

for all $u_{h}, v_{h} \in V_{h}$ . Hence $b (\cdot, \cdot)$ and $ℓ (\cdot)$ are continuous on V_h. We also have that

b (u_{h}, u_{h}) = {‖ u_{h} ‖}_{b}^{2}

for all

u_{h} \in V_{h}

. Hence

b (\cdot, \cdot)

is coercive on V_h. By the Lax-Milgram lemma,^2,3 there exists a unique solution u_h of the discrete problem 7. Additionally, the solution depends continuously on the data z.

In addition, a direct application of the Céa lemma provides an optimal a priori estimate of the discrete solution.

Lemma 8. Let u be the solution to the continuous problem (6), and let u_h be the solution to the discrete problem (7). Then

{‖ u - u_{h} ‖}_{b} \leq \inf_{v_{h} \in V_{h}} {‖ u - v_{h} ‖}_{b}

Each finite element basis function is associated with a point in the tensor product partition $T_{h}$ . Assuming there are mn points, we have mn basis functions. Let ${ϕ_{i}}_{i = 1}^{m n}$ be the set of finite element basis functions, which span V_h. Then we can write our solution $u_{h} \in V_{h}$ as a linear combination of these basis functions, namely

u_{h} (x, y) = \sum_{i = 1}^{m n} u_{i} ϕ_{i} (x, y)

Let $u = {(u_{1}, u_{2}, \dots, u_{m n})}^{T}$ and let K be the finite element stiffness matrix, where $K_{i j} = \int_{Ω} \nabla u_{h}^{T} \nabla v_{h} d x$ . Let M be a mixed partial derivative matrix, where $M_{i j} = \int_{Ω} \frac{\partial u_{h}}{\partial x \partial y} \frac{\partial v_{h}}{\partial x \partial y} d x$ . Then the finite element problem leads to the linear system

(A^{T} A + λ (K + M)) u = A^{T} z

where A is a matrix of size N × mn, with entries

A_{i j} = ϕ_{j} (p_{i})

Computation of K and M

We will use a reference element to construct the matrices K and M. For rectangles, we choose our reference element $\hat{T}$ to be the square with vertices $(0, 0), (1, 0), (1, 1),$ and (0, 1). We then construct local basis functions at these vertices. These are

\begin{array}{l} {\hat{ϕ}}_{1} (ξ, η) = (1 - ξ) (1 - η), \\ {\hat{ϕ}}_{2} (ξ, η) = ξ (1 - η), \\ {\hat{ϕ}}_{3} (ξ, η) = ξ η, \\ {\hat{ϕ}}_{4} (ξ, η) = (1 - ξ) η \end{array}

Now consider the rectangular element T with vertices $(x_{1}, y_{1}), (x_{2}, y_{1}), (x_{2}, y_{2}),$ and (x₁, y₂). We construct global basis functions at these vertices. When restricted to the element, these are given by

\begin{array}{l} ϕ_{1} (x, y) = \frac{(y - y_{2}) (x - x_{2})}{(y_{1} - y_{2}) (x_{1} - x_{2})}, \\ ϕ_{2} (x, y) = \frac{(y - y_{2}) (x - x_{1})}{(y_{1} - y_{2}) (x_{2} - x_{1})}, \\ ϕ_{3} (x, y) = \frac{(y - y_{1}) (x - x_{1})}{(y_{2} - y_{1}) (x_{2} - x_{1})}, \\ ϕ_{4} (x, y) = \frac{(y - y_{1}) (x - x_{2})}{(y_{2} - y_{1}) (x_{1} - x_{2})} \end{array}

Let $F_{T} : \hat{T} \to T$ be the bijective map from the reference element to T. This mapping is given by

[\begin{array}{l} x \\ y \end{array}] = [\begin{array}{l} x_{2} - x_{1} & 0 \\ 0 & y_{2} - y_{1} \end{array}] [\begin{array}{l} ξ \\ η \end{array}] + [\begin{array}{l} x_{1} \\ y_{1} \end{array}]

Let the matrix in this transformation be denoted B_T. We note that $ϕ_{i} = {\hat{ϕ}}_{i} o F_{T}^{- 1}$ .

We will use the chain rule to calculate the derivatives. We have

\begin{array}{l} \frac{\partial ϕ_{i}}{\partial x} = \frac{1}{b_{1}} \cdot \frac{\partial ϕ_{i}}{\partial ξ} = \frac{1}{b_{1}} \cdot \frac{\partial {\hat{ϕ}}_{i}}{\partial ξ} o F_{T}^{- 1}, \\ \frac{\partial ϕ_{i}}{\partial y} = \frac{1}{b_{2}} \cdot \frac{\partial ϕ_{i}}{\partial η} = \frac{1}{b_{2}} \cdot \frac{\partial {\hat{ϕ}}_{i}}{\partial η} o F_{T}^{- 1} \end{array}

where

b_{1} = x_{2} - x_{1}

and

b_{2} = y_{2} - y_{1}

Applying the chain rule again we obtain

\frac{\partial^{2} ϕ_{i}}{\partial x \partial y} = \frac{1}{b_{1} b_{2}} \cdot \frac{\partial^{2} ϕ_{i}}{\partial ξ \partial η} = \frac{1}{b_{1} b_{2}} \cdot \frac{\partial^{2} {\hat{ϕ}}_{i}}{\partial ξ \partial η} o F_{T}^{- 1}

Now, we’ll calculate K. Let $K_{T} \in ℝ^{4 \times 4}$ be the local stiffness matrix associated with the element T. Then

{(K_{T})}_{i j} = \int_{T} \frac{\partial ϕ_{i}}{\partial x} \cdot \frac{\partial ϕ_{j}}{\partial x} + \frac{\partial ϕ_{i}}{\partial y} \cdot \frac{\partial ϕ_{j}}{\partial y} d x

Now

\begin{array}{l} \int_{T} \frac{\partial ϕ_{i}}{\partial x} \cdot \frac{\partial ϕ_{j}}{\partial x} d x = \int_{\hat{T}} (\frac{\partial ϕ_{i}}{\partial x} o F_{T}) (\frac{\partial ϕ_{j}}{\partial x} o F_{T}) | det B_{T} | d \hat{x} \\ = \frac{| det B_{T} |}{b_{1}^{2}} \int_{\hat{T}} \frac{\partial {\hat{ϕ}}_{i}}{\partial ξ} \cdot \frac{\partial {\hat{ϕ}}_{j}}{\partial ξ} d \hat{x} = \frac{y_{2} - y_{1}}{x_{2} - x_{1}} \int_{\hat{T}} \frac{\partial {\hat{ϕ}}_{i}}{\partial ξ} \cdot \frac{\partial {\hat{ϕ}}_{j}}{\partial ξ} d \hat{x} \end{array}

Similarly

\int_{T} \frac{\partial ϕ_{i}}{\partial y} \cdot \frac{\partial ϕ_{j}}{\partial y} d x = \frac{x_{2} - x_{1}}{y_{2} - y_{1}} \int_{\hat{T}} \frac{\partial {\hat{ϕ}}_{i}}{\partial η} \cdot \frac{\partial {\hat{ϕ}}_{j}}{\partial η} d \hat{x}

Hence we have

\begin{array}{l} K_{T} = \frac{y_{2} - y_{1}}{6 (x_{2} - x_{1})} [\begin{array}{l} 2 & - 2 & - 1 & 1 \\ - 2 & 2 & 1 & - 1 \\ - 1 & 1 & 2 & - 2 \\ 1 & - 1 & - 2 & 2 \end{array}] \\ + \frac{x_{2} - x_{1}}{6 (y_{2} - y_{1})} [\begin{array}{l} 2 & 1 & - 1 & - 2 \\ 1 & 2 & - 2 & - 1 \\ - 1 & - 2 & 2 & 1 \\ - 2 & - 1 & 1 & 2 \end{array}] \end{array}

We then assemble each K_T into the global stiffness matrix, by relating the local nodal numbering to the global numbering. Let the global numbering of the vertices of T be $i, j, k, ℓ$ . Then the local matrix K_T will be stored in the submatrix $K [i, j, k, ℓ]$ of the global matrix K. (Note here that the submatrix is formed by keeping the ith, jth, kth and $ℓ$ th rows and columns of the matrix K.) The global matrix is obtained by adding all the contributions from the local matrices.

We now calculate the mixed partial derivative matrix M. Let $M_{T} \in ℝ^{4 \times 4}$ be the local matrix associated with the element T. Then

\begin{array}{l} {(M_{T})}_{i j} = \int_{T} \frac{\partial^{2} ϕ_{i}}{\partial x \partial y} \cdot \frac{\partial^{2} ϕ_{j}}{\partial x \partial y} d x \\ = \int_{\hat{T}} (\frac{\partial^{2} ϕ_{i}}{\partial x \partial y} o F_{T}) \cdot (\frac{\partial^{2} ϕ_{j}}{\partial x \partial y} o F_{T}) | det B_{T} | d \hat{x} \\ = \frac{| det B_{T} |}{{(b_{1} b_{2})}^{2}} \int_{\hat{T}} \frac{\partial^{2} {\hat{ϕ}}_{i}}{\partial ξ \partial η} \cdot \frac{\partial^{2} {\hat{ϕ}}_{j}}{\partial ξ \partial η} d \hat{x} \\ = \frac{1}{(x_{2} - x_{1}) (y_{2} - y_{1})} \int_{\hat{T}} \frac{\partial^{2} {\hat{ϕ}}_{i}}{\partial ξ \partial η} \cdot \frac{\partial^{2} {\hat{ϕ}}_{j}}{\partial ξ \partial η} d \hat{x} \\ = \frac{1}{(x_{2} - x_{1}) (y_{2} - y_{1})} \frac{\partial^{2} {\hat{ϕ}}_{i}}{\partial ξ \partial η} \cdot \frac{\partial^{2} {\hat{ϕ}}_{j}}{\partial ξ \partial η} \int_{\hat{T}} d \hat{x} \\ = \frac{1}{(x_{2} - x_{1}) (y_{2} - y_{1})} \frac{\partial^{2} {\hat{ϕ}}_{i}}{\partial ξ \partial η} \cdot \frac{\partial^{2} {\hat{ϕ}}_{j}}{\partial ξ \partial η} \end{array}

Hence we have

M_{T} = \frac{1}{(x_{2} - x_{1}) (y_{2} - y_{1})} [\begin{array}{l} 1 & - 1 & 1 & - 1 \\ - 1 & 1 & - 1 & 1 \\ 1 & - 1 & 1 & - 1 \\ - 1 & 1 & - 1 & 1 \end{array}]

We then assemble each M_T into the global matrix M in the same way that the stiffness matrix was assembled.

Numerical results

Real life images

We would like to recover some real life images. Consider an image of size m × n. Then we define a tensor product partition $T_{h}$ of the square $[0, 1] \times [0, 1]$ using the collection of points $N_{h} = {(a_{i}, b_{j})}_{i = 1, j = 1}^{n, m}, where a_{i} = \frac{i - 1}{n - 1}$ , and $b_{j} = \frac{j - 1}{m - 1}$ . Then each pixel of the image is associated with a grid point in $N_{h}$ .

Since we know the images before they have noise applied to them, we will use peak signal-to-noise ratio (PSNR) to compare the results. Let the original image be given by I, and the recovered image be given by $\hat{I}$ . Then the PSNR is given by

PSNR = 10 \log_{10} (\frac{{MAX}_{I}^{2}}{MSE}) = 20 \log_{10} (\frac{{MAX}_{I}}{\sqrt{MSE}})

where

{MAX}_{I}

is the maximum pixel value of the image, and MSE is the mean square error. We note that MSE is given by

MSE = \frac{1}{m n} \sum_{i = 1}^{m} \sum_{j = 1}^{n} {‖ I_{i j} - {\hat{I}}_{i j} ‖}^{2} .

We now consider two test images. These images are the Lena image and Baboon image (see Figure 1). We will apply both Gaussian and impulsive noise to these images. The Gaussian noise has zero mean and variances 0.05 and 0.1, and the salt and pepper noise has densities from 30% through to 80%.

Figure 1.

Lena image (a) and Baboon image (b).

We will now use the three different splines to reconstruct the images. As an example, we will first consider the images corrupted with Gaussian noise with variance 0.05, and impulsive noise of density 60%. In the first image of Figure 2, we show the noisy Lena image. The next three images show the reconstructed images obtained by the three splines. The results for the Baboon image are shown in Figure 3.

Figure 2.

Noisy Lena image ( $σ^{2} = 0.05$ , 60% salt and pepper noise density) (a), recovered image using gradient penalty spline (b), recovered image using mixed derivative spline (c), recovered image using Laplacian spline (d).

Figure 3.

Noisy Baboon image ( $σ^{2} = 0.05$ , 60% salt and pepper noise density) (a), recovered image using gradient penalty spline (b), recovered image using mixed derivative spline (c), recovered image using Laplacian spline (d).

We will now show the PSNR for the reconstructed images in Tables 1 to 4.

Table 1.

Lena image PSNR for Gaussian (variance 0.05) and different impulsive noise densities.

	Lena image PSNR
	Noise density
	30%	40%	50%	60%	70%	80%
Grad.	22.13	22.01	22.22	21.90	21.42	20.91
Mixed	22.29	22.28	22.42	21.93	21.59	20.80
Biharm.	22.98	22.95	22.30	22.09	21.74	21.48

Table 2.

Baboon image PSNR for Gaussian (variance 0.05) and different impulsive noise densities.

	Baboon image PSNR
	Noise density
	30%	40%	50%	60%	70%	80%
Grad.	19.14	19.09	18.81	18.48	18.47	18.37
Mixed	19.14	19.08	18.74	18.48	18.35	18.32
Biharm.	18.89	18.71	18.47	18.12	17.84	17.83

Table 3.

Lena image PSNR for Gaussian (variance 0.1) and different impulsive noise densities.

	Lena image PSNR
	Noise density
	30%	40%	50%	60%	70%	80%
Grad.	21.45	21.40	20.81	20.34	20.21	19.82
Mixed	21.77	21.69	20.80	20.44	20.21	19.85
Biharm.	22.20	22.09	21.56	21.10	20.48	20.14

Table 4.

Baboon image PSNR for Gaussian (variance 0.1) and different impulsive noise densities.

	Baboon image PSNR
	Noise density
	30%	40%	50%	60%	70%	80%
Grad.	18.24	18.20	17.93	18.09	17.89	17.65
Mixed	18.18	18.17	17.89	18.06	17.86	17.56
Biharm.	18.06	17.85	17.66	17.54	17.40	17.06

Note that we have chosen our parameter λ using generalised cross validation¹⁷ and the stochastic trace estimator proposed by Hutchinson.¹⁸ We note that this gives a good estimate of the optimal parameter. In Figure 4 we have plotted the PSNR and the generalised cross validation function versus λ. Note that the validation function has been scaled for visualisation purposes. For both plots, the Lena image has been corrupted with Gaussian noise with variance 0.05, and has been recovered with the mixed derivative spline. In the left plot, the image has been corrupted with impulsive noise of density 30% while in the right plot the density is 40%.

Figure 4.

Generalised cross validation function and PSNR versus λ for Gaussian noise with variance 0.05 and impulsive noise with densities 30% (a) and 40% (b).

Binary image

We will now apply the same methods to a binary test image (see Figure 5). As an example, consider the image is corrupted with Gaussian noise of variance 0.05, and impulsive noise of density 60%. We show the noisy image and the reconstructed images in Figure 6.

Figure 5.

Binary image.

Figure 6.

Noisy binary image ( $σ^{2} = 0.05$ , 60% salt and pepper noise density) (a), recovered image using gradient penalty spline (b), recovered image using mixed derivative spline (c), recovered image using Laplacian spline (d).

We will now show the PSNR for the reconstructed image in Tables 5 and 6.

Table 5.

Binary image PSNR for Gaussian (variance 0.05) and different impulsive noise densities.

	Binary image PSNR
	Noise density
	30%	40%	50%	60%	70%	80%
Grad.	14.77	14.66	14.37	14.19	13.77	13.47
Mixed	15.37	15.21	15.03	14.84	14.43	14.00
Biharm.	15.17	14.93	14.86	14.49	14.39	13.72

Table 6.

Binary image PSNR for Gaussian (variance 0.1) and different impulsive noise densities.

	Binary image PSNR
	Noise density
	30%	40%	50%	60%	70%	80%
Grad.	13.24	13.05	13.01	12.61	12.49	12.20
Mixed	13.58	13.38	13.27	13.03	12.86	12.50
Biharm.	13.36	13.10	12.23	12.88	12.89	12.56

Continuous functions

We would now like to recover continuous functions. We define a tensor product partition $T_{h}$ of the square $[- 1, 1] \times [- 1, 1]$ using the set of points

\begin{array}{l} N_{h} = {(a_{i}, b_{j})}_{i = 1, j = 1}^{n, m}, \\ where a_{i} = \frac{2 (i - 1)}{n - 1} - 1, and b_{j} = \frac{2 (j - 1)}{m - 1} - 1 \end{array}

We sample the function value at each point in the partition, and then apply Gaussian noise of variance 0.05. We then refine the partition several times, which halves the mesh size h in each iteration. We will now consider the first test function. Let the function f be given by $f (x, y) = \sin (3 x) e^{- x^{2} - y^{2}}$ over the domain $[- 1, 1] \times [- 1, 1]$ (see Figure 7).

Figure 7.

$f (x, y) = \sin (3 x) e^{- x^{2} - y^{2}}$ restricted to $[- 1, 1] \times [- 1, 1]$ (a), function recovered using gradient penalty spline (b), function recovered using mixed derivative spline (c), function recovered using Laplacian spline (d).

We compare the PSNR values for the recovered function and the original function for different steps of refinement in Table 7, where the refinement step is given by the step-size h.

Table 7.

PSNR for f using different penalty terms.

i	h	Grad.	Mixed	Bihar.
0	2/19	19.85	24.31	25.54
1	1/19	20.01	24.48	26.44
2	1/38	19.59	24.61	26.61
3	1/76	19.16	24.67	26.69
4	1/152	18.51	24.70	26.72
5	1/304	17.95	24.71	26.73

We can see that PSNR values do not increase or decrease for the spline with the mixed derivative penalty and the spline with the Laplacian penalty, whereas the PSNR values decrease for the spline with the gradient penalty. This is due to the fact that the stability constant depends on the mesh-size h for the spline with the gradient penalty.

We show the functions recovered after the fifth iteration in Figure 7. We can see that gradient penalty spline produces a recovered function that overfits the noisy data. On the other hand, both the mixed derivative and the Laplacian splines produce smoother recovered functions.

We want to see the effect of the mesh-size on the spurious spikes of the recovered function. In Figure 8, we show the functions recovered using the gradient penalty spline using the coarser mesh-sizes h = 2/19 and h = 1/19. These pictures show that the spurious spikes are still present although the spikes are slightly smaller in the coarser mesh results.

Figure 8.

Function recovered using gradient penalty spline with h = 2/19 (a), function recovered using gradient penalty spline with h = 1/19 (b).

We will now provide a second test function. Let the function g be given by $g (x, y) = - x^{2} - x y^{2}$ over the domain $[- 1, 1] \times [- 1, 1]$ (see Figure 9). We have tabulated the PSNR values for different splines at different levels of refinement in Table 8. The results are similar to the first example but the spurious spikes of the recovered function using the gradient penalty formulation have not affected the PSNR values much in this example.

Figure 9.

$g (x, y) = - x^{2} - x y^{2}$ restricted to $[- 1, 1] \times [- 1, 1]$ (a), function recovered using gradient penalty spline (b), function recovered using mixed derivative spline (c), function recovered using Laplacian spline (d).

Table 8.

PSNR for g using different penalty terms.

i	h	Grad.	Mixed	Biharm.
0	2/19	19.49	21.24	22.19
1	1/19	19.01	21.17	22.29
2	1/38	18.76	21.15	22.33
3	1/76	18.33	21.13	22.31
4	1/152	18.16	21.12	22.30
5	1/304	18.82	21.11	22.29

We show the functions recovered after the fifth iteration in Figure 9. Again, we see that the gradient penalty spline overfits the data.

Discussion

We compared three different bivariate L-spline approaches for removing the mixture of Gaussian and impulsive noise from images. We found that for the Lena image, the Laplacian penalty produced recovered images with the best PSNR. However, we found that the Laplacian penalty performed the worst when recovering the Baboon image. The gradient and mixed derivative penalties performed very similarly to each other when recovering the two real life images. For the Binary image, we found that the mixed derivative penalty performed the best, followed by the Laplacian penalty and then the gradient penalty.

We then applied the same approaches to recover two continuous functions from a set of noisy observations. We found that for both functions, the Laplacian penalty produced the best recovered functions, closely followed by the mixed derivative penalty. The gradient penalty produced recovered functions that overfitted the data.

The overfitting occurred because the gradient penalty formulation is not well-posed in the continuous setting. For dimensions $d \geq 2$ , we have that $H^{1} (Ω) \subset C^{0} (Ω)$ (by the Sobolev embedding theorem). This ill-posedness exhibits itself when the mesh size goes to zero.¹⁹ The other formulations, however, are well-posed in the continuous setting. This is because $H_{m}^{1} (Ω) \subset C^{0} (Ω)$ for any dimension d,¹⁶ and $H^{2} (Ω) \subset C^{0} (Ω)$ for dimensions $d \leq 3$ .

Overall, the gradient penalty was the simplest spline to implement and the most computationally efficient. However, as this is not well-posed for d > 1, it often produces spurious results. The computational cost of the spline with the mixed derivative penalty is very close to the gradient penalty and this is well-posed for all dimensions.¹¹ The Laplacian penalty was the least simple to implement, and was the least computationally efficient. Moreover, the spline with the Laplacian penalty is also not well-posed when d > 3. Therefore, we find that the spline with the mixed derivative penalty is the best choice among the presented three splines.

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Bishnu P Lamichhane

References

Adams

Sobolev spaces. New York: Academic Press, 1975.

Brenner

Scott

The mathematical theory of finite element methods. New York: Springer–Verlag, 1994.

Ciarlet

The finite element method for elliptic problems. North Holland, Amsterdam, 1978.

Lamichhane

Roberts

Hegland

A new multivariate spline based on mixed partial derivatives and its finite element approximation. Appl Math Lett 2014; 35: 82–85.

Preußer

Rumpf

An adaptive finite element method for large scale image processing. J Visual Commun Image Represent 2000; 11: 183–195.

Ferrant

Nabavi

Macq

, et al. Registration of 3-d intraoperative MR images of the brain using a finite-element biomechanical model. IEEE Trans Med Imaging 2001; 20: 1384–1397.

Besdok

Impulsive noise suppression from images by using Anfis interpolant and Lillietest. EURASIP J Appl Signal Process 2004; 2423–2433.

Wang

Zhou

2006. A discontinuous finite element method for image denoising. In: Image analysis and recognition, lecture notes in computer science. Vol. 4141. Berlin/Heidelberg: Springer, pp. 116–125.

Demaret

Iske

Adaptive image approximation by linear splines over locally optimal Delaunay triangulations. IEEE Signal Process Lett 2006; 13: 281–284.

10.

Lamichhane

Finite element techniques for removing the mixture of Gaussian and impulsive noise. IEEE Trans Signal Process 2009; 57: 2538–2547.

11.

Lamichhane

BP.

Removing a mixture of Gaussian and impulsive noise using the total variation functional and split Bregman iterative method. In: Sharples J and Bunder J (eds) Proceedings of the 17th biennial computational techniques and applications conference, CTAC-2014, ANZIAM J, vol. 56, 2015, pp. C52–C67, http://journal.austms.org.au/ojs/index.php/ANZIAMJ/article/view/9316 (accessed 27 October 2015).

12.

Quarteroni

Valli

Numerical approximation of partial differential equations. Berlin: Springer–Verlag, 1994.

13.

Garcke

Hegland

Fitting multidimensional data using gradient penalties and the sparse grid combination technique. Computing 2009; 84: 1–25.

14.

Ramsay

Spline smoothing over difficult regions. J R Stat Soc Series B 2002; 64: 307–319.

15.

Lamichhane

A mixed finite element method for the biharmonic problem using biorthogonal or quasi-biorthogonal systems. J Sci Comput 2011; 46: 379–396.

16.

Schmeisser

Triebel

Topics in Fourier analysis and function spaces. 1st edition. Chichester: Wiley, 1987.

17.

Wahba

Spline models for observational data. Series in applied mathematics. Vol. 59, 1st ed. Philadelphia: SIAM, 1990.

18.

Hutchinson

A stochastic estimator of the trace of the influence matrix for laplacian smoothing splines. Commun Statis Simul Comput 1989; 18: 1059–1076.

19.

Utreras

Convergence rates for multivariate smoothing spline functions. J Approx Theory 1988; 52: 1–27.

Approximation of noisy data using multivariate splines and finite element methods

Abstract

Keywords

Introduction

Multivariate spline with gradient penalty

Multivariate spline with Laplacian penalty

New multivariate spline with mixed derivative penalty

Computation of K and M

Numerical results

Real life images

Binary image

Continuous functions

Discussion

Footnotes

Declaration of conflicting interests

Funding

ORCID iD

References