The Optimal Noise Distribution for Privacy Preserving in Mobile Aggregation Applications

Abstract

In emerging mobile aggregation applications (e.g., large-scale mobile survey), individual privacy is a crucial factor to determine the effectiveness, for which the noise-addition method (i.e., a random noise value is added to the true value) is a simple yet powerful approach. However, improper additive noise could result in bias for the aggregate result. It demands an optimal noise distribution to reduce the deviation. In this paper, we develop a mathematical framework to derive the optimal noise distribution that provides privacy protection under the constraint of a limited value deviation. Specifically, we first derive a generic system dynamic function that the optimal noise distribution must satisfy and further investigate two special cases for the distribution of the original value (i.e., Gaussian and truncated Gaussian distribution). Our theoretical and numerical analysis suggests that the Gaussian distribution is the optimal solution for the Gaussian input and the asymptotically optimal solution for the truncated Gaussian input.

1. Introduction

With the advance of information age, data aggregation has been widely used in daily life and commercial applications. Even some companies such as Canalys make a living from providing all kinds of statistics. In aggregation applications the server wishes to distill valuable aggregate statistics from a mass of individual data. For example, CarTel [1] learns the traffic condition from the road information collected by mobile phones. BikeNet [2] measures air and road condition to guide cyclists, where all the data is contributed by users' devices.

However, the individual privacy may be violated during the aggregation. The server is able to obtain the individual data of participants from inputs. Nevertheless, much of this information is private for individuals, such as health condition and income, especially in the presence of curious server or data abuse. Actually the server only needs to know the aggregate result without knowing the individual data. Thus, in aggregation applications, calculating the aggregate statistics without compromising individual privacy is an important challenge.

Secure Multiparty Computation (SMC) is a good choice to solve this problem. It usually uses cryptographic methods, doing operations on ciphertext domain. However, it also has many limitations. Firstly, because of the huge overhead, SMC is not suitable for large-scale systems. Secondly, most of SMC methods need collaboration of parties, which is not suitable for some circumstances where collaboration is not easy (e.g., in wireless network, the node may not connect with others at all times). Thirdly, both encryption/decryption and communication are high-power consumption operations, which limit SMC deployed in energy-sensitive devices (e.g., sensor or phone). Therefore, SMC is not suitable for the large-scale energy-constraint environments such as large-scale mobile survey applications. Furthermore, brute force attack is powerful to cryptographic methods if the plaintext space is small. Some of the methods such as [3] get the aggregate result based on this property. On the contrary, noise addition, which prevents the adversary from getting the accurate individual data values, is a simple but effective method. Compared to SMC, it is much simpler and efficient, especially in this environment. Without collaboration with others, each participant only adds noise into his data independently before updating. However, in this method, how to choose the noise distribution is a headache. Improper additive noise could result in bias for the aggregate result. It demands an optimal noise distribution which provides the best protection to individual privacy while the aggregate result has tolerable bias. However, the optimal noise distribution is not evident. Usually the noise distribution is proposed directly (usually homogeneous noise or Gaussian noise) without any explanation.

In aggregation applications the accuracy of result and the privacy of individuals are two main concerned issues. Our goal is to find out the optimal noise distribution in noise addition method, where the individual privacy is protected best under the given accuracy requirement. The main contributions of this paper are as follows. (1)

We formulate the accuracy and privacy metrics by mean and variance and mutual information, respectively, which are the foundations for choosing proper noise distribution.

(2)

Based on the accuracy and privacy metrics, we develop a mathematical framework to derive the optimal noise distribution.

(3)

We get the generic system dynamic function that the optimal noise distribution must satisfy, where the input is the distribution of original individual data.

(4)

We solve the problem for two input cases. For the Gaussian input, we get the theoretical optimal solution. For the truncated Gaussian distribution input, firstly we point out when it can be approximated by Gaussian distribution, so that the solution of the Gaussian distribution input can be employed directly. Then for the arbitrary truncated Gaussian input, we point out Gaussian distribution is the asymptotically optimal solution.

The rest of the paper is organized as follows. Related work is introduced in Section 2. We formulate the problem in Section 3. In Section 4 we give the general solution and investigate the Gaussian input. In Section 5 the truncated Gaussian input is analyzed in the details. In Section 6 we numerically verify the conclusions and compare the privacy-preserving capability of three proposed noise distributions. At last the paper is concluded in Section 7.

2. Related Work

SMC enables parties to calculate the result by collaboration based on their own data without compromising others' privacy. However it has lots of limitations. In [4] a secure sum protocol was depicted, where the summation is calculated serially which would spend too much time in large-scale systems. Another protocol [3] was proposed, which allows the untrusted server to calculate the summation. It requires that the sum of the keys of parties is $0$ . If one of the partiez leaves in the process, which is a common case in large-scale systems, the summation cannot be calculated. Jung et al. [5] proposed a linear time protocol without secure channel, but it still needs lots of communications among parties. Meanwhile, in these methods each party has to communicate with others and do lots of mathematical operations, both of which are high-power consumption operations. Although CPDA [6] reduces the communication overhead of SMC for wireless sensor networks, it is still much more complex than other methods. So SMC is not suitable for energy-constrained devices.

In ad hoc network, some other methods are exploited to protect the privacy during data aggregation while trying to reduce the energy cost. A cryptology-based aggregation approach is proposed in [7], which leverages a simple secure additively homomorphic stream cipher, but it requires that all the nodes must share their keys to the sink node, so that the sink node could decrypt the encrypted aggregate result. In SMART [6], the original data is sliced into several pieces and recombined randomly. This method calculates the summation securely, but the communication cost rises several times. GP2S [8] is based on the data generation. It replaces the original data by an integer range, by which the data collector plots the histogram without the accurate original value. However, the summation calculated by the histogram is not accurate.

Noise addition has been studied for many years in secure data mining [9]. It prevents the adversary from getting the accurate individual data values. Plenty of schemes are proposed to preserve the privacy of individual records. Most of them such as [10, 11] are not claimed whether their methods are optimal. Furthermore, they utilize the covariance of data in the database, which needs the party who adds the noise to know the global information of data. In some schemes the noise is added without concerning the covariance of the data, but the uniform distribution or Gaussian distribution is directly declared [12, 13]. In [14] the authors considered the optimal randomization given the bias of results, but they did not solve it. Meanwhile, some researchers [12, 15] found the original data distribution can be restructured by perturbed values, but the individual privacy is not violated yet. To the best of our knowledge, there is no work completely focusing on the optimization of the noise addition scheme.

There are several different measures of privacy. In [12] the privacy is measured by “confidence interval.” If the data concerned x is in the interval $I (x)$ with at least certain probability $c %$ , the length of interval $| I (x) |$ is treated as a privacy measure. However, this measure is not accurate. Mutual information or differential entropy in Shannon's information theory is another much more popular privacy metric [15]. It indicates the average privacy supporting by mathematical theory. Renyi entropy (an extension of Shannon entropy) is also used to measure privacy [16], but it is too complex and does not have obvious physical meaning.

In recent years differential privacy [17] is a hot noise addition technology protecting the individual privacy in data mining. It guarantees the accuracy of statistical result while avoiding individual record disclosure. Ghosh et al. [18] found out the optimized noise distribution that provides most accurate result under the given privacy requirement. However, differential privacy is against the adversary that obtains individual record from different statistic results. In our situation, the adversary can get the individual records directly, and we only focus on one aggregation process.

3. Problem Formulation

In this section, firstly we introduce some aggregation applications where the violation of individual privacy potentially exists and noise addition method is appropriate. Then we quantify the accuracy and privacy requirements. Finally, based on the measurements the optimization problem is presented.

3.1. Applications

The individual privacy is potentially threatened in statistics aggregation applications. There are many examples, including (i)

sensor network aggregation; in sensor network applications, many energy-constrained sensors are widely deployed to monitor the surrounding environment and send data to the central server for aggregation. However, the data from individual sensor may contain privacy-sensitive information, especially if the sensors are deployed in personal space, confidential institution, or across multiple companies. So energy-efficient privacy protection in aggregation is an important issue;

(ii)

mobile survey applications; in these applications, tens of thousands of participants exist and the phones are energy-constrained. The overall results are distilled from a large amount of individual information collected by mobile phones. However, the individual privacy may be violated during information collection.

In these large-scale energy-constrained applications, the server should know the aggregate results, which are distilled from the information of individuals. However, the individual privacy may be violated during the collection. Noise addition technology, which protects the individual privacy by adding noise into the individual data, is a simple but efficient method in these applications, where it can be employed independently by individual devices without collaboration and the operations are energy-efficient compared with SMC. To describe the problem more accurately, in the following we formulate the problem in mathematical way.

3.2. Accuracy and Privacy Measurement

Suppose that there are n users with values $x_{i}$ , $i = 1,2, \dots, n$ , and a server calculating aggregate statistics. In this paper, we mainly focus on a simple but common statistic problem called summation. The server processes the aggregation function $sum (v) = \sum_{i = 1}^{n} x_{i}$ . Of course there are several other aggregation types. Besides summation, Popa et al. [19] list other classes such as average, standard deviation, and count. All of them can be constructed by summation, as outlined in Table 1.

Table 1

Aggregation function list.

Aggregation function	Construction with summation $sum (v)$
Count: $count (v)$	The value of each individual is $1$
Average: $avg (v)$	$sum (v) / count (v)$
Standard deviation: $std (v)$	$\sqrt{avg (v^{2}) - avg (v)^{2}}$

There are two parties threatening the individual privacy. One is the server, who would get the individual data by aggregation. The other is the eavesdropper, who could capture the packets from the participants to the server. Both of them (named attacker) can get the individual value, which is regarded as the individual privacy.

To protect the individual privacy in the process of aggregating statistics, user $u_{i}$ adds random noise $z_{i}$ into his/her true value $x_{i}$ . Instead of $x_{i}$ , $u_{i}$ contributes the perturbed value $y_{i} = x_{i} + z_{i}$ to the server. The information that the attacker knows most is all the perturbed values and the scheme by which the noise is generated. So we suppose the attacker knows $y_{i}$ and the distributions of $y_{i}$ and $z_{i}$ . He tries to get $x_{i}$ based on the information he knows. The aim of the noise is to prevent the attacker from getting the accurate true value.

Obviously different noise distributions have different privacy protection capability. To protect the true value, how to choose a good noise distribution is the key issue. Noise $Z_{i}$ is a random variable with the probability density function (pdf) $f_{Z_{i}}$ . To meet the requirements of accuracy and privacy, $f_{Z_{i}}$ should satisfy (1)

accuracy requirement; the difference of $\sum Y_{i}$ and $\sum X_{i}$ is small;

(2)

privacy requirement; the confusion of the true value is evident.

The first requirement guarantees that the aggregate result does not deviate from the true result too much. The second one guarantees the individual privacy is not violated. If the attacker gets the user's value, he still doubts it because of the existence of noise.

3.2.1. Accuracy Measurement

For accuracy requirement, we define the difference

\begin{matrix} M_{n} = \sum_{i = 1}^{n} Y_{i} - \sum_{i = 1}^{n} X_{i} = \sum_{i = 1}^{n} Z_{i}, \end{matrix}

(1)

where n is the number of participants. Ideally

M_{n}

is constantly equal to zero, but it is impossible. Due to the fact that

Z_{1}, \dots, Z_{n}

are random variables,

M_{n}

also is a random variable, with the expectation

E (M_{n})

and the variance

D (M_{n})

Z_{1}, Z_{2}, \dots, Z_{n}

are independent, where

Z_{i}

has the expectation

μ_{Z_{i}}

and the variance

σ_{Z_{i}}^{2}

, respectively. If they satisfy Lindeberg's condition [20],

M_{n}

obeys Gaussian distribution regardless of the distributions of individual noise. It is only decided by the expectation and the variance; that is,

E (M_{n}) = \sum_{i = 1}^{n} μ_{Z_{i}}

and

D (M_{n}) = \sum_{i = 1}^{n} σ_{Z_{i}}^{2}

. We try to keep

M_{n}

small with high probability. It requires

E (M_{n}) = 0

and

D (M_{n})

is small. Therefore, we quantify the accuracy requirement

U

\begin{matrix} U = \frac{D (M_{n})}{n} = \frac{1}{n} \sum_{i = 1}^{n} σ_{Z_{i}}^{2}, \end{matrix}

(2)

with an additional condition

E (M_{n}) = 0

. It measures the average deviation tolerance of the perturbed result from the true result. Suppose that

Z_{1}, \dots, Z_{n}

are independent and identically distributed random variables with the expectation

μ_{Z}

and the variance

σ_{Z}^{2}

. The accuracy requirement U is simplified as

\begin{matrix} U = σ_{Z}^{2}, \end{matrix}

(3)

with

μ_{Z} = 0

U measures the average deviation tolerance of the perturbed result from the true result by the variance of the noise distribution. If two zero-mean noise distribution $f_{Z_{i}}$ and $f_{Z_{j}}$ satisfy $U_{i} < U_{j}$ , it means that $f_{Z_{i}}$ guarantees the accuracy of result better.

3.2.2. Privacy Measurement

Consider

\begin{matrix} Y = X + Z, \end{matrix}

(4)

where Y, X, and Z are random variables which delegate perturbed value, true value, and noise, respectively. Z is independent of X. Suppose that the adversary knows the distribution of Z. It is reasonable that any user including malicious user knows it to generate noise. Because of the perturbation of noise z, the adversary is uncertain about x when he gets y. We use Shannon's information entropy to measure the uncertainty. Suppose that the adversary gets

Y = y

, the uncertainty of X is measured by

H (X ∣ Y = y) = - \sum_{x} P_{X ∣ Y} (x ∣ y) \log P_{X ∣ Y} (x ∣ y)

. The larger

H (X ∣ Y = y)

is, the better the privacy protection is provided at

Y = y

For different y, $H (X ∣ Y = y)$ is different. We use the average $H (X ∣ Y = y)$ to quantify the privacy protection strength (denoted by V) of the noise; that is,

\begin{matrix} V = \sum_{y} H (X ∣ Y = y) P_{Y} (y) = H (X ∣ Y) . \end{matrix}

(5)

H (X ∣ Y)

denotes the average uncertainty of the true value when the perturbed value is captured. The larger V is, the higher the average uncertainty is.

Generally speaking, for noise addition technology, the accuracy and the privacy are in contradiction. High accuracy leads to low privacy protection strength, and vice versa. However, for a given accuracy level, different $f_{Z}$ usually has different privacy protection capability. Thus how to optimize the noise distribution that provides the best privacy protection under the accuracy constraint is the key problem.

3.3. Optimization Problem Formulation

For convenience, in the following we consider the continuous distributions. The discrete distribution can be regarded as the approximation of the corresponding continuous distribution. Consider the formulation $Y = X + Z$ , where X and Z are random variables with pdf $f_{X} (x)$ and pdf $f_{Z} (z)$ , respectively. We will find the optimal $f_{Z}$ providing the best privacy protection while guaranteeing that the result has an acceptable deviation; that is, $\max_{f_{Z} (z)} V = H (X ∣ Y)$ . Consider the optimization problem

\begin{array}{l} \max_{f_{Z} (z)} V = H (X ∣ Y) \\ s . t . U = σ_{Z}^{2} ⩽ σ_{m}^{2} \\ E (Z) = 0, \end{array}

(6)

where

σ_{m}^{2}

is the accuracy requirement bound required by applications.

Consider

\begin{matrix} H (X ∣ Y) = H (X) - I (X; Y) . \end{matrix}

(7)

Since

f_{X} (x)

is deterministic,

H (X)

is a constant. Thus the optimization problem is translated to

\begin{array}{l} \min_{f_{Z} (z)} I (X; Y) \\ s . t . \int f_{Z} (z) z^{2} d z ⩽ σ_{m}^{2} \\ \int f_{Z} (z) z d z = 0 \\ \int f_{Z} (z) d z = 1 . \end{array}

(8)

4. Problem Solution

To solve the problem proposed in the above section, firstly we investigate the general solution. Then for the special case that the original data obeys Gaussian distribution, the further result is shown.

4.1. General Solution

To solve problem (8), firstly we consider a more general problem

\begin{array}{l} \min_{f_{Y ∣ X} (y ∣ x)} I (X; Y) \\ s . t . \int \int f_{X} (x) f_{Y ∣ X} (y ∣ x) {(y - x)}^{2} d x d y ⩽ σ_{m}^{2} \\ \int f_{Y ∣ X} (y ∣ x) (y - x) d y = 0 \\ \int f_{Y ∣ X} (y ∣ x) d y = 1, \end{array}

(9)

where

Y = Z + X

. If Z is independent of X, we have

\begin{matrix} f_{Y ∣ X} (y ∣ x) = f_{Y - X ∣ X} (y - x ∣ x) = f_{Z ∣ X} (z ∣ x) = f_{Z} (z) . \end{matrix}

(10)

The constraints become

\begin{matrix} \int \int f_{X} (x) f_{Y | X} (y ∣ x) {(y - x)}^{2} d x d y = \int f_{Z} (z) z^{2} d z \\ \int f_{X} (x) f_{Y ∣ X} (y ∣ x) (y - x) d y = \int f_{Z} (z) z d z \\ \int f_{Y ∣ X} (y ∣ x) d y = \int f_{Z} (z) d z . \end{matrix}

(11)

This problem is translated to problem (8). In other words, the problem (8) is a special case of problem (9).

The mutual information $I (X; Y)$ is

\begin{array}{l} I (X; Y) = \int \int f_{X, Y} (x, y) \log \frac{f_{X, Y} (x, y)}{f_{X} (x) f_{Y} (y)} d x d y \\ = \int \int f_{X} (x) f_{Y ∣ X} (y ∣ x) \log \frac{f_{Y ∣ X} (y ∣ x)}{f_{Y} (y)} d x d y, \end{array}

(12)

where

f_{Y} (y) = \int_{X} ‍ f_{X} (x) f_{Y ∣ X} (y ∣ x) d x

We use the method of Lagrange multipliers to find the solution. Set up the functional

\begin{array}{l} J = \int \int f_{X} (x) f_{Y ∣ X} (y ∣ x) \log \frac{f_{Y ∣ X} (y ∣ x)}{f_{Y} (y)} d y d x \\ + λ \int \int f_{X} (x) f_{Y ∣ X} (y ∣ x) {(y - x)}^{2} d y d x \\ + \int u (x) \int f_{Y ∣ X} (y ∣ x) (y - x) d y d x \\ + \int v (x) \int f_{Y ∣ X} (y ∣ x) d y d x . \end{array}

(13)

Differentiating with respect to

f_{Y ∣ X} (y ∣ x)

, we have

\begin{array}{l} \frac{\partial J}{\partial f_{Y ∣ X} (y ∣ x)} \\ = f_{X} (x) \log \frac{f_{Y ∣ X} (y ∣ x)}{f_{Y} (y)} + f_{X} (x) \\ - \int ‍ f_{X} (x^{'}) f_{Y ∣ X} (y ∣ x^{'}) \frac{1}{f_{Y} (y)} f_{X} (x) d x^{'} \\ + λ f_{X} (x) {(y - x)}^{2} + u (x) (y - x) + v (x) \\ = f_{X} (x) \log \frac{f_{Y ∣ X} (y ∣ x)}{f_{Y} (y)} + λ f_{X} (x) {(y - x)}^{2} \\ + u (x) (y - x) + v (x) \\ = 0 . \end{array}

(14)

Setting $ω (x) = u (x) / f_{X} (x)$ and $τ (x) = v (x) / f_{X} (x)$ ,

\begin{array}{l} f_{X} (x) [\log \frac{f_{Y ∣ X} (y ∣ x)}{f_{Y} (y)} + λ {(y - x)}^{2} \\ + ω (x) (y - x) + τ (x)] = 0 . \end{array}

(15)

Thus

\begin{matrix} f_{Y ∣ X} (y ∣ x) = f_{Y} (y) e^{- λ {(y - x)}^{2} - ω (x) (y - x) - τ (x)} . \end{matrix}

(16)

Here we get the expression of $f_{Y ∣ X} (y ∣ x)$ . In problem (8), $Y = X + Z$ and Z is independent of X. From (16),

\begin{matrix} f_{Z} (z) = f_{Y} (x + z) e^{- λ z^{2} - ω (x) z - τ (x)} . \end{matrix}

(17)

Z and X are independent; for any x, $f_{Z} (z)$ is unchangeable. $f_{Z} (z)$ can be calculated by fixing x (e.g., $x = 0$ ). So $f_{Z} (z)$ is simplified as $f_{Z} (z) = f_{Y} (z) e^{- λ z^{2} - ω (0) z - τ (0)}$ , where λ, $ω (0)$ , and $τ (0)$ are constants for fixed $f_{X}$ . For convenience, $ω (0)$ and $e^{- τ (0)}$ are abbreviated as ω and C. Therefore, we have the following theorem.

Theorem 1.

Given the accuracy requirement $σ_{m}^{2}$ , the noise providing the best privacy protection has the pdf

\begin{matrix} f_{Z} (z) = C f_{Y} (z) e^{- λ z^{2} - ω z}, \end{matrix}

(18)

where

f_{Y} (z) = \int ‍ f_{X} (x) f_{Z} (z - x) d x

, and λ, ω, and C are related to the constraints

\int ‍ f_{Z} (z) z^{2} d z = σ_{m}^{2}

\int ‍ f_{Z} (z) z d z = 0

, and

\int ‍ f_{Z} (z) d z = 1

, respectively.

Based on the theorem the corresponding system diagram is constructed in Figure 1, where $f_{X}$ is the input and $f_{Z}$ is the output. The system contains two operations. One is convolution of the input and the output. The other is multiplication of the convolution result and the factor “ $C e^{- λ z^{2} - ω z}$ .”

Figure 1

The system diagram.

From the theorem and system diagram, the optimal noise distribution is determined by the distribution of the original value. Therefore, for different aggregation application, the optimal noise distribution may be different.

$I (X; Y)$ is a convex function of $f_{Y ∣ X} (y ∣ x)$ in problem (9) [21], so problem (8) also is a convex optimization problem. The constraints satisfy the sufficient conditions of KKT approach (inequality constraint is a continuously differentiable convex function; the equality constraints are affine functions [22]). If we find one $f_{Z}$ satisfying (18), it is the global optimal solution. So given the distribution of original value $f_{X}$ , we only try to find a solution for (18), that is, the optimal noise distribution.

4.2. Gaussian Distribution Input

Generally for different input $f_{X} (x)$ , the output $f_{Z} (z)$ is different. We consider a special but popular case that X follows Gaussian distribution.

When $X ~ N (μ_{X}, σ_{X}^{2})$ , it is easy to check that $Z ~ N (0, σ_{m}^{2})$ is a solution of (18), where

\begin{matrix} λ = \frac{1}{2} (\frac{1}{σ_{m}^{2}} - \frac{1}{σ_{m}^{2} + σ_{X}^{2}}), \\ ω = \frac{μ_{X}}{σ_{m}^{2} + σ_{X}^{2}}, \\ C = \sqrt{\frac{σ_{m}^{2}}{σ_{m}^{2} + σ_{X}^{2}}} \cdot e^{- (μ_{X}^{2} / 2 (σ_{m}^{2} + σ_{X}^{2}))} . \end{matrix}

(19)

Thus

f_{Z} (z) = (1 / σ_{m}) ϕ (z / σ_{m})

is the solution of problem (8), where

ϕ (x) = (1 / \sqrt{2 π}) e^{- (x^{2} / 2)}

. Therefore we have the following theorem.

Theorem 2.

When X obeys Gaussian distribution, the noise which obeys Gaussian distribution with the expectation $0$ and the variance $σ_{m}^{2}$ protects the individual privacy best.

5. Truncated Gaussian Distribution Input

In practice, X usually has maximum and minimum bounds. For example, the person's height has a maximum bound and is not less than $0$ . The examination score usually is in $[0,100]$ . So we consider the truncated Gaussian distribution in the range $[a, b]$ . In Section 5.1 we investigate the condition that the truncated Gaussian distribution can be approximated by Gaussian distribution, so that Theorem 2 can be applied directly. In Section 5.2 we revise the condition, making it much more accurate. However, not all the truncated Gaussian distribution can be approximated by Gaussian distribution. In Section 5.3, for the arbitrary truncated Gaussian distribution we find Gaussian distribution still is a nearly optimal noise distribution.

5.1. Approximation Condition

We use the metric $ℐ (f, \hat{f})$ [15] to measure the difference of two distributions, where

\begin{matrix} ℐ (f, \hat{f}) = \frac{1}{2} \int | f (x) - \hat{f} (x) | d x . \end{matrix}

(20)

This difference metric measures the overlap of the two distributions, which lies in the interval

[0,1]

. The smaller

ℐ (f, \hat{f})

is, the more overlap the two distributions have, and the more similar they are.

ℐ (f, \hat{f}) = 0

implies that the two distributions are exactly the same, while

ℐ (f, \hat{f}) = 1

means there is no overlap between them.

Suppose $G (x) = (1 / σ) ϕ ((x - μ) / σ)$ . $T (x)$ is a pdf of the truncated Gaussian distribution over $[a, b]$ ; that is,

\begin{matrix} T (x) = \frac{(1 / σ) ϕ ((x - μ) / σ)}{Φ ((b - μ) / σ) - Φ ((a - μ) / σ)}, \end{matrix}

(21)

where

Φ (\cdot)

is the cumulative distribution function of the standard normal distribution.

When $ℐ (T, G) ⩽ η$ , $T (x)$ can be approximated by $G (x)$ , where

\begin{array}{l} ℐ (T, G) \\ = \frac{1}{2} \int_{a}^{b} | T (x) - G (x) | d x + \frac{1}{2} \int_{- \infty}^{a} G (x) d x \\ + \frac{1}{2} \int_{b}^{\infty} G (x) d x \\ = \frac{1}{2} \int_{a}^{b} \frac{1}{σ} ϕ (\frac{x - μ}{σ}) \\ \times (\frac{1}{Φ ((b - μ) / σ) - Φ ((a - μ) / σ)} - 1) d x \\ + \frac{1}{2} Φ (\frac{a - μ}{σ}) + \frac{1}{2} Φ (\frac{μ - b}{σ}) \\ = \frac{1}{2} (1 - Φ (\frac{b - μ}{σ}) + Φ (\frac{a - μ}{σ})) + \frac{1}{2} Φ (\frac{a - μ}{σ}) \\ + \frac{1}{2} Φ (\frac{μ - b}{σ}) \\ = Φ (\frac{a - μ}{σ}) + Φ (\frac{μ - b}{σ}) . \end{array}

(22)

This metric is equivalent to Kullback-Leibler divergence $D_{KL} (T ∥ G)$ , where

\begin{array}{l} D_{KL} (T ∥ G) \\ = \int_{- \infty}^{+ \infty} T (x) \log \frac{T (x)}{G (x)} d x \\ = \int_{a}^{b} T (x) \log \frac{1}{Φ ((b - μ) / σ) - Φ ((a - μ) / σ)} d x \\ = - \log [1 - (Φ (\frac{a - μ}{σ}) + Φ (\frac{μ - b}{σ}))] \\ = - \log (1 - ℐ (T, G)) . \end{array}

(23)

D_{KL} (T ∥ G)

is the increasing function of

ℐ (T, G)

, so the two metrics are the same.

Since $ℐ (T, G) ⩽ η$ , we have $0 ⩽ Φ ((a - μ) / σ) ⩽ η$ and $0 ⩽ Φ ((μ - b) / σ) ⩽ η - Φ ((a - μ) / σ)$ . So $[a, b]$ satisfies

\begin{matrix} a ⩽ μ + σ Φ^{- 1} (η), \\ b ⩾ μ - σ Φ^{- 1} (η - Φ (\frac{a - μ}{σ})) . \end{matrix}

(24)

Given $x < 0$ , there is lower and upper bounds for $Φ (x)$ , that is, $- (x / (1 + x^{2})) \cdot (1 / \sqrt{2 π}) e^{(- x^{2} / 2)}$ < $Φ (x) < - (1 / x) \cdot (1 / \sqrt{2 π}) e^{(- x^{2} / 2)}$ . When $x^{2} ≫ 1$ , $Φ (x) \approx - (1 / x) \cdot (1 / \sqrt{2 π}) e^{(- x^{2} / 2)}$ . Thus

\begin{matrix} \log Φ (x) = - \frac{1}{2} x^{2} - \log (- x) - \frac{1}{2} \log 2 π . \end{matrix}

(25)

When

x ≪ 0

x^{2} ≫ \log (- x)

. So

\begin{matrix} x = - \sqrt{\log \frac{1}{2 π Φ^{2} (x)}} . \end{matrix}

(26)

Therefore, given the bound of the difference metric

η \to 0

Φ^{- 1} (η) ≪ 0

. For (24) and (26), the range

[a, b]

satisfies

\begin{array}{l} a ⩽ μ - σ \sqrt{\log \frac{1}{2 π η^{2}}}, \\ b ⩾ μ + \sqrt{{(a - μ)}^{2} - 2 σ^{2} \log (e^{- (1 / 2) [{(Φ^{- 1} (η))}^{2} - {((a - μ) / σ)}^{2}]} - 1)} \\ = μ + σ \sqrt{2 \log \frac{1}{\sqrt{2 π} η - e^{- (1 / 2) {((a - μ) / σ)}^{2}}}} . \end{array}

(27)

ℐ (T, G) = η

, the length of

[a, b]

\begin{array}{l} L_{η} = | b - a | ⩽ σ \sqrt{2 \log \frac{1}{\sqrt{2 π} η - e^{- (1 / 2) {((a - μ) / σ)}^{2}}}} \\ + σ \sqrt{\log \frac{1}{2 π η^{2}}} . \end{array}

(28)

For a fixed η, from

\partial L_{η} / \partial a = 0

, we get the minimum

L_{η}

\begin{matrix} \min L_{η} = 2 σ \sqrt{\log \frac{2}{π η^{2}}} = 2 (μ - a), \end{matrix}

(29)

where

\begin{matrix} a ⩽ μ - σ \sqrt{\log \frac{2}{π η^{2}}}, \\ b ⩾ μ + σ \sqrt{\log \frac{2}{π η^{2}}} . \end{matrix}

(30)

In this case

μ = (b + a) / 2

. The expectation and the variance of

T (x)

are

\begin{array}{l} E_{T} (X) = μ + σ \frac{ϕ ((a - μ) / σ) - ϕ ((b - μ) / σ)}{Φ ((b - μ) / σ) - Φ ((a - μ) / σ)} = μ, \\ D_{T} (X) \\ = σ^{2} (1 + (\frac{a - μ}{σ} ϕ (\frac{a - μ}{σ}) - \frac{b - μ}{σ} ϕ (\frac{b - μ}{σ})) \\ \times {(Φ (\frac{b - μ}{σ}) - Φ (\frac{a - μ}{σ}))}^{- 1} \\ - {(\frac{ϕ ((a - μ) / σ) - ϕ ((b - μ) / σ)}{Φ ((b - μ) / σ) - Φ ((a - μ) / σ)})}^{2}) \\ = σ^{2} (1 - \sqrt{\log \frac{2}{π η^{2}}} \frac{η}{1 - η}) . \end{array}

(31)

Since

\lim_{η \to 0} \sqrt{\log (2 / π η^{2})} (η / (1 - η)) = 0

D_{T} (X) = σ^{2}

Therefore, given the difference bound $η \to 0$ , if the range $[a, b]$ of $T (x)$ follows the condition (27), $T (x)$ can be approximated by $G (x) = (1 / σ) ϕ ((x - μ) / σ)$ . The corresponding optimal noise distribution is Gaussian distribution. In particular, if $μ = (b + a) / 2$ , the condition can be simplified as (30).

By the analysis in this section, for an aggregation application where the original value obeys the truncated Gaussian distribution, if the distribution satisfies condition (27) or (30), it can be approximated by Gaussian distribution, where Theorem 2 can be used directly.

5.2. Approximation Amendment

From the analysis of Section 5.1, when $L_{η}$ reaches the minimum, $| b - μ | = | a - μ | = (1 / 2) | b - a |$ . Thus the exact $\min L_{η}$ (calculated by (24)) is

\begin{matrix} \min L_{η} = - 2 σ Φ^{- 1} (\frac{1}{2} η) . \end{matrix}

(32)

Equation (29) is the approximate

\min L_{η}

, from which

\min L_{η} / 2 σ = | a - μ | / σ

. Figure 2 shows the relationship between minimum length of

[a, b]

and η. The smaller η is, the larger

\min L_{η} / 2 σ

is. However, from the figure biases exist between the approximate method and the exact one. The reason is that (29) is based on (26), which contains the approximation that

{(\min L_{η} / 2 σ)}^{2} ≫ \log (\min L_{η} / 2 σ)

. However, in practice the difference is not large enough, so the bias exists.

Figure 2

The ratio of minimum length of $[a, b]$ to $2 σ$ for difference bound η before amendment.

In order to reduce the bias, we revise the condition. Based on (32) without the above approximation, from (25) we have

\begin{matrix} {(\min \frac{L_{η}}{2 σ})}_{1}^{2} + 2 \log ({(\min \frac{L_{η}}{2 σ})}_{1}) = \log \frac{2}{π η^{2}} . \end{matrix}

(33)

In (29), $\min L_{η} / 2 σ$ satisfies

\begin{matrix} {(\min \frac{L_{η}}{2 σ})}_{2}^{2} = \log \frac{2}{π η^{2}} . \end{matrix}

(34)

Here,

{(\min L_{η} / 2 σ)}_{1}

and

{(\min L_{η} / 2 σ)}_{2}

are exact value and approximate value, respectively. Consider

η \in [0.0001,0.05]

(which is reasonable that if η is too large, the approximation using normal distribution is useless, while if η is too small, it is hard to find out such a

T (x)

in practice),

\log (2 / π η^{2})

is in the range

[5.54,18.00]

. Thus

{(\min L_{η} / 2 σ)}_{2} - {(\min L_{η} / 2 σ)}_{1}

is in the range

[0.32,0.33]

. Thus (29) can be revised as

\begin{matrix} \min L_{η} = 2 σ (\sqrt{\log \frac{2}{π η^{2}}} - 0.33) \end{matrix}

(35)

and constraint (30) is revised as

\begin{matrix} a ⩽ μ - σ (\sqrt{\log \frac{2}{π η^{2}}} - 0.33), \\ b ⩾ μ + σ (\sqrt{\log \frac{2}{π η^{2}}} - 0.33) . \end{matrix}

(36)

The effect of revision is illustrated in Figure 3, where approximate result is almost the same as the exact one.

Figure 3

The ratio of minimum length of $[a, b]$ to $2 σ$ for difference bound η after amendment.

5.3. Arbitrary Truncated Gaussian Distribution Input

In the above analysis, when the truncated Gaussian distribution can be approximated by Gaussian distribution, Theorem 2 can be used directly. However, not all the truncated Gaussian distribution can be approximated by Gaussian distribution. The truncated Gaussian distributions with the variance $σ^{2}$ and the expectation $(b + a) / 2$ are in quite a variety of shapes, which are decided by the range $[a, b]$ , where $2 \sqrt{3} σ ⩽ | b - a | < + \infty$ . Specially, when $| b - a |$ converges to infinity, the truncated Gaussian distribution becomes Gaussian distribution. When $| b - a |$ equals $2 \sqrt{3} σ$ , the truncated Gaussian distribution becomes homogeneous distribution. If the truncated Gaussian distribution cannot be approximated by Gaussian distribution, does $Z ~ N (0, σ_{m}^{2})$ remain the optimal noise? Unfortunately, Gaussian distribution is not the solution of (18); that is, Gaussian distribution is not the optimal noise distribution. However, if $Z ~ N (0, σ_{m}^{2})$ makes $I (X; Y)$ so small that the difference to the minimum $I (X; Y)$ is small enough, it is still a good choice.

Theorem 3.

Suppose that X obeys the truncated Gaussian distribution over $[a, b]$ with the variance $σ_{X}^{2}$ and the expectation $(b + a) / 2$ . $\min I (X; X + Z)$ in problem (8) is only determined by $(b - a) / σ_{X}$ and $σ_{X} / σ_{m}$ .

Proof.

Suppose that two random variables $X_{1}$ and $X_{2}$ follow the truncated Gaussian distributions $T_{1}$ and $T_{2}$ , respectively, where $T_{1}$ is over $[a_{1}, b_{1}]$ with the variance $σ_{1}^{2}$ and the expectation $(b_{1} + a_{1}) / 2$ , $T_{2}$ is over $[a_{2}, b_{2}]$ with the variance $σ_{2}^{2}$ and the expectation $(b_{2} + a_{2}) / 2$ , and $(b_{1} - a_{1}) / σ_{1}^{'} = (b_{2} - a_{2}) / σ_{2}^{'}$ . The expectation only decides the position of the function in coordinate system. It has no effect on entropy and mutual information. For convenience, we suppose that the expectations of two functions are 0 (i.e., ${(b}_{2} + a_{2}) / 2 = (b_{1} + a_{1}) / 2 = 0$ ), so $b_{1} / σ_{1}^{'} = - a_{1} / σ_{1}^{'} = b_{2} / σ_{2}^{'} = - a_{2} / σ_{2}^{'}$ . Next, we will prove $\min I (X_{1}; X_{1} + Z_{1}) = \min I (X_{2}; X_{2} + Z_{2})$ , where $Z_{1}$ and $Z_{2}$ are random variables with the variances $k σ_{1}^{2}$ and $k σ_{2}^{2}$ (k is a positive number)

\begin{matrix} T_{1} (x) = \frac{(1 / σ_{1}^{'}) ϕ (x / σ_{1}^{'})}{Φ (b_{1} / σ_{1}^{'}) - Φ (a_{1} / σ_{1}^{'})}, \end{matrix}

(37)

\begin{matrix} T_{2} (x) = \frac{(1 / σ_{2}^{'}) ϕ (x / σ_{2}^{'})}{Φ (b_{2} / σ_{2}^{'}) - Φ (a_{2} / σ_{2}^{'})} . \end{matrix}

(38)

Set $c = σ_{2}^{'} / σ_{1}^{'}$ . On one hand, for any $x_{1} \in [a_{1}, b_{1}]$ , there is $x_{2} = c x_{1} \in [a_{2}, b_{2}]$ . On the other hand, for any $x_{2} \in [a_{2}, b_{2}]$ , there is $x_{1} = (1 / c) x_{2} \in [a_{1}, b_{1}]$ . Thus a one-to-one mapping exists between $X_{1}$ and $X_{2}$ .

Consider the two functions (37) and (38). Denominators are equal to each other denoted by M. For any $x_{1}$ in $[a_{1}, b_{1}]$ and $x_{2} = c x_{1}$ ,

\begin{matrix} T_{1} (x_{1}) = \frac{1}{M σ_{1}^{'}} ϕ (\frac{x_{1}}{σ_{1}^{'}}), \\ T_{2} (x_{2}) = \frac{1}{M σ_{2}^{'}} ϕ (\frac{x_{2}}{σ_{2}^{'}}) = \frac{1}{M σ_{2}^{'}} ϕ (\frac{x_{1}}{σ_{1}^{'}}) . \end{matrix}

(39)

Thus

T_{2} (x) = (1 / c) T_{1} ((1 / c) x)

\begin{array}{l} σ_{2}^{2} = \int_{a_{2}}^{b_{2}} x^{2} T_{2} (x) d x \\ \overset{x = c x_{1}}{=} \int_{a_{1}}^{b_{1}} c^{2} x_{1}^{2} T_{1} (x_{1}) d x_{1} \\ = c^{2} σ_{1}^{2} . \end{array}

(40)

σ_{2} / σ_{1} = σ_{2}^{'} / σ_{1}^{'} = c

For any $Z_{1}$ , the pdf of which is $f_{Z_{1}} (z)$ with the expectation $0$ and the variance $σ_{Z_{1}}^{2} = k σ_{1}^{2}$ , we can find $Z_{2}$ with pdf $f_{Z_{2}} (z)$ which satisfies $f_{Z_{2}} (z) = (1 / c) f_{Z_{1}} ((1 / c) z)$ . $f_{Z_{2}} (z)$ has the expectation $0$ and the variance $σ_{Z_{2}}^{2} = c^{2} σ_{Z_{1}}^{2} = k σ_{2}^{2}$ . There is a one-to-one mapping between $f_{Z_{1}} (z)$ and $f_{Z_{2}} (z)$ . Thus for any $Z_{1}$ for $X_{1}$ , we have corresponding $Z_{2}$ for $X_{2}$ .

The entropy satisfies

\begin{array}{l} h (Z_{2}) = - \int f_{Z_{2}} (z_{2}) \log f_{Z_{2}} (z_{2}) d z_{2} \\ = - \int c \frac{1}{c} f_{Z_{1}} (z_{1}) \log \frac{1}{c} f_{Z_{1}} (z_{1}) d z_{1} \\ = h (Z_{1}) + \log c . \end{array}

(41)

Supposing that $Y_{1} = X_{1} + Z_{1}$ with pdf $f_{Y_{1}} (y_{1})$ and $Y_{2} = X_{2} + Z_{2}$ with pdf $f_{Y_{2}} (y_{2})$ ,

\begin{array}{l} f_{Y_{2}} (y) = \int T_{X_{2}} (x) f_{Z_{2}} (y - x) d x \\ \overset{x_{1} = (1 / c) x}{=} \int \frac{1}{c} T_{X_{1}} (x_{1}) \frac{1}{c} f_{Z_{1}} (y_{1} - x_{1}) c d x_{1} \\ = \frac{1}{c} \int ‍ T_{X_{1}} (x_{1}) f_{Z_{1}} (y_{1} - x_{1}) d x_{1} \\ = \frac{1}{c} f_{Y_{1}} (y_{1}) \\ = \frac{1}{c} f_{Y_{1}} (\frac{1}{c} y) . \end{array}

(42)

Similarly,

\begin{matrix} h (Y_{2}) = h (Y_{1}) + \log c . \end{matrix}

(43)

Since Z and X are independent,

\begin{matrix} I (X_{1}; Y_{1}) = h (Y_{1}) - h (Z_{1}) = h (Y_{2}) - h (Z_{2}) = I (X_{2}; Y_{2}) . \end{matrix}

(44)

Suppose that $Z_{m 1}$ minimizes $I (X_{1}; Y_{1}))$ ; the corresponding $Z_{m 2}$ minimizes $I (X_{2}; Y_{2})$ too; otherwise we can find another $Z_{m 1}^{'}$ that makes $I (X_{1}; Y_{1})$ smaller. In the above analysis, we only have the constraint that $(b_{1} - a_{1}) / σ_{1} = (b_{2} - a_{2}) / σ_{2}$ . In problem (8) $σ_{m}^{2}$ actually is the variance of optimal $f_{Z}$ . In this proof we have an implied condition that when $Z_{m 1}$ satisfies the variance condition for $X_{1}$ ( $σ_{m 1}^{2} = k σ_{1}^{2}$ ), $Z_{m 2}$ satisfies it too for $X_{2}$ ( $σ_{m 2}^{2} = k σ_{2}^{2}$ ). Since $σ_{2} / σ_{1} = c = σ_{Z_{m 2}} / σ_{Z_{m 1}}$ , $σ_{2} / σ_{Z_{m 2}} = σ_{1} / σ_{Z_{m 1}}$ . Therefore when $(b - a) / σ_{X}$ and $σ_{X} / σ_{m}$ are fixed, the minimum of $I (X; Y)$ is a constant.

From Theorem 3 the optimal solution is only determined by $(b - a)$ , $σ_{X}$ , and $σ_{m}$ . To see the performance that noise obeys Gaussian distribution, the difference between $\min I (X; X + Z)$ and $I (X; X + Z_{G})$ is shown in Figure 4, where $Z_{G}$ obeys Gaussian distribution. From the figure, we find that although Gaussian distribution is not the optimal noise distribution, the deviation to the optimal solution is small. In particular, when $(b - a) / 2 σ_{X}$ is larger than $3$ , the bias is very close to $0$ . For example, when $(b - a) / 2 σ_{X}$ is $3.05$ , $I (X; X + Z_{G})$ and $\min I (X; X + Z)$ are $0.793214$ and $0.793210 (σ_{X} / σ_{Z} = \sqrt{2})$ , $1.161543$ and $1.161542 (σ_{X} / σ_{Z} = 2)$ , and $1.585175$ and $1.585174 (σ_{X} / σ_{Z} = 2 \sqrt{2})$ . That is because in this case $f_{X}$ is much like Gaussian distribution. When $(b - a) / 2 σ_{X}$ is close to $\sqrt{3}$ , $f_{X}$ is approximated to homogeneous distribution. The results remain good. For example, when $(b - a) / 2 σ_{X}$ is $1.76$ , $I (X; X + Z_{G})$ and $\min I (X; X + Z)$ are $0.7850$ and $0.7848 (σ_{X} / σ_{Z} = \sqrt{2})$ , $1.1305$ and $1.1304 (σ_{X} / σ_{Z} = 2)$ , and $1.52132$ and $1.52127 (σ_{X} / σ_{Z} = 2 \sqrt{2})$ . Thus although Gaussian distribution is not the optimal noise distribution when X follows the truncated Gaussian distribution, it is a nearly optimal noise distribution.

Figure 4

The difference between $\min I (X; X + Z)$ and $I (X; X + Z_{G})$ where $Z_{G}$ obeys Gaussian distribution. $σ_{X}^{2}$ and $σ_{Z}^{2}$ are the variances of X and Z, respectively.

6. Numerical Simulation

From Theorem 2, when the original value obeys Gaussian distribution, the optimal noise distribution is Gaussian distribution too. Besides Gaussian distribution, homogeneous distribution (e.g., [12]) and Laplace distribution (e.g., [17]) are also used in noise addition method. Figure 5 shows the privacy-preserving capabilities of these three noise distributions, where $f_{X}$ is a Gaussian distribution. From the figure, we could find that the mutual information, which measures the privacy protection strength, is the smallest with the Gaussian noise. It means that by adding Gaussian noise, the attacker gets the least information of the true value from the perturbed value. Meanwhile, from the figure, the information leaks less with the increase of the accuracy requirement bound $σ_{m}^{2}$ .

Figure 5

Compare the privacy-preserving capabilities of Gaussian distribution, homogeneous distribution, and Laplace distribution, where $f_{X}$ is Gaussian distribution with the variance $6400$ .

Figure 6 illustrates the privacy-preserving capabilities of Gaussian distribution, homogeneous distribution, and Laplace distribution when $f_{X}$ is a truncated Gaussian distribution. Here we choose $f (x) = ((1 / 80) ϕ (x / 80)) / (Φ (300 / 80) - Φ (- 300 / 80))$ as an example, which is in the range of $[- 300,300]$ . From the figure, Gaussian noise is still the best one to protect the individual privacy. The difference to the optimal noise distribution has been shown in Figure 4, from which we find when the original value obeys the truncated Gaussian distribution, Gaussian distribution is still a good noise distribution.

Figure 6

Compare the privacy-preserving capabilities of Gaussian distribution, homogeneous distribution, and Laplace distribution, where $f_{X}$ is the truncated Gaussian distribution $f (x) = ((1 / 200) ϕ (x / 200)) / (Φ (300 / 200) - Φ (- 300 / 200))$ .

7. Conclusion

In this paper, we quantify the accuracy of result and the privacy of individuals. Based on the metrics, we propose the optimization problem, finding out the optimal noise distribution that provides the best privacy protection while maintaining the acceptable deviation from the accurate result. For the special cases that the original data of individuals follows Gaussian distribution and the truncated Gaussian distribution, Gaussian distribution is the optimal distribution and the asymptotically optimal one, respectively.

Footnotes

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

This work was supported in part by National Natural Science Foundation of China under Grant no. 61371192, National Natural Science Foundation of China under Grant no. 61271271, 100 Talents Program of Chinese Academy of Science, the Strategic Priority Research Program of the Chinese Academy of Sciences under Grant XDA06030601, and the Funding of Science and Technology on Information Assurance Laboratory under Grant KJ-13-003.

References

Hull

Bychkovsky

Zhang

Cartel: a distributed mobile sensor computing system

Proceedings of the 4th ACM International Conference on Embedded Networked Sensor Systems

November 2006

Eisenman

S. B.

Miluzzo

Lane

N. D.

Peterson

R. A.

Ahn

G.-S.

Campbell

A. T.

BikeNet: a mobile sensing system for cyclist experience mapping

ACM Transactions on Sensor Networks 2009 6 1, article 6

2-s2.0-75149165677

10.1145/1653760.1653766

Shi

Chan

T.-H. H.

Rieffel

Chow

Song

Privacy-preserving aggregation of time-series data

Proceedings of the Network and Distributed System Security Symposium (NDSS '11)

February 2011

Clifton

Kantarcioglu

Vaidya

Lin

Zhu

M. Y.

Tools for privacy preserving distributed data mining

ACM SIGKDD Explorations Newsletter 2002 4 2 28 34

Jung

Mao

X.-Y.

Tang

S.-J.

Gong

Zhang

Privacy-preserving data aggregation without secure channel: multivariate polynomial evaluation

Proceedings of the IEEE Conference on Computer Communications (INFOCOM '13)

April 2013

2634 2642

Liu

Nguyen

Nahrstedt

Abdelzaher

PDA: privacy-preserving data aggregation in wireless sensor networks

Proceedings of the 26th IEEE International Conference on Computer Communications (INFOCOM '07)

May 2007

2045 2053

2-s2.0-34548301953

10.1109/INFCOM.2007.237

Castelluccia

Mykletun

Tsudik

Efficient aggregation of encrypted data in wireless sensor networks

Proceedings of the 2nd Annual International Conference on Mobile and Ubiquitous Systems: Networking and Services (MobiQuitous '05)

July 2005

109 117

2-s2.0-33749525209

10.1109/MOBIQUITOUS.2005.25

Zhang

Wang

Feng

GP2S: generic privacy-preservation solutions for approximate aggregation of sensor data

Proceedings of the 6th IEEE Annual International Conference on Pervasive Computing and Communications (PerCom '08)

March 2008

179 184

2-s2.0-49149125693

10.1109/PERCOM.2008.60

Adam

N. R.

Wortmann

J. C.

Security-control methods for statistical databases. A comparative study

ACM Computing Surveys 1989 21 4 515 556

2-s2.0-0024914229

10.

Oliveira

S. R. M.

Zaiane

O. R.

Privacy preserving clustering by data transformation

Proceedings of the 18th Brazilian Symposium on Databases

2003

11.

Bao

Zhou

Takagi

Sakurai

A new scheme for distributed density estimation based privacy-preserving clustering

Proceedings of the 3rd International Conference on Availability, Reliability and Security (ARES '08)

March 2008

112 119

2-s2.0-49049096374

10.1109/ARES.2008.129

12.

Agrawal

Srikant

Privacy-preserving data mining

Proceedings of the ACM International Conference on Management of Data (SIGMOD '00) 2000 439 450

2-s2.0-0041783510

13.

Domingo-Ferrer

Sebé

Castellà-Roca

On the security of noise addition for privacy in statistical databases

Privacy in Statistical Databases 2004 3050

Berlin, Germany

Springer

149 161 Lecture Notes in Computer Science

2-s2.0-35048834229

14.

Zhu

Liu

Optimal randomization for privacy preserving data mining

Proceedings of the International Conference on Knowledge Discovery and Data Mining (KDD '04)

August 2004

15.

Agrawal

Aggarwal

C. C.

On the design and quantification of privacy preserving data mining algorithms

Proceedings of the 20th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems

May 2001

247 255

2-s2.0-0034827009

16.

Rachlin

Probst

Ghani

Maximizing privacy under data distortion constraints in noise perturbation methods

Privacy, Security, and Trust in KDD 2009 5456

Berlin, Germany

Springer

92 110 Lecture Notes in Computer Science

2-s2.0-67650289929

10.1007/978-3-642-01718-6_7

17.

Dwork

Differential privacy

Proceedings of the 33rd International Conference on Automata, Languages and Programming (ICALP '06)

2006

1 12

18.

Ghosh

Roughgarden

Sundararajan

Universally utility-maximizing privacy mechanisms

Proceedings of the 41st ACM Annual Symposium on Theory of Computing (STOC '09)

June 2009

351 359

2-s2.0-70350683770

10.1145/1536414.1536464

19.

Popa

R. A.

Blumberg

A. J.

Balakrishnan

F. H.

Privacy and accountability for location-based aggregate statistics

Proceedings of the 18th ACM Conference on Computer and Communications Security (CCS '11)

October 2011

653 665

2-s2.0-80755169448

10.1145/2046707.2046781

20.

Lindeberg

J. W.

Wikipedia: central limit theorem

http://en.wikipedia.org/wiki/Central_limit_theorem

21.

Cover

T. M.

Thomas

J. A.

Elements of Information Theory 2006 2nd

New York, NY, USA

John Wiley & Sons

22.

Kuhn

H. W.

Tucker

A. W.

Nonlinear programming

Proceedings of the 2nd Berkeley Symposium on Mathematical Statistics and Probability

1951