Color image steganalysis based on channel gradient correlation

Abstract

It is one of the potential threats to the Internet of Things to reveal confidential messages by color image steganography. The existing color image steganalysis algorithm based on channel geometric transformation measures owns higher accuracy than the others, but it fails to utilize the correlation between the gradient amplitudes of different color channels. Therefore, this article points out that the color image steganography weakens the correlation between the gradient amplitudes of different color channels and proposes a color image steganalysis algorithm based on channel gradient correlation. The proposed algorithm extracts the co-occurrence matrix feature from the gradient amplitude residuals among different color channels and then combines it with the existing color image steganalysis features to train the ensemble classifier for color image steganalysis. The experimental results show that, for WOW and S-UNIWARD steganography, compared with the existing algorithms, the proposed algorithm outperforms the existing algorithms.

Keywords

Color image steganalysis gradient steganography texture

Introduction

With the development and popularization of network technology, various image acquisition networks have been widely deployed in many fields, which have become one of the important forms of Internet of Things. Because the digital image is the body of the data in image acquisition networks, the transmitted image data will not be intercepted by the security protection systems. Therefore, it is possible to maliciously use the steganography to hide sensitive messages into digital images and transmit the images over the Internet, which results in information leakage. As the counterpart of steganography, steganalysis aims to judge if a digital image carries secret messages, further locates the payload, and finally extracts the secret messages. At present, the research of digital image steganalysis mainly focuses on steganography detection of a single-color channel. Either for the traditional steganography algorithms,^1–4 or for the content-adaptive image steganography algorithms developing in recent years,^5–8 researchers have proposed a series of effective steganalysis algorithms for the single-color channel,^9–12 and some can even locate or extract the secret messages in the special cases.¹³

In reality, color images are widely used, which consist of multiple color channels. When the steganographer embeds messages into multiple color channels, often the single-channel steganalysis algorithm is applied to detect each color channel and then to determine if the color image contains secret messages according to the detection results of each channel. However, compared with embedding the same length of message in a single-color channel, the length of messages embedded in each color channel is much shorter, and thus it is more difficult to implement effective detection.^14,15 Therefore, reliable steganalysis of color image steganography is important for practical applications of steganalysis technology. For steganography with color images, based on the characteristic that steganography will increase the number of color or similar color pairs, Fridrich et al.¹⁶ used the ratio of similar color pairs in the color pairs as steganalysis features. To detect the color stego image of least significant bit (LSB) steganography, Su et al.¹⁷ embedded fixed ratio of random information into the investigated image and then extracted the increased numbers of different colors and similar color pairs as features. Abdulrahman et al.¹⁸ calculated the co-occurrence matrices from gradient amplitudes of each channel and their derivatives and then combined them as features to realize color image steganalysis. Goljan et al.¹⁹ extracted the co-occurrence matrices between the residuals of three channels and the Rich Model features of each channel, respectively, and then merged them into the color image steganography detection features—SCRMQ1 (Spatio-Color Rich Model with quantization step $q = 1$ ). Goljan and Fridrich²⁰ divided the image pixels into blocks according to the color filter array (CFA) characteristics from the imaging principle of camera and then computed the co-occurrence matrices of residuals between different channels of each block for steganalysis. According to the characteristics that content-adaptive steganography embeds information into complex textural regions, Liao et al.²¹ first obtained the complex texture regions in all channels and in each channel, respectively, and then calculated the co-occurrence matrices of residuals in each channel of the two types of regions as steganalysis features. Lyu and Farid²² calculated logarithmic prediction errors from the correlations among wavelet subband coefficients of different scales and different color channels in horizontal, vertical, and diagonal directions, respectively, and extracted their statistic features for steganalysis, such as mean, variance, skewness, and kurtosis, which achieves the pure blind detection of color image steganography. To improve the detection performance of least significant bit matching (LSBM) on color images, Liu et al.²³ measured the correlation coefficients among the LSB planes of different color channels and the correlation coefficients between the prediction errors of each channel to capture the influence of steganography on the correlation among different channels. Li et al.²⁴ calculated the prediction errors of channel Y to other channels and extracted the statistical features to realize the detection of the color JPEG image steganography. Abdulrahman et al.^14,15 depicted the texture direction consistency of different channels by the cosine and sine of angles between the gradients of different channels and further extracted steganalysis features based on the gradient direction consistency to improve the detection accuracy of color stego image.

Compared with the separate detection of each color channel, the above steganalysis algorithms effectively decrease the steganalysis error. Particularly, the detection error rates of the algorithm in Abdulrahman et al.¹⁵ for S-UNIWARD (Spatial UNIversal WAvelet Relative Distortion) and WOW (Wavelet Obtained Weights) steganography are lower than those of the algorithm based on gradient amplitude and derivative,¹⁸ the algorithm based on the co-occurrence matrix between channel residuals,¹⁹ and the detection algorithm based on CFA sensitivity.²⁰ However, the steganalysis algorithm in Abdulrahman et al.¹⁵ only considers that steganography will reduce the texture direction consistency of different color channels, while steganography will also reduce the consistency of their texture variation intensity. Therefore, we try to further improve the steganalysis performance by exploiting the two types of consistencies simultaneously. The main contributions of this article are as follows:

It points out that steganography will weaken the correlations between gradient amplitudes of different color channels.

A steganalysis feature extraction method based on channel gradient amplitude correlation is then proposed.

A steganalysis algorithm based on channel gradient correlation is proposed by combining the proposed features with existing color image steganalysis features in Abdulrahman et al.¹⁵ and Goljan et al.¹⁹ The experimental results show that when the information is embedded into each channel by WOW and S-UNIWARD algorithms, the proposed algorithm reduces the average detection error rate.

Related works

Color image steganography

When embedding messages into a spatial color image composed of three color channels: red (R), green (G), and blue (B), the messages to be embedded will be divided into three components, which can be embedded into three channels correspondingly by steganography algorithms such as HUGO (Highly Undetectable steGO),⁵ WOW,⁶ S-UNIWARD,⁷ and HILL (High-pass, Low-pass, and Low-pass).⁸ And then the three stego channels will be merged to compose the stego color image, as shown in Figure 1.

Figure 1.

Procedure of color image steganography.

This steganography method seems to have great ability to resist detection. First, the secret messages to be embedded are split into three components and embedded into each color channel separately. Compared with embedding the whole message into a grayscale image, the payload of each color channel is obviously reduced. Furthermore, when detecting the color image steganography, if the detection is implemented separately in each color channel, the detection result of different channels will interfere with each other, which decreases the detection performance.

It is worth noting that the three color channels are not independent of each other. The correlation among the channels should not be ignored.²⁵ As shown in Figure 2(b) and (c), the content and texture of each color channel are very similar. Sangwine and Horne²⁶ calculated the correlation coefficients between three color channels of natural color images: $r_{B - R} \approx 0.78$ , $r_{R - G} \approx 0.98$ , and $r_{G - B} \approx 0.94$ . Embedding secret messages in color images will reduce these correlation coefficients. For example, when embedding the pseudo-random information with the payload of 0.5 bpc (bits per channel pixel) into the R (Figure 2(b)) and G (Figure 2(c)) channels by LSB replacement steganography, the absolute difference between the least two planes of R and G is calculated, which will be changed from Figure 2(d) to Figure 2(e). It can be seen from Figure 2(d) and (e) that color image steganography will reduce the correlation between different color channels significantly, which could be exploited to improve the detection of color image steganography.

Figure 2.

Influence of steganography on correlations between different color channels: (a) color image, (b) channel R (shown in grayscale), (c) channel G (shown in grayscale), (d) difference of channel R and channel G before steganography (shown in grayscale), and (e) difference of channel R and channel G after steganography (shown in grayscale).

Gradient vector in digital images

The gradient is a vector, the direction of which is the gradient direction. In mathematics, the function changes fastest and achieves the maximum rate of variation along the gradient direction. This maximum value is the amplitude of the gradient vector. If this theory is applied to digital images, the texture characteristics of images can be expressed by gradient vectors. The gradient direction of a point in the image represents the texture direction of the point, and the gradient amplitude of the point indicates the intensity of the texture variation at that point.

Represent the digital image as a function $f (x, y)$ . The horizontal and vertical components of the gradient vector $\nabla f (x, y)$ at the point $(x, y)$ of the image are the partial derivative of the function along the x and y directions, respectively

\nabla f (x, y) = (\frac{\partial f}{\partial x}, \frac{\partial f}{\partial y})

(1)

For digital images, differentiation can be replaced by difference. That is, the horizontal and vertical components of the gradient vector at the position $(x, y)$ in an image can be obtained by difference in different directions between the pixels, respectively

\begin{matrix} \nabla f (i, j) = (\nabla f_{x}, \nabla f_{y}) = (f (i, j) - f (i, j - 1), f (i, j) \\ - f (i - 1, j)) \end{matrix}

(2)

As shown in Figure 3, a point in the image is taken as an example (“86” in Figure 3). Similar to equation (2), the horizontal component of the gradient vector at this point can be calculated as “86 – 125 = –39,” and its direction is horizontal to the right, as shown by the arrow. Similarly, the vertical component of the gradient vector at this point can be calculated as “86 – 171 = –85,” and its direction is straight up. And the direction of the gradient vector is determined by these two arrows.

Figure 3.

Gradient vector in digital images.

It is worth noting that the difference between adjacent pixels can enhance the signal-to-noise ratio of steganographic signals and suppress the interference of the image content to outstand steganographic noises. Therefore, while depicting the image texture features, the gradient vector of the image highlights the horizontal and vertical steganographic signals.

Color image steganalysis based on channel gradient direction consistency

According to the above section, there is a strong correlation between the color image channels, the texture direction of which also has strong consistency. For example, in each channel, the edge regions of the color image are also the edge regions along the same or the similar direction, and the smooth regions of the color image are also the smooth regions of each channel. In cover images, the angle between different channel gradient vectors is relatively small due to the consistency of texture directions between color channels. But in stego images, the angle between the gradient vectors is likely to increase by embedding changes. In order to capture the influence of steganography on the consistency of texture direction between channels, the cosine and sine values of the angle between channel gradients are calculated to depict the consistency of different channel texture directions. Abdulrahman et al.¹⁵ extracted steganalysis features based on channel gradient direction consistency and proposed a color image steganalysis algorithm based on channel geometric transformation meatures called RGB-CGTM.

The steganalysis algorithm has 24,157-dimensional features, which consists of 6000-dimensional steganalysis feature based on channel gradient direction consistency and 18,157-dimensional SCRMQ1 feature. The extraction process of 6000-dimensional steganalysis feature based on channel gradient direction consistency is as follows:

1. Calculate the gradient vector of three color channels red, green, and blue by equation (2), which is recorded as $\nabla R_{i, j} = (R_{i, j, \leftarrow}, R_{i, j, ↑})$ , $\nabla G_{i, j} = (G_{i, j, \leftarrow}, G_{i, j, ↑})$ , and $\nabla B_{i, j} = (B_{i, j, \leftarrow}, B_{i, j, ↑})$ , where $R_{i, j, \leftarrow}$ , $G_{i, j, \leftarrow}$ , and $B_{i, j, \leftarrow}$ represent the horizontal components of the gradient vector at the position $(i, j)$ of channels R, G, and B, respectively. $R_{i, j, ↑}$ , $G_{i, j, ↑}$ , and $B_{i, j, ↑}$ represent the vertical components of the gradient vector at the position $(i, j)$ of channels R, G, and B, respectively.

2. The cosine values of the angle between gradients of R and G channels and the cosine values of the angle between gradients of R and B channels are obtained as follows

\cos_{RG} = \frac{\nabla R_{i, j} \cdot \nabla G_{i, j}}{| \nabla R_{i, j} | | \nabla G_{i, j} |} = \frac{R_{i, j, \leftarrow} G_{i, j, \leftarrow} + R_{i, j, ↑} G_{i, j, ↑}}{| \nabla R_{i, j} | | \nabla G_{i, j} |}

(3)

\cos_{RB} = \frac{\nabla R_{i, j} \cdot \nabla B_{i, j}}{| \nabla R_{i, j} | | \nabla B_{i, j} |} = \frac{R_{i, j, \leftarrow} B_{i, j, \leftarrow} + R_{i, j, ↑} B_{i, j, ↑}}{| \nabla R_{i, j} | | \nabla B_{i, j} |}

(4)

where the dot product of two vectors is a scalar, which reflects the projection of one gradient vector in the direction of another gradient vector. Therefore, the dot product of two unit gradient vectors can be used to calculate the cosine of the angle between them. The smaller the cosine value is, the larger the angle between them is. If the cosine value is equal to 1, their directions are completely consistent, and the cosine value equal to −1 indicates that they are in the opposite direction.

3. Since the cosine value only reflects the absolute value of the angle between the two vectors, it fails to reflect the direction of deviation between them. Because the sine function is an odd function, it is more sensitive to the positive and negative of the angle than the cosine function (even function). Therefore, the author introduces the sine value of the angle between the two gradients to express the directional characteristic of the angle between them

\sin_{RG} = \frac{\nabla R_{i, j} \times \nabla G_{i, j}}{| \nabla R_{i, j} | | \nabla G_{i, j} |} = \frac{R_{i, j, \leftarrow} G_{i, j, ↑} - R_{i, j, ↑} G_{i, j, \leftarrow}}{| \nabla R_{i, j} | | \nabla G_{i, j} |}

(5)

\sin_{RB} = \frac{\nabla R_{i, j} \times \nabla B_{i, j}}{| \nabla R_{i, j} | | \nabla B_{i, j} |} = \frac{R_{i, j, \leftarrow} B_{i, j, ↑} - R_{i, j, ↑} B_{i, j, \leftarrow}}{| \nabla R_{i, j} | | \nabla B_{i, j} |}

(6)

where the cross product of two vectors is a vector, whose norm reflects the area of the parallelogram formed by the two vectors at the same starting point, and the direction satisfies the right-hand grip rule. Therefore, the cross product of two unit gradient vectors can be used to calculate the sine value of the angle between them.

4. Calculate residuals and extract co-occurrence matrices from four images $\underset{RG}{\cos}$ , $\underset{RB}{\cos}$ , $\underset{RG}{\sin}$ , and $\underset{RB}{\sin}$ by Rich Model,¹⁰ respectively, where the truncation threshold T is 1, and the quantization step q takes the values 0.1, 0.3, 0.5, 0.7, 0.9, and 1, respectively, thus obtaining totally 6000-dimensional feature.

Finally, the 6000-dimensional steganalysis feature based on channel gradient direction consistency is combined with the 18,157-dimensional SCRMQ1 features in Goljan et al.,¹⁹ and the ensemble classifier is trained to detect color images.

Color image steganalysis feature based on channel gradient amplitude correlation

Abdulrahman et al.¹⁵ used the sine and cosine of angle to measure the value and direction of gradient angle between two color channels. These two measurements describe the consistency of the texture direction and reflect the correlation between channels, which improves the performance of color image steganalysis. However, the algorithm in Abdulrahman et al.¹⁵ only utilizes the effect of color image steganography on the consistency of the texture direction between channels, but does not consider the influence of color image steganography on the correlation between texture amplitudes of different color channels. The application of the gradient vector in digital images shows that the gradient direction is the direction along which pixel values change fastest, and the gradient amplitude expresses the intensity of texture variation. The following section will point out that there are also correlations between the gradient amplitudes of different color channels, which will be decreased by steganography. According to this characteristic, this section extracts the steganalysis features based on channel gradient amplitude correlation.

Influence of steganography on correlations between gradient amplitudes of different color channels

Because of the strong correlation between the textures of different color channels, the intensities of texture variation at the same location in different channels should be close. Therefore, the gradient amplitudes of each color channel should also be correlated. As shown in Figure 4, the gradient amplitudes of each color channel of the color image shown in Figure 4(a) are given in the form of a grayscale image. It can be seen that the textures of the gradient amplitude image of each color channel are very similar, which indicates that they are correlated. Besides, this section scaled 10,000 color BOSSbase images downloaded from http://agents.fel.cvut.cz/stegodata/RAWs/ to color images in “tiff” format with a size of $512 \times 512$ and then calculated the correlation coefficients between the gradient amplitudes of red and green channels and the correlation coefficients between the gradient amplitudes of red and blue channels, respectively. The statistical results of the correlation coefficients are shown in Figure 5. The upper and lower quartiles in each statistical result constitute a “box.” The horizontal line in the “box” represents the median, and the two horizontal lines outside the “box,” respectively, are the upper and lower thresholds of the outliers. In other words, the values larger than the upper horizontal line and smaller than the lower horizontal line are regarded as abnormal values. In Figure 5, the values of the upper and lower horizontal lines “out of the box” are taken as 1 and 0.7, respectively. It can be seen from Figure 5 that there are 75% color images, for which the correlation coefficients between the gradient amplitudes of red and green channels are larger than 0.9565, and only 0.58% color images, for which the correlation coefficients between the gradient amplitudes of red and green channels are less than 0.7. Although the correlation between the gradient amplitudes of red and blue channels is lower, there are still 75% color images, for which the correlation coefficients between the gradient amplitudes of red and blue channels are greater than 0.8484, and only 5.69% color images, for which the correlation coefficients between the gradient amplitudes of red and blue channels are less than 0.7. These results indicate a strong positive correlation between the gradient amplitudes of different color channels.

Figure 4.

Color image and gradient amplitude images of each channel: (a) color image, (b) gradient amplitude image of channel R, (c) gradient amplitude image of channel G, and (d) gradient amplitude image of channel B.

Figure 5.

Correlation coefficient of gradient amplitude image of different channels.

When embedding messages in each color channel, the correlation between the gradient amplitudes of different color channels is likely to be weakened due to the randomness of embedding messages, changing pixels, or the change value. Taking WOW steganography algorithm as an example, when the pseudo-random secret information is embedded into a color image, although the changed pixels will be concentrated with a large probability on positions where the distortion values are small, the change values to these pixels will be pseudo-random +1 or −1. As a result, it is very possible that the WOW steganography algorithm will weaken the correlation between the gradient amplitudes of different color channels. In this section, pseudo-random messages with a payload of 0.4 bpc were embedded, respectively, in the three color channels of above 10,000 color cover images to generate the corresponding 10,000 stego images. Then the correlation coefficient between gradient amplitudes of different color channels of each stego image is subtracted from that of the corresponding cover image as shown in equations (7) and (8)

dcorr (RG) = corr (cRG) - corr (sRG)

(7)

dcorr (RB) = corr (cRB) - corr (sRB)

(8)

where $corr (cRG)$ and $corr (cRB)$ , respectively, represent the correlation coefficient between gradient amplitudes of R and G channels and the correlation coefficient between gradient amplitudes of R and B channels in the cover images, while $corr (sRG)$ and $corr (sRB)$ , respectively, represent the correlation coefficient between gradient amplitudes of R and G channels and the correlation coefficient between gradient amplitudes of R and B channels in the stego images. If the value from equation (7) (or equation (8)) is greater than 0, it should be demonstrated that the WOW steganography will weaken the correlation between gradient amplitudes of red and green (or blue) channels. As shown in Figure 6, among 10,000 color BOSSbase images, there are 9989 images, for which the values from equation (7) are greater than 0, and there are 9936 images, for which the values from equation (8) are greater than 0. That is, there are more than 99% of the images for which the correlation between gradient amplitudes of different channels will be weakened by the WOW steganography.

Figure 6.

Correlation coefficient difference of different channel gradient amplitude images: (a) $dcorr (RG)$ and (b) $dcorr (RB)$ .

The above statistical results show that in most color images there are strong positive correlations between the gradient amplitudes of different channels, which will be weakened by steganography. If the statistical features which can capture such changes are extracted and used for steganalysis, the detection performance of color image steganography should be improved.

Extraction of steganalysis feature based on channel gradient amplitude correlation

It can be seen from Figure 4(b)–(d) that not only the gradient amplitudes in the same position of different color channels are correlated, but also the gradient amplitudes inside each color channel are strongly correlated. Therefore, this section first calculates the gradient amplitude images of three color channels to describe the intensity of texture variation. Second, 7 spam high-pass filters and 24 min–max high-pass filters are used to calculate the residuals of gradient amplitudes of each channel. The correlations between the gradient amplitudes inside each channel are used to suppress the information of image content. Then the co-occurrence matrices over the gradient amplitude residuals of different channels are calculated. The differences between the co-occurrence matrices before and after steganography reflect the influence of steganography on the correlations between gradient amplitudes of different channels. The specific calculation process is as follows (shown in Figure 7):

For a color image with a size of $M \times N$ , obtain gradient vectors of each color channel by filtering in the horizontal and vertical directions, respectively, according to equation (2)

{\begin{matrix} \nabla R_{i, j} = (R_{i, j, \leftarrow}, R_{i, j, ↑}) \\ \nabla G_{i, j} = (G_{i, j, \leftarrow}, G_{i, j, ↑}) \\ \nabla B_{i, j} = (B_{i, j, \leftarrow}, B_{i, j, ↑}) \end{matrix}

(9)

For every gradient vector in each channel, calculate the gradient amplitudes in each position of the three color channels as follows

{\begin{matrix} | \nabla R_{i, j} | = \sqrt{R_{i, j, \leftarrow}^{2} + R_{i, j, ↑}^{2}} \\ | \nabla G_{i, j} | = \sqrt{G_{i, j, \leftarrow}^{2} + G_{i, j, ↑}^{2}} \\ | \nabla B_{i, j} | = \sqrt{B_{i, j, \leftarrow}^{2} + B_{i, j, ↑}^{2}} \end{matrix}

(10)

where $| \nabla R_{i, j} |$ , $| \nabla G_{i, j} |$ , and $| \nabla B_{i, j} |$ represent the gradient amplitudes in position $(i, j)$ of red, green, and blue channels, respectively.

Round and quantize each residual, and truncate the residuals greater than the truncation threshold T to T, the residuals less than $- T$ to $- T$ , obtaining 31 residual images with step $q = 1$ to construct the set of residual images for each gradient amplitude image as follows

{\begin{matrix} d_{| \nabla R |} = {d_{| \nabla R |, 1}, d_{| \nabla R |, 2}, \dots, d_{| \nabla R |, 31}} \\ d_{| \nabla G |} = {d_{| \nabla G |, 1}, d_{| \nabla G |, 2}, \dots, d_{| \nabla G |, 31}} \\ d_{| \nabla B |} = {d_{| \nabla B |, 1}, d_{| \nabla B |, 2}, \dots, d_{| \nabla B |, 31}} \end{matrix}

(11)

For three gradient amplitude residual images of each filter $d_{| \nabla R |, t}$ , $d_{| \nabla G |, t}$ , and $d_{| \nabla B |, t}$ , $1 \leq t \leq 31$ , calculate co-occurrence matrices C from the triplets $(d_{| \nabla R |, t} (i, j), d_{| \nabla G |, t} (i, j), d_{| \nabla B |, t} (i, j))$ as follows

\begin{matrix} C (p_{1}, p_{2}, p_{3}) = \frac{1}{Z} | {(d_{| \nabla R |, t} (i, j), d_{| \nabla G |, t} (i, j) \\ d_{| \nabla B |, t} (i, j)) | d_{| \nabla R |, t} (i, j) = p_{1}, \\ d_{| \nabla G |, t} (i, j) = p_{2}, d_{| \nabla B |, t} (i, j) = p_{3}} | \end{matrix}

(12)

where $p_{1}, p_{2}, p_{3} \in {- T, \dots, T}$ , Z is the number of triplets $(d_{| \nabla R |, t} (i, j), d_{| \nabla G |, t} (i, j), d_{| \nabla B |, t} (i, j))$ , and |•| represents the amount of elements in the set •.

The rules of symbolic symmetry and direction symmetry given in Goljan et al.¹⁹ are applied to merge the co-occurrence matrices of 7 spam residual images and 24 min–max residual images to obtain 7 × 100-dimensional spam residual image co-occurrence matrix feature and 24 × 196-dimensional min–max residual image co-occurrence matrix feature, which constitute 5404-dimensional steganalysis feature based on channel gradient amplitude correlation.

Figure 7.

Correlation coefficient of gradient amplitude image of different channels.

Steganalysis algorithm based on channel gradient correlation

This section presents a color image steganalysis algorithm based on channel gradient correlation which combines the steganalysis feature based on channel gradient amplitude correlation with the steganalysis feature based on channel gradient direction consistency and the SCRMQ1 feature. Because the supervised learning technique usually outperforms the unsupervised learning technique in both accuracy and efficiency,²⁷ the supervised learning technique is more commonly used in steganalysis. With the increase of dimension of steganalysis feature, ensemble classifier has become a popular learning tool for steganalysis. Therefore, it is also used in the proposed color image steganalysis algorithm. The algorithm is composed of steganalyzer training and stego image detection, the detailed procedures of which are described in Algorithms 1 and 2.

Algorithm 1. Training color steganalyzer based on channel gradient correlation.
Input: Cover images training set and stego images training set. Output: Trained steganalyzer. Steps: 1. Steganalysis feature extraction. 29,561-dimensional steganalysis feature is extracted from each training image as follows. I. Extract the SCRMQ1 feature and the features based on channel gradient direction consistency. For each training image, the feature extraction method in Abdulrahman et al.¹⁵ is used to extract the 24,157-dimensional SCRMQ1 feature and the 6000-dimensional feature based on channel gradient direction consistency. II. Calculate gradient amplitudes of each color channel. The gradient amplitude image of each color channel, $\| \nabla R \|$ , $\| \nabla G \|$ , or $\| \nabla B \|$ , is obtained by calculating the module of gradient vector at each position. III. Extract the steganalysis feature based on channel gradient amplitude correlation. The method shown in the above section is used to extract the 5404-dimensional steganalysis feature based on channel gradient amplitude correlation, in which the truncation threshold of the gradient amplitude residuals is set at 3, namely, $T = 3$ . IV. Merge the steganalysis features extracted in I and III into 29,561-dimesional color image steganalysis feature set. 2. Train ensemble classifier. For each training image, if it is a cover training image, the label is set as −1 and, if it is a stego training image, the label is set as +1. The group of label and the corresponding steganalysis feature of each training image is taken as a training sample. Training by the method of ensemble learning to obtain the steganalyzer.

Algorithm 1. Training color steganalyzer based on channel gradient correlation.

Input: Cover images training set and stego images training set.
Output: Trained steganalyzer.
Steps:
1. Steganalysis feature extraction. 29,561-dimensional steganalysis feature is extracted from each training image as follows.
I. Extract the SCRMQ1 feature and the features based on channel gradient direction consistency. For each training image, the feature extraction method in Abdulrahman et al.¹⁵ is used to extract the 24,157-dimensional SCRMQ1 feature and the 6000-dimensional feature based on channel gradient direction consistency.
II. Calculate gradient amplitudes of each color channel. The gradient amplitude image of each color channel,

| \nabla R |

| \nabla G |

, or

| \nabla B |

, is obtained by calculating the module of gradient vector at each position.
III. Extract the steganalysis feature based on channel gradient amplitude correlation. The method shown in the above section is used to extract the 5404-dimensional steganalysis feature based on channel gradient amplitude correlation, in which the truncation threshold of the gradient amplitude residuals is set at 3, namely,

T = 3

.
IV. Merge the steganalysis features extracted in I and III into 29,561-dimesional color image steganalysis feature set.
2. Train ensemble classifier. For each training image, if it is a cover training image, the label is set as −1 and, if it is a stego training image, the label is set as +1. The group of label and the corresponding steganalysis feature of each training image is taken as a training sample. Training by the method of ensemble learning to obtain the steganalyzer.

Algorithm 2. Detecting color image based on channel gradient correlation.
Input: The given color image and trained steganalyzer. Output: The label which reflects whether the given image is a stego image. When the given image is classified as a stego image, the returned label is +1; otherwise, the returned label is −1. Steps: 1. Extract steganalysis feature. The method of step 1 in Algorithm 1 is used to extract the 29,561-dimensianl steganalysis feature of the image to be detected. 2. Classify the cover and stego image. Take the steganalysis feature extracted from the given image as input, and use the steganalyzer trained by Algorithm 1 to distinguish whether the given image is a stego image. If it is classified as a stego image, the label output is set as +1; otherwise, the label output is set as −1.

Algorithm 2. Detecting color image based on channel gradient correlation.

Input: The given color image and trained steganalyzer.
Output: The label which reflects whether the given image is a stego image. When the given image is classified as a stego image, the returned label is +1; otherwise, the returned label is −1.
Steps:
1. Extract steganalysis feature. The method of step 1 in Algorithm 1 is used to extract the 29,561-dimensianl steganalysis feature of the image to be detected.
2. Classify the cover and stego image. Take the steganalysis feature extracted from the given image as input, and use the steganalyzer trained by Algorithm 1 to distinguish whether the given image is a stego image. If it is classified as a stego image, the label output is set as +1; otherwise, the label output is set as −1.

Experimental results and analysis

Experimental settings

In this section, 10,000 color images in “tiff” format generated in the above section are used to test the performance of the proposed algorithm and existing color image steganalysis algorithms. Two typical adaptive steganography algorithms WOW⁶ and S-UNIWARD⁷ with the payloads of 0.05, 0.1, 0.2, 0.3, and 0.4 bpc are used. In total, 100,000 color stego images were obtained.

In the experiment, 5000 images in 10,000 cover images are randomly selected as training cover images, and the corresponding 5000 stego images are used as training stego images. The remaining 5000 cover images and 5000 stego images are used as the test cover and stego images, respectively. As a result, there are 10 groups of training image set and test image set, each group containing 5000 cover images and 5000 stego images. Ensemble classifier²⁸ is used as the steganalyzer, and the average testing errors under equal priors are taken to evaluate the steganalysis performance that

P_{E} = \min_{P_{FA} \in [0, 1]} \frac{1}{2} (P_{FA} + P_{MD} (P_{FA}))

(13)

where $P_{FA}$ and $P_{MD}$ are the false alarm probability and the missed detection probability, respectively. For each test, 10 experimental units were carried out. And the median of the average testing errors under equal priors of 10 experimental units is taken to measure detection performance.

Performance of steganalysis features based on channel gradient amplitude correlation

For contrast experiment, the algorithms proposed in Abdulrahman et al.^15,18 are tested. Tables 1 and 2 show the average testing errors of three steganalysis algorithms for the detection of WOW and S-UNIWARD steganography with different payloads. Figures 8 and 9 show the receiver operating characteristic (ROC) curves of three steganalysis algorithms for WOW and S-UNIWARD with the payloads of 0.05, 0.1, and 0.3 bpc, respectively. It can be seen that, for the detection of both WOW and S-UNIWARD steganography, the proposed steganalysis algorithm outperforms the steganalysis algorithms in Abdulrahman et al.^15,18 And with the increasing payload, the difficulty of detection will decrease, which makes the improvement more difficult and the crossings of curves move forward. In addition, the steganalysis reliability measurements²⁹ have been calculated and are shown in Figures 8 and 9, which can describe the area under the curve. The bigger the value, the better the steganalysis performance. For WOW steganography, the maximum decreasing amplitude of average test errors reaches 1.02% at the payload of 0.05 bpc compared to the method of Abdulrahman et al.¹⁸ and 0.85% at the payload of 0.1 bpc compared to the method of Abdulrahman et al.¹⁵ Regardless of the payload, the steganalysis reliability measurement of the proposed algorithm is the maximum of the three. For S-UNIWARD steganography, the maximum decreasing amplitudes of average test errors reaches 0.95% at the payload of 0.05 bpc compared to the method of Abdulrahman et al.¹⁸ and 0.67% at the payload of 0.1 bpc compared to the method of Abdulrahman et al.¹⁵ The steganalysis reliability measurement of the proposed algorithm is also the largest of the three.

Table 1.

Average test errors of different steganalysis algorithms for WOW.

Payload	0.05	0.1	0.2	0.3	0.4
Method of Abdulrahman et al.¹⁸	0.3872	0.2759	0.1606	0.1036	0.0701
Method of Abdulrahman et al.¹⁵	0.3830	0.2785	0.1562	0.1019	0.0714
Proposed method	0.3770	0.2700	0.1516	0.0977	0.0676

WOW: Wavelet Obtained Weights. Values in last line of Tables 1 are the experimental results of the algorithm proposed in this paper, which show the best detection performance. They are bold to be more highlighted and recognizable.

Table 2.

Average test errors of different steganalysis algorithms for S-UNIWARD.

Payload	0.05	0.1	0.2	0.3	0.4
Method of Abdulrahman et al.¹⁸	0.3786	0.2666	0.1559	0.0968	0.0642
Method of Abdulrahman et al.¹⁵	0.3704	0.2702	0.1557	0.0956	0.0640
Proposed method	0.3691	0.2665	0.1490	0.0951	0.0622

S-UNIWARD: Spatial UNIversal WAvelet Relative Distortion. Values in last line of Tables 2 are the experimental results of the algorithm proposed in this paper, which show the best detection performance. They are bold to be more highlighted and recognizable.

Figure 8.

ROC curves of different steganalysis algorithms for WOW at the payloads of (a) 0.05 bpc, (b) 0.1 bpc, and (c) 0.3 bpc.

Figure 9.

ROC curves of different steganalysis algorithms for S-UNIWARD at the payloads of (a) 0.05 bpc, (b) 0.1 bpc, and (c) 0.3 bpc.

In Abdulrahman et al.,¹⁸ the co-occurrence matrix features are extracted from the three color channels and then combined to detect steganography without considering the integrity of the color image. In Abdulrahman et al.,¹⁵ the gradient vector directions of each channel are used to describe the direction of texture change in each channel, and the angle between channels is represented by the angle between their gradient vectors. Compared with Abdulrahman et al.,¹⁸ the correlation between channels is used in Abdulrahman et al.¹⁵ However, it only considers the consistency of the texture direction of each channel. This article depicts the consistency of the texture variation intensity between channels by the distribution of gradient amplitudes of each channel. It captures the changes of correlation between different channels from direction and intensity, which could further improve the steganalysis performance.

Conclusion

The current research on digital image steganalysis mainly focuses on the single-color channel. Among the existing color image steganalysis algorithms, the detection algorithm based on channel geometric transformation measures has higher detection accuracy than the other detection algorithms. But it fails to utilize the correlation between the gradient amplitudes of different color channels and still has a large detection error rate. Thus, this article points out that color image steganography will weaken the correlation between the gradient amplitudes of different color channels and proposes a color image steganalysis algorithm based on channel gradient correlation. The experimental results show that, compared with the existing algorithms, the proposed algorithm reduces the average detection error rate of the existing algorithms for WOW and S-UNIWARD steganography, and the maximum decreasing amplitudes reach 1.02% and 0.95%.

The main work of this article focuses on extracting the features that can describe the correlation of the different channels, but the dimensionality of features is always high, which will take up lots of time and space when being extracted and saved. If the feature dimensionality can be reduced³⁰ and the algorithm in classification stage can be improved,³¹ the performance of detection can be more effective. In addition, the image is just one of the types of cover which may be maliciously used, while text, voice, protocol packets, and other types of data may also be utilized to transit secret confidential information.^32–34 Therefore, how to reliably detect the hidden secret information of various multimedia data transmitted in the Internet of Things should be solved for ensuring the security of the Internet of Things.

Footnotes

Handling Editor: Giancarlo Fortino

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Natural Science Foundation of China (Nos 61772549, 61872448, U1736214, 61602508, and 61601517).

ORCID iD

Chunfang Yang

References

Provos

Defending against statistical steganalysis. In: 10th USENIX security symposium, vol. 10, Washington, DC, 13–17 August 2001, pp.323–336. Berkeley, CA: USENIX Association.

Westfeld

F5-A steganographic algorithm: high capacity despite better steganalysis. In: 4th international workshop on information hiding, Vol. 2137, Pittsburgh, PA, 25–27 April 2001, pp.289–302. Berlin: Springer.

Sallee

. Model-based steganography. In: 2nd international workshop on digital watermarking, Seoul, Korea, 20–22 October 2003, pp.154–167. Berlin: Springer.

Sallee

Model-based methods for steganography and steganalysis. Int J Imag Graph 2005; 5(1): 167–190.

Filler

Fridrich

Gibbs construction in steganography. IEEE T Inf Foren Sec 2010; 5(4): 705–720.

Holub

Fridrich

Designing steganographic distortion using directional filters. In: IEEE international workshop on information forensics and security, Tenerife, 2–5 December 2012, pp.234–239. New York: IEEE.

Holub

Fridrich

Digital image steganography using universal distortion. In: 1st ACM workshop on information hiding and multimedia security, Montpellier, 17–19 June 2013, pp.59–68. New York: ACM.

Denemark

Fridrich

Improving steganographic security by synchronizing the selection channel. In: 3rd ACM workshop on information hiding and multimedia security, Portland, OR, 17–19 June 2015, pp.5–14. New York: ACM.

Pevný

Bas

Fridrich

Steganalysis by subtractive pixel adjacency matrix. IEEE T Inf Foren Sec 2010; 5(2): 215–224.

10.

Fridrich

Kodovský

Rich models for steganalysis of digital images. IEEE T Inf Foren Sec 2012; 7(3): 868–882.

11.

Holub

Fridrich

Random projections of residuals for digital image steganalysis. IEEE T Inf Foren Sec 2013; 8(12): 1996–2006.

12.

Luo

et al . Selection of rich model steganalysis features based on decision rough set α-positive region reduction. IEEE T Circ Syst Vid 2019; 29(2): 336–350.

13.

Yang

Luo

et al . Extracting hidden messages of MLSB steganography based on optimal stego subset. Sci China Inform Sci 2018; 61(11): 109113:1–109113:3.

14.

Abdulrahman

Chaumont

Montesinos

et al . Color image stegananalysis using correlations between RGB channels. In 10th international conference on availability, reliability and security, Toulouse, 24–27 August 2015, pp.448–454. New York: ACM.

15.

Abdulrahman

Chaumont

Montesinos

et al . Color images steganalysis using rgb channel geometric transformation measures. Secur Commun Netw 2016; 9(15): 2945–2956.

16.

Fridrich

Long

Steganalysis of LSB encoding in color images. In: IEEE international conference on multi-media and expo, Vol. 3, New York, 30 July–2 August 2000, pp.1279–1282. New York: IEEE.

17.

Han

Huang

et al . A steganalysis algorithm based on statistic characteristics of the color images. In: International conference on computer science and network technology, vol. 4, Harbin, China, 24–26 December 2011, pp.2294–2297. New York: IEEE.

18.

Abdulrahman

Chaumont

Montesinos

et al . Color image steganalysis based on steerable gaussian filters bank. In: 4th ACM workshop on information hiding and multimedia security, Vigo, 20–22 June 2016, pp.109–114. New York: ACM.

19.

Goljan

Fridrich

Cogranne

Rich model for steganalysis of color images. In: IEEE international workshop on information forensics and security, Atlanta, GA, 3–5 December 2014, pp.185–190. New York: IEEE.

20.

Goljan

Fridrich

CFA-aware features for steganalysis of color images. In: Media watermarking, security, and forensics, San Francisco, CA, 9–11 February 2015, pp.94090V:01–94090V:13. Paris: SPIE.

21.

Liao

Chen

Yin

Content-adaptive steganalysis for color images. Secur Commun Netw 2016; 9(18): 5756–5763.

22.

Lyu

Farid

Steganalysis using color wavelet statistics and one-class support vector machines. In: Security, steganography, and watermarking of multimedia contents VI, vol. 5306, San Jose, CA, 18–22 January 2004, pp.35–45. Paris: SPIE.

23.

Liu

Sung

et al . Image complexity and feature extraction for steganalysis of LSB matching steganography. In: 18th international conference on pattern recognition, Hong Kong, China, 20–24 August 2006, pp.267–270. New York: IEEE.

24.

Zhang

Steganalysis for color JPEG images based on ensemble proportion training. J Electron Inf Technol 2014; 36(1): 114–120.

25.

Rambabu

Chakrabarti

An efficient immersion-based watershed transform method and its prototype architecture. J Syst Architect 2007; 53(4): 210–226.

26.

Sangwine

Horne

RE.

The colour image processing handbook. Berlin: Springer Science & Business Media, 1998.

27.

Xiang

Zhao

et al . TUMK-ELM: a fast unsupervised heterogeneous data learning approach. IEEE Access 2018; 6: 35305–35315.

28.

Kodovský

Fridrich

Holub

Ensemble classifiers for steganalysis of digital media. IEEE T Inf Foren Sec 2012; 7(2): 432–444.

29.

Fridrich

Feature-based steganalysis for JPEG images and its implications for future design of steganographic schemes. In: 6th international workshop on information hiding, LNCS, vol. 3200, Toronto, Canada, 23–25 May 2004, pp.67–81. Berlin: Springer.

30.

Nie

Chang

et al . Beyond trace ratio: weighted harmonic mean of trace ratios for multiclass discriminant analysis. IEEE T Knowl Data En 2017; 29(10): 2100–2110.

31.

Chang

Nie

Wang

et al . Compound rank-projections for bilinear analysis. IEEE T Neur Net Lear 2016; 27(7): 1502–1513.

32.

Xiang

Hao

et al . Reversible natural language watermarking using synonym substitution and arithmetic coding. Comput Mater Con 2018; 55(3): 541–559.

33.

De Fuentes

Als

Ferreres

AIG

et al . Applying information hiding in vanets to covertly report misbehaving vehicles. Int J Distrib Sens N 2014; 10(2): 120626.

34.

Beugnon

Puech

et al . Rethinking the high capacity 3d steganography: increasing its resistance to steganalysis. In IEEE international conference on image processing, Beijing, China, 17–20 September 2017, pp.510–414. New York: IEEE.