Abstract
Super-resolution image reconstruction can achieve favorable feature extraction and image analysis. This study first investigated the image’s self-similarity and constructed high-resolution and low-resolution learning dictionaries; then, based on sparse representation and reconstruction algorithm in compressed sensing theory, super-resolution reconstruction (SRSR) of a single image was realized. The proposed algorithm adopted improved K-SVD algorithm for sample training and learning dictionary construction; additionally, the matching pursuit algorithm was improved for achieving single-image SRSR based on image’s self-similarity and compressed sensing. The experimental results reveal that the proposed reconstruction algorithm shows better visual effect and image quality than the degraded low-resolution image; moreover, compared with the reconstructed images using bilinear interpolation and sparse-representation-based algorithms, the reconstructed image using the proposed algorithm has a higher PSNR value and thus exhibits more favorable super-resolution image reconstruction performance.
Introduction
Super-resolution image reconstruction technology aims to acquire enhanced high-resolution images with better visual effects based on the estimation of the complementary information among images, and thus provides more favorable feature extraction and image analysis results. Super-resolution image reconstruction technology mainly includes two types—reconfiguration-based super-resolution reconstruction (SRSR) and learning-based SRSR.
Using reconfiguration-based SRSR, high-resolution images were mainly acquired based on the established degradation model. A high-resolution image can be degraded to a low-resolution image through a series of operations mainly including the addition of noise, geometrical movement, degraded sampling and optical blurring. Then, using an inverse operation, the low-resolution image can be reconstructed to a high-resolution. Figure 1(a) shows the degradation process from a high-resolution image to a low-resolution image, and Figure 1(b) shows the comparison between the degraded image and the original image.

Illustration of an image’s degradation analysis. (a) Degradation process of an image from high resolution to low resolution. (b) Comparison between the original image and the image after degradation.
The degradation model can be described as:
1
As stated above, the additive noise, geometrical movements, down sampling and optical blurring were considered in this degradation model (as described in equation (1)). Next, by taking into account the atmospheric disturbance (denoted as
Although super-resolution image reconstruction gained a lot of useful results, it needs to be further investigated and improved in many aspects such as the algorithm’s temporal and spatial efficiency, the establishment of degradation model, blind SRSR and the construction of learning model.
First, the degradation model plays a decisive role in super-resolution image reconstruction and directly affects noise estimation and fuzzy estimation. The current established degradation models are generally nonlinear; however, simple linear degeneration models were always used in practical reconstruction processes.
Second, since part of blurring process is unknown in SRSR, fuzzy identification should be introduced for the realization of blind SRSR. Meanwhile, since only partial information was given, it is also difficult to estimate point spread function. The statistical model in degradation model generally assumed that linear space was invariable. On the other hand, in practical application, the point spread function was variable in linear space and the blind SRSR algorithm based on nonlinear degradation model needs further improvement.
Third, the established learning model significantly affects the completeness and effectiveness of knowledge acquisition. Markov random field (MRF), image pyramid model and neutral network model are commonly used in the establishment of learning model; however, the established learning models still lacked of all priori knowledge in super-resolution image reconstruction and thus showed relatively poor reconstruction performances. In addition, the realization of learning with advanced features should also be further investigated.
Fourth, SRSR always needs many frames of images for achieving favorable reconstruction performance. 2 On the one hand, the images always have large sizes, and massive data should be processed in real-time when using multiple images for reconstruction; on the other hand, in some cases, only one low-resolution image was acquired, which should be reconstructed as the corresponding high-resolution image. Therefore, it will be of great significance to investigate the SRSR for a single image.
In early stage, super-resolution single-image reconstruction mainly adopted interpolation reconstruction algorithm. To be specific, for a pixel point, the pixel value was estimated by the pixel information at around this point in image’s spatial domain or frequency domain. The commonly used interpolation reconstruction algorithms include linear interpolation, bilinear interpolation, neighborhood interpolation and bicubic interpolation. Using interpolation, the pixel values of high-resolution image were estimated using the image’s pixel information, while the image’s information was not taken into account, thus easily bringing about a series of questions such as blocking effect, edge blurring and the lost of detailed information.
In order to effectively enrich the reconstructed high-resolution image’s information and acquire better reconstruction effect, the learning-dictionary-based super-reconstruction single-image reconstruction method was developed recently. The dictionaries of low-resolution and high-resolution images were firstly acquired through a great number of sample database construction and sample training; then, based on image degradation model, the corresponding relation between low-resolution dictionary and high-resolution dictionary was established for achieving single-image reconstruction. Yang et al. proposed a multiple-geometric-dictionaries-based clustered sparse coding scheme for SISR. 3 Pan et al. propose a single image SR method by learning local self-similarities from the original image itself. 4
This study first examined the image’s self-similarity and constructed high-resolution and low-resolution learning dictionaries; then, using sparse representation and reconstruction algorithm in compressed sensing, the SRSR for a single image was realized.
Image’s self-similarity
There may exist similar structures on a same scale or different scales in an image. This is thus referred to as the image’s multi-scale structural self-similarity. Figure 2 shows two images with multi-scale structural self-similarity; specifically, Figure 2(a) includes many image blocks of aircrafts and Figure 2(b) includes many image blocks of buildings. These image blocks show self-similarity on different scales. The images with self-similarity are common, and the self-similarity information can provide much additional information for image’s SRSR.

Images with multi-scale structural similarity. (a) includes many image blocks of aircrafts. (b) includes many image blocks of buildings.
For the images with self-similarity, the image with self-similarity on a same scale can serve as the low-resolution image for realizing the SRSR of local image blocks. Figure 3(a) illustrates the image reconstruction process using the image blocks from multiple images with self-similarity on a same scale and Figure 3(b) illustrates the image reconstruction process using the image blocks from a single image with self-similarity on a same scale.

Super-resolution image reconstruction using the image blocks (from multiple images (a) and a single image (b)) with self-similarity on a same scale, in which LR denotes low-resolution image and HR denotes high-resolution image.
Image’s SRSR can be achieved using the auxiliary information provided by the self-similarity image blocks with sub-pixel displacement in the low-resolution image.
For the image blocks with self-similarity on different scales, the low-resolution image can be scaled to different extents using pyramid method; then, with the use of K-SVD dictionary learning algorithm, the information of the image blocks with similarity on different scales were added to the reconstructed image,
5
and the image’s SRSR was achieved based on the auxiliary information provided by the similar image blocks on different scales. Figure 4 illustrates the principle of super-resolution image reconstruction using the image blocks with self-similarity on different scales,
6
in which LR denote the low-resolution image, HR denotes the high-resolution image,

Super-resolution image reconstruction using the image blocks (from multiple images (a) and a single image (b)) with self-similarity on different scales, in which LR denotes low-resolution image and HR denotes high-resolution image.
Establishment of the learning dictionary
The establishment of the learning dictionary aims to acquire the image with a higher resolution based on the priori knowledge of training samples and to supplement the low-resolution image so as to effectively enhancing image’s super-resolution effect. Images often have a variety of geometric structures, including edges, corners, contours, and textures. Images with different structures can use different transforms to accomplish representation. Several characteristic dictionaries in this paper are applied to the corresponding feature regions in the image respectively. The aim is to improve the effect of low resolution image reconstruction. Figure 5 shows the training process of the learning dictionary.

Training of the learning dictionary.
In this study, the samples were trained using the improved K-SVD algorithm (the implementation of the algorithm refer to reference
7
), and the high- and low-resolution dictionary pair was constructed by means of sparse representation. Algorithm 1 describes the establishment procedure of the learning dictionary in detail. Algorithm 1. Detailed establishment procedure of the learning dictionary Step 1: Select the characteristic samples from the training samples and conduct pre-processing on the selected characteristic image blocks (the characteristic image block was set as 16 pixel * 16 pixel); Step 2: Classify the selected characteristic image blocks and perform classification and training on the image blocks with self-similarity according to various characteristics including uniformity, detail, texture and complex edge; Step 3: Rearrange the characteristic image blocks after classification and construct the high-resolution and low-resolution dictionaries using the improved K-SVD algorithm.
Single-image super-resolution reconstruction based on compressed sensing and learning dictionary
After the construction of high-resolution and low-resolution learning dictionaries, this study then conducted SRSR on a single image used on the theoretical framework in compressed sensing and the related reconstruction algorithms.
The observation model of super-resolution image reconstruction model can be written as:
In order to reduce the correlation between the measurement matrix and wavelet basis, a low-pass filter (
In order to guarantee
Due to the scrambling of
Based on SRSR observation model and dictionary learning, this study also made appropriate improvements on matching pursuit algorithm so as to achieve the single-image SRSR on the basis of compressed sensing and learning dictionary, as the specific procedure described in Algorithm 2.
11
The improved algorithm is projected to the low resolution image on the basis of the degraded model, and obtained the high resolution image by iterative solution. Algorithm 2. Detailed single-image super-resolution reconstruction procedures based on compressed sensing and learning dictionary Step 1: Input high-resolution and low-resolution dictionary pair Step 2: Initiate the iteration number (n) and the reconstructed image as 1 and 0, respectively (i.e. make the initiation: n = 1 and Step 3: Based on the degradation model ( Step 4: Do the iteration and solve Step 5: Calculate the sparse representation coefficient Step 6: Update the reconstructed image subset Step 7: Calculate the residual of the observed signal Step 8: Repeat Step 3–Step 7 and finish the iteration-based reconstruction until
Experimental analysis
Currently, super-resolution image reconstruction results are mainly evaluated subjectively or objectively, which can also be inversely evaluated in the practical applications by analyzing the reconstructed images through feature extraction, target detection and identification.
Subjective evaluation refers to experts’ subjective judgment on super-resolution image reconstruction through direct observation or grading. Subjective evaluation lacks of objective and accurate evaluation parameters; moreover, different experts would give different evaluation results, which were greatly affected by the experts’ visual perception.
Objective evaluation refers to the quantitative analysis of the reconstructed super-resolution image. Since the reconstructed image has no fixed reference image for comparison, some traditional parameters including mean-square error and cross entropy are no longer applicable. Objective evaluation mainly focuses on the quantitative analysis of the reconstructed image, with the common evaluation parameters including peak signal-to-noise ratio (SRNR), information entropy and mean gradient. In this study, PSNR was used as the evaluation parameter.
In the dictionary training experiment, aiming at the different characteristics of the image, the dictionary training experiment of grassland image, road texture image, self similar image and complex scene image is carried out. The training results of the grassland feature image dictionary are shown in Figure 6. Figure 6(a) is a meadow image block and Figure 6(b) is a dictionary of grassland image training. The road texture image dictionary training results are shown in Figure 7. Figure 7(a) is a road image block and Figure 7(b) is a dictionary for the training of road texture images. The self similarity image dictionary training results are shown in Figure 8. Figure 8(a) is a self similar image block and Figure 8(b) is a self similarity image training dictionary. The result of the image dictionary training for complex scenes is shown in Figure 9. Figure 9(a) is a complex scene image block and Figure 9(b) is a complex scene image training dictionary.

Dictionary training for feature image of grassland. (a) Grassland image. (b) Grassland image training dictionary.

Road texture image dictionary training. (a) Road image. (b) Road texture image training dictionary.

Self similarity image dictionary training. (a) Self similar image. (b) Self similarity image training dictionary.

Image dictionary training for complex scenes. (a) Complex scene image. (b) Complex scene image training dictionary.
In the present study, an image with a size of 256 pixels * 256 pixels was selected and degraded to the image with a size of 128 pixels * 128 pixels. Then, the image was reconstructed using bilinear interpolation, sparse-representation-based SRSR and the proposed method, as the results shown in Figure 10.

Comparison of the super-resolution reconstructed images using different methods. (a) Degraded image (128 pixels * 128 pixels). (b) High-resolution reference image (256 pixels * 256 pixels). (c) Reconstructed image using bilinear interpolation (256 pixels * 256 pixels). (d) Reconstructed image using SRSR method (256 pixels * 256 pixels). (e) Reconstructed image using the proposed method (256 pixels * 256 pixels).
Figure 10(a) shows the degraded image with a size of 128 pixels * 128 pixels, Figure 10(b) shows the original high-resolution image for reference, and Figure 10(c)–(e) displays the reconstructed images using bilinear interpolation, SRSR and the proposed algorithm, respectively.
It can be observed that the reconstructed high-resolution images using SRSR and the proposed algorithm show better visual effect and image quality than the degraded low-resolution image. Figure 11 shows the local enlarged results of the reconstructed images using different algorithms.

Local enlarged results of the reconstructed images. (a) Original high-resolution image for reference. (b) Reconstructed image using bilinear interpolation. (c) Reconstructed image using SRSR method. (d) Reconstructed image using the proposed method.
Meanwhile, the PSNR values of the reconstructed images using different algorithm were calculated for evaluation. The PSNR values of the reconstructed images using bilinear interpolation algorithm, SRSR algorithm and the proposed method are 26.32, 29.57 and 29.81, respectively. In order to further verify the reconstruction effect using the proposed algorithm, the degraded image was magnified by two times, four times and eight times, respectively, and the magnified images were then reconstructed using different super-resolution image reconstruction algorithms, as the results listed in Table 1.
Comparison of the PSNR values of the reconstructed images using different super-resolution reconstruction algorithms, during which the degraded image was magnified by two times, four times and eight times, respectively.
SRSR: sparse-representation-based super-resolution reconstruction.
As listed in Table 1, the greater the magnification times of the degraded image, the smaller the value of PSNR of the reconstructed image, suggesting a poorer reconstruction performance. Through comparison, the reconstructed image using the proposed single-image SRSR algorithm based on compressed sensing and learning dictionary shows a largest PSNR value and most favorable SRSR performance.
Conclusions
By introducing compressed sensing theory that was widely applied in image processing into super-resolution image reconstruction, this study achieved favorable SRSR for a single image based on compressed sensing and learning dictionary. In order to reduce the correlation between measurement matrix and wavelet basis, low-pass filter was introduced in SRSR; meanwhile, local Hadamard matrix with fast reconstruction and low requirements on storage space, was selected as the measurement matrix for ensuring the image’ reconstruction satisfies RIP in compressed sensing theory; finally, a single image’s SRSR was achieved through iteration-based reconstruction algorithm. According to the experimental results, the proposed algorithm is superior to bilinear interpolation algorithm and SRSR algorithm in reconstruction performance.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was financially supported by the Scientific Research Program of Sichuan Province, China (No. 17ZA0453), and the Scientific Research Program of Yibin University, Sichuan, China (No. 2016QD12).
