Side Information Generation for Distributed Video Coding Using Spatiotemporal Joint Bilinear Upsampling

Abstract

Distributed video coding presents a viable solution for power-constrained multimedia communication. However, its relatively low coding efficiency compared to the conventional video coding schemes remains a challenging issue. The rate-distortion performance of distributed video coding is highly dependent on the quality of side information generated at the decoder and various techniques have been proposed to improve the side information quality in block-based and frame-based distributed video coding architectures. In this paper, a robust spatiotemporal joint bilateral upsampling based side information generation method is proposed. The proposed side information generation method is based on a block-based low-complexity distributed video coding architecture with adaptive block coding mode classification. A partially reconstructed Wyner-Ziv (WZ) frame with skip and key blocks is downsampled and spatiotemporal error concealment and joint bilateral upsampling are used to generate the side information. Simulation results show that the proposed method improves the quality of side information significantly while keeping low computational complexity.

1. Introduction

Video coding technology has played a key role in the explosion of the current multimedia society. The success of the widespread deployment of digital video applications and services is largely built on the predictive video coding paradigm where the encoder exploits the video redundancy and irrelevancy. This type of video coding is well suited for broadcasting or one-to-many video transmission systems where video is encoded once and decoded many times. However, in resource-constrained environments, a low-complexity encoder is necessary at the expense of a high-complexity decoder while still maintaining a high coding efficiency.

Distributed video coding (DVC) has emerged as a new video compression paradigm for video applications with resource-constrained devices because it enables low-complexity encoding and is naturally robust against transmission errors. Over the past decade, several practical implementations of DVC have been proposed including the Stanford codec [1], PRISM codec [2], and DISCOVER codec [3]. However, current DVC architectures still have several technical limitations that prevent their widespread use in real-world applications. In particular, there is still a significant gap in terms of compression efficiency between the current DVC solutions and conventional predictive video coding techniques.

Since the coding efficiency is highly affected by the quality of SI in DVC, lots of efforts have been made to improve the quality of SI [4–18]. Popular SI generation techniques exploit spatial correlation within the same frame and/or temporal correlation between the consecutive frames [4–6]. Recently, optical flow based methods [7, 8], hash information generated at the encoder [9–11], or multiresolution based techniques [12–18] have been introduced to improve the SI quality. However, most of these methods have high complexity and long decoding time due to a feedback-based architecture.

In this paper, we propose a novel SI generation scheme based on spatiotemporal joint bilateral upsampling (STJBU), which is simple and applicable to any block-based DVC architecture. The proposed method consists of three steps: (1) downsampling of a partially reconstructed WZ frame, (2) SI generation for the WZ blocks in the downsampled WZ frame, and (3) upsampling of the WZ frame using the proposed STJBU algorithm.

The rest of the paper is organized as follows. We review related work on SI generation in DVC in Section 2. In Section 3, the low-complexity DVC (LC-DVC) [19] architecture is briefly introduced which is the basis for the proposed SI generation technique. Then the proposed STJBU based SI generation method is explained in Section 4. Simulation results are presented in Section 5 and the conclusion of the paper is given in Section 6.

2. Related Work

In the past few years, various approaches have been proposed to improve the performance of DVC. The main issues restricting the use of current DVC architectures in practical applications are its low coding efficiency, high decoding latency, and the presence of a feedback channel. In particular, since the coding efficiency is highly affected by the SI quality in DVC, extensive research has been performed to improve the quality of SI.

A multiple motion hypotheses pixel-based temporal interpolation method is proposed in [4], where global and local motion estimation is incorporated. This work has been extended to an adaptive pixel-based temporal interpolation scheme [5] which can adaptively switch between spatial interpolation and forward/backward temporal extrapolation for SI generation. Similarly, a mode decision scheme is presented in [6] to determine the interpolation mode for each block by combining forward and backward motion vectors. Recently, the optical flow algorithm has been exploited for SI generation to compensate for the weaknesses of block-based methods. An optical flow based SI generation algorithm is proposed in [7] which improves the SI quality by obtaining more accurate motion vectors. A similar method proposed in [8] uses optical flow to improve the SI quality and block clustering to increase local adaptivity in the noise modeling. In general, the complex motion estimation process used in these methods incurs high computational complexity and long decoding time.

In the SI generation method proposed in [9], seed blocks are selected first and these blocks are used for motion estimation of the other blocks. Extra information for WZ blocks was transmitted in [10] to help the block matching process at the decoder. Another method called frame-hash uses a highly compressed WZ frame with zero motion vectors to improve the quality of SI [11]. However, the performance of the hash-based DVC schemes is highly dependent on the accuracy of the rate allocation mechanism. An alternative method is to use multiple resolutions in encoding WZ frames. Recently, several SI generation methods based on a mixed-resolution (MR) DVC architecture have been proposed [12–18]. In the MR-DVC architecture, the SI quality is improved by exploiting the spatial relationship between the original frames and the scaled ones.

Spatial low-pass filtering is used in image processing to replace a pixel by a uniform or weighted average of its neighboring pixels. An edge preserving bilateral filter was originally proposed in [20] to alleviate the drawback of spatial low-pass filtering when it is performed over discontinuous regions. It takes into account both the geometric closeness of pixels and their photometric similarity. This noniterative filter smooths images while preserving edges by means of a nonlinear combination of nearby pixel values. Joint bilateral filter proposed in [21] extends the bilateral filter to two correlated images. It filters one image with weights generated using the other image. An alternative joint edge-preserving filter, the guided filter, has been proposed in [22] where the guided filter is derived from a local linear model and can perform filtering in constant time. In [23], the joint bilateral filter has been further extended on image pairs with different resolutions, namely, the joint bilateral upsampling. In [24], a multiresolution bilateral filtering is proposed where the bilateral filter is combined with wavelet thresholding to provide an image denoising framework. The joint bilateral filtering has been successfully applied in a variety of image processing and computer vision applications such as photo enhancement and stereo matching [25].

3. Architecture of Low-Complexity DVC

A simple and unidirectional LC-DVC architecture is proposed in [19]. In the encoder of LC-DVC, an incoming frame is adaptively classified as a key or a WZ frame. The key frame is encoded using the H.264/AVC encoder in intramode. The WZ frames are divided into 4 × 4 nonoverlapping blocks and the blocks are further classified into skip, key, and WZ blocks. The classification map resulting from the block classification process is compressed using arithmetic coding and sent to the decoder. The skip blocks are not transmitted and can be reconstructed at the decoder with help of the previous frame. The key blocks are encoded using H.264/AVC in intramode. The WZ blocks are transformed, quantized, and the bit planes are extracted and encoded using BCH codes.

At the decoder, the key frames are decoded using the H.264/AVC decoder. For a WZ frame, the key blocks are decoded first and then the skip blocks are copied from colocated blocks in the previous frame according to the classification map. As it is shown in Figure 1, a partially reconstructed WZ frame which contains the key and skips blocks is generated. Then, the SI for the WZ blocks is generated by using the proposed method which can be applied to any block-based DVC architecture.

Figure 1

An example of a partially reconstructed WZ frame.

4. Proposed SI Generation Algorithm

The procedure of the proposed SI generation method is shown in Figure 2 and can be divided into 3 steps: (1) downsampling of a partially reconstructed WZ frame, (2) SI generation in the downsampled partially reconstructed WZ frame, and (3) upsampling of the error-concealed WZ frame using the proposed STJBU algorithm.

Figure 2

Flowchart of the proposed SI generation scheme.

4.1. Downsampling of the Partially Reconstructed WZ Frame

In order to reduce the computational complexity in spatiotemporal SI generation methods, the partially reconstructed WZ frame is first downsampled. Downsampling has been used in various image or video compression applications to improve the compression efficiency while reducing the computational complexity [26–30]. The simplest downsampling method is to retain only every Mth sample to create a lower resolution signal in downsampling by a factor of M. However, this simple downsampling method causes aliasing in the resulting downsampled signal. In this paper, four different downsampling methods are used.

4.1.1. Nearest Neighbor Downsampling

The intensity of a pixel in the downsampled image is the intensity of the nearest pixel in the original image as shown in (1):

\begin{matrix} K_{N N} (x) = {\begin{cases} 1; & if | x | < 0.5 \\ 0; & otherwise. \end{cases} \end{matrix}

(1)

4.1.2. Bilinear Downsampling

Bilinear downsampling considers the closest $2 \times 2$ neighborhood of known pixel values surrounding the unknown pixel. It can be implemented by the triangle kernel given in the following:

\begin{matrix} K_{B L} (x) = {\begin{cases} 1 - | x |; & | x | < 1 \\ 0; & otherwise. \end{cases} \end{matrix}

(2)

4.1.3. Bicubic Downsampling

The output pixel value after bicubic downsampling is a weighted sum of the pixels in the nearest $4 \times 4$ neighborhood as shown in the following:

\begin{matrix} K_{B C} (x) = {\begin{cases} 1.5 {| x |}^{3} - 2.5 {| x |}^{2} + 1; & if x \leq 1 \\ - 0.5 {| x |}^{3} + 2.5 {| x |}^{2} - 4 | x | + 2; & if 1 < x \leq 2 \\ 0; & otherwise. \end{cases} \end{matrix}

(3)

4.1.4. Lanczos Downsampling

The output pixel value of the downsampled image is obtained by using a convolution kernel given in the following:

\begin{matrix} K_{L Z} (x) = {\begin{cases} \sin (x) \sin c (\frac{x}{a}); & if | x | < 3 \\ 0; & otherwise. \end{cases} \end{matrix}

(4)

4.2. SI Generation at a Lower Resolution

After the partially reconstructed WZ frame is downsampled, SI is generated for the WZ blocks by exploiting the spatial and temporal correlation. Within a low-delay DVC, the decoder cannot wait for the future frame to arrive before starting the SI generation process and so it must use only the previously reconstructed frame for temporal information. Since the proposed DVC method is block-based and it uses a unique block classification scheme, the decoder is ensured that every WZ block is surrounded by either a key or a skip block in its adjacent 4 neighbors. In this paper, we consider two different methods which are bilinear error concealment and inpainting for SI generation at a lower resolution.

4.2.1. Bilinear Interpolation

SI generation at decoder can be regarded as error concealment (EC) process where the WZ blocks have to be estimated using EC techniques. Among various spatial error concealment techniques [31–34], bilinear error concealment [31] is chosen to estimate the WZ blocks because it is simple but highly efficient.

Bilinear interpolation is a spatial error concealment method which uses the spatially adjacent blocks to recreate the missing pixels by a weighted averaging procedure. Let x and y represent the vertical and horizontal coordinates of the WZ block, where $0 \leq x \leq Q - 1$ and $0 \leq y \leq Q - 1$ (Q is the WZ block size). Let $T (y)$ and $B (y)$ be the pixels to the top and bottom of the WZ block and let $L (x)$ and $R (x)$ be the pixels to the left and right of the WZ block. If P is the estimated pixel, it can be calculated by (5). The weights are defined in (6) so that they are inversely proportional to the distance of the neighboring pixels from the estimated pixel:

\begin{array}{l} p \\ = \frac{T (y) w_{T} (x) + B (y) w_{B} (x) + L (x) w_{L} (y) + R (x) w_{R} (y)}{w_{T} (x) + w_{B} (x) + w_{L} (y) + w_{R} (y)} \end{array}

(5)

\begin{array}{l} w_{T} (x) = Q - x \\ w_{B} (x) = x + 1 \\ w_{L} (y) = Q - y \\ w_{R} (y) = y + 1 . \end{array}

(6)

4.2.2. Region-Filling Inpainting

EC at the lower resolution frame can also be regarded as a hole-filling problem. Region-filling inpainting technique proposed in [35–37] fills holes within the image by propagating linear structure (also called isophotes) into the target region by diffusion. This interactive processing includes 3 steps, namely, patch priorities computation, texture and structure information propagation, and confidence value updating. The initial setting includes target region $(Ω)$ specification, source region $(Φ)$ definition by subtracting the target region from the entire image, and the specification of template window size $(Ψ)$ which is usually set to be slightly larger than the largest distinguishable texture element in the region Φ. Once the parameters are determined, the iterative inpainting process starts automatically until all pixels have been filled. In general, region-filling inpainting incurs high computational complexity, but the processing time can be reduced in the proposed method since inpainting is performed at the lower-resolution WZ frame.

4.3. Spatiotemporal Joint Bilateral Upsampling

After applying EC, the error concealed frame is upsampled using the proposed STJBU method. STJBU is an extension of joint bilateral upsampling (JBU) [24]. JBU is an extension of bilateral filtering [23] and it uses both a domain filter and a range filter to adaptively combine pixels based on both their geometric closeness and their photometric similarity. The difference between JBU and bilateral filtering is that the range filter in JBU is applied to a second guidance image.

In the proposed method, JBU cannot be applied directly because the target reference pixels used for the range filter are not available. In order to solve this problem, the temporal correlation between the consecutive frames is considered. The information in the previous frame is exploited to be used as the second guide image for the range filter. The collocated block in the previous frame is found by boundary matching and it is used as the reference block for the range filter. The scheme of STJBU is shown in Figure 3.

Figure 3

Proposed spatiotemporal joint bilateral upsampling (STJUB) scheme.

Given a previously decoded frame at high resolution ${\tilde{I}}_{p}$ and a low resolution input $S_{\bar{q}}$ , which is the error concealed downsampled WZ frame, a spatial filter is applied to the low resolution input $S_{\bar{q}}$ , while the range filter is jointly applied on the previous high resolution frame ${\tilde{I}}_{p}$ . The upsampled WZ frame ${\tilde{S}}_{p}$ is obtained using the following:

\begin{matrix} {\tilde{S}}_{p} = \frac{1}{k_{p}} \sum_{\bar{q}} S_{\bar{q}} f (∥ \bar{p} - \bar{q} ∥) g (∥ {\tilde{I}}_{p} - {\tilde{I}}_{q} ∥), \end{matrix}

(7)

where p and q denote integer positions in the high resolution frame grid.

\bar{p}

and

\bar{q}

denote the corresponding coordinates in the lower resolution frame grid, f is the domain filter centered over

\bar{p}

, g is the range filter centered at the image value at p, and the normalization term

k_{p}

is the sum of the

f \cdot g

filter weights which ensures that the weights for all the pixels add up to one.

5. Simulation Results

To evaluate the performance of the proposed SI generation technique, we conducted experiments using four standard test sequences, Hall Monitor, Akiyo, Mother and Daughter, and Foreman of QCIF size ( $176 \times 144$ ) sampled at 15 frames per second. The luminance component of key and WZ frames and the classification map are taken into consideration for the bitrate computation. The GOP size is adaptive and the maximum GOP size is 5. Only the DC band and first two AC bands of the WZ blocks are refined using the BCH code.

5.1. Comparison of Different Downsampling Methods

First, we compare the performance of four different downsampling methods introduced in Section 4.1. For each test sequence, the first 50 frames are used for simulation. For the experiments, the frames are downsampled using different downsampling methods and then upsampled using the proposed STJBU. The resulting frames are compared to the original frame to calculate the peak-signal-to-noise ratio (PSNR). Table 1 shows the average PSNR value of the four test sequences when different downsampling methods are applied.

Table 1

Comparison of different downsampling methods.

	Bilinear (dB)	Bicubic (dB)	Nearest (dB)	Lanczos (dB)
Hall Monitor	34.495	34.807	33.290	35.362
Akiyo	42.910	43.313	41.591	44.057
Mother Daughter	38.703	39.984	37.256	40.178
Foreman	32.795	33.194	31.137	33.826

As shown in Table 1, the nearest neighbor downsampling algorithm has the lowest computational complexity but it produces the lowest quality. The Lanczos algorithm is much more complex than the other methods but gives the best quality. The processing time of the Lanczos algorithm is almost 10 times higher than the other methods. Bilinear and bicubic downsampling algorithms have lower computational complexity with acceptable output quality. By considering the trade-off between the performance and the processing speed, bilinear downsampling is chosen to downsample the partially reconstructed WZ frames.

5.2. Comparison of the Reconstructed WZ Frame Quality

This section compares the visual quality of the SI generated by the proposed method with that of the hybrid spatiotemporal error concealment [19]. Akiyo and Hall Monitor sequences were encoded and decoded using the LC-DVC architecture setting the QPISlice value to 30. For the experiments, we use two different EC techniques along with STJBU. In the following sections, we refer to different techniques as defined in Table 2.

Table 2

Different SI generation methods being compared.

Proposed 1	Inpainting as EC and upsampled by STJBU
Proposed 2	BI as EC and upsampled by STJBU
Hybrid EC [19]	Hybrid spatiotemporal EC

The simulation results shown in Figure 4 illustrate the visual quality of WZ frames obtained by different methods for the Akiyo sequence. As can be seen in Figure 4, the proposed methods produce WZ frames with higher PSNR compared to the ones obtained by the hybrid EC [19]. Specifically, Proposed 1 (inpainting + STJBU) achieves better performance than Proposed 2 (BI + STJBU) because image inpainting is more effective than simple BI in error concealment, while it increases the computational complexity. However, since image inpainting is applied to a lower resolution image, the proposed method maintains low computational complexity.

Figure 4

Comparison of visual quality of the SI generated by different methods: (a) partially reconstructed 12th frame (WZ frame) of Akiyo sequence, (b) hybrid EC; PSNR = 43.259 dB. (c) Proposed 2; PSNR = 42.726 dB. (d) Proposed 1; PSNR = 44.602 dB.

5.3. Comparison of the Rate-Distortion Performance in SI Generation

Next, we encode the first 150 frames of Akiyo, Mother and Daughter, Hall Monitor, and Foreman sequences at various bitrates and compare the RD performances of the proposed methods with that of DISCOVER [38], H.264/AVC intramode, LC-DVC [19], and a recent proposed selective data pruning (SDP)-DVC [39]. It should be noted that DISCOVER uses bidirectional motion estimation for SI generation and uses a feedback channel. Therefore, DISCOVER achieves higher rate-distortion performance than block-based DVC without a feedback channel for high motion sequence but incurs prohibitively long delay.

Rate-distortion (RD) performances of four test sequences are shown in Figures 5, 6, 7, and 8, respectively. Taking the Hall Monitor sequence as an example, it can be seen in Figure 7 that Proposed 1 gives the best RD performance, even better than the SDP-DVC [39]. However, the RD performance of Proposed 2 is lower than that of LC-DVC and DISCOVER. Both Proposed 1 and Proposed 2 perform better than H.264/AVC in intramode with an extremely simple encoder. BI based error concealment used in Proposed 2 enables a very simple encoder, but it reduces SI quality and RD performance.

Figure 5

RD performance comparison for Akiyo sequence.

Figure 6

RD performance comparison for Mother and Daughter sequence.

Figure 7

RD performance comparison for Hall Monitor sequence.

Figure 8

RD performance comparison of Foreman sequence.

As shown in Figure 8, the proposed method performs worse for higher motion sequences such as the Foreman sequence. However, it should be noted that DISCOVER uses bidirectional motion estimation for SI generation and uses a feedback channel. Therefore, the DISCOVER codec incurs extremely long decoding time and system delay. Since the proposed method is very simple, has low system delay, and does not require a feedback channel while producing a comparable rate-distortion performance to state-of-the-art DVC methods, it can be a promising solution for video applications in resource-limited environments.

6. Conclusion

In this paper, we present a robust STJBU-based SI generation method. The proposed method consists of 3 steps: (1) downsampling of a partially reconstructed WZ frame, (2) SI generation for the WZ blocks in the downsampled WZ frame, and (3) upsampling of the WZ frame using the proposed STJBU algorithm. Results show that the proposed method improves the visual quality of the SI by preserving the edges and improves the RD performance by more than 1 dB in comparison to other DVC architectures. The proposed SI generation method is simple and can be implemented into any exiting block-based DVC architecture. Moreover, with its low complexity and low latency, the proposed method can be a promising solution for video applications in resource-limited environments with a tight delay bound.

Footnotes

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgment

This work was supported by the Technology Development Program for Commercializing System Semiconductor funded by the Ministry of Trade, Industry and Energy (MOTIE, Korea). (No. 10041126, title: International Collaborative R&BD Project for System Semiconductor).

References

Aaron

Rane

Setton

Girod

Transform-domain wyner-ziv codec for video

5308

Visual Communications and Image Processing

January 2004

520 528 Proceedings of SPIE

2-s2.0-10444281537

10.1117/12.527204

Puri

Ramchandran

PRISM: a new robust video coding architecture based on distributed compression principles

Proceedings of the Allerton Conference on Communication, Control, and Computing

2002

Artigas

Ascenso

Dalai

Klomp

Kubasov

Ouaret

The DISCOVER codec: architecture, techniques and evaluation

Proceedings of the Picture Coding Symposium (PCS ′07)

2007

Hänsel

Richter

Müller

Incorporating feature point-based motion hypotheses in distributed video coding

Proceedings of the 3rd International Congress on Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT ′11)

October 2011

2-s2.0-84856157217

Hansel

Muller

Improved adaptive temporal inter/extrapolation schemes for distributed video coding

Proceedings of the International Conference on Picture Coding Symposium (PCS ′12)

2012

213 216

Park

S. U.

Lee

Y. Y.

Kim

C. S.

Lee

S. U.

Efficient side information generation using assistant pixels for distributed video coding

Proceedings of the International Conference on Picture Coding Symposium (PCS ′12)

2012

161 164

Ren

Shi

Luo

Liu

A new scheme for side information generation in DVC by using optical flow algorithm

Proceedings of the 2nd International Conference on Multimedia Technology (ICMT ′11)

July 2011

2852 2856

2-s2.0-80052959726

10.1109/ICMT.2011.6003053

Luong

H. V.

Raket

L. L.

Huang

Forchhammer

Side information and noise learning for distributed video coding using optical flow and clustering

Transactions on Image Processing 2012 21 12 4782 4796

Kim

D. Y.

Jun

D. S.

Park

H. W.

An efficient side information generation using seed blocks for distributed video coding

Proceedings of the 28th Picture Coding Symposium (PCS ′10)

December 2010

86 89

2-s2.0-79951793371

10.1109/PCS.2010.5702585

10.

Aaron

Rane

Girod

Wyner-Ziv video coding with hash-based motion compensation at the receiver

Proceedings of the International Conference on Image Processing (ICIP ′04)

October 2004

3097 3100

2-s2.0-20444500087

11.

Martinian

Vetro

Yedidia

J. S.

Ascenso

Khisti

Malioutov

Hybrid distributed video coding using SCA codes

Proceedings of the IEEE 8th Workshop on Multimedia Signal Processing (MMSP ′06)

October 2006

258 261

2-s2.0-34250788036

10.1109/MMSP.2006.285309

12.

Macchiavello

De Queiroz

R. L.

Mukherjee

Motion-based side-information generation for a scalable Wyner-Ziv video coder

Proceedings of the 14th IEEE International Conference on Image Processing (ICIP ′07)

September 2007

413 416

2-s2.0-48149099601

10.1109/ICIP.2007.4379609

13.

MacChiavello

Brandi

Peixoto

De Queiroz

R. L.

Mukherjee

Side-information generation for temporally and spatially scalable Wyner-Ziv codecs

Eurasip Journal on Image and Video Processing 2009 2009

2-s2.0-63749125131

10.1155/2009/171257

171257

14.

Macchiavello

Mukherjee

de Queiroz

R. L.

Iterative side-information generation in a mixed resolution wyner-ziv framework

IEEE Transactions on Circuits and Systems for Video Technology 2009 19 10 1409 1423

2-s2.0-70350053007

10.1109/TCSVT.2009.2026820

15.

Mukherjee

A robust reversed-complexity Wyner-Ziv video codec introducing sign-modulated codes

2006 HPL-2006-80

HP Lab.

16.

Mukherjee

Macchiavello

De Queiroz

R. L.

A simple reversed-complexity Wyner-Ziv video coding mode based on a spatial reduction framework

6508

Visual Communications and Image Processing

February 2007

65081Y1 65081Y12 Proceedings of SPIE

2-s2.0-35148854448

17.

Phan

T. T.

Tanaka

Hasegawa

Kato

Mixed-resolution Wyner-Ziv video coding based on selective data pruning

Proceedings of the 3rd IEEE International Workshop on Multimedia Signal Processing (MMSP ′11)

November 2011

1 5

2-s2.0-84055177002

10.1109/MMSP.2011.6093784

18.

Zhang

Zhao

Zhang

Xiong

Gao

Interpolation-dependent image downsampling

IEEE Transactions on Image Processing 2011 20 11 3291 3296

2-s2.0-80054811987

10.1109/TIP.2011.2158226

19.

Vijayanagar

K. R.

Kim

Dynamic GOP size control for low-delay distributed video coding

Proceedings of the 18th IEEE International Conference on Image Processing (ICIP ′11)

September 2011

157 160

2-s2.0-84856265595

10.1109/ICIP.2011.6115750

20.

Tomasi

Manduchi

Bilateral filtering for gray and color images

Proceedings of the 1998 IEEE 6th International Conference on Computer Vision

January 1998

839 846

2-s2.0-0032319446

21.

Yoon

K. J.

Kweon

I. S.

Adaptive support-weight approach for correspondence search

IEEE Transactions on Pattern Analysis and Machine Intelligence 2006 28 4 650 656

2-s2.0-33144482417

10.1109/TPAMI.2006.70

22.

Sun

Tang

Guided image filtering

Proceedings of the European Conference on Computer Vision (ECCV ′11)

2010

1 14

23.

Kopf

Cohen

Lischinski

Uyttendaele

Joint bilateral upsampling

IEEE Transactions on Graphics SIGGRAPH 2007 26 96 100

24.

Zhang

Gunturk

B. K.

Multiresolution bilateral filtering for image denoising

IEEE Transactions on Image Processing 2008 17 12 2324 2333

2-s2.0-57049096977

10.1109/TIP.2008.2006658

25.

Petschnigg

Szeliski

Agrawala

Cohen

Hoppe

Toyama

Digital photography with flash and no-flash image pairs

Proceedings of the ACM Transactions on Graphics (SIGGRAPH ′04)

August 2004

664 672

2-s2.0-12844252390

10.1145/1015706.1015777

26.

Bruckstein

A. M.

Elad

Kimmel

Down-scaling for better transform compression

IEEE Transactions on Image Processing 2003 12 9 1132 1144

2-s2.0-0042428991

10.1109/TIP.2003.816023

27.

Tsaig

Elad

Milanfar

Golub

G. H.

Variable projection for near-optimal filtering in low bit-rate block coders

IEEE Transactions on Circuits and Systems for Video Technology 2005 15 1 154 160

2-s2.0-12344333328

10.1109/TCSVT.2004.839980

28.

Lin

Dong

Adaptive downsampling to improve image compression at low bit rates

IEEE Transactions on Image Processing 2006 15 9 2513 2521

2-s2.0-33747703676

10.1109/TIP.2006.877415

29.

Schwarz

Marpe

Wiegand

Overview of the scalable video coding extension of the H.264/AVC standard

IEEE Transactions on Circuits and Systems for Video Technology 2007 17 9 1103 1120

2-s2.0-34748835762

10.1109/TCSVT.2007.905532

30.

Shangguan

J. T.

Y. L.

Wang

Y. G.

H. L.

Fast algorithm of modified cubic convolution interpolation

Proceedings of the 4th International Congress on Image and Signal Processing (CISP ′11)

October 2011

1072 1075

2-s2.0-84855608116

10.1109/CISP.2011.6100267

31.

Liu

L.-J.

Zhang

Chen

Bilinear interpolation of geomagnetic field

Proceedings of the International Conference on Computer Application and System Modeling (ICCASM ′10)

October 2010

V2665 V2668

2-s2.0-78649561307

10.1109/ICCASM.2010.5620517

32.

Ben-Yue

Min

Adaptive algorithm for image interpolation based on blending osculatory rational interpolants

Computer Engineering and Applications 2010 46 1 196 199

33.

Han

J. K.

Kim

H. M.

Modified cubic convolution scaler for multiformat conversion in a transcoder

Optical Engineering 2004 43 7 1596 1608

2-s2.0-4143071399

10.1117/1.1758732

34.

Shi

Reichenbach

S. E.

Image interpolation by two-dimensional parametric cubic convolution

IEEE Transactions on Image Processing 2006 15 7 1857 1870

2-s2.0-33745599642

10.1109/TIP.2006.873429

35.

Criminisi

Perez

Toyama

Object removal by exemplar-based inpainting

Proceedings of the International Conference on Computer Vision and Pattern Recog (CVPR ′03)

2003

II-721 II-728

36.

Lei

T. C. W.

Chern

S. J.

Partial boundary matching algorithm and spatio-temporal texture synthesis in distributed video coding

Proceedings of the 4th International Conference on Innovative Computing, Information and Control (ICICIC ′09)

December 2009

353 356

2-s2.0-77951485194

10.1109/ICICIC.2009.290

37.

Hänsel

Müller

Error locating for plausible Wyner-Ziv video coding using turbo codes

Proceedings of the IEEE International Workshop on Multimedia Signal Processing (MMSP ′09)

October 2009

1 6

2-s2.0-74349113764

10.1109/MMSP.2009.5293291

38.

http://www.img.lx.it.pt/~discover/rd_performance.html

39.

Tuan

P. T.

Tanaka

Hasegawa

Kato

Mixed-resolution Wyner-Ziv video coding based on selective data pruning

Proceedings of the 3rd IEEE International Workshop on Multimedia Signal Processing (MMSP ′11)

November 2011

1 5

2-s2.0-84055177002

10.1109/MMSP.2011.6093784