Estimation-based intra prediction algorithm for H.264/AVC

Chun-Su Park; Sung-Jea Ko

doi:10.1117/1.3101375

1 March 2009 Estimation-based intra prediction algorithm for H.264/AVC

Chun-Su Park, Sung-Jea Ko

Author Affiliations +

Optical Engineering, Vol. 48, Issue 3, 030506 (March 2009). https://doi.org/10.1117/1.3101375

Abstract

We propose a new intra prediction algorithm that can improve the coding efficiency of the H.264/AVC standard. In the proposed algorithm, the prediction signal for each macroblock (MB) is formed by using all the previously reconstructed pixels of the current frame. Experimental results show that the proposed algorithm can reduce the bit rate by 1.36% to 9.07%, as compared with the conventional intra prediction, while the average PSNR is not decreased.

1. Introduction

Compared with the other existing video coding standards, the H.264/AVC standard achieves a significant improvement in compression performance.¹ Its high coding efficiency is made possible by new advanced coding tools such as variable block size motion estimation (ME), multiple reference frames, quarter-pixel accuracy ME, and intra prediction. Among these, we focus on the intra prediction in the H.264/AVC standard. The basic concept of the intra prediction is to reduce the number of coded bits by exploiting the spatial correlation between adjacent macroblocks (MBs). When an MB is coded in the intra mode, a prediction signal is formed by using neighboring pixels in the upper and/or left MBs. Intra prediction is used to prevent error propagation, provide random access, and support a real-time streaming service.² However, it is well known that the number of bits generated by intra prediction is much larger than that generated by inter prediction. In this letter, in order to enhance the coding efficiency of intra prediction, we propose an improved intra prediction algorithm. The proposed algorithm searches for the prediction signal for each MB by using all the previously reconstructed pixels of the current frame. The experimental results show that the proposed algorithm can reduce the bit rate by 1.36% to 9.07% depending on the video sequence.

2. Proposed Intra Prediction Algorithm

In this letter, we focus on the intra prediction algorithm adopted in the latest video coding standard H.264/AVC.³ The proposed algorithm can be easily applied to other video coding standards, including H.263 and MPEG-2. The H.264/AVC standards offers a rich set of prediction patterns for intra prediction, i.e., nine prediction modes for $4 \times 4$ blocks and four prediction modes for $16 \times 16 MBs$ . In addition, nine prediction modes for $8 \times 8$ blocks have been added as part of the fidelity range extension (FRExt) of the standard.

Let $B_{i, j}$ be the current MB to be coded, where $i$ and $j$ are the coordinates in horizontal and vertical directions, respectively. Figure 1 shows $B_{i, j}$ and its neighboring MBs. In H.264/AVC intra prediction, only a small number of adjacent pixels are used to construct the prediction signal (see Fig. 1). Thus, in general, the prediction signal generated by intra prediction is not well matched to the original signal, and a large number of bits are required for encoding the difference between the prediction and original signals. Note that, at the time when the current MB is encoded, all pixels of the preceding MBs in the raster scan order have been already reconstructed. Thus, the current MB can be more precisely predicted by utilizing the reconstructed pixels of the preceding MBs. We propose an estimation-based intra prediction algorithm to generate the prediction signal for intra MB. In the proposed algorithm, the best matching position of the current MB is searched in the previously reconstructed part of the current frame. Our proposed algorithm provides a displacement vector (DV) indicating the best matching position with the minimum prediction error. Figure 2 shows the proposed intra prediction algorithm using the DV.

Fig. 1

$B_{i, j}$ and its neighboring MBs.

Fig. 2

Proposed intra prediction algorithm.

The use of the DV can reduce the number of bits required for the prediction error, but it causes additional overhead.⁴ The trade-off between the rate and distortion can be optimized using the Lagrangian method.⁵ The optimal DV $m_{i, j}$ for $B_{i, j}$ is selected by minimizing the Lagrangian functional

Eq. 1

m_{i, j} = \underset{m ∊ M}{argmin} [D_{DV} (B_{i, j}, m) + λ_{DV} \cdot R_{DV} (B_{i, j}, m)],

where

M

is the DV search range, and

λ_{DV}

is the Lagrange parameter. The rate

R_{DV} (B_{i, j}, m)

specifies the number of bits required for encoding the DV. Let

m_{x}

and

m_{y}

be the components of the DV

m_{i, j}

along the

x

and

y

axes, respectively. Then, corresponding to

m_{i, j} = (m_{x}, m_{y})

, the distortion

D_{DV} (B_{i, j}, m)

is calculated as

Eq. 2

D_{DV} (B_{i, j}, m) = \sum_{y = 0}^{15} \sum_{x = 0}^{15} {∣ B_{i, j} (x, y) - B_{i, j} (x - m_{x}, y - m_{y}) ∣}^{p},

where

p = 1

for the sum of absolute difference (SAD), and

p = 2

for the sum of squared differnece (SSD). Note that in Eq. 2, the computation complexity of the proposed method is almost the same as that of the motion estimation for a

16 \times 16 MB

.

From our simulation, we found that the DV of the current MB is correlated with those of neighboring MBs. Thus, instead of the original DV, we encode the difference between the original DV and the predicted one. Let $m_{i, j}^{p}$ be the predicted DV for $m_{i, j}$ . In the proposed method, $m_{i, j}^{p}$ is set to the median value of the neighboring DVs as follows:

Eq. 3

m_{i, j}^{p} = median (m_{i - 1, j}, m_{i, j - 1}, m_{i + 1, j - 1}) .

3. Implementation

The proposed method requires minor modification of the syntax of the H.264/AVC standard. When an MB is coded in intra mode, we add to the syntax a flag $DV ̱ flag$ indicating whether the DV is coded $(D V ̱ flag = 1)$ or not $(D V ̱ flag = 0)$ . At the decoder, the following parsing process is performed:

• If $D V ̱ flag = 0$ , the decoder skips the parsing process for the DV. In this case, the MB is reconstructed based on the conventional intra prediction in H.264/AVC.
• If $D V ̱ flag = 1$ , the decoder parses the DV and constructs a prediction signal by using the proposed algorithm. The decoded residual data is added to the resultant prediction signal.

In order to improve the coding efficiency further, the proposed algorithm estimates the DV in the search range, including the unreconstructed MBs, $B_{i, j}$ and $B_{i + 1, j}$ , as well as the previously reconstructed MBs (see Fig. 2). In the proposed algorithm, since there is no modification in the syntax of the standard except the DV and its corresponding flag, the intra prediction mode is always encoded regardless of the usage of the proposed algorithm. Thus, if $D V ̱ flag = 1$ , the decoder interpolates the pixels in $B_{i, j}$ used for the DV estimation based on the coded intra prediction mode. As shown in Fig. 2, the pixels in $B_{i + 1, j}$ are obtained by simply copying the pixels in the last row of $B_{i + 1, j - 1}$ .

4. Experimental Results

For our experiments, we used Joint Scalable Video Model (JSVM) 8.7 reference software. We used three test sequences, Foreman, Crew, and Table, with CIF at $30 fps$ . These test sequences have different characteristics. The DV search range is set to 32, and all frames were coded in the intra mode. We calculated the average bit rate and PSNR for the video sequences of 200 frames. The CAVLC entropy coding method was used in our experiments.

We compare the proposed intra prediction algorithm with the conventional one in the H.264/AVC using several quantization parameters (QPs). In Table 1, the Foreman sequence produces a higher bit saving than Crew and Table since the number of bits required for encoding the prediction error is smaller than that of the other sequences. The bit rate saving of Foreman is up to 9.07%, with $QP = 44$ . It can be seen that the proposed intra prediction algorithm shows better performance at low bit rates with negligible visual degradation. To show the results clearly, the rate-distortion curves for the Foreman sequence are presented in Fig. 3. As shown in Fig. 3, the proposed algorithm outperforms the conventional one in terms of the rate-distortion sense.

Fig. 3

Rate-distortion curves for the Foreman sequence.

Table 1

Performance comparison of the conventional and proposed algorithms.

QP	Method	Foreman			Crew			Table
QP	Method	Bit rate(kbits/s)	PSNR(dB)	Saving(%)	Bit rate(kbits/s)	PSNR(dB)	Saving(%)	Bit rate(kbits/s)	PSNR(dB)	Saving(%)
32	Conventional	1278.29	35.43	1.56	1336.01	35.43	1.36	1886.43	33.83	1.75
	Proposed	1258.31	35.43		1307.10	35.43		1853.39	33.81
38	Conventional	695.77	31.92	4.66	673.02	31.93	2.16	960.45	30.43	2.32
	Proposed	663.34	31.90		648.51	31.89		938.13	30.41
44	Conventional	401.34	28.39	9.07	340.81	28.86	3.64	533.99	27.21	2.82
	Proposed	364.95	28.42		324.51	28.84		518.94	27.19

5. Conclusion

In this letter, we proposed a new intra prediction algorithm that can improve the coding efficiency of existing video coding standards including H.264/AVC. For each MB, our proposed algorithm provides the DV that indicates the best matching position in the previously reconstructed part of the current frame. The proposed algorithm can be easily implemented by adding a single flag to the syntax of existing video coding standards.

Acknowledgments

This research was supported by Seoul Future Contents Convergence (SFCC) Cluster established by Seoul R&BD Program.

References

1.

T. Wiegand, G. J. Sullivan, G. Bjontegaard, and A. Luthra, “Overview of the H.264/AVC video coding standard,” IEEE Trans. Circuits Syst. Video Technol., 13 560 –576 (2003). https://doi.org/10.1109/TCSVT.2003.815165 1051-8215 Google Scholar

2.

Y. L. Lee, K. H. Han, and G. J. Sullivan, “Improved lossless intra coding for H.264/MPEG-4 AVC,” IEEE Trans. Image Process., 15 2610 –2616 (2006). 1057-7149 Google Scholar

3.

, “Advanced video coding for generic audio-visual services,” (2005) Google Scholar

4.

C. S. Park, C. K. Park, and S. J. Ko, “Generalization of interlayer intra prediction for scalable video coding,” Electron. Lett., 44 337 –338 (2008). https://doi.org/10.1049/el:20083114 0013-5194 Google Scholar

5.

G. J. Sullivan and T. Wiegand, “Rate-distortion optimization for video compression,” IEEE Signal Process. Mag., 15 74 –90 (1998). https://doi.org/10.1109/79.733497 1053-5888 Google Scholar

Citation Download Citation

Chun-Su Park and Sung-Jea Ko "Estimation-based intra prediction algorithm for H.264/AVC," Optical Engineering 48(3), 030506 (1 March 2009). https://doi.org/10.1117/1.3101375

Published: 1 March 2009

Access the abstract

JOURNAL ARTICLE
3 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

CITATIONS

Cited by 3 scholarly publications and 3 patents.

Explore citations on Lens.org

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Reconstruction algorithms

Video coding

Computer programming

Video

Distortion

Motion estimation

Optical engineering

1.

Introduction

2.

Proposed Intra Prediction Algorithm

Fig. 1

Fig. 2

Eq. 1

Eq. 2

Eq. 3

3.

Implementation

4.

Experimental Results

Fig. 3

Table 1

5.

Conclusion

Acknowledgments

References

Show All Keywords

Keywords/Phrases

Search In:

Publication Years