24 June 2021 Adaptive coding unit size convolutional neural network for fast 3D-HEVC depth map intracoding
Hua Zhang, Wangze Yao, Hongfei Huang, Yifan Wu, Guojun Dai
Author Affiliations +
Abstract

The advanced three-dimensional extension of high-efficiency video coding (3D-HEVC) is the latest coding standard for 3D video. The coding of the depth map for 3D-HEVC is very time-consuming. With the development of deep learning, it has become feasible to employ convolutional neural networks (CNNs) to predict the coding unit (CU) division of the depth map. However, there are three types of CU sizes: 64, 32, and 16, which makes it difficult to unify the model. The features of the depth map are very different from the texture map. In view of the aforementioned problems, we propose an adaptive CU size CNNs for fast 3D-HEVC depth map intracoding. We first employ spatial pyramid pooling to fully extract the features of the three types of CUs. Then, we apply the nonlocal self-attention mechanism to make it suitable for depth maps. Compared with the 3D-HEVC reference algorithm, the proposed network reduces the coding time by an average of 35.7%, while the quality degradation of the synthesized virtual view is negligible.

© 2021 SPIE and IS&T 1017-9909/2021/$28.00 © 2021 SPIE and IS&T
Hua Zhang, Wangze Yao, Hongfei Huang, Yifan Wu, and Guojun Dai "Adaptive coding unit size convolutional neural network for fast 3D-HEVC depth map intracoding," Journal of Electronic Imaging 30(4), 041405 (24 June 2021). https://doi.org/10.1117/1.JEI.30.4.041405
Received: 13 January 2021; Accepted: 29 March 2021; Published: 24 June 2021
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Copper

Volume rendering

Computer programming

Video

Video coding

3D modeling

Lithium

Back to Top