24 February 2022 SUNIT: multimodal unsupervised image-to-image translation with shared encoder
Liyuan Lin, Shulin Ji, Yuan Zhou, Shun Zhang
Author Affiliations +
Abstract

Image-to-image translation that refers to the synthesis task of transferring images from the source domain to the target domain has gained significant progress in recent times. Multimodal image-to-image translation aims to generate images with multiple styles of target domains. However, the existing multimodal image-to-image translation network architectures are incapable of accurately transferring the style of a specified image. Moreover, they require an additional deep encoder network to extract the image style code, which increases the network parameters. To address this problem, we propose Sunit, a multimodal unsupervised image-to-image translation with a shared encoder. Sunit shares an encoder network between the discriminator and style encoder. This method reduces the number of network parameters and uses the information from the discriminator to extract the style. Furthermore, we design a training strategy in which the style encoder solely uses the style reconstruction loss and does not follow the generator to train. In this manner, the target of the style encoder becomes clearer. Finally, extensive experimental validations are carried out on the AFHQ and Celeb-HQ datasets. The results demonstrate that our approach outperforms the state-of-the-art methods in the task of reference-guided image translation and transfers the style of the specified image more accurately.

© 2022 SPIE and IS&T 1017-9909/2022/$28.00 © 2022 SPIE and IS&T
Liyuan Lin, Shulin Ji, Yuan Zhou, and Shun Zhang "SUNIT: multimodal unsupervised image-to-image translation with shared encoder," Journal of Electronic Imaging 31(1), 013033 (24 February 2022). https://doi.org/10.1117/1.JEI.31.1.013033
Received: 23 October 2021; Accepted: 3 February 2022; Published: 24 February 2022
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Computer programming

Network architectures

Image quality

Associative arrays

Gallium nitride

Visualization

Ear

RELATED CONTENT


Back to Top