ScienceOpen: research and publishing network

For Researchers

Search
Advanced search

1

views

    

0

recommends

0

shares

Record: found
Abstract: found
Article: found

Is Open Access

Preserving Semantic and Temporal Consistency for Unpaired Video-to-Video Translation

Preprint

Author(s): Kwanyong Park , Sanghyun Woo , Dahun Kim , Donghyeon Cho , In So Kweon

Publication date Created: 20 August 2019

Read this article at

ScienceOpen Publisher ArXiv

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

In this paper, we investigate the problem of unpaired video-to-video translation. Given a video in the source domain, we aim to learn the conditional distribution of the corresponding video in the target domain, without seeing any pairs of corresponding videos. While significant progress has been made in the unpaired translation of images, directly applying these methods to an input video leads to low visual quality due to the additional time dimension. In particular, previous methods suffer from semantic inconsistency (i.e., semantic label flipping) and temporal flickering artifacts. To alleviate these issues, we propose a new framework that is composed of carefully-designed generators and discriminators, coupled with two core objective functions: 1) content preserving loss and 2) temporal consistency loss. Extensive qualitative and quantitative evaluations demonstrate the superior performance of the proposed method against previous approaches. We further apply our framework to a domain adaptation task and achieve favorable results.

Related collections

Most cited references 15

Record: found
Abstract: not found
Conference Proceedings: not found

Pyramid Scene Parsing Network

Jianping Shi, Xiaojuan Qi, Hengshuang Zhao … (2017)

0 comments Cited 919 times – based on 0 reviews

Record: found
Abstract: not found
Conference Proceedings: not found

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu … (2018)

0 comments Cited 278 times – based on 0 reviews

Record: found
Abstract: not found
Conference Proceedings: not found

A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation

Philipp Fischer, Nikolaus Mayer, Eddy Ilg … (2016)

0 comments Cited 212 times – based on 0 reviews

Author and article information

Journal

Publication date Created: 20 August 2019

Article

DOI: 10.1145/3343031.3350864

ArXiV ID: 1908.07683

SO-VID: a9422042-6cc4-4040-9134-cdcd00832139

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Comments Accepted by ACM Multimedia(ACM MM) 2019

Categories cs.CV cs.MM

ScienceOpen disciplines: Computer vision & Pattern recognition,Graphics & Multimedia design

Data availability:

ScienceOpen disciplines: Computer vision & Pattern recognition, Graphics & Multimedia design

Comments

Comment on this article