arxiv:2501.16330

RelightVid: Temporal-Consistent Diffusion Model for Video Relighting

Published on Jan 27

Authors:

Zeyi Sun ,

Abstract

Diffusion models have demonstrated remarkable success in image generation and editing, with recent advancements enabling albedo-preserving image relighting. However, applying these models to video relighting remains challenging due to the lack of paired <PRE_TAG>video relighting datasets</POST_TAG> and the high demands for output fidelity and temporal consistency, further complicated by the inherent randomness of diffusion models. To address these challenges, we introduce RelightVid, a flexible framework for video relighting that can accept background video, text prompts, or environment maps as relighting conditions. Trained on in-the-wild videos with carefully designed illumination augmentations and rendered videos under extreme dynamic lighting, RelightVid achieves arbitrary <PRE_TAG>video relighting</POST_TAG> with high temporal consistency without intrinsic decomposition while preserving the illumination priors of its image backbone.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

No model linking this paper

Cite arxiv.org/abs/2501.16330 in a model README.md to link it from this page.

No dataset linking this paper

Cite arxiv.org/abs/2501.16330 in a dataset README.md to link it from this page.

No Collection including this paper

Add this paper to a collection to link it from this page.