Paper Image

Deep visual and audio watermarking for AI video editing forensics

Published on:

25 April 2024

Primary Category:

Computer Vision and Pattern Recognition

Paper Authors:

Xuanyu Zhang,

Youmin Xu,

Runyi Li,

Jiwen Yu,

Weiqi Li,

Zhipei Xu,

Jian Zhang

Bullets

Key Details

Combines fragile video steganography and robust watermarking for versatility

Enables precise tamper localization and copyright protection

Introduces temporal alignment and fusion module for accuracy

Uses cross-modal extraction between video and audio

Achieves high localization precision without retraining

AI generated summary

Deep visual and audio watermarking for AI video editing forensics

This paper proposes V2A-Mark, a method to embed invisible visual and audio watermarks into videos to enable precise tamper localization and copyright protection. It addresses limitations of current forensic methods like poor generalizability and singular functionality. V2A-Mark combines fragile video steganography with robust watermarking, using temporal alignment and cross-modal extraction between video and audio. Experiments show it achieves superior localization precision and copyright accuracy compared to other methods, without needing retraining for new tamper types.

Answers from this paper

Comments

No comments yet, be the first to start the conversation...

Sign up to comment on this paper

Sign Up