Paper Title:
V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection
Published on:
25 April 2024
Primary Category:
Computer Vision and Pattern Recognition
Paper Authors:
Xuanyu Zhang,
Youmin Xu,
Runyi Li,
Jiwen Yu,
Weiqi Li,
Zhipei Xu,
Jian Zhang
Combines fragile video steganography and robust watermarking for versatility
Enables precise tamper localization and copyright protection
Introduces temporal alignment and fusion module for accuracy
Uses cross-modal extraction between video and audio
Achieves high localization precision without retraining
Deep visual and audio watermarking for AI video editing forensics
This paper proposes V2A-Mark, a method to embed invisible visual and audio watermarks into videos to enable precise tamper localization and copyright protection. It addresses limitations of current forensic methods like poor generalizability and singular functionality. V2A-Mark combines fragile video steganography with robust watermarking, using temporal alignment and cross-modal extraction between video and audio. Experiments show it achieves superior localization precision and copyright accuracy compared to other methods, without needing retraining for new tamper types.
No comments yet, be the first to start the conversation...
Sign up to comment on this paper