Deep visual and audio watermarking for AI video editing forensics

Paper Title:

V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection

Published on:

25 April 2024

Primary Category:

Computer Vision and Pattern Recognition

Paper Authors:

Xuanyu Zhang,

Youmin Xu,

Runyi Li,

Jiwen Yu,

Weiqi Li,

Zhipei Xu,

Jian Zhang

Bullets

Key Details

•

Combines fragile video steganography and robust watermarking for versatility

•

Enables precise tamper localization and copyright protection

•

Introduces temporal alignment and fusion module for accuracy

•

Uses cross-modal extraction between video and audio

•

Achieves high localization precision without retraining

Explore the topics in this paper

audio video alignment

copyright protection

fragile video watermarking

media steganography

video forensic methods

AI generated summary

Deep visual and audio watermarking for AI video editing forensics

This paper proposes V2A-Mark, a method to embed invisible visual and audio watermarks into videos to enable precise tamper localization and copyright protection. It addresses limitations of current forensic methods like poor generalizability and singular functionality. V2A-Mark combines fragile video steganography with robust watermarking, using temporal alignment and cross-modal extraction between video and audio. Experiments show it achieves superior localization precision and copyright accuracy compared to other methods, without needing retraining for new tamper types.