Centrala begrepp
Efficiently embedding dual watermarks in large language models to enhance detection efficiency and text quality.
Sammanfattning
This article introduces Duwak, a dual watermarking approach for large language models. It focuses on improving watermark detection efficiency and text quality by embedding two independent secret patterns in token probability distribution and sampling schemes. The article discusses the theoretical framework, evaluation results, related studies, limitations, and impact statements of Duwak.
Abstract:
- Discusses the importance of auditing large language models (LLMs) and proposes Duwak for efficient watermarking.
- Introduces the concept of embedding dual secret patterns in token probability distribution and sampling schemes.
- Evaluates Duwak's performance against existing watermark techniques under various post-editing attacks.
Introduction:
- Highlights the risks associated with misusing LLMs for generating incorrect content.
- Emphasizes the need for watermarking LLM content to govern applications and prevent misuse.
Data Extraction:
Quotations:
- "Existing watermark techniques are shown effective in embedding single human-imperceptible and machine-detectable patterns without significantly affecting generated text quality." - Author
Statistik
Our anonymous code is available at https://anonymous.4open.science/r/Duwak-BDE5.
Citat
"Existing watermark techniques are shown effective in embedding single human-imperceptible and machine-detectable patterns without significantly affecting generated text quality." - Author