Author
Listed:
- Sumanth Dathathri
(Google DeepMind)
- Abigail See
(Google DeepMind)
- Sumedh Ghaisas
(Google DeepMind)
- Po-Sen Huang
(Google DeepMind)
- Rob McAdam
(Google)
- Johannes Welbl
(Google DeepMind)
- Vandana Bachani
(Google DeepMind)
- Alex Kaskasoli
(Google DeepMind)
- Robert Stanforth
(Google DeepMind)
- Tatiana Matejovicova
(Google DeepMind)
- Jamie Hayes
(Google DeepMind)
- Nidhi Vyas
(Google)
- Majd Al Merey
(Google)
- Jonah Brown-Cohen
(Google DeepMind)
- Rudy Bunel
(Google DeepMind)
- Borja Balle
(Google DeepMind)
- Taylan Cemgil
(Google DeepMind)
- Zahra Ahmed
(Google DeepMind)
- Kitty Stacpoole
(Google DeepMind)
- Ilia Shumailov
(Google DeepMind)
- Ciprian Baetu
(Google)
- Sven Gowal
(Google DeepMind)
- Demis Hassabis
(Google DeepMind)
- Pushmeet Kohli
(Google DeepMind)
Abstract
Large language models (LLMs) have enabled the generation of high-quality synthetic text, often indistinguishable from human-written content, at a scale that can markedly affect the nature of the information ecosystem1–3. Watermarking can help identify synthetic text and limit accidental or deliberate misuse4, but has not been adopted in production systems owing to stringent quality, detectability and computational efficiency requirements. Here we describe SynthID-Text, a production-ready text watermarking scheme that preserves text quality and enables high detection accuracy, with minimal latency overhead. SynthID-Text does not affect LLM training and modifies only the sampling procedure; watermark detection is computationally efficient, without using the underlying LLM. To enable watermarking at scale, we develop an algorithm integrating watermarking with speculative sampling, an efficiency technique frequently used in production systems5. Evaluations across multiple LLMs empirically show that SynthID-Text provides improved detectability over comparable methods, and standard benchmarks and human side-by-side ratings indicate no change in LLM capabilities. To demonstrate the feasibility of watermarking in large-scale-production systems, we conducted a live experiment that assessed feedback from nearly 20 million Gemini6 responses, again confirming the preservation of text quality. We hope that the availability of SynthID-Text7 will facilitate further development of watermarking and responsible use of LLM systems.
Suggested Citation
Sumanth Dathathri & Abigail See & Sumedh Ghaisas & Po-Sen Huang & Rob McAdam & Johannes Welbl & Vandana Bachani & Alex Kaskasoli & Robert Stanforth & Tatiana Matejovicova & Jamie Hayes & Nidhi Vyas & , 2024.
"Scalable watermarking for identifying large language model outputs,"
Nature, Nature, vol. 634(8035), pages 818-823, October.
Handle:
RePEc:nat:nature:v:634:y:2024:i:8035:d:10.1038_s41586-024-08025-4
DOI: 10.1038/s41586-024-08025-4
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:634:y:2024:i:8035:d:10.1038_s41586-024-08025-4. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.