Massachusetts Undergraduate Research Conference (MassURC)

Poster Session 3, 1:15 PM - 2:00 PM: Room 163 [C27]

Analysis of Large Language Model Watermarks

Presenter: Jonathan Rossignol

Faculty Sponsor: Lulu Kang

School: UMass Amherst

Research Area: Computer Science

ABSTRACT

Large Language Models (LLMs) are producing increasingly natural text which can be difficult to distinguish from human created text. One way to make LLM generated text identifiable is by applying a watermark in the generation process. Often this is done by introducing some form of pseudo-random noise into the logits produced by an LLM before applying the soft-max and sampling from the distribution of tokens. This makes the text identifiable using a detector function to check for this embedded pseudo-random noise which is unlikely to exist in human text.

While watermarks are an important step towards text provenance, they face certain limitations which makes them difficult to be effective in practice. Detector functions are prone to misidentifying both human and LLM generated text; watermarked text which has been even slightly modified by a human after generation may be difficult to identify; applying a watermark may degrade the text quality of the LLM. Here, I analyze and compare numerous watermarks such as the Green-list, Gumbel-max and SynthID to see how each counteracts the problems above. Using the Llama 3 LLM I generated text with and without watermarks using diverse prompts and varied token limits to be used as data for computing statistics such as perplexity and detection accuracy to evaluate each watermark's effectiveness.

RELATED ABSTRACTS

Toward LLM-Supported Automated Assessment of Critical Thinking Subskills, Naskar, Kushaan, UMass Amherst, Poster Session 3, 1:15 PM - 2:00 PM, 163, C28
Using AI to Understand Itself, Andrews, Peter James, UMass Amherst, Poster Session 5, 3:15 PM - 4:00 PM, Auditorium, A75
Mechanistic Interpretability Probing Information Retrieval Ranking LLMs, Caraeni, Adriana, UMass Amherst, Poster Session 5, 3:15 PM - 4:00 PM, 163, C21
Interpretable Semantic Axes for ‘Break' in Contextualized Embeddings, Lee, Sujin, UMass Amherst, Poster Session 6, 4:15 PM - 5:00 PM, Auditorium, A64
The Generative Generation? A Qualitative Investigation into Undergraduate AI Tool Use and Perspectives, Cheng, Lance, UMass Amherst, Poster Session 6, 4:15 PM - 5:00 PM, 163, C26

301 Commonwealth Honors College
University of Massachusetts Amherst
157 Commonwealth Avenue
Amherst, MA 01003-9253

Tel: 413.545.2483
Fax: 413.577.2620
Monday - Friday, 9:00 a.m. - 5:00 p.m.
General inquiries: info@honors.umass.edu

This page is maintained by the Commonwealth Honors College
© 2026 University of Massachusetts Amherst • Site Policies
06/19/26 18:47