Massachusetts Undergraduate Research Conference (MassURC)

Poster Session 5, 3:15 PM - 4:00 PM: Room 163 [C21]

Mechanistic Interpretability Probing Information Retrieval Ranking LLMs

Presenter: Adriana Caraeni

Group Members: Romaisa Fatima

Faculty Sponsor: James Allan

School: UMass Amherst

Research Area: Computer Science

ABSTRACT

This project investigates the internal representations of information retrieval (IR) features within the RankLLaMA 7B language model through layer-wise probing analysis. We extract neuron activations from all 32 layers of RankLLaMA when processing query-document pairs from the MS MARCO dataset and train Ridge regression models to predict around two dozen distinct IR features from these activations. Our feature set encompasses an extensions from traditional retrieval metrics (BM25, TF-IDF cosine similarity, KL and JS divergence), with term frequency features (min TF, normalized min TF, stream length), position-based features (proximity score, position bias, order preservation, term clustering), and advanced features (co-occurrence score, phrase matching, rare term score, query type score, semantic coverage, title boost, document length normalization, and TF saturation). By computing R^2 scores across layers, we identify where specific IR features are most strongly represented in the model's internal activations. Our results provide insights into how neural ranking models encode and process retrieval-relevant information across their layers, contributing to the mechanistic interpretability of language models for information retrieval tasks.

RELATED ABSTRACTS

Interpretable Semantic Axes for ‘Break' in Contextualized Embeddings, Lee, Sujin, UMass Amherst, Poster Session 6, 4:15 PM - 5:00 PM, Auditorium, A64
Fortifying Small Language Models Against Query Injection Attacks, Senapati, Riddhimaan, UMass Amherst, Poster Session 5, 3:15 PM - 4:00 PM, 163, C20
Adaptive Re-Ranking, Genc, Ata Cinar, UMass Amherst, Poster Session 5, 3:15 PM - 4:00 PM, 163, C22
The Generative Generation? A Qualitative Investigation into Undergraduate AI Tool Use and Perspectives, Cheng, Lance, UMass Amherst, Poster Session 6, 4:15 PM - 5:00 PM, 163, C26
Toward LLM-Supported Automated Assessment of Critical Thinking Subskills, Naskar, Kushaan, UMass Amherst, Poster Session 3, 1:15 PM - 2:00 PM, 163, C28

301 Commonwealth Honors College
University of Massachusetts Amherst
157 Commonwealth Avenue
Amherst, MA 01003-9253

Tel: 413.545.2483
Fax: 413.577.2620
Monday - Friday, 9:00 a.m. - 5:00 p.m.
General inquiries: info@honors.umass.edu

This page is maintained by the Commonwealth Honors College
© 2026 University of Massachusetts Amherst • Site Policies
06/18/26 23:23