Mark Crovella's Publications

[Franco et al., 2026]

Gabriel Franco, Carson Loughridge and Mark Crovella (2026).
Singular Vectors of Attention Heads Align with Features.
In: Proceedings of ICML. Also appeared in Mechanistic Interpretability Workshop at ICML 2026. doi:

[Franco et al., 2026]

Gabriel Franco, Lucas M. Tassis, Azalea Rohr and Mark Crovella (2026).
Finding Highly Interpretable Prompt-Specific Circuits in Language Models.
Technical Report. Also appeared in Mechanistic Interpretability Workshop at ICML 2026. doi:10.48550/arXiv.2602.13483

[Franco and Crovella, 2025]

Gabriel Franco and Mark Crovella (2025).
Pinpointing Attention-Causal Communication in Language Models.
In: Proceedings of NeurIPS. San Diego, CA. Also appeared in Mechanistic Interpretability Workshop at NeurIPS 2025. doi:TBD

[Franco and Crovella, 2024]