Picture for Ruben Härle

Ruben Härle

Measuring and Guiding Monosemanticity

Add code
Jun 24, 2025
Viaarxiv icon

SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs

Add code
Nov 11, 2024
Viaarxiv icon