Quite impressive work on LLM interpretability, with applications.https://transformer-circuits.pub/2024/scaling-monosemanticity/index.html
QOTO: Question Others to Teach Ourselves An inclusive, Academic Freedom, instance All cultures welcome. Hate speech and harassment strictly forbidden.