Shea CardozoTransformer Circuits: Decomposing Small Language ModelsCan we understand what’s going in Large Language Models by dissecting small ones?Jan 9, 2023Jan 9, 2023