Invited talk at the LT3 Language and Translation Technology Team seminar (UGent): “Understanding LLMs for Science: From Benchmarks to Mechanisms.”