Scaling interpretability
发布时间 2024-06-13 16:06:20 来源
摘要
Science and engineering are inseparable. Our researchers reflect on the close relationship between scientific and engineering progress, and discuss the technical challenges they encountered in scaling our interpretability research to much larger AI models.
Read more: https://anthropic.com/research/engineering-challenges-interpretability