What is interpretability?

发布时间 2024-06-03 18:22:57 来源

Episode 设置

摘要

A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the Interpretability team strives to change that — to understand these models to better plan for a future of safe AI. Find out more: https://www.anthropic.com/research

GPT-4正在为你翻译摘要中......

What is interpretability?

摘要

中英文字稿