Memory and Continual Learning: Engram's Dan Biderman and Jessy Lin

发布时间    来源
Episode 设置




摘要

Dan Biderman and Jessy Lin, co-founders of Engram, are building a neolab around memory and continual learning, which they call two sides of the same coin. Their contrarian premise: instead of stuffing ever-larger prompts into the context window or bolting on RAG, bake a team's knowledge directly into the model's weights, so it knows your company the way an employee of several years does. The payoff: matching or beating frontier models while consuming up to 100x fewer tokens. Working with partners like Microsoft, Notion, and Harvey, the team draws on roots in computational neuroscience and state-space architectures to attack what they see as the real bottleneck in AI — not raw intelligence, but memory and continual learning. In contrast to the frontier labs' race toward one ever-bigger model and AGI, Dan and Jessy imagine a world where everyone has their own model — privately trained, always learning, and good at the things you actually care about. The real ChatGPT moment for memory, they argue, is the day your model feels like an intern that genuinely got smarter overnight. Hosted by Sonya Huang and Shaun Maguire, Sequoia Capital 00:00 Introduction 00:59 Always Training Explained 01:51 Beyond Context Windows 03:29 Ngram Product Overview 04:34 Adapters And Training Signals 05:32 Internalize Vs Externalize 06:49 Compute And Token Savings 08:19 Teams First Then Individuals 08:51 Memorization Vs Understanding 12:47 Dreams And Offline Digestion 14:08 Training Beats Curation 15:19 Why Everyone Needs A Model 21:44 Bitter Lesson And Architecture 24:44 RAG Killer And KV Cache 31:38 Future Of Memory And Models

GPT-4正在为你翻译摘要中......

中英文字稿