Memory and Continual Learning: Engram's Dan Biderman and Jessy Lin
发布时间 来源
Episode 设置
摘要
Dan Biderman and Jessy Lin, co-founders of Engram, are building a neolab around memory and continual learning, which they call two sides of the same coin. Their contrarian premise: instead of stuffing ever-larger prompts into the context window or bolting on RAG, bake a team's knowledge directly into the model's weights, so it knows your company the way an employee of several years does.
The payoff: matching or beating frontier models while consuming up to 100x fewer tokens. Working with partners like Microsoft, Notion, and Harvey, the team draws on roots in computational neuroscience and state-space architectures to attack what they see as the real bottleneck in AI — not raw intelligence, but memory and continual learning. In contrast to the frontier labs' race toward one ever-bigger model and AGI, Dan and Jessy imagine a world where everyone has their own model — privately trained, always learning, and good at the things you actually care about. The real ChatGPT moment for memory, they argue, is the day your model feels like an intern that genuinely got smarter overnight.
Hosted by Sonya Huang and Shaun Maguire, Sequoia Capital
00:00 Introduction
00:59 Always Training Explained
01:51 Beyond Context Windows
03:29 Ngram Product Overview
04:34 Adapters And Training Signals
05:32 Internalize Vs Externalize
06:49 Compute And Token Savings
08:19 Teams First Then Individuals
08:51 Memorization Vs Understanding
12:47 Dreams And Offline Digestion
14:08 Training Beats Curation
15:19 Why Everyone Needs A Model
21:44 Bitter Lesson And Architecture
24:44 RAG Killer And KV Cache
31:38 Future Of Memory And Models
GPT-4正在为你翻译摘要中......
