Why NVIDIA Is Giving Away AI Models | Bryan Catanzaro

发布时间    来源
Episode 设置




摘要

NVIDIA is a chip company. So why does it put hundreds of researchers on building AI models — and then give them away for free? Bryan Catanzaro is VP of Applied Deep Learning Research at NVIDIA and one of the people whose work quietly underpins modern AI: he helped create cuDNN (NVIDIA's first deep learning product), co-invented DLSS, and named and built Megatron, the framework behind how much of the industry trains large models. Today he leads Nemotron, NVIDIA's family of open models — and Nemotron 3 Ultra, released just weeks ago, is one of the strongest open-weights models to come out of the US. Matt Turck sits down with Bryan for a genuinely deep conversation: the real business logic behind a chip company building its own models, the state of open vs. closed AI, and whether the US is falling behind China in open models. Then they go inside Nemotron itself — four-bit (NVFP4) pretraining, hybrid Mamba-Transformer architecture, mixture-of-experts, multi-token prediction, and multi-teacher distillation — all explained in plain language. Plus a rare look at how a modern AI research org actually runs, what it was like working alongside Andrew Ng and Dario Amodei at Baidu, why Bryan doesn't believe in the singularity, and his contrarian case that open AI is safer than closed. A reference conversation for anyone trying to understand where AI is really headed. Bryan Catanzaro LinkedIn - https://www.linkedin.com/in/bryancatanzaro X/Twitter - https://x.com/ctnzr NVIDIA Website - https://www.nvidia.com X/Twitter - https://x.com/nvidia Matt Turck (Managing Director) Blog - https://mattturck.com LinkedIn - https://www.linkedin.com/in/turck/ X/Twitter - https://x.com/mattturck FirstMark Website - https://firstmark.com X/Twitter - https://x.com/FirstMarkCap Listen on: Spotify - https://open.spotify.com/show/7yLATDSaFvgJG80ACcRJtq Apple - https://podcasts.apple.com/us/podcast/the-mad-podcast-with-matt-turck/id168623872 00:00 — Cold open & Intro 01:33 — Is open source AI catching the frontier? 05:29 — Do closed labs blocking distillation slow open source down? 07:42 — Is the US falling behind China? 10:30 — Why companies actually choose open models 12:39 — A "crazy" 2008 bet: machine learning on GPUs 15:33 — Working with Andrew Ng and Dario Amodei at Baidu 17:41 — Coming back to NVIDIA: DLSS and the birth of Megatron 21:55 — The real reason NVIDIA builds its own models 24:28 — Is Moore's Law really dead? 33:37 — The Nemotron family: Nano, Super, Ultra 35:09 — Built for agents: why NVIDIA bets on speed 36:02 — How you train a 550B model in 4 bits 39:25 — Hybrid Mamba-Transformer, explained simply 42:31 — Mixture of experts — and why NVIDIA built NVL72 around it 47:26 — Why a 1-million-token context window matters 49:26 — Multi-token prediction: how the model predicts 5 tokens at once 52:47 — Multi-teacher distillation: teaching one model from many 58:01 — Where reinforcement learning goes next 01:00:16 — Inside NVIDIA's research org: "the mission is the boss" 01:04:03 — How NVIDIA decides who gets the GPUs 01:10:53 — Why NVIDIA still feels entrepreneurial after 33 years 01:12:58 — Why Bryan doesn't believe in the singularity 01:17:50 — The AI backlash 01:19:18 — The controversial case: open AI is safer than closed

GPT-4正在为你翻译摘要中......

中英文字稿