This A16Z podcast episode features Alexander Wang, founder and CEO of Scale AI, discussing the evolution of AI, the importance of data, and his leadership philosophy. Wang and A16Z General Partner David George delve into the three pillars of AI progress: compute, algorithms, and data. Wang positions Scale AI as a key player in fueling AI advancements through data production.
Wang characterizes the current state of language models as closing in on the end of "phase two," which is the scaling phase after the early pure research. He believes that the industry is entering a phase where research directions will diverge significantly, with breakthroughs occurring at various times across different labs. One crucial element highlighted is the shift from raw execution to innovation-powered cycles, particularly in addressing the limitations of existing data.
The conversation stresses the importance of data production as the next frontier. With easily accessible data like Common Crawl exhausted, labs are hitting a "data wall." Wang proposes a focus on data abundance, moving towards "frontier data" to enable more complex capabilities in AI. He points to the current inadequacy of AI agents, noting the lack of high-quality agent data for training models to perform tool composition tasks like humans naturally do. The solution lies in capturing more of what humans do, investing in synthetic and hybrid data, and creating data foundries to generate massive amounts of high-quality data.
Wang discusses the advantage that big tech companies have with their internal data, but also addresses potential regulatory issues in utilizing this data, especially in Europe. He highlights the financial advantages of big tech companies, allowing them to heavily invest in AI efforts with the potential for significant returns.
The discussion shifts to the market structure of the model layer. The cost of model inference has dramatically decreased, suggesting that intelligence may be becoming a commodity. Wang doubts that renting models will be a lucrative long-term business. He notes the value of businesses that provide the underlying hardware (NVIDIA) or the necessary cloud infrastructure. He notes higher quality businesses exist above the model layer with application, and a good example of this is ChatGPT.
Turning to enterprise adoption of AI, Wang observes excitement and experimentation but notes that fewer proof-of-concepts are making it to production than expected. Instead of a complete transformation, AI has so far mostly delivered efficiency gains and improvements to support functions, so in marginal areas. Wang encourages enterprises to focus on AI implementations that meaningfully boost stock prices through cost savings, efficiency gains, and better customer experiences.
He highlights the potential value of enterprise data, which is used by AI to transform current business operations. Data is valuable, and it’s hyper valuable, however there are challenges around how companies organize and leverage that data.
The conversation pivots to Wang's leadership philosophy and lessons learned from the hiring boom of 2020 and 2021. He acknowledges the mistake of assuming that more people equate to better results. He discovered that a high-performing team is delicate and difficult to scale dramatically without sacrificing quality and culture. He suggests that startups should prioritize keeping high-performing teams intact. He also advises that when hiring executives, it's important to integrate them into the existing culture and operations before making sweeping changes.
Wang discusses Scale AI's implementation of "MEI" (merit, excellence, and intelligence), emphasizing that they will hire the best candidate for each position regardless of demographics.
Finally, Wang offers his definition of Artificial General Intelligence (AGI) as when AI can accomplish 80% or more of jobs that people do purely on computers. He expects that to be a minimum of four years away.