The AI Daily Brief: Artificial Intelligence News and Analysis - How the AI Price Wars Will Impact You

Starting point is 00:00:00 Today on the AI Daily Brief, the AI Price Wars heat up and we're talking about what it means for you. Before that in the headlines, Apple confirms its AI strategy is a complete mess. The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI. To join the conversation, follow the Discord link in our show notes. One of the substories we've been following for some time is just what the heck is going on with Apple when it comes to AI. This is a company that you really would think would be excellently positioned to understand exactly how to bring some version of AI to consumers in a way that actually met their needs, didn't require them to buy into a bunch of AI hype, and yet they have just done absolutely nothing.

Starting point is 00:00:47 Underwhelming concept after underwhelming concept followed by delay after delay, and now its Siri crisis seems to be deepening as a leaked all-hands meeting speaks to disarray in the AI division. Now, of course, at this point, the troubles behind the AI versions of Syria have been well reported, including on this show, but comments from the company have been relatively slim. Last week, Apple confirmed for the first time that Siri was facing major delays and seemed to tacitly acknowledge that they've been advertising features that don't exist. Remember, they pulled ads from YouTube.

Starting point is 00:01:16 Bloomberg's Apple Insider, Mark German, reports that Apple's senior director, Robbie Walker recently addressed the team. German wrote, Walker suggested that employees on his team may be feeling angry, disappointed, burnt out, and embarrassed after the features were postponed. The company had been racing to get the technology ready for this spring, but now the features aren't expected until next year at the earliest. You'll remember that AI Siri was first shown off at the Worldwide Developer Conference last June. Bloomberg has reported that at the time, there was barely a working prototype, and the unveiling

Starting point is 00:01:43 relied entirely on a video mock-up. Walker said that the delays were especially ugly and embarrassing due to the advertising push stating, this was not one of those situations where we get to show people our plan after it's done. We showed people before. He acknowledged that the marketing department had, quote, made matters worse by wanting to promote the enhancements even though they weren't ready. And touching on the true state of the delays, Walker even raised doubt about having AI Siri ready to ship in next year's iOS 19. He said that even though the feature is earmarked for release in the new version of the operating system, it, quote, doesn't mean we're shipping then.

Starting point is 00:02:15 Confirming what we already knew, Walker stated that the delays are due to quality issues with Siri's new features, quote, that resulted in them not working properly up to a third of the time. Citing new hardware and software initiatives, the AI team is working on simultaneously, Walter commented that, we have other commitments across Apple to other projects. We want to keep our commitments to those, and we understand those are now potentially more timeline urgent than the features that have been deferred. He said the decisions will be made on a case-by-case basis, implying that AI Siri is not the number

Starting point is 00:02:41 one priority. Finally, touching on responsibility for the botched rollout, Walker insisted that there was, quote, intense personal accountability, shared by his boss, John Gionandria, the head of AI at Apple, as well as software chief, Craig Frederigi and other executives. German wrote, as a Friday, Apple doesn't plan to immediately fire any top executives over the AI crisis, according to people with knowledge of the matter. That decision could theoretically change in any time. In any case, the company is poised to make management adjustments.

Starting point is 00:03:07 It is discussed moving more senior executives under Gianondria to assist with a turnaround effort. In a sign of just how far Apple has fallen, the leading Apple blog for the last 20 years at this point, Daring Fireball, published on Wednesday a piece called Something is Rotten in the state of Cupertino. He said, in the two decades I've been on this racket, I've never been angrier at myself for missing a story than I am about Apple's announcement on Friday that the more personalized Siri features of Apple Intelligence scheduled to appear between now and WWDC would be delayed until the coming year. I should have had my head examined. Now, John Gruber, the author of this, is raising red flags that something has gone deeply wrong at the company. He referenced a famous

Starting point is 00:03:42 meeting from 2008 when Steve Jobs berated engineers on why an early iPhone email client doesn't work properly. Direct quotes from jobs included, you've tarnished Apple's reputation, and you should hate each other for having let each other down. Gruber wrote, Tim Cook should have already held a meeting like that to address and rectify the Syrian Apple Intelligence debacle. If such a meeting hasn't yet occurred or doesn't happen soon, then I fear that's all she wrote. The ride is over. When mediocrity, excuses and BS take root, they take over. A culture of excellence, accountability, and integrity cannot abide the acceptance of any of those things, and will quickly collapse upon itself with the acceptance of all three. Temay S writes, I've been reading Gruber's blog for

Starting point is 00:04:20 over a decade and always saw him defend every obvious Apple flaw, a true unapologetic fanboy. seeing this is a complete shock. If even this doesn't prompt drastic leadership changes, I don't know what will. Meanwhile, in another big tech house, Google is taking the plunge and replacing the Google Assistant with AI. On Friday, Google announced that over the coming months, mobile users will be switched over to Gemini Assistant, and the classic assistant will no longer be accessible. They added, additionally, we'll be upgrading tablets, cars, and devices that connect to your phone, such as headphones and watches, to Gemini.

Starting point is 00:04:50 We're also bringing a new experience powered by Gemini to home devices like speakers, displays, and TVs. Google notes there, quote, continuing to focus on improving the quality of the day-to-day Gemini experience, especially for those who have come to rely on Google Assistant. Now, in some ways, this was inevitable, but it's still noteworthy relative to the other companies in the space. Neither Amazon or Apple have even shipped an AI-driven version of their assistant, let alone have the confidence to remove a legacy product from customers. David Barnard writes, as a recovering Apple apologist, I get the Apple has high standards and won't ship LLM Siri until its ready argument, but I got to call BS on that.

Starting point is 00:05:23 Siri is a terrible product that we've all been complaining about for over a decade. The bar is low. Google is making the right choice here, pushing to the future even though Gemini isn't perfect. It will get better quickly with hundreds of millions of people using it daily. And for a bunch of things, it's already quite good. It would be funny to see Googling into how far ahead they are with IMAX style ads making fun of how dumb Siri is compared to Gemini. So there you have it. It's a mess in Cupertino. Maybe a little bit better in Mountain View. And all of us are just watching it play out.

Starting point is 00:05:50 For now that that's going to do it for the AI Daily Brief Headlines edition, next up the main episode. Today's episode is brought to you by Vanta. Trust isn't just earned, it's demanded. Whether you're a startup founder navigating your first audit or a seasoned security professional scaling your GRC program, proving your commitment to security has never been more critical or more complex. That's where Vanta comes in. Businesses use Vanta to establish trust by automating compliance needs across over 35 frameworks like SOC2 and ISO-2701.

Starting point is 00:06:23 Centralized security workflows, complete questionnaires up to 5X faster, and proactively manage vendor risk. Vanta can help you start or scale up your security program by connecting you with auditors and experts to conduct your audit and set up your security program quickly. Plus, with automation and AI throughout the platform, Vanta gives you time back, so you can focus on building your company. Join over 9,000 global companies like Atlassian, Kora, and Factory who use Vanta to manage risk and prove security in real time. For a limited time, this audience gets $1,000 off Vanta at vanta.com slash NLW. That's V-A-N-T-A dot com slash N-L-W for $1,000 off. We talk a lot about agents on this show. But if you've ever thought to yourself, I don't want to talk about agents anymore.

Starting point is 00:07:09 I just want to actually build and deploy something. I'm really excited to share something special with you today. We've partnered with Lindy to offer companies that just want to dive into the deep end of agents a way to get their feet wet, a way to move fast and build something meaningful without breaking the budget. The first five companies that email me NLW at Bsuper.a.I with Lindy and the title will have access to work with Lindy to build an actual functional agent serving their specific needs for under $20,000. Some of the agents you can build include a customer support agent, maybe automating responses on your website. You could build an

Starting point is 00:07:47 SDR for generating or qualifying sales leads, or you could build an agent that's perfectly suited for your internal communications needs, be it note-taking, scheduling, or something else. Not only is Lindy's structured to integrate with all of the places that you already keep data and information, it's also a full extensible platform, which means as you hire more and more agent employees and really build out your digital workforce, Lindy's going to enable those agents to be interoperable and basically be able to work together in a seamless way. So again, And if you are interested in diving in all the way to agents, in a matter of weeks, not months, not years, email me, nLW at Bsuper.a.i, put Lindy in the title. And let's get your first digital

Starting point is 00:08:27 employee online. Hey, listeners, are you tasked with the safe deployment and use of trustworthy AI? KPMG has a first of its kind AI Risk and Controls Guide, which provides a structured approach for organizations to begin identifying AI risks and design controls to mitigate threats. What makes KPMG's AI Risks and Controls Guide different is that it outlines practical control considerations to help businesses manage risks and accelerate value. To learn more, go to www.kpmG.org.us slash AI Guide. That's www.kpmg.org slash AI Guide. Today's topic is one that on first glance feels very day-regor for the moment we're in.

Starting point is 00:09:11 Specifically, China's Baidu has released two new AI models, which, as always, they claim, to have performance as high as or at least close to the big American model, specifically GPT, and to do so for cheap. Except the cheap in this case is really cheap. We're talking about a claim that Bidu's new Ernie model matches or exceeds the performance of GPT 4.5 at about 1% of the price, $55 per million input tokens as opposed to $75 per million input tokens for GPT4.5. The other model that Bidu released Ernie X1, which is their reasoning model,

Starting point is 00:09:46 model is priced at even 50% of deepseeks, already low price. And so the sense that many people have is that this is a serious ratcheting up of the AI price war. Today we're going to talk about how this has been evolving and what it potentially means for a number of different groups. Now, there is a sense, broadly speaking, that the price of AI was going to come down precipitously. In fact, even holding aside the onslaught of Chinese models, intelligence has been getting cheaper at a rate that far exceeds, for example, Moore's Law, which was the previous way that the technology industry thought about the speed at which technology became less expensive. There's a phrase that Sam Altman has been fond of,

Starting point is 00:10:23 intelligence too cheap to meter. In July, when OpenAI introduced GPT 4.0 Mini, he pointed out that just two years earlier, the best model in the world was not only much, much worse than the current models, but also a hundred times as much. And of course, we've also seen, even within the big tech companies, price is a major competitive feature. Amazon, which still hasn't exactly gotten its feet under it when it comes to its own proprietary models, introduced its Nova Foundation family in December, and it was very clear that part of the strategy was to compete on price. Google has also been trying to use price as a competitive advantage. In February, when Google released its Gemini 2.0 Flash and Flashlight, again,

Starting point is 00:11:00 the major highlight was how much less expensive they were. Indeed, broadly speaking, the price of LLMs and the intelligence they represent has been just absolutely collapsing. But of course, all of this took on a new dimension when Deepseek launched, claiming that their model, which had very similar performance over comparative open AI models, was trained for less than $6 million. That revelation, unconfirmed as though it may be, has rocked the markets and hasn't really let them go. When Deepseek R1 came out, people started calling it the Sputnik moment, launching a global race for ever cheaper AI. So profound was the psychological mark that Deepseek left in terms of recalibrating how people thought

Starting point is 00:11:39 about where China was in the AI competition, that everything subsequent to that has been, is this the next deep seek moment? Last week we talked about the AI agent Manus, which many people called China's second deep seek moment. And once again, while Manus hadn't innovated at the foundation model level, it had created a consumer product that just seemed to beat everything that was available here in the U.S. Now once again, we have people calling this Baidu drop another deep seek moment. Byu writes, we've just unveiled Ernie 4.5 in Xxieck. As a deep-thinking reasoning model with multimodal capabilities, Ernie X1 delivers performance on par with Deep-Seek R1 at only half the price. Meanwhile, Ernie 4.5 is our latest foundation model and

Starting point is 00:12:19 new generation native multimodal model. Now, in terms of capabilities, these models have everything you'd expect. They can analyze and summarize documents. They can solve complex problems. But price is really what people are talking about. Now, some have pointed out that while Ernie's X1 model is about half the cost of Deepseek's R1 reasoning model, deepseek's V's 3 non-reasoning model is still about half as much as Ernie 4.5, although both of those are, of course, dramatically cheaper than both GBT40 and GPT4.5. In response to this, the memes were flying quickly. With a video of a multi-car pile-up Jeffrey Townsend writes, a real-time view of investors in Gen. AI Foundation models. Low-cost deep-seek, shocked open AI, now Baidu has released super low-cost

Starting point is 00:13:02 Ernie 4.5. China is driving the cost of AI way down. It's brutal. San DiModan writes, China's AI firms are not only building fundamentally better models, e.g. Deepseek, they're building fundamentally cheaper models, e.g. Baidu's. America cannot compete with this if it continues. This is radical innovation, state-funded or otherwise. So what really are the implications of this? Well, first of all, there's the stock market. And obviously we've seen that the emergence of these models, and the idea that they might use much less compute to be able to get this sort of performance threatens the narrative of companies like Nvidia, which have driven the rally for a couple of years now. I think it's important to have a couple of caveats here. First of all, there's a lot more

Starting point is 00:13:41 going on than DeepSeek when it comes to the stock market woes. Right now, we're in a period of extreme volatility and unpredictable futures, and markets are not just dealing with China and AI, but also with tariffs, geopolitical realignment. So trying to parse out how much tech stock underperformance has to do with that versus just a correction after two years of basically unfettered up into the right is a little bit harder parsed than people might be making it seem. The other question, of course, more structurally, when it comes to compute, is cost of inference. The Wall Street narrative is still kind of stuck on the idea that the only use of compute is to train new models, rather than to deliver those models in practice. The counter argument,

Starting point is 00:14:20 the one that companies like Nvidia have been making, is that the cheaper the models become, the more people use them, the more people use the models, the more inference costs they incur. And so the burden of the compute shifts to a different part of the stack, but still remains. Regardless, it makes things look like a less clear bet, and that could have implications for downstream funding as well. Now, when it comes to startup business models, it's a mixed bag. In the short term, there's a lot that's amazing about this. Downward price pressure means that all the startups out there can do and offer a lot more for a lot cheaper. The more intelligence becomes available in a cost-effective way, the more startups are going to find

Starting point is 00:14:55 ways to use it. And that's a very good thing. In the long term, it could be a little bit more challenging. If the price compression continues to be as severe as it looks, it could constrain and limit the band of prices that startups can actually offer. One place that this will specifically come to bear is with regard to agents. Right now, there are lots of different pricing models when it comes to agents. People are experimenting with outcome-based pricing and generally trying to think about things in new ways outside of the traditional SaaS model. But I would say that by and large, they're still benchmarking it against the comparative human labor. If you have a sales or SDR agent, the promise of that agent is that you're going to pay less than the equivalent human time would have

Starting point is 00:15:34 cost. However, in terms of how much less, companies are still benchmarking it against the human that would have done the job before. And that still makes them pretty expensive. It seems highly likely to me that someone is going to try to reverse this flow, and instead of pricing it on the basis of what the comparative human time would have been, they're going to price it on the basis of the cost of goods and have a radically cheaper price that undercuts the entire premise of that other model. Now, once again, the countervailing pressure here is that if I'm right and in the future, we don't just hire one agent to do the thing that a person used to do, but a thousand agents, to do it in a totally different type of way. Maybe that all ends up in a wash. But still,

Starting point is 00:16:09 the point is that the price war will have impacts on both the startup side in terms of what they can offer, as well as the enterprise side in terms of what they expect to buy. And then, of course, there's the geopolitical dimension of all of this. One of the questions is, how much is this intentional price warfare, is this China and Chinese companies doing something unsustainable, in fact, and un-economical in order to cause harm to American competitors? The answer is probably that it doesn't matter, as long as the companies engaging in the price war have deep enough pockets to keep it going. Certainly the American companies aren't loving it. In OpenAI's proposal for the USAI Action Plan, they basically argue that

Starting point is 00:16:46 Deep Seek and Chinese AI should be banned, which of course is one way to limit competition. There are other strategies, though. Robert Scobel, for example, wrote over the weekend, if I were Mark Zuckerberg, I'd release a badass AI model for free and end this price for once and for all. Why? Because the model that wins will collect more real-time data from all of its 3 billion users, which will make its glasses and services better and more profitable.

Starting point is 00:17:07 And indeed, there is a sense that perhaps this is the path. Mark Jeffrey writes, AI smartness goes to infinity, AI price goes to free. Open source-style vendors of add-on services and verticals built around AI win. embodiment of AI in the world of atoms wins and becomes geometrically more valuable with intelligence increases. There is a lot up in the air right now when it comes to the future of the business model of AI. At this point, all that seems clear is that the business side of this is going to change nearly as fast as the technology side. I hope if I've convinced you of anything today, it's that there

Starting point is 00:17:37 are actually meaningful implications of this to the AI you interact with, what price you pay for it, and the opportunities that it creates for you. And so I will, of course, keep track of how these things evolve and change over time. For now that, that's going to do it for today's AI Daily Brief. Until next time, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - How the AI Price Wars Will Impact You

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.