The AI Daily Brief: Artificial Intelligence News and Analysis - How the AI Price Wars Will Impact You
Episode Date: March 18, 2025China’s Baidu dropped new AI models at prices far below competitors like OpenAI and DeepSeek. Baidu's Ernie model matches GPT-4.5 at only 1% of its cost, kicking off a significant AI price war. ...Before that in the Headlines, Apple's AI strategy is officially a dumpster fire. SPECIAL OFFERTo get your ready-to-go agent from https://www.lindy.ai/ email nlw@besuper.ai with the word "LINDY" in the titleBrought to you by:KPMG – Go to https://kpmg.com/ai to learn more about how KPMG can help you drive value with our AI solutions.Vanta - Simplify compliance - https://vanta.com/nlwThe Agent Readiness Audit from Superintelligent - Go to https://besuper.ai/ to request your company's agent readiness score.The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: https://pod.link/1680633614Subscribe to the newsletter: https://aidailybrief.beehiiv.com/Join our Discord: https://bit.ly/aibreakdown
Transcript
Discussion (0)
Today on the AI Daily Brief, the AI Price Wars heat up and we're talking about what it means for you.
Before that in the headlines, Apple confirms its AI strategy is a complete mess.
The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI.
To join the conversation, follow the Discord link in our show notes.
One of the substories we've been following for some time is just what the heck is going on with Apple when it comes to AI.
This is a company that you really would think would be excellently positioned to understand exactly how to bring
some version of AI to consumers in a way that actually met their needs, didn't require them to
buy into a bunch of AI hype, and yet they have just done absolutely nothing.
Underwhelming concept after underwhelming concept followed by delay after delay, and now
its Siri crisis seems to be deepening as a leaked all-hands meeting speaks to disarray in the
AI division.
Now, of course, at this point, the troubles behind the AI versions of Syria have been well reported,
including on this show, but comments from the company have been relatively slim.
Last week, Apple confirmed for the first time that Siri was facing major delays
and seemed to tacitly acknowledge that they've been advertising features that don't exist.
Remember, they pulled ads from YouTube.
Bloomberg's Apple Insider, Mark German, reports that Apple's senior director, Robbie Walker
recently addressed the team.
German wrote, Walker suggested that employees on his team may be feeling angry,
disappointed, burnt out, and embarrassed after the features were postponed.
The company had been racing to get the technology ready for this spring,
but now the features aren't expected until next year at the earliest.
You'll remember that AI Siri was first shown off at the Worldwide Developer Conference last June.
Bloomberg has reported that at the time, there was barely a working prototype, and the unveiling
relied entirely on a video mock-up. Walker said that the delays were especially ugly and embarrassing
due to the advertising push stating, this was not one of those situations where we get to show people
our plan after it's done. We showed people before. He acknowledged that the marketing department
had, quote, made matters worse by wanting to promote the enhancements even though they weren't ready.
And touching on the true state of the delays, Walker even raised doubt about having AI Siri
ready to ship in next year's iOS 19.
He said that even though the feature is earmarked for release in the new version of the operating
system, it, quote, doesn't mean we're shipping then.
Confirming what we already knew, Walker stated that the delays are due to quality issues
with Siri's new features, quote, that resulted in them not working properly up to a third
of the time.
Citing new hardware and software initiatives, the AI team is working on simultaneously,
Walter commented that, we have other commitments across Apple to other projects.
We want to keep our commitments to those, and we understand those are now potentially more timeline
urgent than the features that have been deferred.
He said the decisions will be made on a case-by-case basis, implying that AI Siri is not the number
one priority.
Finally, touching on responsibility for the botched rollout, Walker insisted that there was, quote,
intense personal accountability, shared by his boss, John Gionandria, the head of AI
at Apple, as well as software chief, Craig Frederigi and other executives.
German wrote, as a Friday, Apple doesn't plan to immediately fire any top executives over
the AI crisis, according to people with knowledge of the matter.
That decision could theoretically change in any time.
In any case, the company is poised to make management adjustments.
It is discussed moving more senior executives under Gianondria to assist with a turnaround effort.
In a sign of just how far Apple has fallen, the leading Apple blog for the last 20 years at this point,
Daring Fireball, published on Wednesday a piece called Something is Rotten in the state of Cupertino.
He said, in the two decades I've been on this racket, I've never been angrier at myself
for missing a story than I am about Apple's announcement on Friday that the more personalized
Siri features of Apple Intelligence scheduled to appear between now and WWDC would be delayed
until the coming year. I should have had my head examined. Now, John Gruber, the author of this,
is raising red flags that something has gone deeply wrong at the company. He referenced a famous
meeting from 2008 when Steve Jobs berated engineers on why an early iPhone email client doesn't
work properly. Direct quotes from jobs included, you've tarnished Apple's reputation, and you
should hate each other for having let each other down. Gruber wrote, Tim Cook should have already
held a meeting like that to address and rectify the Syrian Apple Intelligence debacle. If such a meeting
hasn't yet occurred or doesn't happen soon, then I fear that's all she wrote. The ride is over.
When mediocrity, excuses and BS take root, they take over. A culture of excellence, accountability,
and integrity cannot abide the acceptance of any of those things, and will quickly collapse
upon itself with the acceptance of all three. Temay S writes, I've been reading Gruber's blog for
over a decade and always saw him defend every obvious Apple flaw, a true unapologetic fanboy.
seeing this is a complete shock.
If even this doesn't prompt drastic leadership changes, I don't know what will.
Meanwhile, in another big tech house, Google is taking the plunge and replacing the Google Assistant with AI.
On Friday, Google announced that over the coming months, mobile users will be switched over to Gemini Assistant,
and the classic assistant will no longer be accessible.
They added, additionally, we'll be upgrading tablets, cars, and devices that connect to your phone,
such as headphones and watches, to Gemini.
We're also bringing a new experience powered by Gemini to home devices like speakers, displays, and TVs.
Google notes there, quote, continuing to focus on improving the quality of the day-to-day Gemini
experience, especially for those who have come to rely on Google Assistant.
Now, in some ways, this was inevitable, but it's still noteworthy relative to the other
companies in the space. Neither Amazon or Apple have even shipped an AI-driven version of their
assistant, let alone have the confidence to remove a legacy product from customers.
David Barnard writes, as a recovering Apple apologist, I get the Apple has high standards and
won't ship LLM Siri until its ready argument, but I got to call BS on that.
Siri is a terrible product that we've all been complaining about for over a decade.
The bar is low. Google is making the right choice here, pushing to the future even though Gemini isn't perfect.
It will get better quickly with hundreds of millions of people using it daily.
And for a bunch of things, it's already quite good.
It would be funny to see Googling into how far ahead they are with IMAX style ads making fun of how dumb Siri is compared to Gemini.
So there you have it. It's a mess in Cupertino.
Maybe a little bit better in Mountain View.
And all of us are just watching it play out.
For now that that's going to do it for the AI Daily Brief Headlines edition, next up the main episode.
Today's episode is brought to you by Vanta.
Trust isn't just earned, it's demanded.
Whether you're a startup founder navigating your first audit or a seasoned security professional scaling your GRC program,
proving your commitment to security has never been more critical or more complex.
That's where Vanta comes in.
Businesses use Vanta to establish trust by automating compliance needs across over 35 frameworks like SOC2
and ISO-2701.
Centralized security workflows, complete questionnaires up to 5X faster, and proactively manage vendor risk.
Vanta can help you start or scale up your security program by connecting you with auditors and experts to conduct your audit and set up your security program quickly.
Plus, with automation and AI throughout the platform, Vanta gives you time back, so you can focus on building your company.
Join over 9,000 global companies like Atlassian, Kora, and Factory who use Vanta to manage risk and prove security in real time.
For a limited time, this audience gets $1,000 off Vanta at vanta.com slash NLW.
That's V-A-N-T-A dot com slash N-L-W for $1,000 off.
We talk a lot about agents on this show.
But if you've ever thought to yourself, I don't want to talk about agents anymore.
I just want to actually build and deploy something.
I'm really excited to share something special with you today.
We've partnered with Lindy to offer companies that just want to dive
into the deep end of agents a way to get their feet wet, a way to move fast and build something
meaningful without breaking the budget. The first five companies that email me NLW at Bsuper.a.I
with Lindy and the title will have access to work with Lindy to build an actual functional
agent serving their specific needs for under $20,000. Some of the agents you can build
include a customer support agent, maybe automating responses on your website. You could build an
SDR for generating or qualifying sales leads, or you could build an agent that's perfectly suited
for your internal communications needs, be it note-taking, scheduling, or something else.
Not only is Lindy's structured to integrate with all of the places that you already keep data
and information, it's also a full extensible platform, which means as you hire more and more
agent employees and really build out your digital workforce, Lindy's going to enable those
agents to be interoperable and basically be able to work together in a seamless way. So again,
And if you are interested in diving in all the way to agents, in a matter of weeks, not months,
not years, email me, nLW at Bsuper.a.i, put Lindy in the title. And let's get your first digital
employee online. Hey, listeners, are you tasked with the safe deployment and use of trustworthy
AI? KPMG has a first of its kind AI Risk and Controls Guide, which provides a structured
approach for organizations to begin identifying AI risks and design controls to mitigate threats.
What makes KPMG's AI Risks and Controls Guide different is that it outlines practical control
considerations to help businesses manage risks and accelerate value.
To learn more, go to www.kpmG.org.us slash AI Guide.
That's www.kpmg.org slash AI Guide.
Today's topic is one that on first glance feels very day-regor for the moment we're in.
Specifically, China's Baidu has released two new AI models, which, as always, they claim,
to have performance as high as or at least close to the big American model, specifically
GPT, and to do so for cheap.
Except the cheap in this case is really cheap.
We're talking about a claim that Bidu's new Ernie model matches or exceeds the performance
of GPT 4.5 at about 1% of the price, $55 per million input tokens as opposed to $75 per million
input tokens for GPT4.5.
The other model that Bidu released Ernie X1, which is their reasoning model,
model is priced at even 50% of deepseeks, already low price. And so the sense that many people
have is that this is a serious ratcheting up of the AI price war. Today we're going to talk about
how this has been evolving and what it potentially means for a number of different groups.
Now, there is a sense, broadly speaking, that the price of AI was going to come down precipitously.
In fact, even holding aside the onslaught of Chinese models, intelligence has been getting
cheaper at a rate that far exceeds, for example, Moore's Law, which was the previous way that
the technology industry thought about the speed at which technology became less expensive.
There's a phrase that Sam Altman has been fond of,
intelligence too cheap to meter. In July, when OpenAI introduced GPT 4.0 Mini,
he pointed out that just two years earlier, the best model in the world was not only much,
much worse than the current models, but also a hundred times as much.
And of course, we've also seen, even within the big tech companies, price is a major competitive
feature. Amazon, which still hasn't exactly gotten its feet under it when it comes to its own
proprietary models, introduced its Nova Foundation family in December, and it was very clear that
part of the strategy was to compete on price. Google has also been trying to use price as a competitive
advantage. In February, when Google released its Gemini 2.0 Flash and Flashlight, again,
the major highlight was how much less expensive they were. Indeed, broadly speaking, the price of
LLMs and the intelligence they represent has been just absolutely collapsing. But of course,
all of this took on a new dimension when Deepseek launched, claiming that their model, which had very
similar performance over comparative open AI models, was trained for less than $6 million.
That revelation, unconfirmed as though it may be, has rocked the markets and hasn't really let them go.
When Deepseek R1 came out, people started calling it the Sputnik moment, launching a global race
for ever cheaper AI.
So profound was the psychological mark that Deepseek left in terms of recalibrating how people thought
about where China was in the AI competition, that everything subsequent to that has been,
is this the next deep seek moment? Last week we talked about the AI agent Manus, which many people
called China's second deep seek moment. And once again, while Manus hadn't innovated at the
foundation model level, it had created a consumer product that just seemed to beat everything
that was available here in the U.S. Now once again, we have people calling this Baidu drop
another deep seek moment. Byu writes, we've just unveiled Ernie 4.5 in Xxieck.
As a deep-thinking reasoning model with multimodal capabilities, Ernie X1 delivers performance on par with
Deep-Seek R1 at only half the price. Meanwhile, Ernie 4.5 is our latest foundation model and
new generation native multimodal model. Now, in terms of capabilities, these models have everything
you'd expect. They can analyze and summarize documents. They can solve complex problems.
But price is really what people are talking about. Now, some have pointed out that while Ernie's
X1 model is about half the cost of Deepseek's R1 reasoning model, deepseek's V's
3 non-reasoning model is still about half as much as Ernie 4.5, although both of those are, of course,
dramatically cheaper than both GBT40 and GPT4.5. In response to this, the memes were flying quickly.
With a video of a multi-car pile-up Jeffrey Townsend writes, a real-time view of investors in Gen.
AI Foundation models. Low-cost deep-seek, shocked open AI, now Baidu has released super low-cost
Ernie 4.5. China is driving the cost of AI way down. It's brutal. San DiModan writes,
China's AI firms are not only building fundamentally better models, e.g. Deepseek,
they're building fundamentally cheaper models, e.g. Baidu's. America cannot compete with this if it
continues. This is radical innovation, state-funded or otherwise. So what really are the implications
of this? Well, first of all, there's the stock market. And obviously we've seen that the emergence
of these models, and the idea that they might use much less compute to be able to get this sort of
performance threatens the narrative of companies like Nvidia, which have driven the rally for a couple
of years now. I think it's important to have a couple of caveats here. First of all, there's a lot more
going on than DeepSeek when it comes to the stock market woes. Right now, we're in a period of
extreme volatility and unpredictable futures, and markets are not just dealing with China and
AI, but also with tariffs, geopolitical realignment. So trying to parse out how much tech stock
underperformance has to do with that versus just a correction after two years of basically unfettered
up into the right is a little bit harder parsed than people might be making it seem. The other
question, of course, more structurally, when it comes to compute, is cost of inference.
The Wall Street narrative is still kind of stuck on the idea that the only use of compute is to
train new models, rather than to deliver those models in practice. The counter argument,
the one that companies like Nvidia have been making, is that the cheaper the models become,
the more people use them, the more people use the models, the more inference costs they
incur. And so the burden of the compute shifts to a different part of the stack, but still
remains. Regardless, it makes things look like a less clear bet, and that could have implications
for downstream funding as well. Now, when it comes to startup business models, it's a mixed
bag. In the short term, there's a lot that's amazing about this. Downward price pressure
means that all the startups out there can do and offer a lot more for a lot cheaper. The more
intelligence becomes available in a cost-effective way, the more startups are going to find
ways to use it. And that's a very good thing. In the long term, it could be a little bit more
challenging. If the price compression continues to be as severe as it looks, it could constrain and
limit the band of prices that startups can actually offer. One place that this will specifically
come to bear is with regard to agents. Right now, there are lots of different pricing models when
it comes to agents. People are experimenting with outcome-based pricing and generally trying to think
about things in new ways outside of the traditional SaaS model. But I would say that by and large,
they're still benchmarking it against the comparative human labor. If you have a sales or SDR agent,
the promise of that agent is that you're going to pay less than the equivalent human time would have
cost. However, in terms of how much less, companies are still benchmarking it against the human that
would have done the job before. And that still makes them pretty expensive. It seems highly likely to me
that someone is going to try to reverse this flow, and instead of pricing it on the basis of what
the comparative human time would have been, they're going to price it on the basis of the cost of
goods and have a radically cheaper price that undercuts the entire premise of that other model.
Now, once again, the countervailing pressure here is that if I'm right and in the future,
we don't just hire one agent to do the thing that a person used to do, but a thousand agents,
to do it in a totally different type of way. Maybe that all ends up in a wash. But still,
the point is that the price war will have impacts on both the startup side in terms of what they can
offer, as well as the enterprise side in terms of what they expect to buy. And then, of course,
there's the geopolitical dimension of all of this. One of the questions is, how much is this
intentional price warfare, is this China and Chinese companies doing something unsustainable,
in fact, and un-economical in order to cause harm to American competitors?
The answer is probably that it doesn't matter, as long as the companies engaging in the
price war have deep enough pockets to keep it going. Certainly the American companies aren't
loving it. In OpenAI's proposal for the USAI Action Plan, they basically argue that
Deep Seek and Chinese AI should be banned, which of course is one way to limit competition.
There are other strategies, though.
Robert Scobel, for example, wrote over the weekend,
if I were Mark Zuckerberg, I'd release a badass AI model for free and end this price for
once and for all.
Why?
Because the model that wins will collect more real-time data from all of its 3 billion users,
which will make its glasses and services better and more profitable.
And indeed, there is a sense that perhaps this is the path.
Mark Jeffrey writes,
AI smartness goes to infinity, AI price goes to free.
Open source-style vendors of add-on services and verticals built around AI win.
embodiment of AI in the world of atoms wins and becomes geometrically more valuable with intelligence
increases. There is a lot up in the air right now when it comes to the future of the business model of
AI. At this point, all that seems clear is that the business side of this is going to change
nearly as fast as the technology side. I hope if I've convinced you of anything today, it's that there
are actually meaningful implications of this to the AI you interact with, what price you pay for it,
and the opportunities that it creates for you. And so I will, of course, keep track of how these things
evolve and change over time. For now that, that's going to do it for today's AI Daily Brief.
Until next time, peace.
