The AI Daily Brief: Artificial Intelligence News and Analysis - The Claude Code Problem
Episode Date: August 16, 2025This episode examines how pricing challenges in AI coding platforms like Cursor and Claude Code reveal a fundamental shift in the software industry. While these tools currently struggle with unsustain...able economics - where users pay far less than actual compute costs - this mismatch signals AI's inevitable transition from premium software tool to essential utility infrastructure. Through analysis of emerging pricing models and the concept of "intelligence too cheap to meter," the episode explores how AI coding represents the first glimpse of a future where software operates like electricity or water - a commodity utility accessible to everyone rather than a luxury good. Brought to you by:KPMG – Discover how AI is transforming possibility into reality. Tune into the new KPMG 'You Can with AI' podcast and unlock insights that will inform smarter decisions inside your enterprise. Listen now and start shaping your future with every episode. https://www.kpmg.us/AIpodcastsBlitzy.com - Go to https://blitzy.com/ to build enterprise software in days, not months Vanta - Simplify compliance - https://vanta.com/nlwPlumb - The automation platform for AI experts and consultants https://useplumb.com/The Agent Readiness Audit from Superintelligent - Go to https://besuper.ai/ to request your company's agent readiness score.The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: https://pod.link/1680633614Subscribe to the newsletter: https://aidailybrief.beehiiv.com/Join our Discord: https://bit.ly/aibreakdownInterested in sponsoring the show? nlw@breakdown.network
Transcript
Discussion (0)
Today on the AI Daily Brief, the Claude Code problem.
Before that in the headlines, is the White House about to take a stake in Intel?
The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI.
Hello, friends, quick announcements before we dive into today's show.
First of all, thank you to our sponsors, Blitzy, Vanta, and Super Intelligent.
And if you are looking to avoid all of those sponsor announcements, you can head on over to
Patreon.com slash AI Daily Brief, where you can get an ad-free version for just three bucks a month.
And lastly, one more shout out to the new
KPMG series you can with AI that I am the host of. It's a seven-part series that is really designed
as a primer for those who are still just wrapping their heads around Enterprise AI. You can check it out
at www.kpmg.org.comg.com slash AI podcasts. Welcome back to the AI Daily Brief Headlines Edition,
all the daily AI news you need in around five minutes. Well, here's an interesting one. The Trump
administration is in talks to have the U.S. government take a stake in Intel. Bloomberg sources
said the deal is intended to shore up Intel's planned factory hub in Ohio. Intel had previously pledged
to turn the site into the world's largest chipmaking facility and were provisioned with government
subsidies out of the Biden administration's chips act to make it happen. However, slowing sales
over recent years has meant the facility made less and less sense and no meaningful progress has
been made. Earlier this year, reports stated that the administration was trying to put together
a TSM-led joint venture to take over parts of Intel's operations. In fact, those reports have
been popping up all year, but they've always been very murky as to exactly what the state of the
negotiations were. Still, it seemed pretty clear that TSM was being dragged to the table and wasn't
particularly keen to take on Intel's problems. But then last week, all of this got even louder when
Trump called on Intel's CEO Liputon to resign. He posted, the CEO of Intel is highly conflicted,
all caps, obviously, and must resign immediately. There is no other solution to this problem.
Now, there were no further details, but the outburst was likely related to allegations leveled by
Senator Tom Cotton earlier in the week. Cotton wrote a letter to the Intel board asking about
Tahn's previous CEO role at a chip design firm that contracted with the Chinese government.
In July, that firm agreed to a plea deal with the U.S. government to settle claims that it illegally
exported products to China between 2015 and 2021 when Tahn was CEO. By the end of last week, it emerged
that Tahn had been locked in a power struggle with Intel's board. The Wall Street Journal
reported that some board members were pushing for Intel to stop manufacturing their own chips.
It seemed that Tahn was at odds with Chairman Frank Yuri on the issue almost immediately after
he was appointed CEO back in March. Yuri, a former investment bank,
banker had already drawn up plans to sell off Intel's manufacturing business.
Ton, meanwhile, wanted to conduct a multi-billion dollar capital raise to fund a restructuring
of the business kicking off in July.
Yere and other board members reportedly blocked the fundraising as they wanted to move more slowly.
All of this is to say that the intrigue has been building for months, but there has been very
little clarity on how things were going to work out for Intel.
Now it appears that the Trump administration views Intel's manufacturing capabilities as a national
security issue.
And yet, neither the Intel board nor outside investors seem willing to put up the capital to turn around
the loss-making division. The solution of a TSM joint venture was always messy at best,
with the White House insisting they could only take a 49% share to ensure it wasn't majority
foreign-owned, and it was with all of that context that new reports came out, suggesting the
administration has discussed the prospect of taking a stake in Intel, specifically doing so in a meeting
with Ton on Monday. At this stage, we basically have no details on how a deal would be structured,
and the reporting stated that the talks are still in early stages. The WSJ painted the news as the
latest in a string of unconventional private sector interventions from Trump, obviously referring
among other things to the unprecedented 15% export charges on Nvidia and AMD products bound for
China. Now, if the Wall Street Journal was sort of indicating the irregularity of this engagement,
the New York Times was much less subtle in their critique, blaring the headline that Trump
has made himself commander-in-chief of the chip industry. Meanwhile, if you go on Twitter slash
you will variously find accusations of this being an example of fascism or, on the other hand,
socialism, just to give you a sense of how it's all going. Luke Groman had a more
measured take, trying to suss out why people are reacting differently to this than to some previous
government interventions. He writes, October 2008, the U.S. government invests 250 billion into failing
U.S. banks and Wall Street Cheers. August 2025, the U.S. government invests into U.S. defense
industrial base that is losing to China, and that just got out produced 4 to 1 in Ukraine by Russia
and Wall Street booze. Why the difference in reaction? Now, I have to say before we move on that the other
hilarious to me sub-story is that you'll remember that we talked about earlier in the week,
situational awareness. The hedge fund from 23-year-old former OpenAI staffer Leopold Aschenbrenner,
who had raised a billion and a half with no previous financial experience and was sitting on
something like 47% gains for the year. Ever since that article came out, people have been absolutely
harping on the dude for his intel calls. That's right, for his intel calls. Bucco Capital
this morning posted a video of Cam Ran freestyling with a pile of money in his hands and said,
this is Leopold Ashin Brenner today after being clowned on literally this weekend for being
neck deep in Intel calls. Wild times, man, wild times. Now, speaking of U.S. China, chip geopolitics,
the Financial Times reports that the delay of Deepseek's new model has been caused by a failed
training run on Huawei chips. The report suggests that Chinese authorities pushed Deepseek to adopt
Huawei's ascend chips rather than use Nvidia training clusters for the follow-up to their viral R1 model.
Sources said that the startup encountered persistent technical issues during the training process,
leading them to ultimately use Nvidia chips for training and Huawei for inference.
These issues say the reports were the main reason the model's launch was delayed back in May.
Industry Insiders in China said that the Chinese chips suffer from stability issues,
slower interchip connectivity, and inferior software compared to Nvidia GPUs.
Huawei reportedly had a team on site during DeepSeek's training runs,
but they still couldn't conduct a successful run on the Ascend chips.
Obviously, if accurate, this sheds new light on Chinese authorities's recent push
to force domestic firms to use Huawei chips.
If the chips aren't capable of completing advanced training runs, then obviously that's going to be a huge problem for the advancement of Chinese AI.
It also implies that the Chinese government is pushing their companies to use inferior or non-functional chips in pursuit of nationalist or protectionist goals.
Now the Chinese bots on X are out in force calling this fake news, suggesting of anything to me that it's probably more accurate than not.
This is now becoming an issue in financial reporting as well, with the president of Chinese tech giant Tencent saying that they have enough AI chips for training.
During their earnings call yesterday, Martin Lau told analysts,
we don't really have a definitive answer on the import situation of the U.S. chips.
I think there's a lot of discussion between the two governments,
and we're waiting to see exactly what comes out of that.
But from our perspective, we do have enough chips for training and continuous upgrade of our existing models,
and we also have many options for inference chips,
and we are also executing a lot of software improvement and upgrade in order to drive efficiency gains
and inference so that we can actually put more workload on the same number of chips.
If you've been paying close attention to this show all week,
you'll know that there's been a back and forth between the White House finally saying,
yes, it's okay to export these H20s, with a 15% take rate, of course. And then China's saying,
no, don't use those. So all of this is up in the air. The intrigue continues. But before we moved
in the main episode, let's bring it back to some fundraising news. One fairly big and kind of
unexpected one comes from Kohir. The company announced a half billion dollar raise at a
$6.8 billion valuation. Now, on the one hand, this is a fairly modest jump, at least by crazy
AI standards, from the five and a half billion they achieved during their last round a little over a year
ago. At the same time, the round was oversubscribed and it seems like Cohere were able to take their
pick of big strategic names. In addition to the fresh funding, Cohere added Joel Pinow as chief
AI officer. Pinow left Meta in April where she was the VP of AI research and a day-to-day leader
at their Fair Research Lab. The company also added Francois-Cadwick as CFO. Chadwick was involved in
Uber for 10 years, including serving as acting CFO in 2017. Now, what makes all this notable is that
this indicates that Cohere's second act really has some legs. The Canadian start to
startup was originally founded in 2019 and was initially involved in the foundation model and
chatbot competition. However, over the course of the last year or so, the company pivoted hard
towards deployment of on-premise AI for enterprises. The headline of their blog post reinforced
that this is the approach, saying Cohere raises $500 million at $6.8 billion valuation to accelerate
enterprise efficiency with Agentic AI. The new capital, they write, will enable us to accelerate
our efforts to make businesses and governments around the world vastly more efficient, simplifying
tedious tasks through agentic AI solutions. We aim to free up people to spend their time on the
interesting, challenging, and human parts of work, all while prioritizing data security.
This represents a security-first category of enterprise AI that is simply not being met by
repurposed consumer models. I actually think that Koh here have found a really important
and valuable niche here. The foundation model companies do not have the time, bandwidth, or appetite,
frankly, to go in and do the sort of customization work and implementation work that enterprises require
in many circumstances. At the same time,
time. The GSIs who are doing that work don't have anywhere near the type of talent and actual
technical capacity to go do that in any meaningful way as well. Enter, cohere, and there's probably
room for a couple other players in that space as well, but congrats to them, it is no mean feat to
transition a company at that stage this successfully. Lastly, today, it appears that cognition
AI is also raising a big new round. The Wall Street Journal reports that the startup has raised
$500 million at a $9.8 billion valuation. Reportedly Founders Fund is leading the round, which
more than doubles the company's valuation from back in March.
Now, Cognition most recently made a splash by acquiring the remains of windsurf after Google
Acqua hired their founder. And between that IP and this new round of money,
cognition is now armed with a ton of fresh capital to make a serious run to disrupting the
still nascent AI coding space. Now, speaking of the nascent AI coding space, that is the subject
of our main episode. So let's turn to that now.
This episode is brought to you by Blitzy, the Enterprise Autonomous Software Development Platform
with infinite code context.
Blitzy uses thousands of specialized AI agents that think for hours to understand enterprise-scale
code bases with millions of lines of code.
Enterprise engineering leaders start every development sprint with the Blitzy platform,
bringing in their development requirements.
The Blitzy platform provides a plan, then generates and pre-compiles code for each task.
Blitzy delivers 80% plus of the development work autonomously while providing a guide for the final
20% of human development work required to complete the sprint.
Public companies are achieving a 5x engineering velocity,
increase when incorporating Blitzy as their pre-I-D-E development tool,
pairing it with their coding co-pilot of choice to bring an AI-Native STLC into their org.
Blitzy is providing a limited time, 30-day free proof of concept for qualifying enterprises.
The team will provide a 5X velocity increase on a real development project in your org.
Visit blitzy.com and press book demo to learn how Blitzie transforms your STLC
from AI-assisted to AI Native.
That's BLITZY.com.
As a founder, you're moving forward.
fast towards product market fit, your next round, or your first big enterprise deal. But with AI
accelerating how quickly startups build and ship, security expectations are higher earlier than ever.
Getting security and compliance right can unlock growth or stall it if you wait too long.
With deep integrations and automated workflows built for fast-moving teams, Vanta gets you audit-ready
fast and keeps you secure with continuous monitoring as your models, infra, and customers evolved.
Fast-growing customers like Langchain, writer and cursor trusted Vanta to build a
scalable foundation from the start. And look, as someone who lives in the world of enterprise procurement,
I love how Vanta makes it easy to get compliance right. The last thing you need when you're trying
to win that big deal is to have it scuttled by something that Vanta has solved for over 10,000
companies. Go to vanta.com slash NLW to save $1,000 today through the Vanta for startups program
and join over 10,000 ambitious companies already scaling with Vanta. That's V-A-N-T-A-com
slash NLW to save $1,000 for a limited time.
If you are a regular listener, you will have heard about Super Intelligence Agent Readiness Audits at this point.
But I wanted to tell you today about the full suite of Agent Readiness products that go beyond just the initial readiness report.
Over the last six months, Super Intelligence has built out an entire agent planning suite.
We help you move from discovery to planning to implementation.
After you've completed your agent readiness audits, we help you double click on your most important use cases with what we call our use case planning reports.
These reports are going to help you understand what sort of technical preparation you need to do to be ready for a use case,
what challenges you might face in implementation, and whether you should be thinking about building, buying, partnering, or some combination.
After that, you can even get a spec document in what we call our technical blueprint that gives either your developers or the developers of the partner you work with what they need to build exactly the agent that you're looking for.
If you want to learn more about superintelligence agent planning suite, we built a custom GPT to answer your questions.
just go to bit.ly slash super super agent. That's bit.l.ly slash super super agent, all one word. And if you have any
questions, the agent can even help you book an appointment with our team. Welcome back to the AI Daily Brief.
There has been an increasingly loud conversation happening in AI world about the business model that
underlines AI coding. Investor Chris Pike recently called this the cursor problem, or more specifically
he titled a blog post published as a Google Doc as he does, but in conversations with my friend
Sean, better known as Swix, from late in space, he thought a more app name for this was the Claude
problem and I tend to agree. Now, just to give you a little bit of an example, Riley Brown really
sums it up without even meaning to when he tweeted late last night, I hate how good 4.1 opus is.
So expensive, but so good. It just one-shots full apps if you tell it what you want, which is
harder than it sounds. Now, Riley should know, not only is he one of the most prominent creators
around vibe coding, they just announced that their mobile vibe coding app raised over $9 million
bucks in a seed, meaning they've spent a lot of time trying to figure out how to get the best
performance out of these different models. Now, I have touched on this particular problem in an
episode sometime in the last week, but it was specifically in the context of this post from Antonio
Garcia-Martinez, who, as I mentioned then, I like and think is very smart, but basically
my argument was that he was drawing a too lazy comparison. He wrote,
Every tech bubble is initially pumped by some extraneous source of liquidity poured into
unsustainable growth. Web2 consumer use paid for VC ads to pump MAUs. Crypto used retail tokens
to pump user rewards to inflate usage, and hence the token. AI is using VC and inflated
equity to subsidize compute costs and inflate consumer usage. Now, he was talking broadly, not just
about AI coding, but the post that he was referring to is one which we'll read in a minute
and was specifically talking about AI coding. Now, my argument was not the
that there wasn't a misalignment in the business model where people were being charged
less than the cost of the actual services that they were using. My argument was that there was a
fundamental difference in the demand profile of these industries. The implication of his comparison
is that if you took away the subsidy, people would not use nearly as much of the service. I think
that that's not true with coding. In fact, I think that the problem is that it turns out that the
appetite and demand for AI coding is effectively unlimited. It is not just a one-to-one transition
from people who are already coding, it is unlocking and opening a new market, which I believe will be
as ubiquitous as word processing in the future. But I'm getting a little bit ahead of myself here.
The thing we're going to explore today is this Claude Code problem, the mismatch between what people
are paying for AI coding and what it's actually costing companies with an eye to understanding
what it says about where we are in the AI cycle and just how worried we should be about that mismatch.
The conversation started last week when we got a string of reporting suggesting that AI coding startups
as violently high growth as they were were actually facing some financial problems.
The information got their hands on Replitt's financials, which showed their gross margins had fallen
from 36% in February to negative 14% in April.
They bounced back a little bit since then, and CEO Amjad Massad insisted that unit economics
are not a problem.
The information reported that similar issues were happening at Lovable, while TechCrunch added
that Curser and Windsor were also facing cost pressures.
Nicholas Chariere, the founder of Moka, which is a vibe coding startup and back-end hosting
solution commented in the article that, quote, margins on all of the co-gen products are either
neutral or negative.
They're absolutely abysmal.
Now, at core, the issue here is, in fact, not even the margin on paid users.
Now, many people think about this issue as vibe coding apps subsidizing tokens for their
heaviest users.
At the end of July, for example, when Anthropic rolled out new weekly rate limits, they
pointed to a very small number of users who would be impacted.
They pointed to their biggest power users as causing the problems.
saying, some of the biggest ClaudeCode
fans are running it continuously in the background 24-7.
These uses are remarkable and we want to enable them,
but a few outlying cases are very costly to support.
For example, one user consumed tens of thousands
in model usage on a $200 plan.
And when this story came out,
one of the things that we discussed on this show
was how part of the challenge when it comes specifically to AI coding
is that it's sort of a, the more you use it, the more you use it situation,
where especially with the rise of background agents,
Not only are people accomplishing more with AI coding tools, they're doing so in a way that consumes
just a radically higher amount of tokens.
This was, it appeared, running headlong into the business model.
However, there's a whole separate dimension of this, which I think is actually wildly under-discust,
which is not about the power users, but about the fact that the very small number of paid users
have to subsidize all of the free usage.
And when only a tiny fraction of people are paying for a tool, that puts a lot of pressure
on those paid users.
So let's talk about how Pace Capital investor Chris Pike thinks about the problem.
He basically argues that there are two separate things that founders need to be thinking about.
Product Market Fit and Business Model Product Fit.
Product Market Fit is the one we all know.
He defines it as users repeatedly choosing your product.
Business model product fit, on the other hand, is defined by the extraction of value being
sustainably in excess and proportional to the cost of delivering value.
Now, before we get into the rest of his explanation about this, as relating to the coding tools,
It's important to note that this is not a phenomenon that's limited to the AI era.
In April, for example, the Atlantic wrote about how millennials got cheap Ubers, cheap on-demand
delivery.
I can certainly validate this.
I was there in San Francisco as this was all happening.
And none of the stuff that I had access to back then is available today for anywhere
near the same price.
And they made the explicit connection between this and cheap AI tokens now.
The question is, of course, what happens when the subsidy ends.
And that's really a lot of what Chris is talking to.
Chris writes,
cursor has relied on a subscription model that historically allowed for unlimited use.
That's a fixed revenue variable cost setup.
Without actual discipline, pricing, segmentation, caps, and exclusions,
this type of model drifts into the same ditch that killed movie pass, oyster, and forced
class pass to retire unlimited.
The pathologies rhyme.
First, cohorts invert.
Your most profitable users churn because they use the product the least and get the
least value.
Oftentimes they can get better value from a competitor who prices them more accurately or with
less breakage.
The remaining users are those who extract more value than they pay.
Over time, older cohorts morphed towards deeply negative gross margins.
Then top line masks rod.
New larger cohorts can briefly offset the drag, hiding the deterioration in early cohorts.
Revenue grows, margin quality quietly decays.
Chris says that the lesson is if there's an operational path to positive margins and future pricing power, temporary subsidies can be a bridge.
He also argues that venture capital is precisely the right instrument to facilitate these kinds of companies.
The question he has is, does this bridge exist for cursor?
And the challenge he points out is that cursor doesn't control two critical parts of its cost of goods sold.
It doesn't control model performance frontier, i.e. what quality model its users will demand,
and it doesn't control model input and output pricing, which is what cursor pays to OpenAI, Anthropic, etc.
He continues, if cursor steps down to cheaper, weaker models, the users who care about performance will notice in churn.
Those who can tolerate weaker models can get them cheaper elsewhere.
If it stays at the frontier while keeping prices flat, the variable real cost to service their heaviest users
will explode. In an effort to combat this, cursor has been forced to raise prices and institute usage
caps leading to user outrage and churn. Any time unlimited shows up in a variable cost business,
product market fit becomes a permanently open question. Are users here for the product or for the
subsidy? Would they still use as much or at all at true marginal cost? Until cursor prices
consumption in proportion to cost, it cannot know. So that's the discussion that Chris brought up,
which I think is a really interesting and important one. There are a few things confounding this,
though. There is an entire additional dimension to the cursor problem or the Claude Code problem
that was not present with these other examples. And that is that one, the quality of the good
sold is rising dramatically, and two, the cost of the good sold is coming down precipitously.
The cost of inference with AI and just the general token cost has come down at a rate that
absolutely no one anticipated. The problem and the reason we're still having this conversation
is that demand just grows even faster. However, the other part of this is really interesting to me,
which is this question of model quality.
I think Chris is right to point out that Cursor and any other company in this space
doesn't control model performance frontier or what quality of model users demand.
And so far, what's clear, at least according to studies like Menlo Ventures' mid-year
LLM market update, is that people are not switching between models because of price considerations,
they are entirely focused on getting better performance.
In other words, right now, all indications suggest that as much as users might complain
and hem and haw and squawk on Twitter in the short term, they're going to pay what anthropic charges.
Now, part of what makes that interesting, though, is that we are only just on the other side
of where these models are good enough to actually be in production workflows for a lot of coding
tasks. In fact, for some coding tasks, they're still not good enough. If, and this is, of course,
a big if, the rate of performance continues to increase. I wonder what the situation will be in a year.
Right now, it's very clear that at least the core base of users that exist,
right now for these coding tools, want the highest performing models, period, even if they're more
expensive. In a year, when today's state-of-the-art models are actually a bit older, and incredibly
cheap in comparison to whatever the state-of-the-art is then, will people who are using this for
incredibly large enterprise-grade workloads be willing to use the models that are today, state-of-the-art,
but in the future will not be state-of-the-art? Or will they always just want whatever the newest model is?
I, of course, can't answer that with any sort of assuredness, but I do think that just using today as a
snapshot in time doesn't give us a full picture because of the fact that we have only just hit
this frontier where these models are actually good enough. Now that we are operating entirely in the
context of all of these models being fairly high performance, how is that going to change in a year's
time? In the meantime, what I think is for sure is that we are going to see lots and lots of pricing
experiments. We've already seen shifts among some of the leading competitors right now. Replit is a great
case study in this. Earlier in the year, they had been experimenting with outcome-based pricing,
charging a flat fee per task. However, as the cost of coding began to rise, they started to lose
money with that model and switched to effort-based pricing in July. This is basically a version of
usage-based pricing, with Breplit charging based on the amount of compute a task required. Now,
this is where we've gotten a lot of the hemming and hawing and squeaking and squawking that I was
just talking about, because this can create some real sticker shock. When the price of certain
types of tasks increases 4 or 5x overnight, which isn't uncommon, obviously you're going to get
a lot of people complaining. At the same time, it's clear that usage-based
pricing is going to have fundamentally more sustainability than any sort of flat fee approach.
Another pricing model that seems interesting is a complete inversion of the business model.
In May, latent space noticed that the new coding agents from OpenAI and Google were being
handed out at zero cost, just pay for inference.
Our sense is that, especially with the frontier labs launching effectively unlimited usage
plans, the competitive war in coding agents has come to the point where the norm of charging
a premium over token usage as a coding agent GPT wrapper has now flipped to offering discounts
in order to get your usage data.
And this certainly presaged what we saw in the market.
Swicks posted at the time,
the market has now bifurcated quite hard between
we will max out every limit you have pro users,
and maybe I'll use it if it's free tire kickers.
It'll be interesting to see if the maximizers win
versus the more cautious people,
or if they're enthusiasts that are high on their own supply.
Another interesting observation is that the agent part of AI agent coding platforms
is getting rapidly commoditized.
One example of this is Klein,
where users bring their own API keys
and pay for inference directly.
Speaking with latent space last month,
founder Sao Oruzwan said,
our thesis is that inference is not the business model.
We want to give the user total transparency into price.
Give them confidence in spending however much it takes to get the work done.
There's enough ROI on coding agents that people are willing to spend money to get the job done.
Now, extending this idea even further as SoftGen.
The company was founded as a weekend project, grew incredibly fast,
and was acquired earlier this year by Sheridan Erickson,
the CEO of a rising ventures,
who just relaunched the platform with a mission to have
ultra-transparent Costco-style pricing. She wrote,
When we took over SoftGen in March, my goal wasn't just to understand the explosive AI tool
space. It was to get ahead of it and to leap where others might hesitate. Many told me not to
bother. Our competitors are well-funded in orders of magnitude larger. But the game is young,
AI coding tools will soon be a utility used by a billion-plus people. We're still at the
beginning of this race. More importantly, we're in a new era, where software builds itself in minutes
and even the best products can be replicated in short order. New eras mean new rules. When product
becomes commodity what will determine the winners. SoftGen believes the answer will be a quality
we're calling radical pro-usership. Radical pro-usership means being relentlessly on the user's side
in every possible way, from price to transparency to ownership. Radical pro-usership means
rethinking the playbooks that used to drive SaaS success, subscriptions, lock-ins, high markups,
hidden fees, and simply doing things better for the user and the user only. So what that means in
practice for them is that SoftGen has done away with the free plan. Basically, their assessment
is that the free plan creates economic problems for everyone else
and makes the models underneath unsustainable and forces companies
into those ultimately user exploitative types of relationships.
Again, Costco is the example here where they have an annual membership, $33 a year,
and then what they're calling wholesale AI usage pricing.
So basically, in addition to that $33 annually,
users are paying a transparent 15% fee on top of the raw cost of their API calls.
They've also set it up so that that fee decreases over time as more users join.
Basically, Sheriston and SoftGen's prediction is not only that current business models are
indefensible, but that sophomore moats in general are going away. In their place, her bet is that
customer loyalty is going to be the key moat moving forward. The thesis is not only that AI
tokens will become a commodity, but that software itself will become a commodity through AI-enabling
software on demand. At that stage, the entire software industry changes completely. It looks less
like the big tech era and more like a utility, like your water or electricity company.
And while that may seem insane now, I don't really know that it is. Right now, all of this challenge
is basically driven by the fact that AI coding is being priced, A, like software, and B, like a luxury
good. In a world where intelligence really is too cheap to meter, it feels almost totally inevitable
that the pricing structure will end up being much more like a utility where everyone has
some reasonable access to that thing. Now, this almost gets into a political discourse around
what rights to access people have, but I don't think that's insane. I think that that's actually
going to be a big part of the political discourse. Some people like I think biology have called this
universal basic AI. The point is, what we are seeing right now with the interesting pricing
challenges and business model intrigue around cursor, Claude Code, and all these other platforms,
is the first glimpses of AI, not as software tool, but as fundamental societal utility.
Might sound crazy to you now, but come back to me in a couple years and we'll see how crazy it
sounds then. For now, that's going to do it for today's AI Daily Brief. Appreciate you listening
or watching as always. Until next time, peace.
