The AI Daily Brief: Artificial Intelligence News and Analysis - 16 Ways Enterprise AI is Changing
Episode Date: June 17, 2025Enterprise AI is evolving quickly. Budgets are rising, agents are becoming essential, and companies demand state-of-the-art AI as soon as possible. Here are 16 insights from Andreessen Horowitz’s la...test analysis on how AI transforms the enterprise.Source: https://a16z.com/ai-enterprise-2025/Get Ad Free AI Daily Brief: https://patreon.com/AIDailyBriefBrought to you by:KPMG – Go to https://kpmg.com/ai to learn more about how KPMG can help you drive value with our AI solutions.Blitzy.com - Go to https://blitzy.com/ to build enterprise software in days, not months AGNTCY - The AGNTCY is an open-source collective dedicated to building the Internet of Agents, enabling AI agents to communicate and collaborate seamlessly across frameworks. Join a community of engineers focused on high-quality multi-agent software and support the initiative at agntcy.org Vanta - Simplify compliance - https://vanta.com/nlwPlumb - The automation platform for AI experts and consultants https://useplumb.com/The Agent Readiness Audit from Superintelligent - Go to https://besuper.ai/ to request your company's agent readiness score.The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: https://pod.link/1680633614Subscribe to the newsletter: https://aidailybrief.beehiiv.com/Join our Discord: https://bit.ly/aibreakdownInterested in sponsoring the show? nlw@breakdown.network
Transcript
Discussion (0)
Today on the AI Daily Brief, 16 trends in Enterprise AI.
The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI.
Hello, friends. Quick announcements before we dive in.
First of all, thank you to today's sponsors, Blitzy, Plum, Vanta, and Agency.org.
Second, if you are looking for an ad-free version of the show,
go to patreon.com slash AI Daily Brief, where you can get it for just $3 a month.
Lastly, as sometimes happens with a big, long report like this,
this episode got a little long, so we just have a main episode. We will be back to our normal
headlines plus main format tomorrow, but for now, let's dive in. Welcome back to the AI Daily Brief.
Today we are doing one of my favorite things on this show, which is when someone comes out with a set
of predictions or observations, we get to then go through them and talk about if and how what we're
seeing both at the AI Daily Brief and it super intelligent resonate with those trends and
observations or if we're seeing something different. Today's source for our show comes from
and Driesen Horowitz, who recently posted their 16 changes to AI in the Enterprise
2025 edition.
Now, this is the second year in a row that they've done this piece.
And back in 2024, there were a few big trends.
First of all, they saw budgets for generative AI skyrocketing.
Second, enterprises had a lot of concerns around enterprise data security.
Leaders from enterprises they found weren't building models from scratch, but instead
customizing models through fine-tuning.
Purchasing decisions were being heavily influenced by the existing cloud providers,
company was working with, and enterprises citing a dearth of existing high-quality vertical applications
were building tools rather than buying. Now, what you might notice wasn't there at all was, of course,
agents. In March of 2024, yes, there was conversation about agents, but we were still pre-reasoning
models. The big coding agent companies didn't exist yet, or at least weren't doing this. And so just based
on that, of course, that represented a very different time. So what have they found this year?
Because this is the contemporary piece, we're actually going to go through,
bullet by bullet, starting with their first, budgets are bigger than expected with no signs of
slowing down.
In fact, A16 and Z found that enterprise leaders expect an average of 75% growth over the next
year in their AI spend.
Positively, the growth in spend seems driven by the discovery of an implementation of new use
cases.
As one company said, we've been mostly focused on internal use cases so far, but this
year we're focused on customer-facing Gen.
AI where spend will be significantly larger, i.e., it's not like they sat down and said,
we're going to spend a bunch more money, it's just that the use cases demanded it.
Now, the one thing that I would add to this is that it's very clear that agents are having an
outsized impact on this as well. You might remember this slide from my presentation on why your
company needs to move faster on AI. It's from PWC, and it's about the AI budget increases
that are due to agentic AI. Basically, they found that, like, 88% of organizations were increasing
their AI budget due to agents, and about three quarters of them were increasing their budget
by 10% or more. The second trend that A16Z points to is the fact that this spend is coming from
different places. Here's a really important stat. Last year, innovation budgets represented around
a quarter of Gen A.I. Spending. That is now down to just 7%. Reallocated central IT budget
jumped from 28% last year to 39% this year, so a big jump. Business unit budget went from
21% last year to 27%. Net new central IT actually was down a little from 19 to 17% of
spend, and central R&D was up from 8% to 10%. Still, the point that they take away is that
Gen AISPend is graduating two permanent budget lines, and I think that's a good way of putting it.
This also sort of reflects something that I've been noticing a lot, which is that the
intention around agent pilots is very different than it was around other Gen AI pilots that we
saw maybe a year or a year and a half ago. A lot of these pilots still had the feeling of
flare of if. Meanwhile, all of these agent pilots are really just about figuring out what's
possible now, and building infrastructure for what is presumed will be available in the future.
I think that because organizations are in the mindset of redesigning themselves around agents,
it's natural to see the budgets shift consequently to different areas.
Number three is an interesting one because it sees enterprises behaving more like consumers.
If you are a consumer of AI, especially if you're someone who's enfranchised enough to be
listening to this show, you're probably the type of person who maybe uses different models for
different use cases. You might use Anthropics Claude for coding, for example,
but use GPT 4.5 for writing and 03 for business strategy tasks. A16Z finds that enterprises are also
using multiple models, and that model differentiation is the key driver. I think the thing that comes
through here is that organizations are just getting a lot more sophisticated as they put
more reps into these actual tools. A16Z writes, it's well known, for instance, that Anthropics models
excel encoding-related tasks, but there's more nuance to this claim. Within coding, some users report
that Claude performs better for fine-grained code completion,
Gemini is stronger and higher-level systems design and architecture. A reason for that, by the way,
might be just the difference in context window and Gemini being able to interact with a bigger code base all at
once. Still, this is a really big and important observation. I think a lot of people might have
assumed that enterprises would just be locked into whatever one tool they had based on central IT,
and instead, A16Z found the number of respondents who are using five or more models is all the way
up to 37%. Number four in this trend list, while there are lots of models, there is definitely
some clear consolidation and leadership around a handful. Specifically, A16C points out that OpenAI
continues to be in the lead with overall market share, but both Google and Anthropic made big strides
over the last year. Interestingly, one of the things that they found around OpenAI's models
is that in addition to companies using GPT40 and 03, 67% of OpenAI users have also deployed non-frontier models
in production, which is not the case for Google and Anthropic. That number is 41% for Google and just 27%
for Anthropic. So basically, the companies that are using Google or Anthropic are much more likely
to be concentrated just at their highest end offering. When it comes to growth, and this one
maybe won't surprise you, Google has gained more of a share within bigger enterprises.
Interestingly, A16C points to not just the fact that big enterprises are going to be inherently
more trustful if you think about compliance and legal departments of a big company like Google,
but also that Google has made a major play around its performance to cost ratio, and given how big
the use cases are for large enterprises, that cost consideration really seems to matter.
Large enterprises are also more likely to adopt open source models like Lama and Mistral,
which I think has a lot to do with the trend towards building, which we'll talk about in a little bit.
Overall, price is really interesting.
One of the things that's happened is that closed source models, especially non-frontier models,
have come down so precipitously in cost that customers are more frequently opting for those
closed source models, given that they still get other ecosystem benefits as opposed to moving
outside to open source. One customer said, the pricing has gotten appealing and we're already
embedded with Google. We use everything from G Suite to databases and their enterprise expertise
is attractive, whereas another company put it more simply, Gemini is cheap. For those who are
interested in the long-term trajectory of open source versus closed source, the fact that closed source
is seeing costs come down so precipitously is an interesting factor in how that battle might shake out.
One big change from last year comes in trend number six, which will surprise no one who's actually
paying close attention to enterprises but is a big difference from last year, which is that
fine tuning is viewed as less necessary as model capabilities improve. And this is really simple.
Basically, instead of taking a whole bunch of training data and taking the time to fine-tune models
on that data, now that context windows are really long and the models are smarter, you can just
use what's off the shelf. One enterprise said, instead of taking the training data and parameter-efficient
fine-tuning, you just dump it into a long context and get almost equivalent results.
Given how many startups were positioning themselves as helping enterprises fine-tune models,
this is obviously a shift with a lot of financial implications.
Trend number seven is all about reasoning models.
And the TLDR is that while enterprises are still early in their testing, they're pretty
excited about their potential.
Basically, enterprises are looking at these reasoning models, and frankly, I think the agents
that they represent, as opening up a new set of use cases.
23% of enterprises right now are using OpenAI's O3 model in production.
57% of A16ZU respondents said that reasoning models are slightly accelerating their adoption.
The thing that we see all the time at Superintelligent is that every single model improvement
opens up in some way some new use case. In other words, model improvements are not just about
doing the things that you are already doing a little bit better, a little bit faster, or a little bit cheaper,
is about doing things that weren't possible before. As one executive said, reasoning models allow us to solve newer,
more complex use cases, so I anticipate a big jump in our usage. What about why enterprises are
choosing different models? A16Z observes that the buying process is increasingly similar to
traditional enterprise software procurement, complete with quote checklists and price sensitivity.
The breakout chart here is really interesting, where we see changes in how important a
particular consideration was in the buying decision. The areas that went down the most as a major
consideration, were reasoning inaccuracy and reliability, whereas the area that went up the most
is cost of ownership. And I think this reflects back to what we just saw before around the idea
that this is moving into more long-term budgets. This is no longer just about innovation.
The usage of these tools is getting more ubiquitous across the organization, and so cost
comes back as a major consideration in a way that it just didn't have to be in the innovation stage.
Tread number nine is a little in the weeds, but I still think reflects a fairly significant change.
A16Z characterizes it as hosting preferences still vary widely, though enterprises have quickly built
trust for model providers over the last year.
Basically, the TLDR on this one is that companies have gotten a lot more comfortable hosting
directly with a model provider, even if it's a newer company like OpenAI and Anthropic,
as opposed to last year when most organizations were opting to access models through a cloud
provider that they already trusted and already had a relationship wherever was possible.
The biggest reason for this shift is that leaders, quote,
want direct access to the latest model with the best performance as soon as it's available.
Early access previews are important too.
One of the things that we see as a flagged problem from enterprises all the time
is the disparity between the quality of the models that they have access to in their personal
lives and in their consumer accounts as opposed to what they have access to at work.
This is increasingly becoming an issue.
It looks, for example, like Amazon might actually ditch its internally produced AI code companion
and instead just let people use cursor because that's what employees are demanding.
We hammer on enterprises all the time that this is potentially an area where they have the most
to gain because they have so much control.
The closer that they can get to just getting state-of-the-art models,
rather than being stuck with whatever's available through their standard cloud provider,
the more effective the use cases that the deploy are going to be.
This episode is brought to you by Blitzy,
the Enterprise Autonomous Software Development Platform with Infinite Code Context.
Blitzy is used alongside your favorite coding copilot as your batch software development
platform for the enterprise seeking dramatic development acceleration on large-scale codebases.
While traditional copilots help with line-by-line completions, Blitzy works ahead of the IDEE by first
documenting your entire code base, then deploying over 3,000 coordinated AI agents in parallel
to batch build millions of lines of high-quality code. The scale difference is staggering.
Copilots might give you a few hundred lines of code in seconds, but Blitzy can generate up to
three million lines of thoroughly vetted code. If your enterprise is looking to accelerate software development,
Contact us at blitzie.com to book a custom demo or press get started to begin using the product right away.
Today's episode is brought to you by Plum.
If you build agenetic workflows for clients or colleagues, you need to check out Plum.
Plum is the only AI-Native workflow builder on the market designed specifically for automation consultants,
with all the features you need to create, deploy, manage, and monetize complex automations.
Features like one-click updates that reach all your subscribers, user-level variables for personalization,
and the ability to protect your prompts and workflow IP.
Make your life easier, your clients happier,
and your business thrive with plum.
Sign up today at useplum.com.
That's Plum with a B forward slash NLW.
Today's episode is brought to you by Vanta.
In today's business landscape,
businesses can't just claim security,
they have to prove it.
Achieving compliance with a framework like SOC2,
ISO-2, HIPAA, GDPR, and more,
is how businesses can demonstrate strong security practices.
The problem is that navigating security and compliance is time-consuming and complicated.
It can take months of work and use up valuable time and resources.
Vanta makes it easy and faster by automating compliance across 35-plus frameworks.
It gets you audit-ready in weeks instead of months and saves you up to 85% of associated costs.
In fact, a recent IDC White Paper found that Vanta customers achieve $535,000 per year in benefits,
and the platform pays for itself in just three months.
The proof is in the numbers.
More than 10,000 global companies trust Vanta.
For a limited time, listeners get $1,000 off at vanta.com slash NLW.
That's V-A-N-T-A dot com slash NLW for $1,000 off.
Today's episode is brought to you by Agency,
an open-source collective for inter-agent collaboration.
Agents are, of course, the most important theme of the moment right now,
not only on this show, but I think for businesses everywhere.
And part of that is the expanded scope of what agents are starting to be
able to do. While single agents can handle specific tasks, the real power comes when specialized agents
collaborate to solve complex problems. However, right now there is no standardized infrastructure
for these agents to discover, communicate with, and work alongside one another. That's where agency,
spelled A-G-N-T-C-Y, comes in. Agency is an open-source collective building the Internet of agents,
a global collaboration layer where AI agents can work together. It will connect systems across
vendors and frameworks solving the biggest problems of discovery, interoperability, and scalability
for enterprises. With contributors like Cisco, crew AI, Langchain, and MongoDB, agency is breaking
down silos and building the future of interoperable AI. Shape the future of enterprise innovation,
visit agency.org to explore use cases now. That's agn tcy.org.
Now one really interesting conversation that comes up pretty frequently around AI is where
moats are going to exist in the future. If technology gets commoditized, and even state-of-the-art
models everyone catches up to in a matter of weeks, where is Lock-in going to come from? And one area
that appears to be on the rise is switching costs. Turns out that switching costs are getting
higher for AI as agents turn on more complex use cases. Last year, we found that most enterprises
were designing their applications to minimize switching costs and make models as interchangeable
as possible. As a result, many enterprises treated models.
as easy-come, easy-go.
That might have worked well for simple one-shot use cases,
but the rise of agentic workflows has started making it more difficult to switch between models.
Now, this makes a ton of sense.
Every once in a while, we'll try a different model inside our process,
either for the voice agent in our agent readiness audits,
or at some part of the LLM sequence by which we process the results of those voice agent interviews.
The changes when we do that can be dramatic and in not always positive ways.
Andresen points out that agendic workflows often require multiple steps to complete,
a task, so changing one part of a model's workflow could impact all downstream dependencies.
You can see this show up in competition with the agent platforms. Every agent platform is pushing
some idea of wanting to get a bigger set of the agent use cases for their platform because
companies are going to want agent interoperability. Now, I tend to think that even with high
switching costs, the market is going to totally force interoperability on the agent platforms,
but you can definitely see that this is a consideration that enterprises have, as they
invest in a particular agentic ecosystem. Number 11 is a really interesting one. This is about how
enterprises evaluate different models. And basically, A16Z found a big rise in using external
benchmarks as a proxy. Internal benchmarks actually went down a little bit, and project by project
benchmarks went down a little bit as well. I understand why they found this, but I think that this
might be a temporary thing. The problem is I often talk about on this show is that most of these
external benchmarks are really washed. We are crowding in around the very top ends of performance,
and frankly, we just need a different, more sophisticated set of evaluations. If you take the AI
engineer summit tracks as a leading indicator of where the market is heading, I think you're going
to see a ton of work over the next year around evals and model evaluation approaches that are going
to find their way to the enterprise as well. So one prediction that I have is between May 25 and
May 26, if they do the survey again, I would anticipate a big jump up and a return to internal
benchmarks as a major consideration in how they evaluate model performance.
Next up, we get into the build versus buy paradigm, and this is a fascinating question.
So the short of what A16Z found is that as compared to last year, where you remember
from the beginning of this show, a lot of companies were building their solutions by default because
there weren't good AI applications for their particular use case, that's shifting now.
A16Z writes, we've seen a market shift towards buying third-party applications over the last 12 months as the ecosystem of AI apps has started to mature.
It is absolutely the case that vertical agents and functional agents have been the biggest startup trend of the last six months at least.
And because of that, more and more of the use cases that are common across lots of different types of enterprises are starting to have high quality off-the-shelf-ish options.
I say off-the-shelf-ish because when it comes to agents,
nothing is as off the shelf as previous sort of enterprise software that we've seen.
There's still going to be some amount of customization and a whole bunch of work to wire together
the data that makes these agents work. So even the build versus buy paradigm as a conceit is a little
bit challenged by how AI works. And to some extent, this is just a natural evolution of software.
Enterprises did a bunch of building because they didn't see good options in the market for the
applications that they wanted. But then when those applications come online,
external companies who just focus on that thing are going to be in general in a much better position
than the enterprises to maintain and update and improve these specific functional and vertical
tools as opposed to what internal developers can do. And yet, I think that actually we're going
to see a bit of a bifurcation. I've become increasingly convinced, based on the huge array
of conversations that we have at super intelligent, that we're going to see organizations,
and maybe even different parts of organizations, fall into two distinct camps. One camp is
going to be where the off-the-shelf-ish solutions are good enough. So, things,
Think CPG companies with customer service agents, a lot of the use cases of random CPGX and random CPGY
are going to be so similar that you really can have a high quality off-the-shelf-ish option
that just needs a little bit of customization and wiring in with the data of the particular
company.
However, I also think that there are going to be certain areas in industries, particularly heavily
regulated, high-value industries, think finance, health care, where the default is going to be
forever to roll your own agentic solutions. I think this is going to be made more viable by the
rise of agents that can build other agents, which is the type of capability we're seeing from
companies like emergence. And those companies are going to be focused on building the infrastructure
to allow their teams to build and share and provision agents dynamically across their organizations.
There are going to be two almost totally different types of ecosystems and buying behavior around
these different approaches, with lots of opportunities for companies on all sides.
You can definitely see a little bit of the split in how regulated versus non-regulated industries
are interacting with different use cases.
In non-regulated industries, for example, it's fairly evenly split between organizations
that are using a third-party app versus a custom solution for data analysis, whereas in regulated
industries, the vast preponderance are using a customized solution as opposed to just like
20% who are using a third-party app.
What about pricing?
This is another thing that we frequently keep an eye on here at this show.
And basically what I've said before is that I think that we are in the experimental phase,
where it's pretty clear that the traditional per seat model isn't going to hold,
but that also a lot of these first attempts at outcome-based pricing are not necessarily
going to get it right out of the gate either. And that's basically what Andries and Horowitz found
as well. Their trend 13 is that buyers are struggling with outcome-based pricing. CIOs are
concerned around lack of clear outcomes, unpredictable costs, as well as attribution.
Now, companies are still interested in usage-based or hybrid-style models for AI applications,
and basically it feels like these outcome-based experiments are still just really nascent.
Only 15% of CIO surveyed said that they preferred outcome-based models,
as compared to, for example, 21% who said that they preferred seat-based,
and 39% who said that they preferred usage-based models.
When asked with their biggest challenges with outcome-based pricing,
lack of clear-measurable outcomes, was cited by 47% of respondents,
and unpredictable and unscalable costs were cited by 36%.
Another trend is that we are starting to see certain use cases become absolutely ubiquitous in
default.
Specifically, A16C calls out software development.
They write, software development has seen a step change in adoption, driven by a perfect
storm of extremely high quality off-the-shelf apps, a significant increase in model
capabilities, relevance for a broad set of companies and industries, and a no-brainer
ROI use case.
The percentage of enterprises that had software development as an in-production use case went from
less than 40% to over 70% in just one year.
Some of the other use cases that saw big gains were enterprise search, data analysis,
data labeling, and customer service actually saw a slight decline in in-production use.
Trend 15 has to do with how different solutions and use cases are coming into the enterprise,
and the short answer is that they're being driven by their employee's external usage.
A16Z writes, much of the early growth across leading enterprise AI apps has been driven by the
prosumer market. Many CIOs noted that their decision to purchase enterprise chat GPT was driven
by, quote, employees loving chat GPT. It's the brand name they knew. I talked before about how
Amazon might be using cursor in the future, so I think that this is just going to increase.
Lastly, and this is a big one. One of the things that was extremely notable about AI when it started
is how much incumbents had an advantage that they had not had with previous tech paradigm shifts.
The pattern that we had gotten used to was that each new tech paradigm shift creates a new set of incumbents,
who start as startups and move more quickly and nimbly and eventually out-compete and become the new dominant players.
However, there were some really big factors that incumbents had when it came to AI
that put them at the center of the industry much faster than they might otherwise have been.
One of those was established trust, obviously a huge issue when it comes to sensitive data
as a key part of the success rate of this new technology.
Existing distribution was another advantage for incumbents.
And then frankly, incumbents were some of the only funders who could actually play at the
capital level needed by companies like OpenAI.
The big, multi-billion dollar investments into companies like Anthropic and OpenAI by
companies like Microsoft and Google was not driven necessarily by OpenAI and Anthropics' desire
to work with their big tech competitors right out of the gate, but instead by the fact that they needed
to raise so much money that those were basically the only games in town. However, it appears that something
might be starting to shift. The 16th and last trend on this list is that AI-native quality and speed
are starting to outpace incumbents. And this gets back to the idea that enterprises are increasingly
looking to get access to whatever the best state-of-the-art model is as fast as possible after it gets
released. And one particular area where this is showing up is software development.
there is obviously an incredible difference between some of the first-gen-AI coding assistant tools
and the new agentic coding platforms, where once you've used cursor going back to GitHub co-pilot
is just hugely problematic. Enterprises who preferred AI-native companies did so overwhelmingly
because of their faster pace of innovation. So what is the story overall? In a word, it's acceleration.
Things continue to move faster, and enterprises are adapting at a surprisingly quick rate. You'll notice
that a lot of the changes in enterprise behavior almost reflect a consumer-style mindset.
The fact that they are willing to work with multiple different models, the fact that they
want the state of the art as fast as humanly possible. All of these things run counter to
the way that you might have expected enterprises to behave, but make complete sense in context.
I would obviously anticipate between this survey and the next year's survey to see even more
transformation as agents really come online as a key factor. I would expect to see some
updated questions about agents in production. But for now,
that is the story of enterprises as A16Z finds it. Overall, highly resonant to what we're seeing
with super intelligent and a lot here that's useful to chew on if you are a company thinking about
your AI strategy. For now, though, that's going to do it for today's AI Daily Brief. Until next time,
peace.
