The AI Daily Brief: Artificial Intelligence News and Analysis - The Biggest Trends from the AI Engineer World's Fair
Episode Date: June 7, 2025The AI Engineer World’s Fair highlighted key AI and agent world shifts. Top themes: evals, tiny teams, agent swarms, and the rise of coding agents. NLW breaks down the key trends and the alpha that ...exists in the program. Get Ad Free AI Daily Brief: https://patreon.com/AIDailyBriefBrought to you by:KPMG – Go to https://kpmg.com/ai to learn more about how KPMG can help you drive value with our AI solutions.Blitzy.com - Go to https://blitzy.com/ to build enterprise software in days, not months AGNTCY - The AGNTCY is an open-source collective dedicated to building the Internet of Agents, enabling AI agents to communicate and collaborate seamlessly across frameworks. Join a community of engineers focused on high-quality multi-agent software and support the initiative at agntcy.org Vanta - Simplify compliance - https://vanta.com/nlwPlumb - The automation platform for AI experts and consultants https://useplumb.com/The Agent Readiness Audit from Superintelligent - Go to https://besuper.ai/ to request your company's agent readiness score.The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: https://pod.link/1680633614Subscribe to the newsletter: https://aidailybrief.beehiiv.com/Join our Discord: https://bit.ly/aibreakdownInterested in sponsoring the show? nlw@breakdown.network
Transcript
Discussion (0)
Today on the AI Daily Brief, the most important trends coming out of the AI engineer world's fair.
And before that, in the headlines, June's fastest growing software vendors are all agent companies.
The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI.
All right, friends, quick announcement section.
First of all, thanks to today's sponsors, blitzie.com, plumb, vanta, and agency.org.
To get an ad-free version of the show, which starts at just $3 a month go to patreon.com.
slash AI Daily Brief. And other housekeeping reminders, one, we're starting to do a big sponsorship
push for the fall. There are surprisingly only a few slots left. So if you are interested, shoot me a note
at NLW at breakdown.network with the word sponsor in the subject. And I'm excited to see the cool things
you guys are building. One of the things that I love about the sponsors on this show is that they are
always interesting, dynamic, and just building genuinely awesome things. But with that, let's get into
today's headlines. Welcome back to the AI Daily Brief Headlines edition. All the
daily AI news you need in around five minutes. Every month, Ramp, which is a corporate card and
bill pay platform, processes billions of dollars of business expenses and uses that information as a way
to see what types of software is trending. One of the interesting stats they look at is which
software vendors are the fastest growing. And when it comes to customer growth right now,
it is all agent companies. Ramp's top five fastest growing software vendors for June 2025 were
in this order, Google One, Anthropic, Descript, N8N, and Lindy. About Google, they write,
Google One, a Google subscription targeting consumers, launched AI Pro and AI Ultra last month,
driving new subscriptions, and for the first time placing Google on our top vendors list.
Google's Gemini model still lags OpenAI and Anthropic in business adoption, according to our
latest Ramp AI index, but placement on this list suggests businesses are starting to take
advantage of Google's latest 2.5 Pro models, which are popular with coders. Now, of course,
you're familiar with Anthropic, Descript the reason that I think also counts as an agent company
is that their big push is basically to become like a cursor for video. Their integrated AI tool is
called Underlord, and it can do everything from auto-detecting ums and ahs and you knows and likes
and other vocal ticks to other more advanced agentic editing features. Maybe the most interesting
though was that N8N and Lindy, which both offer some version of automated and agentic workflow
builders are in this top five as well. Of N&N, Ramp said, users tell us that N&N's greatest
strength is its customizable, including the ability to add a human review step into
agentic workflows, where of Lindy, they wrote, users told us they used Lindy to take sales
templates and customize them for individual leads to drive higher conversion rates.
Now, of course, Ramp is going to be dealing with a particular slice of the business market.
It's going to be more tech-forward organizations. And so perhaps it's not surprising that they're a little
bit more attuned to these AI tools, but to see agent builders and automated workflow builders like
NADN and Lindy appearing on this top five fastest growing is, I think, an indication that agents are
not just something for the future, but are very, very real.
Next up, poor 11 Labs choosing the absolute craziest newsday ever to try to launch a new product.
I have been wondering for some time when we were going to get a new 11 Labs model.
We've been on the same version for so long that you guys have basically.
run me out of using it for Long Read Sunday, but we now have 11v3 alpha, which they call their
most expressive text-to-speech model ever. It supports more than 70 languages, multi-speaker
dialogue, and also has a new feature called audio tags so you can say things like excited,
sighs, laughing, whispers. And people's first impression of this is really positive.
Hey, Jessica, have you tried the new 11v3? I just got it. The clarity is amazing. I can actually do
whispers now like this.
Ooh, fancy.
Check this out.
I can do full Shakespeare now.
To be or not to be.
That is the question.
Nice.
Though I'm more excited about the laugh upgrade.
Listen to this.
I'm super excited to use this new idea of kind of tags or metadata to give more
information around how the output is supposed to sound.
This gives so much more fine grain control.
So I'm super excited to get in there and play around with it.
Give 11 Lab some love, like I said, they launched into absolute chaos yesterday. Go check out the model. It's 80% off for June. Again, no sponsorship, no shell. I just like the company. Obviously, I use their tools for things like Long Read Sunday. So I'm excited to check out V3, and I think you should go check it out as well. Some funding and performance news. Curser has apparently crossed the 500 million ARR mark, which is up 2.5x for March. Bloomberg writes that their latest round value the company at 9.9 billion.
Finally, one more startup that I'm excited to try that has a ton of buzz right now is Higgsfield.
The company has gone from zero to 11 million ARR in just eight weeks, and part of why its video
generation tools are so popular is that they offer, once again, reminiscent of what we just
saw with 11 Labs v3, the ability to control camera angles, to create consistent characters,
and to use more cinematic shots, meaning that people are actually using it to go create
ads right out of the gate.
Higgsfield represents a new generation of startups that are not just thinking about model performance
in general, but are actually building tooling for specific use cases to try to capture that
application layer that we keep talking about. So again, if you are doing anything with video
generation, go check out Higgsfield. You're going to be hearing a lot more about them if for
no other reason than they are just rocketing right now when it comes to their business.
For now, though, that is going to do it for today's AI Daily Brief Headlines edition.
Next up, the main episode.
This episode is brought to you by Blitzy.
If you're a technology leader, here's something that probably sounds familiar.
Your organization's competitive edge is buried in legacy code that desperately needs modernization,
but the resources required feel out of reach.
That was the case for a global investment analysis firm.
They needed to migrate 70,000 lines of complex MATLAB financial algorithms to Python.
Algorithms that drive investment decisions for trillions in assets.
Their estimate?
Months of high-cost specialized engineering work.
Instead, they partnered with Blitzy.
Blitzie's autonomous AI preserved mathematical precision and generated over 80% of the new codebase,
completing the migration with just five days of engineering time.
They cut the timeline by 95% and saved 880 engineering hours.
If your organization is facing similar modernization challenges,
visit blitzy.com to schedule a consultation and discover how AI power development can
transform your technical capabilities.
Today's episode is brought to you by Plum.
If you build agenetic workflows for clients or colleagues, you need to check out
Plum. Plum is the only AI-Native Workflow Builder on the market designed specifically for
automation consultants, with all the features you need to create, deploy, manage, and monetize
complex automations. Features like one-click updates that reach all your subscribers, user-level
variables for personalization, and the ability to protect your prompts and workflow IP.
Make your life easier, your clients happier, and your business thrive with Plum.
Sign up today at useplum.com. That's Plum with a B forward slash NLW.
Today's episode is brought to you by Vanta.
In today's business landscape, businesses can't just claim security, they have to prove it.
Achieving compliance with a framework like SOC2, ISO-27-01, HIPAA, GDPR, and more,
is how businesses can demonstrate strong security practices.
The problem is that navigating security and compliance is time-consuming and complicated.
It can take months of work and use up valuable time and resources.
Vanta makes it easy and faster by automating compliance across 35-plus frameworks.
It gets you audit ready in weeks instead of months and saves you up to 85% of associated costs.
In fact, a recent IDC White Paper found that Vanta customers achieved $535,000 per year in benefits,
and the platform pays for itself in just three months.
The proof is in the numbers.
More than 10,000 global companies trust Vanta.
For a limited time, listeners get $1,000 off at vanta.com slash NLW.
That's VANTA.com slash NLW for $1,000 off.
Today's episode is brought to you by agency, an open source collective for interagent
collaboration.
Agents are, of course, the most important theme of the moment right now, not only on this show,
but I think for businesses everywhere.
And part of that is the expanded scope of what agents are starting to be able to do.
While single agents can handle specific tasks, the real power comes when specialized agents
collaborate to solve complex problems.
However, right now there is no standardized infrastructure for these agents to do.
to discover, communicate with, and work alongside one another.
That's where agency, spelled A-G-N-T-C-Y, comes in.
Agency is an open-source collective building the Internet of Agents,
a global collaboration layer where AI agents can work together.
It will connect systems across vendors and frameworks,
solving the biggest problems of discovery, interoperability, and scalability for enterprises.
With contributors like Cisco, Crew AI, Langchain, and MongoDB,
agency is breaking down silos and building the future of interoperability.
A.I. Shape the future of enterprise innovation. Visit agency.org to explore use cases now. That's
AGMTCY.org. Welcome back to the AI Daily Brief. Today we are talking about the big
trends in the discussion among AI engineers. And here's why this is a relevant discussion for you,
even if you are not yourself, an AI engineer. Basically everything that comes next when it comes
to AI and agents is somewhere right now being conceived of, concocted, collaborated on, or
created by an AI engineer, right? This is the cohort who are not just thinking about how to use
today's technology, but about inventing the next technology to come. When it comes to the more
capable agent swarms that you'll be using in six months, the kinkter being worked out in the
rooms with the AI engineers right now. And so if you are trying to get a preview of the future,
understanding where the discourse is with AI engineers is one of your fastest paths to that.
Now, the AI Engineer World's Fair is part of the AI Engineering Summit family.
You might remember that a few months ago, I emceed their AI Engineering Summit in New York City,
and I've had Swix from Layton Space, who's one of the creators of this event on the show numerous times before,
and they have just completed their big annual World's Fair in San Francisco.
I unfortunately was not able to go this year because I have family stuff this weekend,
and I have upcoming travel next week, but I was still watching very closely from afar,
and I think that this set of content, even more than previous AI engineer summits in
world fairs, really gives you an incredibly detailed and fairly complete picture of where
the agent and AI world is headed. This was a dense three days. So much so that we even had
attendees like Ishaan Anand create their own little tools for allowing ChatGPT to go figure out
what to go to. You can see if you're watching just how densely packed things were at any given time,
there were about 10 different workshops or talks going on. And one of the best ways to try to understand
all the different areas is to look at the more than 20 different tracks they had. So in brief,
they had tracks for AI architects, AI product management, AI in action, AI in Fortune 500,
agent reliability, autonomy and robotics, design engineering, evals, general session, generative
media, graphragg, infrastructure, keynote, MCP, reasoning and REL, Retrieval and Search,
software engineering agents, security, tiny teams, vibe coding, voice, and workshops.
Now, obviously, even that is too packed to take on its own.
So I broke it into four themes that I see running throughout a bunch of these tracks
that I think broadly speak to what's going on.
Trend number one, to the surprise of no one, is agents.
They had tracks for agent reliability, software engineering agents, MCP,
which is, of course, key infrastructure for helping agents improve and take advantage
of other tools and knowledge sources.
Voice was a massive theme.
We talked about 11 Labs new release in our headlines today, and they were there at the event.
Open AI did a session about building voice agents.
There were keynotes about voice as well.
And so all in all, agents major theme for the conference across different tracks.
A second is what I'll call infrastructure and building, which honestly could in some ways
be bundled with agents, but the point here is that this is the meat of the builders part of
the conference, right?
You had tracks for MCP, for infrastructure, for retrieval and search, for security, and one for
evals, which we're going to come back to in a little bit.
One of the cool things that happened as part of the MCP track is that Anthropic actually put
out a request for startups as part of their presentation.
Their RFS included server, server, servers.
They want servers beyond dev tools.
They want sales servers, finance, legal education.
Basically, if MCP is going to help agents live up to their full possibilities, we need
servers in new domains.
Anthropic also wants to see people simplify server building.
They want both enterprise and integrate hosting, testing, and deployment tooling.
as well as automated MCP server generation.
Finally, they want to increase the AI security observability and auditing stack.
Security was a track that I found interesting because, secretly, this might be more relevant
for the Fortune 500 than the AI for the Fortune 500 track.
So much of what's holding back enterprise-grade deployments of agents and AI is issues around security,
and you saw just tons of sessions about cutting-edge thinking about this.
OpenAI did a session about safety and security for code executing agents.
There was a session about open standards and agent security, another session about
chief information security officer approved agent fleet architecture, which by the way gets
into another theme which we'll talk about in a minute, which is the shift towards thinking
about multi-agent orchestration, agent systems, agent swarms.
And anyways, if you spend any time at all on X slash Twitter, really digging into the AI
engineering community's response to this event, so many of the tweets and posts are about
the workshops in this sort of infrastructure and building mode.
Yes, the keynotes of course get a ton of attention, especially that from Greg Brockman,
but it was very clear from afar that people were there to build, and these were the places
where that was getting done.
A third theme, which I thought was really interesting, I called New Ways of Working.
So some of this is new roles, AI architects and AI product management, but one of the really
interesting subtracks was called Tiny Teams.
Now, obviously, this gets into some of the conversations that people have been having around
solopreneurs and seed-strapping and just broadly how much more you can do with smaller teams.
And many of the sessions here were from companies that were basically executing big, huge projects
with undersized teams. Gumloop did their path to be a 10-person unicorn. Gamma talked about
how small their team is and how they use agents to make that work. And of course, part of how
companies make that work is the last theme that I'll call out from these tracks, which is
agents and AI for coding. They had a vibe.
coding track as well as a software engineering agent track. And this was obviously a huge, huge focus,
given how much of what it means to be an AI engineer is changing based on this set of tooling
and capabilities. But let's hear from the man himself, Sean, better known as SWICs,
around what he thinks the big themes from the conference were. Yeah, how to do great AI PMing,
how to run a tiny team. We have a robotics track for the first time that is Tesla Optimus is speaking,
physical intelligence, Waymo.
Waymo just overtook Lyft.
Yeah, yeah, I saw that.
That already.
Voice is the hottest thing in terms of multalities.
Like everyone's sort of building with voice,
because I think it's like finally good enough.
And I think maybe the last thing I'll highlight to you
is we are also emphasizing security for the first time.
Security is like kind of a boring topic.
It's nobody really, really wants to talk about like how to secure your system,
but like they actually do now because they have real money running through their product.
So there's all that.
And then that is like roughly important, equal in size to the excitement about MCP.
And so we have an entire MCP track with the Anthropic team here.
Very cool.
Because they're nice enough to come by.
And that fills up the whole ballroom that we have.
Swix also did a mini keynote as is standard for these events.
And the slide that I saw that got the most attention was this one that I think should put
the dagger in the heart of the debate around what is or isn't an agent.
The slide reads,
the value of the AI product is in the value of the AI leverage on your effort.
Doesn't matter how agentic just increased the ratio of human input to valuable AI output.
His session was called designing AI intensive applications.
And the description read, whether you call it a workflow or an agent,
AI engineered applications are seeing user input to LLM call ratios go from 1 to 1,
i.e. chat, GBT, to 1 to 100, deep research and codex,
and even 0 to n, i.e. ambient and proactive agents.
How does AI engineering change as you build increasingly AI intensive applications?
And I think that this actually gets at one of the key themes that was underlying all of this,
which is this shift to multi-agent systems.
This was also one of the interesting segments from the keynote discussion with OpenAI co-founder Greg Brockman,
who basically argued that the AGI future doesn't look like one big AI in the sky,
but instead a panoply of specialized agents that can work together.
First of all, it's all on the table, right?
Maybe we reach a world where it's just like the AIs are so capable that we all, you know, just let them write all the code.
Maybe there's a world where that you have like one AI in the sky.
Maybe it's that you actually have a bunch of domain-specific agents that require a bunch of specific work in order to make it happen.
I think the evidence has really been shifting towards this like menagerie of different models.
And I think that's actually really exciting.
So there's actually a lot of power to be had by models that are actually able to use other models.
And so I think that that is going to open up just a ton of opportunity because, you know, we're heading to a world where the economy is fundamentally powered by AI.
We're not there yet, but you can see it right on the horizon.
And the economy is a very big thing.
There's a lot of diversity in it.
And it's also not static, right?
That I think when people think about what AI can do for us, it's very easy to only look at, well, what are we doing now and how does AI slot in and, you know, the percentage of human versus AI?
But that's not the point, right?
The point is, how do we get 10x more activity?
10x more economic output, 10x more benefit to everyone, and the barrier to entry will be lower
than ever. And so things like healthcare that you can't just, you know, it requires responsibility
to go in and think about how to do it right. Things like education where there's multiple
stakeholders, you know, the parent, the teacher, the student. Each of these requires
domain expertise, requires careful thought, requires a lot of work. And so I think that there
is going to be just like so much opportunity for people to build. And so, I'm
just so excited to see everyone in this room because that's the right kind of energy.
Beyond just Greg, there were a lot of great keynotes.
Conviction VC and fellow AI podcaster Sarah Goh made the very strong argument that the key
differentiator right now is execution capability. Product lead for Google's AI studio, Logan Kilpatrick,
not only talked about Google's triumphant year, but straight up launched their latest Gemini 2.5
pro update on the stage.
Logan's whole speech in Google's presence at this event, which was way bigger than just this one
keynote definitely shows how hard Google is competing for developers. And coming back to this theme of
coding agents and agentic IDEs, you can see in this video that it was standing room only for the keynote
with Winserve head of product engineering, Kevin Howe. So where do I think AI engineer is ahead of the
curve and you can get some specific alpha? Number one, evals. If you follow SWIX, he's been talking
about this a lot and finally had a chance to really bring it together. Just before the conference
he tweeted, after over a year of saying I need to do an evals conference, we finally have the
speakers and practitioners who lead these evals at work instead of trying to sell you on their
evals to do a dedicated evals track for the first time ever. Every AI engineer serious enough
about their product should work on their evals. Now, this is a big theme even outside this event.
Lenny Richitsky from Lenny's podcast and Lenny's newsletter just shared a long post about this
where he dumped a ton of quotes around how important this topic is.
Mary Tan saying evals are emerging is the real moat for AI startups. Kevin Wheel, OpenAI,
CPO saying writing evals is going to become a core skill for product managers. Mike Krieger,
Anthropic CPO, saying if there is one thing we can teach people, it's that writing evals
is probably the most important thing. And Greg Brockman saying evals are surprisingly often
all you need. Anyways, this is a huge topic, probably deserving of an entire show. It's something
that we've spent a ton of time on at super intelligent in terms of building evals into our agent-readiness
audit voice agent. And what tends to happen when Swicks and AI engineer put a spotlight on something
is that it tends to take a bigger share of the collective discourse after that, so I would expect to hear
a lot more about evals in the months to come. A second place where AI engineer is ahead of the
curve is definitely this tiny team theme. Now obviously they are not the only progenitors of this.
There are tons of people talking about solopreneurship and seed strapping, but bringing it together
as a discipline is, I think, new and really important. Swicks even tried to put some metrics around
this, saying there's an idea I'm trying to push of companies that have more millions in ARR than
employees. I think it's potentially a nice, simple definition for how to think about a successful
tiny team. So your revenue efficiency is so high because obviously if you pay each employee
less than a million dollars, you're probably profitable. And therefore, you don't actually
need the venture money except to points of marketing. And that's your choice. You can be profitable.
I have a six-person team making more than $40 million.
A third area where I think AI engineer is ahead of the curve
is something that we actually talked about after Microsoft Build as well,
which is that these folks are not talking about single agents and how capable they are.
They are talking about architecting agentic systems,
groups of different agents that can work together.
We obviously heard about this from Brockman a minute ago,
and there was also a product manager for AI coding at Google Labs,
who did a session called your coding agent just got cloned and your brain isn't ready.
The description reads,
Will the future engineer code alongside a single coding agent,
or will they spend their day orchestrating many agents?
Traditional development rewards synchronous focus.
This session dives into the significant mind shift required to move from sequential coding
to orchestrating parallel agents.
I think this is an absolutely massive theme.
It is a mindset shift.
It is an organizational shift.
It is an operational shift.
I've got an interview coming up in a couple of days while
traveling that will get even more into this. But basically, this AI engineer community is designing
for a world replete with agents and absolutely thinking about multi-agent systems. Now, if you have
been listening to all of this, and by the way, I have no affiliation with AI engineer. They're
not sponsoring anything. I just love what they do. One of the extra cool things is that they put
basically all of this content live for free on the web. You can go to their YouTube, which is
YouTube.com slash at AI.orgeneer, and watch all of these keynotes and many of the sessions underneath as
well. So I will conclude by saying a big congrats to SWIX and the entire team at the AI
Engineer World's Fair. For those of you who are there, let me know how it was what you think
the big thing coming out of it were and what you think people who weren't there should take away
from it. For now, though, that is going to do it for today's AI Daily Brief. Thanks as always
for listening or watching. And until next time, peace.
