Everyday AI Podcast – An AI and ChatGPT Podcast - Ep 755: Managing the AI Capability Gap: AI Is More than Ready. Most Companies are Not (Start Here Series Vol 19)
Episode Date: April 14, 2026Would you show up to compete in a Formula One race in a bike? 🚴♂️Like.... you could. But you'd get smoked. Yet, that's the exact AI strategy that 99% of enterprises are going thro...ugh when it comes to AI adoption. And the studies prove that not-so-hot-take to be true. The capability gap between what today's frontier AI models can do and what enterprise companies actually use them for is jaw-dropping. So, how do you manage it? Join us for our latest Start Here Series show to find out. Managing the AI Capability Gap: AI Is More than Ready. Most Companies are Not -- An Everyday AI Chat with Jordan WilsonNewsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageToday's Episode on LinkedIn: Thoughts on this? Join the convo on LinkedIn and connect with other AI leaders.Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:AI Capability Gap Definition & UrgencyFrontier AI vs. Human Expert BenchmarksAnthropic AI Knowledge Worker Usage StudyTop 6% AI Company Adoption StrategiesFive Causes of the AI Capability GapRecursive Self-Improvement in AI ModelsAI Adoption vs. Organizational Workflow DesignClosing the AI Capability Gap with MetricsAI Automation in Professional Knowledge WorkManaging AI Risk Tiers & Process RedesignTimestamps:00:00 Understanding the AI capability gap05:32 Jack Clark on AI progress08:42 The AI adoption gap explained11:00 Start Here series introduction14:48 Evaluating AI on real tasks16:08 Rapid advancements in AI capabilities20:43 AI's impact on work and skills24:36 Latest AI models improving products27:00 AI adoption challenges and capability gaps32:07 Automating workflows and assessing risk34:19 The AI capability gap reportKeywords: AI capability gap, AI adoption, artificial intelligence, organizational adoption, business workflow automation, AI models, AI performance benchmark, industry professionals, FrontierSend Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info) Start Here ▶️Not sure where to start when it comes to AI? Start with our Start Here Series. You can listen to the first drop -- Episode 691 -- or get free access to our Inner Cricle community and all episodes: StartHereSeries.com Also, here's a link to the entire series on a Spotify playlist.
Transcript
Discussion (0)
This is the Everyday AI Show, the Everyday Podcast where we simplify AI and bring its power to your fingertips.
Listen daily for practical advice to boost your career, business, and everyday life.
Meet Firefly AI Assistant, now live in Adobe Firefly, the All In One Creative AI Studio.
Just describe what you want to create and the assistant handles the rest,
orchestrating multi-step workflows across Photoshop, Premiere Express, and more in one conversational interface.
You direct the outcome.
The assistant accelerates execution.
There's been a lot of talk lately about these AI models that are scary good.
And even more chatter about how many of the brightest minds in AI and all the tech AI CEOs
feel that we've already reached artificial general intelligence and are racing toward super
intelligence.
But what does that matter to you or your company?
Because right now, those are kind of just theoretical issues.
It's kind of like when you see an email on a Friday and you look at it and you're like, yeah, that's a Monday problem.
But do you know what's scarier than some private AI model that's scary good or artificial superintelligence?
The AI capability gap. That's the scariest of all. Why? Because it's very real.
And unlike that Friday afternoon email that you're maybe going to tackle on Monday, the AI capability gap is a today problem.
And if you don't address it head on, it's going to slowly stall your company's growth.
And if your company isn't growing, the AI capability gap will undoubtedly stop your company dead in its tracks.
And this is not an exaggeration.
And the timing here is specifically urgent.
So we're going to hit Rewind today and explain the capability gap in AI and why AI is racing ahead of your company.
And we're going to show you how to slow it down a bit and catch it.
All right.
You ready?
This is part of our Start Here series on Everyday AI.
Let's get into it.
So if you are new here, well, let's just get to the big picture.
AI capability right now is outpacing business adoption.
And it has changed recently.
And the pace is too fast for any of us.
Right now, Frontier AI.
And when we talk about it.
about actual capabilities. This is not an exaggeration. Frontier AI models when used correctly,
and that's the big asterisk here, they now match or exceed human professionals on most
defined knowledge work tasks. And there's a massive gap that persists between what AI could,
in theory, automate today and what organizations are actually using it for, which is often
just topical. There's a McKinsey study that said only about 6% of organizations are generating
meaningful profits despite widespread AI adoption. And the bottleneck is not actually the AI models.
You could have made that argument maybe mid-2025, but not anymore in 2026. It's actually about
organizational adoption, workflow design, and training and education. So here's what we're going to
tackle on today's show. So stick with me for the next. I'm going to try to make this one 25 minutes.
We'll see. All right. Stick with me for the next.
22-ish minutes, and you're going to learn the benchmark proving that AI outperforms human experts,
and it's not even close.
You're going to know more about Anthropics' new-ish study, revealing most knowledge workers barely touch AI's true automation potential.
You're going to learn why the AI capability gap only started to show itself over the past few months.
It's actually kind of new.
And what the top 6% of AI performing companies actually do that everyone else completely ignores.
All right.
Welcome to Everyday AI.
And this is our Start Here series.
This is the essential podcast series to both learn the AI basics and for AI experts to double down.
All right.
I started this thing because after 750 plus episodes, everyone always said, Jordan, great
podcast.
Right.
When they found it, they're like, where do I start?
I have no clue.
And I'm like, I also have no clue.
So that's why I started these Start Here series.
It's best if you are brand new here, start an order.
All right.
These are faster, you know, podcasts, usually about 20.
25 to 35 minutes. But if you listen on 2X, they're even half that, right? But this is a great way to listen from
one to now volume 19. And then also make sure to go to start here series.com. That's going to give
you free access to our exclusive inner circle community. You can't find access right now.
Any gap report card. All right. So a little bit more on that at the end. But trust me,
you are going to want to repose today's episode. I'm just saying. All right. And if you miss our
last start here series in volume 18, and this was episode 750, we went over the vibe coding boom,
why vibe coding isn't going away and how it's both good and bad. So make sure to go check that
one out. But now let's talk about how you can actually manage the AI capability gap and why AI
is more than ready and companies are not. All right. And I'm not the only one talking about this.
I've been talking about this now since I think late 20, 25, but a lot of
of super smart people are starting to dive in deep on the AI capability gap. I really like what
Jack Clark said. He is the Anthropic co-founder of Anthropic. And here's what he said in a post
a couple of weeks ago. He said most of AI progress has this flavor. If you have a bit of
intellectual curiosity and some time, you can very quickly shock yourself with how amazingly
capable modern AI systems are. But you need to have that magic combination combination of time
and curiosity. And otherwise, you're going to consume AI like most people do as a passive viewer
of some unremarkable synthetic slop content. Or at best, just asking your LLM of choice how to
roast a turkey and keep it moist. Or Tony Box lights spinning, but not playing music. What do I do?
And all the amazing advancements are mostly hidden from you. All right. So that's what Jack Clark
said in a kind of viral post that he had a couple of
the weeks ago on X.
And I think this is very telling.
And the two things that he talked about, I think are very true.
All right.
To start, well, first to even realize the AI capability gap, let alone close it, you have
to be extremely curious.
And the reason why I say that is because if you work, how you've been working for the
last few decades, you're not going to discover or your team, your organization is not
truly going to discover that capability gap because AI is not meant for the average knowledge worker,
right? That's why I talk about all the time how I hate, absolutely hate the concept of upskilling
because AI is not something that you sprinkle on top of a seasoned knowledge worker, right? Like myself,
I've been working full time for 20-ish years, right? I can't just sprinkle AI on the top. I have to
unlearn, right? So you do have to have the time and you have to be curious on, hey, what does my role
look like if I completely started over.
And if I kind of forgot everything that got me to the point I am in my career, and that's
what you have to do.
And you also have to have to have a lot of time and you have to devote a lot of time to it.
Olivia Moore from A16C talked about, she said, Open AI dropped a state of enterprise
reports across a million customers.
This was a couple of months ago.
But her response, she said, the gulf between AI power users and everyone else is wide.
The 95th percentile user spends six cents six times more messages.
than the median with coding, writing, and analysis showing the biggest gaps.
So, yeah, those people that are power users, well, because they actually understand the capability
gap, they're using AI all the time.
This is why myself, and I'm not saying this is like some weird flex.
I'm saying this because I'm part of this, you know, power user group.
When I say that I use AI for 10 to 12 hours a day, right, that's how much I'm working.
You know, it's not like I'm working 16 hours a day.
I am only working in AI every single day.
No matter where I am, I'm using AI from beginning to end in literally every single step in between.
And it has completely reshaped my workflow.
Kevin Ruse, New York Times columnist, said this.
And I really like how he put this.
He said, I follow AI adoption pretty closely.
And I have never seen such a yawning inside, outside gap.
People in SF in San Francisco are putting multi-agent cloud store,
in charge of their lives, consulting chatbots before every decision,
why are heading to a degree only sci-fi writers, writers dared to imagine.
People everywhere are still trying to get approval to use co-pilot in teams
if they're using AI at all.
It's possible that the early adopter bubble I'm in has always been this intense,
but there seems to be a cultural takeoff happening in addition to the technical one,
not ideal.
And I'll say from my experience, I think there's five main reasons that the AI capability gap has really been exposed in 2026.
And number one, the models of what they can actually do, right?
I'm just going to roll through these things really quickly.
So number one, what AI models can actually do.
Number two, what business leaders think AI can do.
Number three, AI literacy.
Number four, human skill set.
And number five, AI access.
Now, let me break those five things down a little bit more in depth.
So, number one, what AI models can actually do?
A year ago, right?
So I don't know when you're listening to this start here series episode.
You know, could be in April, 2026.
You might be listening to it at the end of 2026.
I don't know.
Right.
But a year ago in the beginning of 2025, you could, for the most part, understand what
AI models are capable of.
Today, you absolutely can't.
And it is literally getting to the point of science fiction.
All of these math problems and science problems that have been plaguing researchers for decades are now being solved.
I look at them and I have no clue.
I read the papers and I'm like, okay, I don't have any clue.
I can follow and understand it a year ago.
What AI models can actually do today is mind-boggling.
Number two, what business leaders think AI can do?
I get to talk to a lot of smart people.
Adobe just introduced an entirely new way to create, bringing the power and precision of
its creative suite into one conversational experience.
Meet Firefly AI Assistant, now live in the Adobe Firefly app, the all-in-one creative
AI studio.
Powered by Adobe's creative agent, Firefly AI Assistant lets you start with your vision, just
describe what you want, and shape the outcome as it takes form with the Assistant.
The Assistant orchestrates multi-step workflows drawing on 60-plus pro-grade tools across
Adobe Creative Cloud apps, including Photoshop, Illustrator, Premiere, Lightroom, Exploreau,
for us and more to help bring your ideas to life.
You can also get started with creative skills, a growing library of pre-built workflows for
common creative tasks, like batch editing photos, creating mood boards, portrait retouching,
and creating social variations.
Every step the assistant takes is visible so you can refine, redirect, or take over at any
time.
You stay in the driver's seat as the creative director.
Adobe Firefly AI assistant now in public beta.
See it today at
at Firefly.adobby.com.
It's really, right?
I've had hundreds of guests on this show.
I'll say maybe a handful that I've talked to.
And I'm like, yes, this business leader fully understands what AI can do.
For the most part, and that's not their fault necessarily because most people in their role,
they have to be an expert at one application, right?
You can't be in like very few business leaders know what AI can do.
I feel fairly confident that I'm in that category, but only because this is all I do.
If I had another job, right, and I was just an AI champion at a company, you can't.
Like you literally can't.
You used to be able to.
You can't anymore because literally every single day, right, whether you're talking about
Google, Microsoft, Open AI, Anthropic, now meta's back in the conversation, perplexity,
et cetera, right?
You literally can't have an actual job where you have to produce something for a company
and still understand what AI can do.
Not anymore.
You used to be able to juggle that.
Now, unless you're literally working 20 plus hours a day, you can't do it.
All right.
Number three, AI literacy.
And this kind of goes, you know, hand in hand with, you know, number two, what business
leaders understanding.
So there's a difference between understanding AI's capabilities and then AI literacy, because
you have to actually be able to speak the language foundationally, which, again, most
people skip over that.
Right?
They go straight to the bells and the whistles.
They go straight to clicking the button, not knowing what happens, you know, from A to X.
They only want to get to Y in the result that brings on Z.
Number four is human skill set.
All right.
This is different than literacy.
You have to have the foundational understanding, but then you also have to have these skills, right?
I know a lot about basketball, but my basketball skill set not great anymore, right?
I think I peaked in eighth grade.
But you have to have that human skill set.
And then last but not least is AI access.
So that's access to the tools, to the technology in your role.
Okay.
So what can AI models actually do?
Right.
And I do want to spend a little bit more time on that because that will better explain
this gap that I think started manageable.
And now it's very hard to manage that capability gap because it has turned into a
friggin' canyon.
So the reason being is because.
AI models are better than expert humans.
All right.
I've talked about this evaluation a couple of times, but the more I follow these AI benchmarks
and all of these other things, right?
There's dozens of them.
I think there's about five that matter.
Right.
And one of them, and I think probably the most important one, is GDP Val.
So this is a benchmark from OpenAI.
And the reason why I think is the most valuable benchmark to look at is because it's about
creating business value.
So more or less, this benchmark.
benchmark evaluates AI models against real professional deliverables across 44 different high GDP
occupations. So these are things that you and I may do, you know, going in and doing research
on a fast moving market segment and then creating an artifact like a, you know, a pitch deck or
creating a spreadsheet, something like that. These are real world domain specific tasks where
both a human and an AI model from start to finish, completed a task, and then submitted their
final version to a panel of experts who are experts in that field.
And expert judges then compared the unlabeled AI and human outputs in a blind head-to-head
pairwise comparison.
And what we've seen now is, as an example, the best model for actually creating front-to-back
economically valuable outputs like a human would is the open AI's GPT 54 model and it matches or exceeds
industry professionals in 83% of these evaluations right I remember when some of these first
GDP valve numbers came out and I'm like oh that's that's pretty impressive right when we were in the
30 40% but now it's undeniable and I do assume that by the end of the year that number is
going to be like in the mid-90s, right?
Like one of my predictions is it was going to get to 80% and we're already at 80% is just
a huge jump.
That's why I keep talking about the AI capabilities are just running too fast for most
business leaders to understand and keep up.
And it's nearly doubled.
That GDP BAL score has nearly doubled in five months.
So when I talk about the last, you know, since essentially late 20, 25, I'm going to talk about
why, but it has been, we've seen more developments in the past five months than we have in the
five years prior, at least when it comes to large language models, and it's not even close.
Right. So in October, the best AI model scored on that was a 47%.
So it hasn't quite doubled, but it's nearly doubled in about five or six months.
And right now, AI handles most defined professional tasks yet.
most organizations barely use models in this way.
It's like, did you know, right, as an example, your, you know, GPT and Claude models can do this.
They can create artifacts by default.
You don't have to have anything special turn on.
It can just create spreadsheets, presentations, Word docs, et cetera, all personalized based on your company's info.
All right.
Next, I want to talk very briefly about the anthropic labor data study that came out,
about two months ago.
I did go over this in more depth.
So if you're interested in this,
go listen to episode 730.
Here's essentially, similarly,
whereas the GDP valve benchmark
looks at actual economic output,
this anthropic labor data study
looks a little bit more of the capability gap.
So that's why I think these two
kind of this benchmark,
open AI benchmark in the anthropic labor data study
in tandem, really tell a powerful story of where we're at and why this capability gap even
exists. So in this study, Anthropic mapped over 20,000 work tasks, right? So they actually
used some public jobs data from the federal government. So they mapped over 20,000 work tasks
across 800 occupations to millions of real AI conversations with their clawed chatbot. These
were anonymized, but essentially they said, okay, what are people using Claude for?
And they're matching millions of these chats to 20,000 work tasks across these 800 occupations
from this federal data.
And what they found in theory was that computer and math rolls could solve 94% of,
sorry, today's most capable AI models could automate a problem.
to 94% of computer and math tasks, right?
But they were only seeing 33% usage, right?
Because people assume, oh, if computer and math, yeah, everyone's using AI for that because
everyone knows that's the lowest hanging fruit, right?
Anything in coding software development, et cetera.
Yet, even in the area where people assume, oh, yeah, literally everyone, if you're working
in anything computer, right, software engineering, anything dev, anything with
math, of course, you're using AI.
They said, well, actually not.
Right.
And even office, admin, business, financial, and legal roles all revealed the same deep
kind of adoption shortfall.
Right.
So for our live stream audience, you can always see the video version of this on our website
at your everyday AI.com.
So I have the theoretical capability and observe usage by occupational category.
Kind of map that Anthropic put together.
Very fascinating.
But all this shows is in certain categories, right?
as management, business and finance, computer and math, you know, architecture and engineering,
legal is another high one, arts and media, right?
Like, AI's capabilities when mapped to literal the actual work tasks that the federal government
uses to define these roles, I mean, most of them have coverage in the 80 to 90 percent.
yet the observed or sorry the that's the theoretical coverage so in theory today's most powerful models
could do anywhere from 80 to 90% of the actual work right computer and math was the high one where
they had an observed coverage of a little more than 30% but these other areas are not even
20% right they're for the most part you know in the 20 some of them below 20 so you have this huge gap
where in most of these general cases, right, like management, you know, in theory, today's AI models,
if you understand the capabilities, they can automate 85 plus percent.
Yet, you don't even have a 20 percent coverage.
That is a crazy gap, right?
Where essentially you have a magic wand that if you know how to use the magic wand and you know
the magic words and you say, poof, work be gone, poof, the work is gone, but people don't know.
because it's impossible to keep up with.
And some other things that they found on this study,
which is actually the workers that are most impacted,
they said that workers most exposed to AI right now
earn 47% more in hold graduate degrees at nearly a four times rate.
So essentially, I think people early on assume that AI would displace
or would potentially have the capabilities to displace workers
who were maybe a little more junior or, you know,
weren't as high up the kind of quote unquote knowledge work totem pole, so to speak.
And what they found was the exact opposite, right?
It is those people who have those higher degrees.
And then conversely, people with physical and manual occupations registered the lowest exposure,
according to Anthropics study.
So how did we get here so quickly, right?
How do we get, as an example, from GDP Val, where literally five and a half months,
ago. It was even, right? It was a little less. It was about a 47% tie or win rate against humans,
right? It was a coin flip. On it, the off the shelf model that you can pay $20 a month for was as
good as a, you know, a human with a decade of experience who specializes in that to now,
humans aren't going to be able to compete in a couple of months, right? By blind benchmarks,
almost every single time within probably six months, humans are always going to
prefer the AI model. How do we get here so quickly? Right? Two years ago, the AI models were
not good at all. They're actually pretty bad, right? Especially, obviously we have today's
comparison to draw back on. But one of the biggest reasons I think is recursive self-improvement.
Stick with me here if you're not super technical. All right. So recursive self-improvement or RSI.
It's a concept within artificial general intelligence where essentially an AI system improves its
own source code, architecture or training data, leading to a more capable model, which then improves
itself further, creating a self-reinforcing loop. All right. So why am I talking about this kind of
strange niche concept called recursive self-improvement on something about managing the AI capability
gap? Well, because at the end of 2025, the AI company started to either directly admit or to
kind of allude to the fact that they were all now using recursive self-improvements on their
models, right? So now their quote unquote big models were good enough that it could start writing
its own code. It could start improving itself, right? I think, you know, Anthropic with their
Claude code, probably one of the most consequential products of the AI life cycle outside of
chat GPT. You could make the argument that Claude code is one of the most important outside
of chat chbteebte, right? The lead at Anthropics Claude code says, yeah, we don't write code
anymore. You know,
Claude codes,
Claude, right?
And we've seen the same inferred by different researchers at OpenAI as well.
You know, Google has been a little less direct, but they've still alluded to the fact that a lot of their models today,
not just the smaller versions that are distilled from the biggest versions, but the biggest
versions are being improved by themselves, right?
And these new big, scary models, right?
Like Anthropics, mythos, or, you know, Open AIs, whatever.
it's called, you know, spud or glacier, right?
If you're listening to this in six months, these names, these code names don't matter anymore.
But one of the reasons that maybe the general public isn't getting their hands on them is, well,
they're too compute intensive, but they are using these models internally to improve the
best consumer models and to also ship new products.
And that's why Anthropic, as an example, their ship rate in February, in March was straight up
off the charts, right? And then we found out later, well, one of the reasons was because they were
using this mythos model to put out a lot of this, a lot of these new features that we're all using
now. And this is why, right? Because a year ago, it would have taken a team, even using AI,
it would have taken teams way longer. But now that you have this kind of recursive self-improvements
or models improving themselves and building new features that use those models, right? This is why it's
hard now to keep up.
So this started, like I said, in 2025.
And ever since, large language model capabilities have far outpaced enterprise training
and learning and development.
You know, even the companies that wanted to do it right, I think they could, they could do
it, you know, in, you know, quarter two, quarter three of last year.
But now you can't, right?
unless you have an entire segment of people,
you know,
maybe 5% of your total workforce,
that literally has no deliverables.
And all they do, right,
I've said this all along,
your team needs a bunch of me's,
where all they do all day is they just scope different AI models.
They play with AI releases,
they sandbox thing.
They're not building anyone for anything.
They're just building solutions for what they think the company needs,
and then they're training those people.
But companies don't have that, right? And that's why, at least right now, it is nearly impossible to keep up.
And most companies, though, claim AI adoption, but they can't actually prove real results. And it's getting even harder and harder, right? That McKinsey study that I talked about in 2025 said that 88% of organizations are using AI, but only 6% are generating meaningful business profits. And right now, I think leadership confidence is running far ahead of what frontline practitioners report actually.
seeing on the ground. That's the thing, right? So a lot of times the people who are in charge of
AI at certain companies now because it's getting easier to build, I think maybe their hands are on
keyboard or their eyes are on monitoring agents a little bit more and they are getting removed
from what the frontline practitioners are actually experiencing. So not only is it pretty hard
to manage that gap just from a technical perspective, but from a change.
management in a people management perspective, it's getting even hard. And I think that's why some
AI capability gaps are closing, but others are still remaining stuck. So as an example, right,
AI is helping AI-enabled humans close some gaps, but many still exist and are getting worse.
So as an example, coding performance, right, this in a lot of different benchmarks,
this surge from single digits to 90% on structured benchmarks in three years in terms of what
the AI itself was capable to do. And that helps, obviously, the humans that use this as part of
their daily workflow close a big chunk of that gap individually. But you now have this, you know,
PhD level models with, you know, science and standardized math capabilities that are rapidly
approaching the ceiling of the current tests. So you aren't even necessarily able to know by the
benchmarks what the capability gap is because these benchmarks are becoming saturated. So we,
even the AI community doesn't even fully understand these models capabilities because the benchmarks, right, a lot of them have been stuck in the 90 to 95 percentile.
You know, a lot of them are at 98, 99. So I think the benchmarks themselves are getting saturated.
So even the people building the models aren't even fully aware or understand what these models are actually capable of.
So let's talk about how you can actually manage this gap and start to close it.
All right.
Right now, the top performers, right?
The top companies are investing more than 20% of their digital budgets to fundamentally rework their processes.
All right.
Let me repeat that.
20% of their digital budget.
So whatever your digital budget is, I'm guessing most companies are probably at 1%.
maybe 5%. Very few. Only the top performers are investing that 20% of their digital budget to
rework their day-to-day knowledge processes, right? Because you need to learn to separate workflows
into risk tiers with verification and human approval matched to each level. In organizations that
wait for AI to become ready, they're going to find that their competitors have already
capture the advantage. Because in the same way, right, just to draw a little parallel here,
what Anthropic, and I keep saying they've won 2026 so far, well, it's because they were first,
right? They were first to at least internally close that capability gap and put it to work, right?
And that's why they've been able to outpace their competitors. I don't know how long it'll last.
We'll see because I think the other labs have closed.
that gap and I think we'll start to see that soon. But think of that within your own organization,
right? The first to close the gap is going to be able to accelerate at a pace that we haven't seen
before, right? That's why you see, unfortunately, a lot of these big companies, you know, block as an example
cutting 40% of the workforce and they seem fairly confident that they're going to be able to
actually grow revenue because of the way they've completely reworked their organization. Actually,
Jack Dorsey had a very fascinating essay that we shared about in our newsletter about how they're essentially flipping the work pyramid on its head.
So here's what I want to have you focus on.
One metric.
Okay.
I want to make this very digestible for you and hopefully actionable as we close out today's show.
I want you to track the percentage of AI-assisted workflow steps right now that are accepted without any rework or incidents.
And here's why.
Because you probably, most organizations,
they kind of get their AI plan or their AI training for the year, right?
Or some companies, unfortunately, it's kind of like one time.
Maybe the companies that are really investing heavily, you might get it once a quarter.
But when was the last time that you used the most capable thinking models?
I'm talking about GPT-5-4 Pro.
I'm talking about Opus 4-6 with extended reasoning.
I'm talking about Gemini 3-1 Pro with a higher thinking budget, right?
When was the last time that you used those and track the percentage of your workflows that get accepted without rework or incident?
And then I want you to break that metric down by risk tier, right?
So the low risk, you know, drafting versus the high stakes legal or financial outputs.
So organizations that measure and understand that operational reliability for the high-stakes,
percentage of AI-assisted workflows that can be accepted at a lower risk.
You will be surprised that depending on what your team does, depending on what your company does,
depending on what sector you're in, you'd be surprised without too much investment, aside from
just reverse engineering your current day-to-day processes, you'd be surprised to say that
about 30 to maybe 60% of a lot of the work that many of us do, right?
I'm not talking about, you know, people in specialized role.
I'm saying if your organization has a thousand employees, you know, if you look at that work
collectively, you'd be surprised.
I would say 30 to 60%.
If you rescope everything, would fall in that can essentially be automated without rework or
incident and is in a lower risk or a medium risk tier.
And if you're not measuring that on an ongoing basis, you can't scale.
But that's step one to managing the AI capability gap.
And you have to be doing this, right, at least monthly.
Again, a year ago, you could get away with quarterly.
The way models, right, if you're using a model from last quarter, good luck, right?
That's showing up to an F1 race in a bicycle.
Good luck.
You're going to get smoked.
You don't see it a chance.
So you can no longer take these year-long pilots, these quarter-long plans.
You have to be agile to actually understand, number one, the capability gap, but to begin to manage it.
All right.
I hope this was helpful in our start here series.
Like I said, make sure to repost today's episode on LinkedIn.
Here's why we put together the AI capability gap report cards to help you know where your organization stands when it comes to, number one,
understanding this gap and number two, how you can actually tackle it.
So in this report card, it's a great guide for you and your team to go through it together
to understand the latest capabilities of all the models and how they break down for different
types of work.
All right.
So if you didn't know, yes, if you're listening on the podcast, this is actually live streamed
on LinkedIn.
So in the podcast show notes, we always put a link to today's LinkedIn.
show. So just go click repost and I will send that capability gap report card your way. All right.
Thank you for tuning in. If you haven't already, please go to start here series.com.
That's going to give you free access to the inner circle community. And you can go listen to all of the
Start Here series in order in the playlist that we have. And you can go read and listen to all of the shows
there. So thank you for tuning in. We hope to see you back tomorrow and every day for more everyday
AI. Thanks y'all. Meet Firefly AI assistant.
Now live in Adobe Firefly, the Allman One Creative AI Studio.
Just describe what you want to create in your own words and the assistant handles the rest,
orchestrating multi-step workflows across Adobe Creative Cloud apps,
including Photoshop, Premiere Express, and more in one conversational interface.
You direct the outcome while the assistant accelerates execution.
Stand control with the ability to step in and refine at any time.
See it today at firefly.adobie.com.
And that's a wrap for today's edition of every.
Day AI. Thanks for joining us. If you enjoyed this episode, please subscribe and leave us a rating.
It helps keep us going. For a little more AI magic, visit Your EverydayaI.com and sign up to
our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.
