Everyday AI Podcast – An AI and ChatGPT Podcast - Ep 755: Managing the AI Capability Gap: AI Is More than Ready. Most Companies are Not (Start Here Series Vol 19)

Episode Date: April 14, 2026

Would you show up to compete in a Formula One race in a bike? 🚴‍♂️Like.... you could. But you'd get smoked. Yet, that's the exact AI strategy that 99% of enterprises are going thro...ugh when it comes to AI adoption. And the studies prove that not-so-hot-take to be true. The capability gap between what today's frontier AI models can do and what enterprise companies actually use them for is jaw-dropping. So, how do you manage it? Join us for our latest Start Here Series show to find out. Managing the AI Capability Gap: AI Is More than Ready. Most Companies are Not -- An Everyday AI Chat with Jordan WilsonNewsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageToday's Episode on LinkedIn: Thoughts on this? Join the convo on LinkedIn and connect with other AI leaders.Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:AI Capability Gap Definition & UrgencyFrontier AI vs. Human Expert BenchmarksAnthropic AI Knowledge Worker Usage StudyTop 6% AI Company Adoption StrategiesFive Causes of the AI Capability GapRecursive Self-Improvement in AI ModelsAI Adoption vs. Organizational Workflow DesignClosing the AI Capability Gap with MetricsAI Automation in Professional Knowledge WorkManaging AI Risk Tiers & Process RedesignTimestamps:00:00 Understanding the AI capability gap05:32 Jack Clark on AI progress08:42 The AI adoption gap explained11:00 Start Here series introduction14:48 Evaluating AI on real tasks16:08 Rapid advancements in AI capabilities20:43 AI's impact on work and skills24:36 Latest AI models improving products27:00 AI adoption challenges and capability gaps32:07 Automating workflows and assessing risk34:19 The AI capability gap reportKeywords: AI capability gap, AI adoption, artificial intelligence, organizational adoption, business workflow automation, AI models, AI performance benchmark, industry professionals, FrontierSend Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info) Start Here ▶️Not sure where to start when it comes to AI? Start with our Start Here Series. You can listen to the first drop -- Episode 691 -- or get free access to our Inner Cricle community and all episodes: StartHereSeries.com Also, here's a link to the entire series on a Spotify playlist. 

Transcript
Discussion (0)
Starting point is 00:00:00 This is the Everyday AI Show, the Everyday Podcast where we simplify AI and bring its power to your fingertips. Listen daily for practical advice to boost your career, business, and everyday life. Meet Firefly AI Assistant, now live in Adobe Firefly, the All In One Creative AI Studio. Just describe what you want to create and the assistant handles the rest, orchestrating multi-step workflows across Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome. The assistant accelerates execution. There's been a lot of talk lately about these AI models that are scary good.
Starting point is 00:00:51 And even more chatter about how many of the brightest minds in AI and all the tech AI CEOs feel that we've already reached artificial general intelligence and are racing toward super intelligence. But what does that matter to you or your company? Because right now, those are kind of just theoretical issues. It's kind of like when you see an email on a Friday and you look at it and you're like, yeah, that's a Monday problem. But do you know what's scarier than some private AI model that's scary good or artificial superintelligence? The AI capability gap. That's the scariest of all. Why? Because it's very real.
Starting point is 00:01:32 And unlike that Friday afternoon email that you're maybe going to tackle on Monday, the AI capability gap is a today problem. And if you don't address it head on, it's going to slowly stall your company's growth. And if your company isn't growing, the AI capability gap will undoubtedly stop your company dead in its tracks. And this is not an exaggeration. And the timing here is specifically urgent. So we're going to hit Rewind today and explain the capability gap in AI and why AI is racing ahead of your company. And we're going to show you how to slow it down a bit and catch it. All right.
Starting point is 00:02:15 You ready? This is part of our Start Here series on Everyday AI. Let's get into it. So if you are new here, well, let's just get to the big picture. AI capability right now is outpacing business adoption. And it has changed recently. And the pace is too fast for any of us. Right now, Frontier AI.
Starting point is 00:02:39 And when we talk about it. about actual capabilities. This is not an exaggeration. Frontier AI models when used correctly, and that's the big asterisk here, they now match or exceed human professionals on most defined knowledge work tasks. And there's a massive gap that persists between what AI could, in theory, automate today and what organizations are actually using it for, which is often just topical. There's a McKinsey study that said only about 6% of organizations are generating meaningful profits despite widespread AI adoption. And the bottleneck is not actually the AI models. You could have made that argument maybe mid-2025, but not anymore in 2026. It's actually about
Starting point is 00:03:26 organizational adoption, workflow design, and training and education. So here's what we're going to tackle on today's show. So stick with me for the next. I'm going to try to make this one 25 minutes. We'll see. All right. Stick with me for the next. 22-ish minutes, and you're going to learn the benchmark proving that AI outperforms human experts, and it's not even close. You're going to know more about Anthropics' new-ish study, revealing most knowledge workers barely touch AI's true automation potential. You're going to learn why the AI capability gap only started to show itself over the past few months. It's actually kind of new.
Starting point is 00:04:00 And what the top 6% of AI performing companies actually do that everyone else completely ignores. All right. Welcome to Everyday AI. And this is our Start Here series. This is the essential podcast series to both learn the AI basics and for AI experts to double down. All right. I started this thing because after 750 plus episodes, everyone always said, Jordan, great podcast.
Starting point is 00:04:24 Right. When they found it, they're like, where do I start? I have no clue. And I'm like, I also have no clue. So that's why I started these Start Here series. It's best if you are brand new here, start an order. All right. These are faster, you know, podcasts, usually about 20.
Starting point is 00:04:38 25 to 35 minutes. But if you listen on 2X, they're even half that, right? But this is a great way to listen from one to now volume 19. And then also make sure to go to start here series.com. That's going to give you free access to our exclusive inner circle community. You can't find access right now. Any gap report card. All right. So a little bit more on that at the end. But trust me, you are going to want to repose today's episode. I'm just saying. All right. And if you miss our last start here series in volume 18, and this was episode 750, we went over the vibe coding boom, why vibe coding isn't going away and how it's both good and bad. So make sure to go check that one out. But now let's talk about how you can actually manage the AI capability gap and why AI
Starting point is 00:05:46 is more than ready and companies are not. All right. And I'm not the only one talking about this. I've been talking about this now since I think late 20, 25, but a lot of of super smart people are starting to dive in deep on the AI capability gap. I really like what Jack Clark said. He is the Anthropic co-founder of Anthropic. And here's what he said in a post a couple of weeks ago. He said most of AI progress has this flavor. If you have a bit of intellectual curiosity and some time, you can very quickly shock yourself with how amazingly capable modern AI systems are. But you need to have that magic combination combination of time and curiosity. And otherwise, you're going to consume AI like most people do as a passive viewer
Starting point is 00:06:33 of some unremarkable synthetic slop content. Or at best, just asking your LLM of choice how to roast a turkey and keep it moist. Or Tony Box lights spinning, but not playing music. What do I do? And all the amazing advancements are mostly hidden from you. All right. So that's what Jack Clark said in a kind of viral post that he had a couple of the weeks ago on X. And I think this is very telling. And the two things that he talked about, I think are very true. All right.
Starting point is 00:07:07 To start, well, first to even realize the AI capability gap, let alone close it, you have to be extremely curious. And the reason why I say that is because if you work, how you've been working for the last few decades, you're not going to discover or your team, your organization is not truly going to discover that capability gap because AI is not meant for the average knowledge worker, right? That's why I talk about all the time how I hate, absolutely hate the concept of upskilling because AI is not something that you sprinkle on top of a seasoned knowledge worker, right? Like myself, I've been working full time for 20-ish years, right? I can't just sprinkle AI on the top. I have to
Starting point is 00:07:50 unlearn, right? So you do have to have the time and you have to be curious on, hey, what does my role look like if I completely started over. And if I kind of forgot everything that got me to the point I am in my career, and that's what you have to do. And you also have to have to have a lot of time and you have to devote a lot of time to it. Olivia Moore from A16C talked about, she said, Open AI dropped a state of enterprise reports across a million customers. This was a couple of months ago.
Starting point is 00:08:14 But her response, she said, the gulf between AI power users and everyone else is wide. The 95th percentile user spends six cents six times more messages. than the median with coding, writing, and analysis showing the biggest gaps. So, yeah, those people that are power users, well, because they actually understand the capability gap, they're using AI all the time. This is why myself, and I'm not saying this is like some weird flex. I'm saying this because I'm part of this, you know, power user group. When I say that I use AI for 10 to 12 hours a day, right, that's how much I'm working.
Starting point is 00:08:50 You know, it's not like I'm working 16 hours a day. I am only working in AI every single day. No matter where I am, I'm using AI from beginning to end in literally every single step in between. And it has completely reshaped my workflow. Kevin Ruse, New York Times columnist, said this. And I really like how he put this. He said, I follow AI adoption pretty closely. And I have never seen such a yawning inside, outside gap.
Starting point is 00:09:18 People in SF in San Francisco are putting multi-agent cloud store, in charge of their lives, consulting chatbots before every decision, why are heading to a degree only sci-fi writers, writers dared to imagine. People everywhere are still trying to get approval to use co-pilot in teams if they're using AI at all. It's possible that the early adopter bubble I'm in has always been this intense, but there seems to be a cultural takeoff happening in addition to the technical one, not ideal.
Starting point is 00:09:49 And I'll say from my experience, I think there's five main reasons that the AI capability gap has really been exposed in 2026. And number one, the models of what they can actually do, right? I'm just going to roll through these things really quickly. So number one, what AI models can actually do. Number two, what business leaders think AI can do. Number three, AI literacy. Number four, human skill set. And number five, AI access.
Starting point is 00:10:19 Now, let me break those five things down a little bit more in depth. So, number one, what AI models can actually do? A year ago, right? So I don't know when you're listening to this start here series episode. You know, could be in April, 2026. You might be listening to it at the end of 2026. I don't know. Right.
Starting point is 00:10:35 But a year ago in the beginning of 2025, you could, for the most part, understand what AI models are capable of. Today, you absolutely can't. And it is literally getting to the point of science fiction. All of these math problems and science problems that have been plaguing researchers for decades are now being solved. I look at them and I have no clue. I read the papers and I'm like, okay, I don't have any clue. I can follow and understand it a year ago.
Starting point is 00:11:02 What AI models can actually do today is mind-boggling. Number two, what business leaders think AI can do? I get to talk to a lot of smart people. Adobe just introduced an entirely new way to create, bringing the power and precision of its creative suite into one conversational experience. Meet Firefly AI Assistant, now live in the Adobe Firefly app, the all-in-one creative AI studio. Powered by Adobe's creative agent, Firefly AI Assistant lets you start with your vision, just
Starting point is 00:11:40 describe what you want, and shape the outcome as it takes form with the Assistant. The Assistant orchestrates multi-step workflows drawing on 60-plus pro-grade tools across Adobe Creative Cloud apps, including Photoshop, Illustrator, Premiere, Lightroom, Exploreau, for us and more to help bring your ideas to life. You can also get started with creative skills, a growing library of pre-built workflows for common creative tasks, like batch editing photos, creating mood boards, portrait retouching, and creating social variations. Every step the assistant takes is visible so you can refine, redirect, or take over at any
Starting point is 00:12:17 time. You stay in the driver's seat as the creative director. Adobe Firefly AI assistant now in public beta. See it today at at Firefly.adobby.com. It's really, right? I've had hundreds of guests on this show. I'll say maybe a handful that I've talked to.
Starting point is 00:12:40 And I'm like, yes, this business leader fully understands what AI can do. For the most part, and that's not their fault necessarily because most people in their role, they have to be an expert at one application, right? You can't be in like very few business leaders know what AI can do. I feel fairly confident that I'm in that category, but only because this is all I do. If I had another job, right, and I was just an AI champion at a company, you can't. Like you literally can't. You used to be able to.
Starting point is 00:13:11 You can't anymore because literally every single day, right, whether you're talking about Google, Microsoft, Open AI, Anthropic, now meta's back in the conversation, perplexity, et cetera, right? You literally can't have an actual job where you have to produce something for a company and still understand what AI can do. Not anymore. You used to be able to juggle that. Now, unless you're literally working 20 plus hours a day, you can't do it.
Starting point is 00:13:34 All right. Number three, AI literacy. And this kind of goes, you know, hand in hand with, you know, number two, what business leaders understanding. So there's a difference between understanding AI's capabilities and then AI literacy, because you have to actually be able to speak the language foundationally, which, again, most people skip over that. Right?
Starting point is 00:13:56 They go straight to the bells and the whistles. They go straight to clicking the button, not knowing what happens, you know, from A to X. They only want to get to Y in the result that brings on Z. Number four is human skill set. All right. This is different than literacy. You have to have the foundational understanding, but then you also have to have these skills, right? I know a lot about basketball, but my basketball skill set not great anymore, right?
Starting point is 00:14:21 I think I peaked in eighth grade. But you have to have that human skill set. And then last but not least is AI access. So that's access to the tools, to the technology in your role. Okay. So what can AI models actually do? Right. And I do want to spend a little bit more time on that because that will better explain
Starting point is 00:14:41 this gap that I think started manageable. And now it's very hard to manage that capability gap because it has turned into a friggin' canyon. So the reason being is because. AI models are better than expert humans. All right. I've talked about this evaluation a couple of times, but the more I follow these AI benchmarks and all of these other things, right?
Starting point is 00:15:08 There's dozens of them. I think there's about five that matter. Right. And one of them, and I think probably the most important one, is GDP Val. So this is a benchmark from OpenAI. And the reason why I think is the most valuable benchmark to look at is because it's about creating business value. So more or less, this benchmark.
Starting point is 00:15:26 benchmark evaluates AI models against real professional deliverables across 44 different high GDP occupations. So these are things that you and I may do, you know, going in and doing research on a fast moving market segment and then creating an artifact like a, you know, a pitch deck or creating a spreadsheet, something like that. These are real world domain specific tasks where both a human and an AI model from start to finish, completed a task, and then submitted their final version to a panel of experts who are experts in that field. And expert judges then compared the unlabeled AI and human outputs in a blind head-to-head pairwise comparison.
Starting point is 00:16:12 And what we've seen now is, as an example, the best model for actually creating front-to-back economically valuable outputs like a human would is the open AI's GPT 54 model and it matches or exceeds industry professionals in 83% of these evaluations right I remember when some of these first GDP valve numbers came out and I'm like oh that's that's pretty impressive right when we were in the 30 40% but now it's undeniable and I do assume that by the end of the year that number is going to be like in the mid-90s, right? Like one of my predictions is it was going to get to 80% and we're already at 80% is just a huge jump.
Starting point is 00:16:56 That's why I keep talking about the AI capabilities are just running too fast for most business leaders to understand and keep up. And it's nearly doubled. That GDP BAL score has nearly doubled in five months. So when I talk about the last, you know, since essentially late 20, 25, I'm going to talk about why, but it has been, we've seen more developments in the past five months than we have in the five years prior, at least when it comes to large language models, and it's not even close. Right. So in October, the best AI model scored on that was a 47%.
Starting point is 00:17:33 So it hasn't quite doubled, but it's nearly doubled in about five or six months. And right now, AI handles most defined professional tasks yet. most organizations barely use models in this way. It's like, did you know, right, as an example, your, you know, GPT and Claude models can do this. They can create artifacts by default. You don't have to have anything special turn on. It can just create spreadsheets, presentations, Word docs, et cetera, all personalized based on your company's info. All right.
Starting point is 00:18:08 Next, I want to talk very briefly about the anthropic labor data study that came out, about two months ago. I did go over this in more depth. So if you're interested in this, go listen to episode 730. Here's essentially, similarly, whereas the GDP valve benchmark looks at actual economic output,
Starting point is 00:18:31 this anthropic labor data study looks a little bit more of the capability gap. So that's why I think these two kind of this benchmark, open AI benchmark in the anthropic labor data study in tandem, really tell a powerful story of where we're at and why this capability gap even exists. So in this study, Anthropic mapped over 20,000 work tasks, right? So they actually used some public jobs data from the federal government. So they mapped over 20,000 work tasks
Starting point is 00:19:04 across 800 occupations to millions of real AI conversations with their clawed chatbot. These were anonymized, but essentially they said, okay, what are people using Claude for? And they're matching millions of these chats to 20,000 work tasks across these 800 occupations from this federal data. And what they found in theory was that computer and math rolls could solve 94% of, sorry, today's most capable AI models could automate a problem. to 94% of computer and math tasks, right? But they were only seeing 33% usage, right?
Starting point is 00:19:50 Because people assume, oh, if computer and math, yeah, everyone's using AI for that because everyone knows that's the lowest hanging fruit, right? Anything in coding software development, et cetera. Yet, even in the area where people assume, oh, yeah, literally everyone, if you're working in anything computer, right, software engineering, anything dev, anything with math, of course, you're using AI. They said, well, actually not. Right.
Starting point is 00:20:15 And even office, admin, business, financial, and legal roles all revealed the same deep kind of adoption shortfall. Right. So for our live stream audience, you can always see the video version of this on our website at your everyday AI.com. So I have the theoretical capability and observe usage by occupational category. Kind of map that Anthropic put together. Very fascinating.
Starting point is 00:20:38 But all this shows is in certain categories, right? as management, business and finance, computer and math, you know, architecture and engineering, legal is another high one, arts and media, right? Like, AI's capabilities when mapped to literal the actual work tasks that the federal government uses to define these roles, I mean, most of them have coverage in the 80 to 90 percent. yet the observed or sorry the that's the theoretical coverage so in theory today's most powerful models could do anywhere from 80 to 90% of the actual work right computer and math was the high one where they had an observed coverage of a little more than 30% but these other areas are not even
Starting point is 00:21:26 20% right they're for the most part you know in the 20 some of them below 20 so you have this huge gap where in most of these general cases, right, like management, you know, in theory, today's AI models, if you understand the capabilities, they can automate 85 plus percent. Yet, you don't even have a 20 percent coverage. That is a crazy gap, right? Where essentially you have a magic wand that if you know how to use the magic wand and you know the magic words and you say, poof, work be gone, poof, the work is gone, but people don't know. because it's impossible to keep up with.
Starting point is 00:22:07 And some other things that they found on this study, which is actually the workers that are most impacted, they said that workers most exposed to AI right now earn 47% more in hold graduate degrees at nearly a four times rate. So essentially, I think people early on assume that AI would displace or would potentially have the capabilities to displace workers who were maybe a little more junior or, you know, weren't as high up the kind of quote unquote knowledge work totem pole, so to speak.
Starting point is 00:22:37 And what they found was the exact opposite, right? It is those people who have those higher degrees. And then conversely, people with physical and manual occupations registered the lowest exposure, according to Anthropics study. So how did we get here so quickly, right? How do we get, as an example, from GDP Val, where literally five and a half months, ago. It was even, right? It was a little less. It was about a 47% tie or win rate against humans, right? It was a coin flip. On it, the off the shelf model that you can pay $20 a month for was as
Starting point is 00:23:17 good as a, you know, a human with a decade of experience who specializes in that to now, humans aren't going to be able to compete in a couple of months, right? By blind benchmarks, almost every single time within probably six months, humans are always going to prefer the AI model. How do we get here so quickly? Right? Two years ago, the AI models were not good at all. They're actually pretty bad, right? Especially, obviously we have today's comparison to draw back on. But one of the biggest reasons I think is recursive self-improvement. Stick with me here if you're not super technical. All right. So recursive self-improvement or RSI. It's a concept within artificial general intelligence where essentially an AI system improves its
Starting point is 00:24:02 own source code, architecture or training data, leading to a more capable model, which then improves itself further, creating a self-reinforcing loop. All right. So why am I talking about this kind of strange niche concept called recursive self-improvement on something about managing the AI capability gap? Well, because at the end of 2025, the AI company started to either directly admit or to kind of allude to the fact that they were all now using recursive self-improvements on their models, right? So now their quote unquote big models were good enough that it could start writing its own code. It could start improving itself, right? I think, you know, Anthropic with their Claude code, probably one of the most consequential products of the AI life cycle outside of
Starting point is 00:24:50 chat GPT. You could make the argument that Claude code is one of the most important outside of chat chbteebte, right? The lead at Anthropics Claude code says, yeah, we don't write code anymore. You know, Claude codes, Claude, right? And we've seen the same inferred by different researchers at OpenAI as well. You know, Google has been a little less direct, but they've still alluded to the fact that a lot of their models today, not just the smaller versions that are distilled from the biggest versions, but the biggest
Starting point is 00:25:22 versions are being improved by themselves, right? And these new big, scary models, right? Like Anthropics, mythos, or, you know, Open AIs, whatever. it's called, you know, spud or glacier, right? If you're listening to this in six months, these names, these code names don't matter anymore. But one of the reasons that maybe the general public isn't getting their hands on them is, well, they're too compute intensive, but they are using these models internally to improve the best consumer models and to also ship new products.
Starting point is 00:25:52 And that's why Anthropic, as an example, their ship rate in February, in March was straight up off the charts, right? And then we found out later, well, one of the reasons was because they were using this mythos model to put out a lot of this, a lot of these new features that we're all using now. And this is why, right? Because a year ago, it would have taken a team, even using AI, it would have taken teams way longer. But now that you have this kind of recursive self-improvements or models improving themselves and building new features that use those models, right? This is why it's hard now to keep up. So this started, like I said, in 2025.
Starting point is 00:26:35 And ever since, large language model capabilities have far outpaced enterprise training and learning and development. You know, even the companies that wanted to do it right, I think they could, they could do it, you know, in, you know, quarter two, quarter three of last year. But now you can't, right? unless you have an entire segment of people, you know, maybe 5% of your total workforce,
Starting point is 00:27:00 that literally has no deliverables. And all they do, right, I've said this all along, your team needs a bunch of me's, where all they do all day is they just scope different AI models. They play with AI releases, they sandbox thing. They're not building anyone for anything.
Starting point is 00:27:18 They're just building solutions for what they think the company needs, and then they're training those people. But companies don't have that, right? And that's why, at least right now, it is nearly impossible to keep up. And most companies, though, claim AI adoption, but they can't actually prove real results. And it's getting even harder and harder, right? That McKinsey study that I talked about in 2025 said that 88% of organizations are using AI, but only 6% are generating meaningful business profits. And right now, I think leadership confidence is running far ahead of what frontline practitioners report actually. seeing on the ground. That's the thing, right? So a lot of times the people who are in charge of AI at certain companies now because it's getting easier to build, I think maybe their hands are on keyboard or their eyes are on monitoring agents a little bit more and they are getting removed from what the frontline practitioners are actually experiencing. So not only is it pretty hard
Starting point is 00:28:19 to manage that gap just from a technical perspective, but from a change. management in a people management perspective, it's getting even hard. And I think that's why some AI capability gaps are closing, but others are still remaining stuck. So as an example, right, AI is helping AI-enabled humans close some gaps, but many still exist and are getting worse. So as an example, coding performance, right, this in a lot of different benchmarks, this surge from single digits to 90% on structured benchmarks in three years in terms of what the AI itself was capable to do. And that helps, obviously, the humans that use this as part of their daily workflow close a big chunk of that gap individually. But you now have this, you know,
Starting point is 00:29:05 PhD level models with, you know, science and standardized math capabilities that are rapidly approaching the ceiling of the current tests. So you aren't even necessarily able to know by the benchmarks what the capability gap is because these benchmarks are becoming saturated. So we, even the AI community doesn't even fully understand these models capabilities because the benchmarks, right, a lot of them have been stuck in the 90 to 95 percentile. You know, a lot of them are at 98, 99. So I think the benchmarks themselves are getting saturated. So even the people building the models aren't even fully aware or understand what these models are actually capable of. So let's talk about how you can actually manage this gap and start to close it. All right.
Starting point is 00:29:50 Right now, the top performers, right? The top companies are investing more than 20% of their digital budgets to fundamentally rework their processes. All right. Let me repeat that. 20% of their digital budget. So whatever your digital budget is, I'm guessing most companies are probably at 1%. maybe 5%. Very few. Only the top performers are investing that 20% of their digital budget to rework their day-to-day knowledge processes, right? Because you need to learn to separate workflows
Starting point is 00:30:26 into risk tiers with verification and human approval matched to each level. In organizations that wait for AI to become ready, they're going to find that their competitors have already capture the advantage. Because in the same way, right, just to draw a little parallel here, what Anthropic, and I keep saying they've won 2026 so far, well, it's because they were first, right? They were first to at least internally close that capability gap and put it to work, right? And that's why they've been able to outpace their competitors. I don't know how long it'll last. We'll see because I think the other labs have closed. that gap and I think we'll start to see that soon. But think of that within your own organization,
Starting point is 00:31:13 right? The first to close the gap is going to be able to accelerate at a pace that we haven't seen before, right? That's why you see, unfortunately, a lot of these big companies, you know, block as an example cutting 40% of the workforce and they seem fairly confident that they're going to be able to actually grow revenue because of the way they've completely reworked their organization. Actually, Jack Dorsey had a very fascinating essay that we shared about in our newsletter about how they're essentially flipping the work pyramid on its head. So here's what I want to have you focus on. One metric. Okay.
Starting point is 00:31:54 I want to make this very digestible for you and hopefully actionable as we close out today's show. I want you to track the percentage of AI-assisted workflow steps right now that are accepted without any rework or incidents. And here's why. Because you probably, most organizations, they kind of get their AI plan or their AI training for the year, right? Or some companies, unfortunately, it's kind of like one time. Maybe the companies that are really investing heavily, you might get it once a quarter. But when was the last time that you used the most capable thinking models?
Starting point is 00:32:31 I'm talking about GPT-5-4 Pro. I'm talking about Opus 4-6 with extended reasoning. I'm talking about Gemini 3-1 Pro with a higher thinking budget, right? When was the last time that you used those and track the percentage of your workflows that get accepted without rework or incident? And then I want you to break that metric down by risk tier, right? So the low risk, you know, drafting versus the high stakes legal or financial outputs. So organizations that measure and understand that operational reliability for the high-stakes, percentage of AI-assisted workflows that can be accepted at a lower risk.
Starting point is 00:33:12 You will be surprised that depending on what your team does, depending on what your company does, depending on what sector you're in, you'd be surprised without too much investment, aside from just reverse engineering your current day-to-day processes, you'd be surprised to say that about 30 to maybe 60% of a lot of the work that many of us do, right? I'm not talking about, you know, people in specialized role. I'm saying if your organization has a thousand employees, you know, if you look at that work collectively, you'd be surprised. I would say 30 to 60%.
Starting point is 00:33:47 If you rescope everything, would fall in that can essentially be automated without rework or incident and is in a lower risk or a medium risk tier. And if you're not measuring that on an ongoing basis, you can't scale. But that's step one to managing the AI capability gap. And you have to be doing this, right, at least monthly. Again, a year ago, you could get away with quarterly. The way models, right, if you're using a model from last quarter, good luck, right? That's showing up to an F1 race in a bicycle.
Starting point is 00:34:28 Good luck. You're going to get smoked. You don't see it a chance. So you can no longer take these year-long pilots, these quarter-long plans. You have to be agile to actually understand, number one, the capability gap, but to begin to manage it. All right. I hope this was helpful in our start here series. Like I said, make sure to repost today's episode on LinkedIn.
Starting point is 00:34:48 Here's why we put together the AI capability gap report cards to help you know where your organization stands when it comes to, number one, understanding this gap and number two, how you can actually tackle it. So in this report card, it's a great guide for you and your team to go through it together to understand the latest capabilities of all the models and how they break down for different types of work. All right. So if you didn't know, yes, if you're listening on the podcast, this is actually live streamed on LinkedIn.
Starting point is 00:35:21 So in the podcast show notes, we always put a link to today's LinkedIn. show. So just go click repost and I will send that capability gap report card your way. All right. Thank you for tuning in. If you haven't already, please go to start here series.com. That's going to give you free access to the inner circle community. And you can go listen to all of the Start Here series in order in the playlist that we have. And you can go read and listen to all of the shows there. So thank you for tuning in. We hope to see you back tomorrow and every day for more everyday AI. Thanks y'all. Meet Firefly AI assistant. Now live in Adobe Firefly, the Allman One Creative AI Studio.
Starting point is 00:36:07 Just describe what you want to create in your own words and the assistant handles the rest, orchestrating multi-step workflows across Adobe Creative Cloud apps, including Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome while the assistant accelerates execution. Stand control with the ability to step in and refine at any time. See it today at firefly.adobie.com. And that's a wrap for today's edition of every. Day AI. Thanks for joining us. If you enjoyed this episode, please subscribe and leave us a rating.
Starting point is 00:36:45 It helps keep us going. For a little more AI magic, visit Your EverydayaI.com and sign up to our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.