Everyday AI Podcast – An AI and ChatGPT Podcast - AI in 2026: 7 reasons why the pace of AI this year will far exceed 2025.

Episode Date: January 6, 2026

You still using AI to..... write emails? 🤔You know these things can like.... access your dynamic data, plan, use tools and create work outputs just like us, right? Chances are, you're still us...ing LLMs like a back-and-forth chatbot straight outta November 2022. But 2026 is gonna slap you in the face, because the rate of adaption is gonna be undeniable. Join us for our first #HotTakeTuesday of 2026 for the 5 reasons why. AI in 2026: 5 reasons why the pace of AI Adoption this year will far exceed 2025 -- An Everyday AI Chat with Jordan WilsonNewsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion on LinkedIn: Thoughts on this? Join the convo on LinkedIn and connect with other AI leaders.Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:Five Pillars of 2026 AI AccelerationReasoning by Default in Large Language ModelsAgentic Scaffolding for Autonomous AI ExecutionFrictionless Data Integration with RAG PipelinesExponential AI Task Endurance MetricsAI Models Delivering Economically Meaningful WorkGDP Valve Benchmark for Enterprise AIAdvanced Front-End LLMs vs Old Transformer ModelsTimestamps:00:00 "AI's Untapped Potential in 2026"05:32 "Embracing AI as Team OS"08:50 "Evolution of Reasoning AI Models"12:34 "Enhanced AI Tools and Scaffolding"16:04 "Gemini Versions and Data Grounding"19:24 "Measuring AI Task Proficiency"21:07 "Exponential Growth of AI Models"25:56 "Can ChatGPT Create Spreadsheets?"29:10 "Benchmarking AI for Economic Impact"30:51 "LLMs Outpace Humans in Efficiency"34:05 "Everyday AI: Subscribe & Thrive"Keywords:AI in 2026, pace of AI adoption, frontier AI capabilities, enterprise automation, agentic scaffolding, reasoning models, hybrid reasoner, large language models, economically meaningful work, AI agents, ChatGPT, Claude Opus 4.5, Google Gemini 2.5, Anthropic, OpenAI CEO of applications, Fiji Simo, proactive super assistant, operating system for enterprises, task endurance, exponential task durationSend Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info) Start Here ▶️Not sure where to start when it comes to AI? Start with our Start Here Series. You can listen to the first drop -- Episode 691 -- or get free access to our Inner Cricle community and all episodes: StartHereSeries.com Also, here's a link to the entire series on a Spotify playlist. 

Transcript
Discussion (0)
Starting point is 00:00:00 This is the Everyday AI Show, the everyday podcast where we simplify AI and bring its power to your fingertips. Listen daily for practical advice to boost your career, business, and everyday life. Meet Firefly AI Assistant, now live in Adobe Firefly, the all-in-one creative AI studio. Just describe what you want to create and the assistant handles the rest, orchestrating multi-step workflows across Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome. The assistant accelerates execution. The gap between how most people use AI today and what it's actually capable of is mountainous.
Starting point is 00:00:54 That's because many business leaders are driving by only looking in the rear view mirror. Many iron prizes are primarily still using AI to polish their blog posts or as a Google search replacement. And that's one of the main reasons. I think 2025 wasn't quite the breakout year for AI that many thought it might be. But that's also because the technology in itself in the first half of the year needed a lot of elbow grease and duct tape. It's not the case in 2026. Today's models are so good.
Starting point is 00:01:31 You can sneeze and accidentally create a million dollar app. I mean, with little experience and a few clicks, you can send a swarm of capable agents to go accomplish real work for you. So why the change in Outlook for 2026? Well, I've got five reasons. And on today's first edition of Hot Take Tuesday for 26, I'm not holding anything back. And we're going to be laying out the five reasons why I think the pace of AI adoption will far exceed what happened in 2025. All right. You ready to get into it? Hope you are. Let's go. If you're new here, welcome.
Starting point is 00:02:13 My name's Jordan Wilson and welcome to Everyday AI. This thing is your daily live stream podcast and free daily newsletter helping everyday business leaders like you and me. Not just keep up with all the AI advancements because they happen every single day, but how we can make sense of it. No BS, no spin. Just grab the most important aspects that can grow our company and our career. So it starts here with the unedited, unscripted live stream podcast, but to be the smartest person in AI, your company, make sure to go to our website at your everyday AI.com, sign up for the free daily newsletter and little tease here. Talked about this on the show yesterday, but we are launching
Starting point is 00:02:53 our AI inner circle community this week. We have our new prime prompt polish course done. It is self-paced. It is ready. We've technically been working on this for like almost two years. So make sure to tune in to tomorrow show and keep an eye out on the newsletter. All right, but let's get straight into it. I'm not going to make you wait any longer for the five reasons. So here is what they are, what I'm calling the five pillars of AI acceleration for 2026 that maybe weren't there in 2025. So number one is reasoning by default.
Starting point is 00:03:37 So large language models are now, a natural thought partner and they were just kind of a fun tool maybe last year for a lot of people. Number two is agentic scaffolding. So we've gone from passive knowledge to active execution with tools. Number three is frictionist, a frictionless data integration. So rag pipelines, if I'm being honest, I'm bearish, right? I'm very bullish on what most front and large language models offer now. is one-click versions to bring your company's dynamic context to the front.
Starting point is 00:04:18 Number four is exponential task endurance. So going from what large language models were capable of, maybe a year, a year and a half ago, short sprints, to now long-term projects. And then number five, and this is last but definitely not least, is economically meaningful work. Yeah, large language models are no longer about writing better blog posts or replacing Google. They are about doing the actual work that you're doing now, which I think in 2026, more than ever, to get the most out of AI, it really requires a mindset shift. And I think this became pretty evident to maybe a lot of people.
Starting point is 00:05:04 I've been saying this for a very long time. but a tweet that has picked up a lot of steam in the last day came from the OpenAI CEO of applications Fiji Simo. So Fiji wrote a tweet and then a blog post. We'll make sure to link it in today's newsletter if you want to check it out. So she said, Frontier AI is far more capable than how most people actually use it. And then in the blog post, just kind of talked about how 2026, at least for Open AI, is about closing that gap through better products, not just better models. You know, she talked about how Chad GPT is positioned to become a true personal super
Starting point is 00:05:45 assistant, proactive and personalized, connected to real services and focused on getting concrete tasks done. And then she said that for businesses, OpenAI aims to be the operating system for enterprise automation with agents that reliably handle real work at scale. All right. Not trying to say, I told you so, but I've been calling Chad Chepti and other large language models, the AI operating systems for multiple years now and saying that even if you are a co-pilot organization, you need to move all of your day-to-day meaningful work inside of a
Starting point is 00:06:19 front-end large language model and start using them as a team. All right. But hopefully today, when I talk about the five big reasons why, I think you'll start to understand, yeah, we should probably start doing this if we haven't already. So let's look at reason number one, right? Why is the pace of AI going to be so much faster in 2026 than it was in 2025? And the first thing is, well, the models are better by default. So what's interesting to note is a stat that's not talked about a lot. Last year, Sam Altman said that only 7% of queries involved reasoning models. And that to me is absolutely nutty.
Starting point is 00:07:06 All right, so if you are kind of newish or not super technical, let me explain very briefly, kind of this big shift that happened in 2025. And it was a slow shift, right? Because maybe a lot of the, a lot of how people view large language models is the original chat chept. Right. A friendly chat bot that's, you know, really fun, can spit out, you know, large blocks of text and sometimes hallucinates.
Starting point is 00:07:36 and that's not really what large language models are today. I'd like to say that there's almost a line in the sand. And to me, that line, it's not between, you know, normal AI and agents. It's actually between, you know, kind of quote unquote old school transformer models versus new reasoning models. And I think that this is turned large language models from something that humans have to put a lot of work into, into being a true agentic partner because the ability to reason is an absolute game changer. All right. So like I said, models from yesterday year, they were great.
Starting point is 00:08:15 You know, nothing wrong with a GPT-40 or a, you know, Gemini 2 or a Clawsonnet 3,5, whatever model you want to throw out there, right? But they weren't that good, if I'm being honest, right? What it required. You had to be a real dork like me, right? You had to really put in the work on prompt engineering. You had to really almost be obsessive to get human level output out of large language models in early 2025 or in 2024, right? For a lot of reasons.
Starting point is 00:08:50 But one of them is just because these models didn't think they couldn't plan. They weren't, they didn't exhibit human traits. Right. They just spit out as quickly as possible next token predictions, right? Reasoning models are much different. They think they plan. They can agentically start going down a certain path and then decide, oh, this is in the right path. And they can then go backwards and start down a different path, much like a human would, right? Looking at, if you have never done this before, go into Chad GPD, use one of the thinking models, ask it a very tough question.
Starting point is 00:09:29 read the summarized chain of thought and you'll see exactly what I'm talking about and why this leap between non-reasoning models and reasoning models is actually exponential, right? And it's crazy because it didn't really happen until kind of the middle of 2025, which is one of the reasons. I still think people had this old original November 2020 viewpoint of chat GPT when it came to AI implementation in the enterprise, which is a dangerous, viewpoints to have. So as an example, right, when we talk about when did, you know, agentic or reasoning or hybrid models come to the forefront? Well, it wasn't until the middle of the year. Right. So Google brought Gemini 2.5, their first hybrid reasoner in
Starting point is 00:10:23 March of 2025. Anthropic brought Claude 3.7.7. on it in February. And then it wasn't really until, you know, most paid ChadGBT users started to get the 03 model right around the same time. So it honestly wasn't even until, you know, that first quarter or middle of the year that people even got a taste of what reasoning models were. And I still don't think, right? Even just by looking at that 7% of users were actually using them.
Starting point is 00:10:59 I think as they become the default, right, I think that changes what companies think AI can do for them or should be doing for them. And reasoning models, I think, help close the gap that let non-technical people get much better result from front-end large language models because you don't have to, you know, do a bunch of, you know, advanced prompting to get the most out of them. All right. So let's now go to reason number two or pillar number two, which is, well, they now have hands and feet to act. This is the agentic scaffolding that models have. So that's the technical term, right, agentic scaffolding. But this is what I mean by that. Models now know when to enable tool calling. They can plan out their responses when to write code or use computer vision without. being told by the human, right? So many times I will send a prompt, right, whether we're talking about Gemini 3 Pro, Claude 4-5 Opus, you know, Gemini 5, or sorry, GPT52 thinking pro,
Starting point is 00:12:12 one of my favorite models, right? And I think that it's going to be straightforward. And it's like, okay, it's got a reason, it's going to go do a little research on the web, it's going to look at my data and that's it. And then all of a sudden, I'm like, oh, no, it's using computer vision. And it's using Python.
Starting point is 00:12:27 And it's doing all, these other things that maybe I didn't expect it to. And then when I look at it, I'm like, oh, wow, I'm getting much better results than I thought I would because the model took advantage of all this advanced scaffolding. So what does this mean? Well, the agented capabilities are there by default in these reasoning models. So that is more or less the AI's ability to act as an agent by independently planning, you know, executing a series of steps to reach whatever goal it is. For example, you know, if you ask, you know, what are these models, to plan a trip. It doesn't just look up flights. It automatically might check your calendar. It
Starting point is 00:13:03 might look up live weather. It searches multiple booking sites at once and might present a final itinerary to you in one go and create different charts and graphs that you didn't even know you need. And you're like, wait, I actually really needed this. You know, and then on the scaffolding side, this is more of the kind of invisible support system that large language models have. That's built around the models and it gives it the tools to be agentic. Right. So in the chat chbtee interface, for example, the scaffolding includes the model's ability to access a web browser, code execution, you know, the memory of your previous chats, previous
Starting point is 00:13:38 files, right? So that just allows it to kind of step outside of its own brain to verify facts or run calculations or to take multiple passes, you know, at a certain problem that you may be throwing at a large language model. And I do think the combination of these two things, it's taking. large language models, I think in my mind, from input output devices to problem solution, right? I find myself, especially over the past 18 months, spending much, much, much more time working with large language models, telling them more about problems and giving them data
Starting point is 00:14:14 about problems, and then investigating different solutions together. So I'm not using large language models very much anymore or as much anymore for simple input output. It is really thought partnership. All right. So let's go into reason number three, AI adoption in 2026 is going to far outpace 2025. And that is, rag is one click down, right? Yeah. All right.
Starting point is 00:14:40 I'm sure I'm going to get something from an engineer here and be like, no, Jordan, you're an idiot. You're wrong. All right. I'm simplifying here. I'm simplifying here, as I always do for our more non-technical audience. So let me talk a little bit about rag, why it's important to understand, right? And you don't have to be an expert. on, you know, vector databases or anything like that, right?
Starting point is 00:15:00 The simple way to think of this is traditional RAG, right, very popular, you know, even before Chad GBT came out, but, you know, especially once Chad GBT came out, I think in 2023, early 2024, RAG was all the rage, right? Retrieve blog, augmented generation. To say simply, it was a way to insert your company's most important dynamic data in front of the model. So before it even went to its training data, essentially it would look at your databases first and then just kind of pass off the relevant insight to the large language model. I'm oversimplifying there, but hopefully that on a podcast, that can make sense for you without showing you, you know,
Starting point is 00:15:40 fancy graphs that I have on screen here. Right. You needed that in 2021, 2022, 2023. Not so much anymore, right. Everything is moving to the front end, right? Not everything, but for non-technical, everyday knowledge workers, right? You're in HR, you're in finance, you're in marketing, you're in PR, comms, you're an executive leadership, whatever, right? You probably, if I'm being honest, you probably don't need to go pay someone, you know, six, seven, eight figures to go build a rag pipeline because now with a couple of clicks and this is one of the biggest changes that I think was overlooked in 2025. You know, Open AI, I do think led the way. I think Anthropic caught up in a huge way. In Google through one of their Gemini, people don't, I feel a lot of people don't even
Starting point is 00:16:42 know that Gemini has two different versions. They have the version that probably most people use. And then they have a business and enterprise version as well that's completely different. So I have two completely different versions of Gemini that kind of do different things, right? But one, I think has a little bit better way to ground your data, right? So when we talk about grounding, you know, that is a simplified way to talk about the benefits of having a rag pipeline, right? So now whether you want to talk about things like, you know, connectors, you know, chat GPUs connectors or now they're kind of leaning more into apps, right? Google calls them different things. Same thing with Anthropic.
Starting point is 00:17:21 They have connectors or integrations, but you can bring your entire Google Drive, right? If you use Box, if you, you know, your Outlook calendar, your, you know, your Gmail, wherever it is that your company's data lives within literally two to three clicks, it can be indexed, waiting there, dynamic. and in certain cases with a little bit of know-how, not a lot, you know, you can get that grounded truth from large language models, right? The holy grail of what we thought we would never get, right? These things are just spitting out, you know, more lies than a politician two weeks before re-election, right? No, not anymore. You know, having this ability to connect AI to business reality was a major bottleneck, but I don't think it is anymore. All right. Let's keep going.
Starting point is 00:18:17 Number four here is AI can work for a very long time. All right. And this is important for a lot of reasons. I think the easiest way to illustrate this point is to talk about coding and to specifically even talk about a kind of exact kind of stat here from meter, which is the model evaluation and threat research. So meter, so that's METR, they're essentially a nonprofit
Starting point is 00:18:59 that test if AI models are becoming powerful enough to be dangerous, right, to simplify it. And they've been doing this for a while. And usually it's kind of through the lens of coding, because that's a great way that you can kind of get this time horizon. So let me, you know, so instead of just giving, you know, different AI models a score, they measure and models stamina for finishing long
Starting point is 00:19:25 projects measured in human hours. So what they look at, it's called a 50% time horizon or the effective horizon. All right. So that's kind of what has been popularized by meter. So in short, it measures the length of a task. that an AI can successfully complete at least half of the time. That is that 50% horizon. Okay.
Starting point is 00:19:49 And this is the human hours scale. So the task length is measured by how many hours it would take an expert human to finish the same job. So this isn't how long a model can work autonomously on its own. This is essentially can a model, you know, by default, you know, do a task that would take a human an hour. Can they do a task that would take a human four hours, six hours? And can they do it, you know, 50% of the time? So what's interesting, and I'm going to go ahead and share something else on my screen on my screen here for the live stream audience. So hopefully you guys can
Starting point is 00:20:36 can see this. What's interesting is, I think the new model from Anthropic, at least on this meter, metric completely changed, you know, what people think large language models are possible of now, right? So it's hard to describe to our podcast audience, but I'll say this, hockey stick, right? Everything else was pretty linear, right? Looking at growth from, you know, GPD 3.5, GPT4, right? And at this point, you know, even middle of last year, or no, let's let's go back about two years. Right.
Starting point is 00:21:18 So when you look at some of these, some of these models, some of the GBT models, they couldn't even do 30 minutes reliably. Right. They couldn't do a task that would take a human 30 minutes. They couldn't do it reliably, you know, more than two years ago. 2025, we started to see some huge growth with reason. models with models that had that agenic scaffolding that I talked about, right? So you saw some pretty big jumps from 04 mini from GPT5, right? And at that point, you're getting into the one hour range, the two hour range. Then you saw GPT51 Codex, you know, two and a half hour, almost three
Starting point is 00:22:02 hour range. And then Claude Opus 4.5 thought it was Gemini 3 Pro in the fact that that it went nanobananas and shot straight up the charts and is now nearly at the five-hour mark. So why is that important? Well, you have to look at the rate of growth. And what's happening now is legit the definition of exponential, right? Because the time horizon previously was doubling every seven months on average going back to 2019. So now, It's accelerated a little bit more in late 2024. So it went from seven months doubling to now it was doubling every four months. But now it is nuts, right?
Starting point is 00:22:56 Now it is nuts because as of late 2025, the top AI models are, like I said, well past the one hour mark. But if the current trend continues, if the current trend continues, if the Current trend continues that we saw set with Claude Opus 4.5. That means that AI models, as early as 2027, maybe 2028, we'll be able to do a month of human work and get it correct. Let me repeat that. Maybe as early as next year or 2028. Adobe just introduced an entirely new way to create. create, bringing the power and precision of its creative suite into one conversational experience.
Starting point is 00:23:51 Meet Firefly AI Assistant, now live in the Adobe Firefly app, the all-in-one creative AI studio. Powered by Adobe's creative agent, Firefly AI Assistant lets you start with your vision, just describe what you want, and shape the outcome as it takes form with the Assistant. The Assistant orchestrates multi-step workflows, drawing on 60-plus pro-grade tools across Adobe Creative Cloud apps, including photo. Photoshop, Illustrator, Premiere, Lightroom Express, and more to help bring your ideas to life. You can also get started with creative skills, a growing library of pre-built workflows for common creative tasks,
Starting point is 00:24:28 like batch editing photos, creating mood boards, portrait retouching, and creating social variations. Every step the assistant takes is visible so you can refine, redirect, or take over at any time. You stay in the driver's seat as the creative director. Adobe Firefly AI assistant now in public beta. See it today at firefly.adobie.com. AI models are going to be able to complete what would take a human a month. Okay. Yeah.
Starting point is 00:25:03 And that hockey stick curve just hit in 2025. And it's a lagging effect, right? So many of these things are a lagging effect. I am absolutely mind boggled at when I talk to. Smart people, right, all the time. I'm not saying on the show that you all get to hear. I, what's crazy is, you know, I talk to other people outside of the show, right? About AI companies, maybe hire us, you know, maybe I'll jump on a call with people.
Starting point is 00:25:32 And sometimes I'm shocked, right? Smart people, growing companies don't even know, oh, there's a difference with a reasoning model. Or they're using the free version of chat GPT, not even knowing anymore. and they're like, oh, yeah, well, you know, if I ask it a hard question, it'll route me to a thinking model. It's like, nope, Open AI got rid of that. That router doesn't exist anymore. It sends you to a bad model, right? And the majority of people, I think 95% of people are using the free version of chat GPT.
Starting point is 00:26:05 Literally, enterprise companies that are making billions of dollars are using, in some instances, the free version of chat chbt, which is I would not recommend, right? So again, this is just another thing that signals just this huge gap, you know, going back to that Fiji quote from earlier, that tweet that she put out, just what models are capable of today versus what people are using them for, right? They're still using the 2022 version of Chad GPT, not knowing you can go out and legit do hours of human quality work. All right. And that brings me to the last number five. The fifth and I think maybe the most important reason why the pace of AI adoption in 2026 will far exceed, exceed 2025.
Starting point is 00:27:01 And the easiest way to say it is, well, AI is finally doing economically meaningful work in a big way and out of the box. Let me ask you a yes or no question. All right. And answer it silently, live stream audience. You know, go ahead and put it in there.
Starting point is 00:27:20 I'll do a little 10 second delay. All right. Can chat GPT without any, you know, additional features or anything like that? Can it create a PowerPoint? Not the agent mode.
Starting point is 00:27:34 The normal version of chat GPT, yes or no. Can it create a PowerPoint? All right. Give everyone a second here. You at home too. listening in the car, you know, walking your dog, whatever you're doing on the treadmill, on the treadmill with your dog. Question number two, can chat GPT without any additional bells
Starting point is 00:27:54 and whistles, can it create spreadsheets? I mean, these have historically been two things, at least when you look at chat GPT, well, and Google Gemini and not as recently Anthropics Claw. These have kind of been one of the Achilles heels, right? It's like, all right, well, I can get all this great content, not of these large language models, but it puts in a chart and I got to copy and paste it, and I got to go create a spreadsheet, right? Or this just gives me stuff to then I got to go build, you know, an outline or, you know,
Starting point is 00:28:25 gives me an outline, but I still got to go create a PowerPoint. All right. If you were playing at home, yeah, Chad GPT by default can do those things now, right? Previously, it could only do it in the agent mode and it was super slow and it really wasn't good. But the new model, the GVT52 thinking model, can create.
Starting point is 00:28:44 Excel spreadsheets by default. It can create PowerPoints that actually look pretty good with visuals by default. Yeah, I was doing a little bit of testing over the weekend, just some kind of PowerPoint building between Claude 4-5 Opus and GBT 52 with thinking mode. And they're both very good. Sorry, consultants. It's going to be a tough 2026 because that's what consultants do. They go do a bunch of research.
Starting point is 00:29:15 They make some spreadsheets and do PowerPoints. Right. Yeah. But number five, yes, it's not just they can do this. It's not just, oh my gosh, you know, cool, Claude and ChadGBT can make decent PowerPoints and spreadsheets. It's not just that they can actually, on their own, complete economically meaningful work at or above an expert level.
Starting point is 00:29:45 All right. So I've talked about this once or twice on the show before. I should probably do an entire dedicated show on GDP Val. Maybe I'll get the creator of the benchmark on the show at some point this year. All right. So this is a in a benchmark that Open AI created. And I've said this once or twice before. I love when a frontier company makes a benchmark, they publish the results, and they aren't the leader.
Starting point is 00:30:16 And it's one of their competitors, right? Because that's exactly what happened when OpenAI announced GDP Val, right? They were not the best model, right? It was Claude Opus. All right. So here's what GDP Val is. It stands for gross domestic product valued evaluation. So essentially, it measures.
Starting point is 00:30:36 real world economic deliverables instead of just, you know, random puzzles or, you know, multiple choice questions or something off offline data sets, which is what a lot of these benchmarks do, right? You know, essentially benchmarks, uh, to over simplify things, it's either, you know, there's these kind of like ACTs for large language models, these offline, uh, tests. There's these complex, right, kind of like arc AGI. There's these complex like reasoning puzzles, uh, you know, kind of like spotting patterns. You know, those are all cool. benchmarks and all, but what do we use them for? What do we use large language models for? We use them to do economically valuable work, right? Which is why I think this GDP Val is such an
Starting point is 00:31:16 important metric to look at, right? But this is where models, it evaluates models ability to create actual work artifacts, right? Legal briefs, engineering CAD blueprints, nursing care plans, financial spreadsheets, whatever, right? Things that the economy requires. And it's a broad professional scope. So the benchmark consists of more than 1,300 specialized tasks across 44 distinct occupations, right? In the nine sectors that contribute most to the U.S.'s GDP. And then there's blind expert grading. So outputs are graded via blind pairwise comparison by human industry professionals who have an average of 14 years of experience. And then they judge whether the AI work is superior, equal, or inferior to a human expert's attempt.
Starting point is 00:32:10 So it's kind of just like a blind, you know, a blind taste test on real work that all of us do, right? Literally all of us. And well, here's what they found. They found that the model, GPD 52, and I believe they used the thinking version, achieved a 74% win tirade, right? Whereas the model from just a couple months prior, GPT5, that was kind of not well received, right? It got a 38% wind tie rate. So it almost doubled its wind tie rate of creating economically valuable work.
Starting point is 00:32:53 So not only that, obviously, well, it's preferred to the expert human work with a 74% wind tie rate. But it is obviously extremely efficient and it completed those economically valuable tax. 11 times faster at less than 1% of the cost. So if you need, number one, if you need a reason to invest more heavily in large language models, in moving your day-to-day knowledge work tasks inside of a front and large-language model, there's your answer.
Starting point is 00:33:31 It does better than human experts, or at least a wind-tie rate of three-fourths, It is 11 times faster and it costs less than 1%. So if you're still thinking of large language models as that cute, cheeky, chat GPT, right, the fun little chat bot. Oh my gosh, look at this. It wrote a haiku. Oh, cute. Oh, look at this.
Starting point is 00:33:56 It helped me seem less mad in an email. No. Large language models, if you know what you're doing, they instantly connect. to your business data. They can think like experts. They can use tools that experts sometimes might not even know how to use. They can work for hours on end and they're doing work better than human experts. That is more economically valuable.
Starting point is 00:34:31 So there you go. Buckle up. 2026 is going to be wild. So that is a quick. recap on the five reasons why. So I hope this was helpful. If it was, please tell someone about it. It's great that you listen. I love it. Appreciate your support. But don't keep this thing your own secret in 2026. Right. I can only keep doing this thing for another 700 episodes if you tell someone about it. So please repost this on LinkedIn. If you find it helpful, if you're listening on the podcast,
Starting point is 00:35:06 please leave us a rating, you know, follow the show. I'd really appreciate that and keep an eye later this week, maybe tomorrow, maybe Thursday. We'll see we're going to be launching the AI Inner Circle. It is a free community. We have a lot of special things planned aside from our, you know, updated prompt engineering course, which I just finished recording like 72 hours ago. it is hot, fresh off the presses. So make sure to keep your eyes and ears peeled.
Starting point is 00:35:41 And you got to do that at our newsletter at your EverydayaI.com. So thanks for tuning in. Hope to see you back tomorrow and every day for more Everyday AI. Thanks y'all. Meet Firefly AI Assistant. Now live in Adobe Firefly, the Allman One Creative AI Studio. Just describe what you want to create in your own words and the assistant handles the rest. Orchestrating multi-step workflows across Adobe Creative Cloud apps,
Starting point is 00:36:08 including Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome while the assistant accelerates execution. Stand control with the ability to step in and refine at any time. See it today at firefly.adobie.com. And that's a wrap for today's edition of Everyday AI. Thanks for joining us. If you enjoyed this episode, please subscribe and leave us a rating. It helps keep us going.
Starting point is 00:36:40 For a little more AI magic, visit Your EverydayAI.com. and sign up to our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.