Everyday AI Podcast – An AI and ChatGPT Podcast - OpenAI's Code Red: Is Google taking ChatGPT's Crown?

Episode Date: December 9, 2025

OpenAI is (reportedly) in full panic mode. 🚨All hands on deck, Code Red status. So.... what happened? How did OpenAI go from defining the AI category to getting beat by competitors they once trounc...ed? And, is it too late for them to turn it around? Or will Google permanently take the AI crown? Tune in... we've got hot takes. OpenAI's Code Red: Is Google taking ChatGPT's Crown? An Everyday AI Chat with Jordan WilsonNewsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion:Thoughts on this? Join the convo and connect with other AI leaders on LinkedIn.Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:OpenAI "Code Red" Panic ExplainedGoogle Gemini 3 vs ChatGPT BenchmarksNovember AI Model Release ImpactAnthropic Claude Opus 4.5 Coding LeaderOpenAI Financial Strain & Revenue AnalysisMarket Share Shift: ChatGPT vs GeminiChatGPT Default Model Quality CritiqueEnterprise AI Adoption: OpenAI vs AnthropicOpenAI Ads Controversy and User BacklashApp Stickiness and Query Length DataTimestamps:00:00 "Everyday AI for Business Leaders"07:23 AI Leadership Questioned09:28 "Before and After ChatGPT"12:36 "OpenAI's Shift and Competition"15:58 OpenAI's Massive Infrastructure Growth21:13 "Google Surges, Challenges OpenAI"23:53 "Cloud Ovis 4.5 Review"28:45 OpenAI Criticized Over Ad Controversy31:36 "ChatGPT's Default Model Critique"33:58 "Optimize Your Chat Model Usage"35:49 "Choosing GPT-5 Over Gemini 3"40:20 AI Narrative Overblown in Tech43:57 "AI Tools and Code Red"45:24 "Build Apps with AI Studio"Keywords:OpenAI, code red, ChatGPT, Google Gemini, Gemini 3, AI race, large language models, Anthropic, Claude Opus 4.5, AI leadership, market share, AI benchmarks, LLM development, GPT-5.1, thinking models, default chat model, model performance, user growth, AI chatbots, reasoning multimodality, enterprise AI adoption, CapEx spend, custom AI chips, Broadcom, Nvidia,Send Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info)

Transcript
Discussion (0)
Starting point is 00:00:00 This is the Everyday AI Show, the everyday podcast where we simplify AI and bring its power to your fingertips. Listen daily for practical advice to boost your career, business, and everyday life. Meet Firefly AI Assistant, now live in Adobe Firefly, the All In One Creative AI Studio. Just describe what you want to create and the assistant handles the rest, orchestrating multi-step workflows across Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome. The assistant accelerates execution. According to reports, open AI has sounded the alarm.
Starting point is 00:00:51 That's because there's some increased competition from Google and even some surprising efforts from other AI players. And that's left Open AI seemingly starting to lose its footing as the clear frontrunner in AI. And they've officially now called a code red, like all hands on deck. code word for kind of panic mode. And they're reportedly hitting a pause on some of the chat GPT bells and whistles that many of us love.
Starting point is 00:01:20 And they're trying to get back to square one, making better models, the area that Google has taken their lunch money on in 2025. So can open AI even pause? I mean, they have more deals right now than going out of business, bed bath and beyond on Black Friday. And will a code red pause even work? After all, Open AI has essentially started a handful of other businesses, ventures, and verticals that they weren't even involved with a year ago.
Starting point is 00:01:54 So on today's show, we'll answer if Open AI is still the king of the AI Hill or if this code red situation going on right now means that they've maybe lost the crown to Google. All right. I'm excited to get into it. I hope you are too. Welcome to Everyday AI. What's going on, y'all? My name's Jordan Wilson, and this is for you.
Starting point is 00:02:14 This is your daily live stream podcast and free daily newsletter, helping everyday business leaders like you and me. Now just keep up with all the AI news and developments. Yeah, you can tell by our opener there, there's a ton, but how we can make sense of them and make sound decisions to grow our companies and our careers. So if you're joining me on that journey, it starts here with the unedited, unscripted, live stream podcast. But if you want to take it on the next level, make sure you go to our website.
Starting point is 00:02:37 That's your cheat code. It is your everyday AI.com. We're going to be in our free daily newsletter, which you can sign up for there. We're going to be recapping the highlights from today's show, as well as bringing you all of the other AI news that you need to know. All right. Now let's get into it. Here's what you need to know about the new reports that say open AI is kind of in panic mode.
Starting point is 00:02:58 They're hitting that siren. It is code red. They are losing ground. So on today's show, here's what we're going to dive into. We're going to investigate the internal code. red panic inside of Open AI following a brutal November. We're going to break down the data showing how Google and even Anthropic have overtaken chat GPT in some key areas and answer the ultimate question.
Starting point is 00:03:22 Is chat TBT's default era over? And if so, who owns the future? I'm not going to make you wait. I'm going to go ahead and answer at least one of those questions right now. Is Google taking chat GPT's crown? Well, kind of. of, but also no, not at all. And here's the explanation in that. I think that over the past three years, right, we are now more than three years into the chat GPT era, which has obviously come to define
Starting point is 00:03:53 AI. And I think that right there is enough to say that no, chat GPT is not going to be losing its crown, at least when it comes to mind share, when it comes to just the sheer number of people that they have using and relying on the platform every day, more on that later of the show. Because right now, there's more players, there's more use cases, there's more verticals, so there's more hills to be a king of, right? Whereas, you know, even two years ago, it wasn't really like this, right? It was AI chatbots, but now we have, you know, agentic models by default. We have all of this scaffolding, all of these bells and whistle.
Starting point is 00:04:37 that opening eye is apparently going to hit pause on, right? We have agents. We have business data that is synced dynamically right into these front end large language models, right? Bringing rag of 2023 to our fingertips in a matter of minutes. So the space has changed so drastically over the past three years since Chad GBT kind of introduced us to the space, which is ironic since Google actually invented the transformer and had a kind of technically a working version of something like a chat gbt public before chat gbt even did
Starting point is 00:05:13 so while i do think google may permanently take the lead in large language model development and i'd say in 2025 they've been the winner hands down i don't think open ai has much to worry about when it comes to market share well at least not until we got all this code red uh talk from open a i so here's what's going on if you missed any of this in a nutshell. Like, well, why? Why are they doing this code red? What's it all about? So according to some reporting from both the information in the verge and others, OpenAI CEO Sam Altman reportedly issued a company-wide code red on December 2nd, so in the middle of last week, pausing some other big initiatives, such as ads and some other projects. And this kind of internal emergency came after Google's Gemini 3 Pro,
Starting point is 00:06:06 launched on November 18th and out of, you know, 20 pretty widely accepted benchmarks as industry standards. Yeah, Google Gemini 3 overtook GPT 5.1 on all of them except one. So almost a clean sweep for Gemini, Gemini's new model, Gemini 3 Pro. And follow that up. Yeah, code red. A lot of things happening at once. You also had Claude, Anthropics Claude Opus 4.5 come out. the week after Gemini 3 Pro.
Starting point is 00:06:39 And that became kind of the first model to score over in 80% on some real world, real world coding benchmarks. Right. So even before all of this, and I'm going to get into some of the discrepancies and how we use chat chvety and what model you're selecting, right? But if I'm being honest, the default model of chat chpT, not really good. All right. And so they.
Starting point is 00:07:05 they just got hit with big release after big release and that has reportedly caused Open AI to hit that big panic button. I'm wondering if in these big companies, is there actually a button? Can someone tell me, right? Friends at Big Tech, let me know. That would be great if there was. It would be so cool. But I just imagine, right? Like someone just sweating in the back and being like, yeah, got to pop up the button.
Starting point is 00:07:29 Time to hit it. But, you know, this is for the first time I'd say since, chat GPT came out at the end of November 22, I say it's the first time that their position has been questioned as the overall AI leader. I would say even, you know, mid-November, early November of this year, so five weeks ago, I'd say if you looked at everything across the board, you'd say, nope, open AI is still comfortably in the lead. I don't think you can really say that anymore.
Starting point is 00:08:02 And that's maybe what's caused. Open AI to get back to the basics here. So here's what's important. All right. So this is not the first code red because what's interestingly, what's interesting enough is Google was actually the first company to kind of put out a code red when it came to AI. So a lot of people, well, most people know Google invented the transformer, the underlying technology that ultimately led to chat gbt and other large language models.
Starting point is 00:08:39 But people don't even know. Google had a working and live version of a AI chatbot, essentially an early preview version of Lambda in their AI kitchen well before there was a chat GPT. But it was extremely limited. They didn't put a lot of resources behind it. It was a research preview, right? It was essentially for geeks, you know. Kind of like me.
Starting point is 00:09:07 So when this came, right, it was bad news for Google. I think Google thought that maybe, you know, this technology going mainstream was still maybe years away, right? They were pioneers in the technology. But the chat GPT moment, right? And that's what it has become. It is a line in the technological sand and in the, I think overall they will, when we look back at it, it's going to be kind of like an AC, you know, BC kind of or BC AC kind of situation, right?
Starting point is 00:09:45 When we look at how businesses operate, just kind of there's like before internet, after internet. I think it's gonna be before Chad Chubt, after chat ChbT or before large language models and after large language models. And it was that chat Chd-chipt moment that caused that. And that did cause Google to go into what we're talking about now,
Starting point is 00:10:02 which is their code red moat. And although I think that their initial response was not the best, right, looking back at the early Bard days, their kind of snafu rollout in December of 2023. I mean, since December of 2024, so about a year ago, I think hands down, Google has been the leader in the AI space, right? I've always said it's one A, one B, depending on how you look at it. But I do think, if you look at all things considered, Google has made up more ground. Open AI was way, way far ahead in November of 2024, you know, versus Google. And now they're neck and neck. So Google has grown more over the past 12 or 13 months. And I don't even think that's close. So now it's the other
Starting point is 00:10:55 side of the coin. Now it is Open AI calling the code red after Gemini 3's launch and some other things. And like I said, this is the first time that I think public perception and even the markets, right, have said that maybe Google's the leader, right? So Wells Fargo now prices Google stock as the AI race leader for the first time since 2016. So here's a little bit more on the details of this code read. So this reportedly went into an internal memo and it said, yo, we got to stop with the bells, the whistles, the features, and we got to get back to core model development. So this directive cited temporary economic headwinds from Google's benchmark topping releases
Starting point is 00:11:40 and also might lead to a new open AI model getting released as early as CheckWatch today. So a report from the verge said we might get a version of Chad GPT, maybe called GBT 5.2, as early. as today. I've also kind of heard or seen it could be Thursday, right? We've seen the dates float around, but it could be as early as literally today. So if you are listening to this podcast, it's a live stream podcast, 7.30 a.m. Central Standard Time in Chicago, right? But maybe you're listening to this on your lunch break or your commute on the way home from work. There could already be a model out, right? We'll obviously have that news when it does break in our newsletter. But this essentially, all of this news accelerated.
Starting point is 00:12:28 the launch of this new model because we had heard previously that it might be released in January. And then we heard, oh, maybe the second or third week of December. And now all of a sudden, it might be December 9th. So how the heck did this happen? Right? How did Open AI go from, from being honest, not really having any true competitors for the first 23 and a half months? and having an almost unfair advantage, right, when it came to market share, when it came to features,
Starting point is 00:13:05 when it came to UI, UX, when it came to just branding. I mean, you talk to anyone, you say something about AI and, you know, everyone's like, you mean chat, GBT, right? They, they beat everyone to the punch and had an insurmountable lead. Yet here we are being like, are they losing it? Have they lost it? And in two years, are they even going to be in the conversation of the 1A, 1B? Well, here's one reason why I think we're in this position.
Starting point is 00:13:35 Open AI's focus for better or worse is much wider than it was a year ago, right? You could make an argument, depending on how you categorize these things. You can make an argument now that Open AI has five, you know, or Open AI is five different businesses or has five very unrelated or not very much related. departments. So as an example, right, you have your frontier large language models, right? That's your, this year, GPD5 series, the actual models being built. Right. Then you have your consumer ecosystem, right? Because some of these players out there, they just make models. And that's it. Right. They put them out on the API or whatever, but they're not building an ecosystem. Obviously, with 800 million weekly active users, Open AI is actively investing in a very vibrant
Starting point is 00:14:26 ecosystem and that comes with it a lot of challenges a lot of those bells and whistles a lot of these things that take time right they have a browser uh you know their atlas browser they have the um you know obviously the chat gbt app across uh web mobile desktop soar up with video gpt image right they have all these things then they have their enterprise and developer platforms so the business and enterprise tiers of chat chbti uh the API side the real time agents SDK etc right and then they have their new kind of in upcoming hardware division or ambitions. We'll see, right? So custom AI chips, right? They're getting in the chip game like in video, like Google with Google's TPUs. Right. So it was reported that they're going to be working with Broadcom to develop an in-house chip that they will
Starting point is 00:15:17 use and probably be selling or renting out to others. And then also on the hardware side, We've seen a lot of reports recently that we cover here on our Monday's Monday new shows. Open AI has been posting like everyone from Apple's hardware division, but it seems like Open AI is going to be making a serious play in the AI hardware space. We don't know what the device is going to be. We've seen reports. It's not going to be a phone. It's not going to be, you know, earbuds or glasses. It's going to be something else.
Starting point is 00:15:48 But they're building in AI hardware division and they're getting real big players to go with. And then last but not least, they're an infrastructure company now. They are. They have more than a trillion dollars of third party infrastructure commitments, right? Building like literally the Stargate deal with Oracle and others. I mean, they're building just huge, huge data centers. The CAPEX spend is enormous, right? $100 billion plus already committed with an Invidia deal slash investment.
Starting point is 00:16:22 people calling circular funding or vendor financing, whatever you want to call it. Right. Those are areas that Open AI was not even playing it for the most part a year ago, right? Aside from maybe the consumer ecosystem and a little bit on the developer platform, but at least half of these things didn't really exist a year ago. So maybe this is the give and take that a company must make in order to grow at the rate at which they, want to grow. Sometimes you have to maybe throw expensive spaghetti at the wall and see what's going to stick. But this split focus has led to reported financial strain. So according to reports from the Wall Street Journal and Fortune, Open AI projects a $22 billion spend versus a $13 billion
Starting point is 00:17:16 in revenue in 2025, although Sam Altman did say that that number in revenue is going to be closer to 20 billion or a little above yet it could anywhere be from you know a couple billion dollars loss to up to nine billion dollars loss this year and there have been estimates right and this estimate was wild uh this h this hsbc estimate that said uh open i may be meeting a two hundred seven billion dollar funding shortfall between now in 2030 uh wow so is this just a case of open AI being spread too thin. Is this an AI startup that, you know, maybe should have just been an AI chatbot company and they wanted to take over the world and maybe they shouldn't have.
Starting point is 00:17:59 And now they're losing their, you know, luster. They're maybe relinquishing what they were known for. Or is it something more than that? All right. We're to find out in a minute. Quick break. Word from our sponsors. This podcast is sponsored by Google.
Starting point is 00:18:17 Hey, folks. I'm Amar, product and design lead at Google. Google DeepMind. Have you ever wanted to build an app for yourself, your friends, or finally launched that side project you've been dreaming about? Now you can bring any idea to life, no coding background required, with Gemini 3 in Google AI Studio. It's called vibe coding and we're making it dead simple. Just describe your app and Gemini will wire up the right models for you so you can focus on your creative vision. Head to AI.studio slash build to create your first app. So I would say part of this slipping from the top of the AI mountain for Open AI is because of its split focus, but the timing of everything else has added to it undoubtedly.
Starting point is 00:18:59 Right. So November has been a pile up of new model releases. One thing people don't talk about, well, Open AI rolled out their own very powerful model in mid-November GPT5.1 pro, right? But very few people have access to that because you have to be on the $200 a month. plan. But then, obviously, the big news here came on November 18th when Google launched Gemini 3 Pro, calling it their most intelligent model ever, and then the next week, Anthropic, followed up with Claude Opus 4.5, really finding its space, targeting coding agents, and computer use. So let's quickly just look at Gemini 3's reception, and it was good, right?
Starting point is 00:19:42 people went nutty and rightfully so I'd say when Gemini 3 came out Gemini 3 Pro it shattered every benchmark right you know getting scores on benchmarks that you know maybe a year ago we thought would not be possible for multiple years you know one of them scoring a 37.5 on humanity's last exam one of the newer kind of you know data sets or benchmarks since a lot of the previous ones like MM MMU, MMLU, a lot of these previous benchmarks have kind of been saturated, right? Because all the models are, you know, scoring off the charts or, you know, close to a hundred on some of these.
Starting point is 00:20:26 So it's like, all right, some of them, some of them are saturated, but one that is not yet is humanity's last exam and Gemini just, a Gemini three leapfrogged in front of everyone. And then just their sheer user growth all the way up to 650 million monthly active users, right, versus Open AI's 800 million weekly active users still. I think that's grown from 400 million just a couple of months ago. So Gemini is on fire and Google is on fire in terms of user growth. And pretty good anecdote we shared here, this quote, I think the day it came out or the day after it came out, Salesforce CEO Mark Benioff publicly switched and put out a tweet.
Starting point is 00:21:08 that's like he said he's been using chat chad chvety every year spent a little bit of time with jemini three and he's like i'm not going back and i think that was a sentiment uh kind of echoed by a lot of people uh right it was that big of a leap not just that big of a leap but i think that uh kind of the default chat mode inside of chat chbt has been stalled again more on that here in a minute so google has significantly even into Open AI's market share. So this is a graph here for our live stream audience. You can see and podcast audience, if you ever wanted to know, I never have anything overly visual.
Starting point is 00:21:50 But if you want the video version, I don't know. You can take screenshots of me making weird faces. You know, for the most part, you don't need a lot of video. But you can always see that on our website at your everyday AI.com. But you will see here, this is according to similar web data, which is extremely accurate. looking at the market share over time, chat chip V-D versus Google Gemini. So a year ago, OpenAI had 87% of the market share now 71%. So a huge drop-off.
Starting point is 00:22:21 Whereas conversely, Google Gemini has been growing faster than anyone else. So a year ago, they only had 5.7% market share. And they've nearly tripled that now at 15%. in the same period of time, you know, maybe, you know, some of the other players have maybe grown half percent, one percent, right? Yeah, Google Gemini tripled. And most of that, right, came from Chad Chibati's lunch. So when I say that they're eating Chatshapiti's lunch on the model side and a little bit eating into their big pie, you know, the numbers support that. And also, the sleeping beauty herself, Anthropic finally.
Starting point is 00:23:06 woke up from its summer slumber and decided to ship something and buy something an extremely impressive model. Yeah, we gave it the live run through here on the show a couple of weeks ago. It failed some of our live test, but that's why I always tell people, right? You have to put it through your own paces, right? I think a lot of times when I'm doing tests on models, it's sometimes when the servers are overloaded, things get buggy. But Anthropics, Claude Obis 4.5, extremely impressive, right, especially on any software development coding benchmarks. So the first model to ever get more than an 80% on the suite bench verified, which is the industry's kind of, you know, number one real world, real world coding benchmark. Right.
Starting point is 00:23:57 Now, probably do a show at some point, maybe later this year or early next year, just really talking about why I think coding is more. important from a benchmark perspective and from a company perspective than most people realize, right? And it's a little deep, right? It gets into AGI and some other things, but I'll probably talk about that. So not only that, but Anthropic finally, right? And I said this on the show a year ago, I said Anthropic is not going to survive as a company unless they reduce their prices to be competitive. Their API prices have never been competitive. And obviously if you're on a paid plan using Claude on the front end, you can't use it. It stinks, right? The limits are so low, but they finally cut their prices by 67%. So again, when I said November was kind of a tough month
Starting point is 00:24:46 for Open AI, well, it was because its competitors came out and they came to play. They woke up for no reason and chose absolute violence. So speaking of Anthropic, the IPO rumors did not help Open AIs cause either. So this was last week, just days before the reported Open AI code read some rumors on anthropic, you know, laying the groundwork for an initial public offering. So this is from the Wall Street Journal. I'm going to read an excerpt here. So it says, excuse me, yeah, still still fighting this little bug, this little throat thing here. Got my medicine, my Topo Chico. So it says Anthropic, which has a growing number of business users because of the capabilities of its quad-chop chatbot encoding in other arenas, expects to break even for the first time in
Starting point is 00:25:46 2008. The documents show. By contrast, OpenAI forecasts its operating losses that year to swell to about 74 billion or roughly three-fourths of revenue thanks to ballooning spend on computing costs. The chat chip-tipt maker also expects to burn through roughly four. 14 times as much cash as Anthropic before turning a profit in 2030. All right. In layman's terms, here's what that means.
Starting point is 00:26:14 And I've been saying this all along, right, that I know that Anthropic is much more profitable per customer. And I think that they're playing two very different games, right? Anthropic is never going to catch Open AI when it comes to users, right? But what they are likely crushing Open AI on is revenue per user. You know, even CEO Sam Altman for Open AI has said that on many plans, they lose money by default, right? Yet this reporting is showing that Anthropic is expected to be profitable by 2028, which is pretty early because a lot of earlier estimates said best case, open AI might not turn a profit until 2030, right, which is normal. A lot of people look at this and, you know, they, they put on their, you know, AI is a bubble. Oh my gosh.
Starting point is 00:27:06 You know, Chad GPD is so popular. Yet they're, you know, losing billions of dollars. That's how startups work, right? I've given long, long receipts on this show, you know, companies that were in the red for 10 years or longer in are now the biggest and most profitable companies in the world. That's how it works, right? If you're going to be a world-changing company, you start as a startup. You take on debt in. for the most part, you're probably going to be burning through billions of dollars for five to 15 years.
Starting point is 00:27:35 That's how it goes. And hopefully then you make it out on the other side as, you know, a Fortune 100, Fortune 50 company. And a lot of people don't realize by the path right now. Open AI is the largest private company in the world. And if they were a public company today based on their evaluation, they would be a Fortune 20 or a top 20 company in the U.S. Right. So they are no small beans. and it is relatively normal to be burning through cash.
Starting point is 00:28:02 However, when compared to Anthropic, this might be another thing, right? The Gemini 3 release the Opus 4.5. And then the news, right, that Open AI or sorry, Anthropic is seemingly being looked at a little bit more favorably on the markets and by potential investors than Open AI is. So speaking of the public narrative punching Open AI this week, there's another thing that happened in the last couple of days right before the code read. There's this kind of little, we'll call it an ad controversy, right? I don't know. I chuckle at this one. But it caused Open AI a lot of bad press.
Starting point is 00:28:46 And according to reports, ads is one avenue that they're going to pause on. So essentially, you had a lot of people complaining on Twitter, right? And they're like, I'm a paid subscriber, right? I'm paying $20. I'm paying $200 a month. And look at this. I'm getting an ad. It was a narrative that maybe opening I, and I said so myself, probably the UI,
Starting point is 00:29:09 ux of this was not the best. It was actually an in-app nudge. People might argue, right? It's not technically an ad, even if it might look like an ad and you could be construed as an ad, right? A lot of this was nudges. usually unrelated to what you were searching for, right? So usually an ad is something related to what you are searching for, right?
Starting point is 00:29:32 So if you're, you know, chatting with chat GPT about the latest, I don't know, Bears game or doing some deep research on something and you get a little nudge for installing the Peloton app on chat GPT, right? So it's not sending you to other websites or pushing products. It's trying to get you to install chat GPT apps, right? a big play that they're trying to do there as well. So these are technically in-app nudges to try to get you to use chat GPT apps. People lost their mind and said, oh my gosh, I'm paying money and here's these ads, right?
Starting point is 00:30:07 So opening I did disable the feature two days ago after chief research officer Mark Chen admitted on Twitter that quote unquote, we fell short and then they disabled the feature. So more or less, open AI didn't even put ads out there and everyone lost their noodles right and there's all these you know articles that came out in the press about how this was just a bad look so it's been a bad uh tough tough couple of weeks especially uh the last maybe two weeks for open a i that has maybe led to this code red but ready saving the hot take tuesday the hot take for near the end of the show here I think here's the secret.
Starting point is 00:30:54 Here's the key. Here's what no one's talking about. No one. Ready? This is one of those small little nuggets that, you know, if you are sitting around dorkin, dorkin with your friends, your colleagues, you know, if you're on an AI team, you know, at your work, here's one thing you can pull out of your hat and everyone's going to look at you and be like, wow, you know what you're talking about. I think one of the reasons why Chachipti is losing its ground.
Starting point is 00:31:21 is because the default chat model is bad. Right? And Sam Altman said in August that previously only 1% of free users used a thinking model. And only 7% of paid users used a thinking model. Those numbers have increased, right? I believe to now up to 24% of paid users. But still, the overwhelming. majority of everyone using chat gbt is not using paid models or sorry not using thinking models i would never
Starting point is 00:32:04 and my wife always gives me a hard time uh right because she's like hey can you you know chat gbt this quick and i'm like yeah i can't uh right and then two minutes later she's like okay what's the answer and i'm like you got to give it time right i mean i got to use the right otherwise just going to you know Google's AI mode, right? If you want the juice, you got to wait. I would never, I would never use the default chat GPD model for absolutely anything. And I, you know, might lose a couple friends at Open AI for saying that, but I don't think you should either because it's not good.
Starting point is 00:32:38 And the stats show that. All right. So let's look at LM Arena. Talk about, unfortunately, Open AI losing some of their ground, even their best model, their high thinking model. Right now is. not a top five model. All right.
Starting point is 00:32:54 So it is at number six on LM Arena. LM Arena is essentially blind taste test for large language models. So it's not a top five model, which is crazy. It's crazy to say because essentially all the way through the end of 20, 24, Open AI was always the top model on LM Arena. And it was usually never even close. All right. So now they've lost that.
Starting point is 00:33:20 But here's the thing. 93% of you users and probably you if you're listening, sorry, I'm calling you out. And I need to call you out because if you're listening to this podcast still after 32 minutes, I want you to get better results. I want you to know. Here's the reality. If you are using the default chat mode, right, the instant or sometimes if you're using auto and it doesn't just, it doesn't route you to a thinking model.
Starting point is 00:33:46 You're not using a good model. You're using a poor model. Right. So GBT5.1 is the 15th highest ranked model on LM Arena. And the previous version, GBT5, chat 21st, right? It's not just LM Arena. If we look at artificial analysis, a great resource, third party benchmarking. It's an aggregate of all these different benchmarks, right? You'll see GPD 51 high right there tied for second. with Claude Opus 4.5, but behind Gemini 3 Pro, okay, so tied for second, GPD 51, high, good model, right? Default model, not even in top 20. Oof, that's one of the reasons. That's one of the reasons why Gemini and other people are taking the crowd, right? And that's why I think, right, my Twitter feed is just,
Starting point is 00:34:49 literally just people talking about AI. And I've seen story after story people just being like, it seems like Chachapit is not as good as it was before. And it's like, yes, stop using the default model. Use the thinking mode, right? That's why sometimes you got to use multiple tools. Because, yeah, if I want something fast, I might just use Gemini 3 or use Google AI mode. Right.
Starting point is 00:35:11 But sometimes if I want something done the right way specifically, and if I want to see in the chain of thought, come on Google. please add this in the chain of thought if i want to see where the model is deciding agentically to go and to research right i can see that in gbt5 pro or gbt5 thinking uh right i can't see that um unfortunately in jemini three pro so you shouldn't be using the default model ever until they improve it right and maybe by this time that i'm done yapping um on this podcast the gbt52 will be out or you know maybe later this week and maybe it'll be a different story. Maybe it'll be a good base model.
Starting point is 00:35:52 But that's one of the reasons why I think that so many people have just been like, oh, chat ChPD is not very good anymore or why they're losing so much market share. And it hasn't really been talked about, well, the default chat model that reportedly before 93% of even paid users were using. And now about 76% of paid users are still used. using that model. And it's not good, right? I'm sorry. When I say it stinks, it's obviously world-changing model, right? You know, four years ago, if we ever would thought we would have something like this, but comparatively, it's not good, right? You don't want to be handing off
Starting point is 00:36:33 important business tasks or anything, you know, that matters to you to a model that's not even a top 20 model. Not a good call. You shouldn't be doing it. So starts to wrap up here. Is this Is the code red warning, a sign that ChachapD has officially lost the AI crown? No, it's not. It's not. Here's why. ChatTBD still dominates. Like I said, 800 million weekly active users.
Starting point is 00:37:03 Also, OpenAI just came out with a new kind of state of AI in the Enterprise report yesterday that we shared about in our newsletter. That's why you got to read the newsletter, right? We don't miss anything. It shows that their user, their user, their user, message volume has increased eightfold over the past year, which is insane. Also, the new Ramp AI index shows that 36% of U.S. businesses use ChadGBT Enterprise versus only 14% for Anthropics. So yeah, I still don't think from a market share perspective, it's still Chad Chabit. And that's going to take a while, right? Google's doing a lot
Starting point is 00:37:45 to claw into that. But also it's all these, I think, the benchmark headlines and the social media drama have probably outpaced the actual real world usage shifts. And so far, it's just Google that's actually getting close in that department.
Starting point is 00:38:02 So no, Chad GPD hasn't lost its major crown. Although I think there's more crowns and maybe, you know, Chad GPD had five of them, you know, a year ago or 13 months ago, they don't anymore. the new reality, I think there's three companies and they now hold three very different crowns. So, Anthropic leads in coding and agenic models for now, right?
Starting point is 00:38:25 This could obviously change tomorrow next week, next month. I think Google leads in reasoning, multi-modality and distribution, right? They have an unfair advantage, right, with Google AI mode, but also rolling out Gemini in search, Android, cloud, everywhere else. I mean, Google Gemini is everywhere, right? I sneeze and I think Google Gemini is in my Kleenex. Right. My wife just screenshot of me. It's like, oh my gosh, like Google Gemini's in, you know, Google Maps. It's everywhere, right? The distribution advantage that they have is unfair. But still, open AI, even after all that, they still lead in consumer mindshare. The word AI is still synonymous with chat chabit. And I don't think that is going to change anytime soon. Right. And the only time, the only reason we're having this conversation, right, It's because if you're listening to this myself, right, we are in a bubble, right? I'm not saying, though, the AI bubble.
Starting point is 00:39:23 It's going to burst, the dot com. No, I'm saying we are in an AI bubble, right? If you follow this frequently, if you're using it all day, every day, if you're passionate, if you're an AI enthusiast, you're probably paying way closer attention to this than 99% of the world. But luckily, I kind of live in Chicago, not kind of live in Chicago. I live in Chicago, right? But what I'm saying is I'm kind of away from the big kind of ongoing narrative.
Starting point is 00:39:49 And I think that the narrative in Silicon Valley in San Francisco right out in California, it gets blown up a little bit. So because it just seems like it's, you know, life and death, everything is AI. And yes, it's hard to succeed in business or even it's going to become harder to have a personal life that's thriving without using AI. because it is literally everywhere. You can't use, you know, really any app without using AI right now. But I think it's just this narrative has been a little bit overblown. But here's the other thing.
Starting point is 00:40:26 Hot take to end it here. Open AI is not going anywhere, right? Yeah, the market share, but let me share some more stats here from a similar web. So they put out a recent stat on app stickiness, right? So even though OpenAI has lost some of their market share to Google Gemini, stickiness or the daily reliance that an app has, Chad GPD has a 33% stickiness rate. All right.
Starting point is 00:41:01 And most other large language models, perplexity, you know, co-pilot, Gemini, you know, Geminiis is actually pretty low at 5%, GROC, 14%, clawed 16%. What does this mean? The people that are using chat chbtee are using it way more frequently than any other large language model. And it's not even close. It is stickier. Right. And what does that mean? And why am I bringing this up at the end? Ads. Yeah, I know people are going to hate it. But another stat. All right, from a similar web. So it showed the average query length. And I think this is important. Right. The average. average Google query length is 3.4.
Starting point is 00:41:45 3.4 words. The average AI mode, query length inside Google's AI mode, is 10.4 words. In chat GPT, it is 60 words. So what does this mean and why does this have anything to do with chat GPT, not losing its crown? Ads are coming. Yes, they reportedly got paused with this code read,
Starting point is 00:42:07 although right before that, the week before this, we did see instances or, or references to Chad GTPT ads in an Android chat Chbett app leak. But ads are coming, right? And yeah, they may not be coming now until the first quarter of 2026. But these last two charts show you why Open AI is in no trouble. The platform is stickier than anything else, and it's not even close. And the context that users share with Chad GPDT is so much.
Starting point is 00:42:40 it is much more personalized than anything else. All right. So that's a wrap, y'all. But let me just say this. It's probably a good thing that Open AI has this code read, right? Because like I said, one of the underlying issues that no one is talking about is the default model that so many people are using day to day. Maybe you've been using it for your business. Maybe you've been using it to launch a side project.
Starting point is 00:43:11 just using it in your personal life. I don't know. And maybe you've noticed like a lot of people, hey, it seems like this Chad GBT is not as good as it once was. Or, oh my gosh, like, you know, Salesforce CEO Mark, Mark Benioff when he, you know, is using chat GVT every day. And, you know, yeah, maybe there's some recency bias there since GVT5 launch in August, not very good. GVD51, not very good, especially when you're using the default chat model. But then when you go use the default chat model of Google Gemini, you're getting Gemini 2 point or you're getting Gemini 3 pro right so there's a big drop off there so it is probably for opening eyes best interest to go through this code red to hit a pause even though i'd love to see
Starting point is 00:43:55 you know atlas get you know developed more and faster and in pulse and even ads i i maybe i'm the only one i can't wait for chat chb t ads right but maybe it's a good thing because maybe Open AI is going to catch back up in the model game, the game that they technically created and dominated. But for now, Google's kind of taking the crown. But if I'm Open AI, I'm not sweating. All right. I hope today's show was helpful and maybe a little bit entertaining, bringing you Hot Take Tuesday. Thank you for tuning in. If you miss anything, don't worry, it's all going to be recapped on our website in today's daily newsletter at your everydaya.com. Thanks for tuning in. Hope to see you back tomorrow and every day for more.
Starting point is 00:44:39 everyday AI. Thanks y'all. Meet Firefly AI assistant. Now live in Adobe Firefly, the Allman One Creative AI Studio. Just describe what you want to create in your own words and the assistant handles the rest, orchestrating multi-step workflows across Adobe Creative Cloud apps, including Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome while the assistant accelerates execution. Stand control with the ability to step in and refine at any time. See it today at firefly.adobie.com. And that's a wrap for today's edition of Everyday AI. Thanks for joining us.
Starting point is 00:45:26 If you enjoyed this episode, please subscribe and leave us a rating. It helps keep us going. For a little more AI magic, visit Your EverydayAI.com and sign up to our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.