Everyday AI Podcast – An AI and ChatGPT Podcast - Ep 754: Anthropic’s ‘scary’ new model, Microsoft Copilot’s ‘Code Red,’ OpenAI’s Superinteligence New Deal and more
Episode Date: April 13, 2026Should we tax AI robots and only work 4 days? 🤖OpenAI thinks so. Speaking of OpenAI, you're gonna wanna learn Codex with the changes coming this week. Speaking of changes.... did you see Met...a's new model? It's actually REALY good. Oh, and Anthropic created a model they say is so scary, they can't release it. (Just another week in AI news, apparently) If you missed anything, we'll get you caught up. Anthropic’s ‘scary’ new model, Microsoft Copilot’s ‘Code Red,’ OpenAI’s Superinteligence New Deal and more -- An Everyday AI chat with Jordan WilsonNewsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion on LinkedIn: Thoughts on this? Join the convo on LinkedIn and connect with other AI leaders.Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:Anthropic’s Claude Desktop Power User RedesignOpenAI Codex Super App Expansion LeaksAnthropic Managed Agents Beta Launch DetailsOpenAI’s Superintelligence New Deal PolicyMeta Muse Spark Model Benchmark ResultsMicrosoft Copilot Code Red Performance PushOpenAI CEO Sam Altman Targeted IncidentsAnthropic Mythos Cybersecurity Model WithheldZAI GLM 5.1 Open Source Model OutbenchAlibaba Happy Horse One Video Model RankingTimestamps:00:00 Claude interface redesign04:24 OpenAI consolidating tools into Codex07:21 Using Anthropic's platform basics10:32 OpenAI's superintelligence policy proposal13:16 AI's impact on jobs18:04 Discussing LM Arena rankings20:42 Early challenges with Microsoft Copilot25:34 Altman addresses AI challenges29:18 AI's impact on jobs and cybersecurity30:46 Project Glasswing cybersecurity initiative36:29 AI software, smart glasses, and subscriptions38:35 AI industry updates and projections41:00 Weekly content schedule overviewKeywords: Anthropic, Claude managed agents, scary new AI model, OpenAI, Sam Altman, MicSend Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info) Start Here ▶️Not sure where to start when it comes to AI? Start with our Start Here Series. You can listen to the first drop -- Episode 691 -- or get free access to our Inner Cricle community and all episodes: StartHereSeries.com Also, here's a link to the entire series on a Spotify playlist.
Transcript
Discussion (0)
This is the Everyday AI Show, the everyday podcast where we simplify AI and bring its power to your fingertips.
Listen daily for practical advice to boost your career, business, and everyday life.
Meet Firefly AI Assistant, now live in Adobe Firefly, the All In One Creative AI Studio.
Just describe what you want to create and the assistant handles the rest,
orchestrating multi-step workflows across Photoshop, Premiere Express, and more in one conversational interface.
You direct the outcome.
The assistant accelerates execution.
Anthropic has a new scary model.
Very scary things are happening to OpenAI CEO Sam Altman.
Microsoft is reportedly panicking about co-pilot's performance.
And somehow Meta's newest AI model is crushing it.
Yet that's not even half of what's moving and important in the AI world right now.
That's because Open AI and Anthropic,
both confirmed big desktop releases this week.
Oh, and Open AI wants to tax robots and let humans only work for days.
Yay?
All right.
If you missed any of that or everything else that we're going to go over today,
don't worry.
That's what we're here for.
This is the AI News that Matters.
If you're new here, welcome to Everyday AI.
We do this well, every day.
It's your daily live stream podcast and free daily newsletter,
helping everyday business leaders like you and me,
not just keep up with everything that's happening in the AI world,
but how to make sense of it and to get ahead and grow your company and career.
So if that's where you're trying to do,
awesome.
Starts here with the unedited,
unscripted live stream podcast,
but to be the smartest person in AI at your company.
Make sure to go to our website at your everyday AI.com.
All right,
there you can sign up for the free daily newsletter.
And we'll tell you everything else that's happening today.
But on Mondays,
we give you the AI news that matters.
So don't spend all week, you know, spending multiple hours a day, reading things and being like, oh my gosh, is this real?
Is this fake?
No, just join us on Mondays.
It's a great way to kick off the week.
I do this literally nonstop 24-7 to help you.
All right.
So let's start off with our first piece of AI news, which is actually a big one.
And it hasn't happened yet.
But the companies have confirmed that it is.
That's because we have, well,
Well, things are heating up.
Just like the weather here in April in Chicago, it's like, my gosh, it's finally more than 20 degrees.
But as the weather is heating up, so is the AI competition.
So we are getting new releases this week from Anthropic and from OpenAI.
So here's what we have, via reporting from testing catalog.
So Anthropic is preparing epitaxi.
Epitixie.
I don't know.
What are these code words?
Can we get some easier ones?
epitaxy, a major power user redesign of Claude.
So according to details uncovered from Claude code, the internal project is
co-named Epitaxy and it signals a big shift toward a more professional power
user-focused desktop experience that could ship this week.
So the redesign interface introduces a single window layout with dedicated panels for
plans, tasks handled by subagents, and did.
plus live coded previews and support for working across multiple repos.
So Anthropic is also developing a new coordinator mode, which would allow Claude to manage
and delegate work across multiple parallel sub-agents while concentrating on higher-level
planning and synthesis. Users will also be able to reportedly create agents directly
inside the app on the fly. All right. Now with OpenAI, we've seen a lot of
rumors swirling lately, but it seems like they're all but confirmed as multiple members of
Open AI did say that they're going to be shipping some major updates this week in Codex.
And well, that's because they're quietly starting to test a new scratch pad feature for
codex that would let users run multiple tasks in parallel, a move that points to a big
expansion of codex beyond just coding and moving into a central hub for AI-driven.
So yeah, essentially this scratch pad, you type out a bunch of things, right?
It can be notes and then they turn into chats and then codex just does them.
So think if you were to, uh, you know, leave yourself to do's and then codex just does them.
Right.
So pretty cool.
So these references suggest that open AI is also consolidating chat chitp t, the Atlas browser
and the different software and agentic engineering tools into the single super app,
which we've been talking about.
over the last couple of weeks,
but it does appear any ways that Codex may be
kind of the final landing platform.
I'm not sure if that's ultimately gonna be true
if they're gonna name it something else,
but it does look like a lot of these rumored features
of bringing the different functionalities all into one
are at least being debuted in Codex.
So we have seen reports that Codex may be the ultimate winner here,
at least when it comes to desktop software.
So yeah, if you don't know, you can use Chad GPT on the web, but also the desktop software.
You have the Chad GPT app and then you have the Codex app.
So in the new kind of leaks here from OpenAI, one of the most telling discoveries is a heartbeat system designed to maintain persistent connections with long running tests.
And that, well, if it sounds like OpenClaw, yeah, that's because that approach does close the near,
systems already used by OpenClaw and Anthropics, managed agent Project Conway, which we talked
about last week, making Open AIs move a clear competitive response as a desktop play.
Separately, social media posts from Open AI employees featuring snowflake emojis.
Yeah, we're talking about snowflake emojis here on the show.
Have sparked speculation about a new model release codenamed Glacier that some believe to be
GPD 5.5, raising the possibility that Open AI could pair a major platform launch with a model
upgrade in the coming days.
So, yeah, maybe we'll see the rumored GPT 5.5.
Maybe we'll see the full super app or maybe this week we'll just see a Codex release with
some of the other features kind of baked into codex.
My guess would be the latter, but on this one, my guess is as good as yours.
any inside Intel on this one at least. All right. Next. Anthropic has launched their new
Claude Managed Agents to make AI agents easier to build, run, and scale. So it is in
public beta and it's offering a full production stack that lets developers build and deploy
cloud-hosted AI agents without managing infrastructure themselves, which makes this a notable step
toward more practical enterprise ready agentic AI.
So if that sounds super confusing, well, it might be.
So you do have to use this on the back end in Anthropics platform.
So you're not using this in Claude AI, FYI, right?
So you're not going to go to Claude.
com.
You're going to be using Anthropics platform.
So the good thing is, well, you don't have to have a paid Claude account to do this.
You just have to at least have a credit card on file because you will be charged for usage.
So yes, you can do this.
this more on the technical side. But the cool thing is, if you've used the, the GPT builder in chat
GPT, it's kind of like a sort of like a version of that. Right. So you can simply chat with
Claude to help you build agents in this new Claude managed agents, but you can also go a little
bit more technical and under the hood. Right. And the cool thing is it can connect to basically
any MCP server. It can connect to anything. Right. So in the same way that you might use Claude code,
And you might not know how to do any of this coding,
but it's using the terminal and connecting to all these API services and doing all these magical things.
That's kind of what Claude managed agents looks like inside of Anthropics platform.
So I did get to play with it for a little bit, I think on Friday.
So I haven't spent multiple hours, but it does seem like a pretty simple way to build agents,
but then to have them contained in Anthropics kind of sandbox.
You don't have to worry about deploying it out on your own.
So the platform handles sandbox code execution, credential management, scoped permissions,
checkpointing and end-to-end tracing.
Meeting teams can focus on defining the tasks, tools, and guardrails,
while inthropics orchestration system manages the tool use, context, and error recovery.
So cloud-managed agents also support long-running autonomous sessions that persist through disconnections
and include multi-agent coordination, allowing one agent to spin up in manage.
others to parallelize, that's a hard word to say, complex work. So it sounds great in theory,
right? And it is. However, I will warn you running parallel agents is great, right? Especially if
you're using Claude code or if you're using codex. Just keep in mind, if you are using Claude Managed
agents, yeah, all those spinning up of subagents is
it's going to cost you because you're paying via usage.
You're paying via tokens.
You're paying via the API.
So keep that in mind.
Sounds great.
And it is, right?
I've tested it.
And, you know, I instantly had an agent that connected to, you know, my email newsletter
and, you know, all these other services that had MCP data, right, which is great.
So you can just say, hey, I have all these services, you know, go connect to them.
It'll bring up an authorization page.
You click a couple of things.
and all of a sudden you have an agent, right?
Let's say there's five pieces of software that you use all the time, right?
And you're like, okay, I could, you know, try to piecemeal this together or, well,
this is where this new release from Anthropic really, really works because not only will
it just kind of build it for you.
Yes, you do have to authorize the agent, but then you can just run it in the sandbox.
But like I said, the cost will add up fairly quickly.
All right.
a new deal for superintelligence.
That is our next story because OpenAI published a 13 page policy document titled Industrial Policy
for the Intelligence Age, Ideas to Keep People First.
So this is what a lot of people are calling the Super Intelligence New Deal, and it outlined
how governments should tax, regulate, and redistribute wealth from AI as the technology rapidly
reshapes the economy. So the blueprint argues that AI progress is accelerating so quickly
that the U.S. may need a new social contract, right, comparable to the progressive era or the
new deal to address risks like mass job displacement, cyber attacks, and social instability.
So Open AI proposes bold new ideas, including a national public wealth fund,
funded partly by AI companies, taxes on automated,
labor to replace shrinking payroll taxes and a four-day, 32-hour work week that shares AI productivity
gains with workers. So the document also calls for treating AI access as a basic right for workers
and schools, creating containment plans for dangerous autonomous systems, and triggering automatic
expansions of unemployment and wage support when AI-driven disruption hits preset levels.
So parts of this, I think we're really good.
Good, right? And if you didn't get a chance to read this, we shared this in our newsletter last week, but that's why you should be subscribing to our newsletter.
So I think parts of this are great in theory. Many of these things will never see the light of day because many of them require the government to act in some official capacity.
And this is coming from someone that used to cover the government as a journalist.
The government doesn't work like that, especially today's federal government. I don't think anything of.
this magnitude, we'll see the light of legislation in the next, I don't know, three to five
years. Right. So what we should really be following is the states. And we will see if states
adopt anything like this. Obviously, I would keep an eye on California, which is where all the big
tech companies, or mostly all the big tech companies are headquartered. So a couple of things I
kind of wanted to point out, right? Like the robot tax, very popular.
A lot of people have talked about that.
That makes sense.
And you do have to, I guess, tip your hat to open AI for saying like, okay, yeah, like,
if AI takes all these jobs, we need to have money to help all the humans and we should
be taxing the robots.
You know, that makes sense.
But then on the other hand, you know, they're essentially saying that AI needs to be deemed
a basic right.
So, you know, on one hand, they're like, okay, well, this thing that we're selling, you know,
we need to call it a basic human right.
But at the same time, we're like, we know it.
We know it's probably going to take a lot of jobs.
And so we need to do something about that.
So I've talked about this a lot over the course of the last three years.
I'm not going to bore you with my hot takes.
But overall, I do think AI is going to change what full-time employment means in the U.S.
I think ultimately AI will replace more full-time jobs than it will create.
But I do think the future of work is, well, a lot of people that aren't even entrepreneurs.
they're going to have multiple knowledge working side hustles, right?
So I don't know if you're a lawyer, maybe you get laid off from your law firm
instead of being a full-time employee.
You might just have 10, you know, very niche lawyer sidegigs, right?
Yeah, it's kind of the way I see things checking out.
But yeah, we'll see.
All right.
Next piece of AI news.
This one was kind of shocking.
Yeah.
Meta has a new model.
and it's actually pretty good.
Yeah.
So Meta announced their new Muse Spark.
That's their new AI model after investing a ton of money and a ton of time.
We're talking billions of dollars and more than a year.
So like I talked about on our Friday features, I did get to sneak this one in on our new Friday features.
But, you know, I said it's been a year since Meta released Lama 4.
in in AI time that feels like a decade right it seems like almost everyone wrote meta llama or sorry sorry meta off because they didn't really come up with anything after llama but we knew that they had some big shifts internally and it looks like their first model anyways fairly impressive so yeah the company offered you know they had an aqua hire of more than 14 billion dollars for scale AI and its CEO Alexander Wang then the company reportedly offered some engineers
paid pay packages worth hundreds of millions of dollars to staff the new MSL or the meta super
intelligence team.
So the Muse Spark is the first model in a series that was known internally first as avocado.
So that was the co-names.
So if you've been listening to the show, we've been talking about that.
And right now, it's initially available only on Meta's AI app and their website.
And they do have plans to essentially repeat.
place the Lama models anywhere with the new Muse Spark.
The other thing to keep in mind, well, unlike previous open releases via the Lama series,
the new Muse Spark is not open source.
So it is closed.
It is proprietary.
Right now it's only for free, right?
So presumably that will change.
And right now it is not available via the API, although the team at MetaDen
did say that they will be rolling out the API soon.
So according to independent evaluations from artificial analysis,
Muse Spark already matches top models from Google,
OpenAI, and Anthropic in language in visual tasks,
but falls behind in coding and abstract reasoning,
tying for fourth place in broad AI tests.
Yeah, I was actually fairly shocked, right?
So talk about artificial.
analysis a lot on the show. It's essentially it's kind of like an aggregator, right? So it takes all these different benchmarks and all these different scores from all these different places and gives all the models a score. All right. So right now, Google and OpenAI are tied with their respective models. And then in technically second place, you have Claude with Opus 46. And now in third, technically, you have Muse Spark, right? Which is
pretty impressive.
The other thing you have to think, I think people are looking at this a little bit
differently because meta did say that they've rebuilt this model from the ground up.
Right.
So this is not, according to meta, just a new version of Lama that's been improved upon.
This is what meta says, a built from scratch new model.
And the fact that it's doing that well already, a just one point behind Claude Opus 4.6
on the artificial analysis.
And what is maybe even crazier on Arena, right?
So we talk about Arena formally LM Arena.
So this is the blind taste test.
And it's also third right now on LM Arena,
although that could change at any second because it's only by like one point.
But regardless, it's a top five model by benchmarks and by user preference,
which if I was putting money on this beforehand,
And I would have said they were probably going to be in more of the five to eight range.
So fairly impressive.
And a lot of people were kind of dragging meta, right?
Because they released their benchmarks and they're like, okay, well, meta released all these
benchmarks and they're not even really top on any of them.
But when you think about it, this is technically their first model in this series.
And it's, you know, top two, three, four, depending on what you look at.
I don't know.
I'm impressed.
I've used it.
My actual usage is mixed, right?
because I'm a very heavy GPT-5-4 pro user,
and I was giving it very complex tasks.
You know, there is also a new, what is it called?
It's called contemplating mode, right?
Which kind of runs these multiple agents simultaneously.
So that was the thing I was like really looking forward to
because I'm like, oh, my gosh, this thing runs, you know,
16 agents at a time or something like that.
And it's supposed to be comparable to, you know,
Gemini Deep Think or opening eyes GPT-54 Pro.
to me, I wasn't as impressed with the new contemplating mode, but I was maybe more impressed
actually with its coding abilities, its writing abilities. So yeah, you have a new model to try
out, at least. Going from a impressive model to a company that is maybe not impressed with
its current AI outputs. That is because, according to reports, Microsoft is under a co-pilot
Code Red. All right. So, and this is, according to BNP, Peribis analyst, Stefan Sloinski,
who reported that Microsoft CEO, Sadi Nadela, has declared a copilot's code red inside of
Microsoft, signaling an all-out push to enhance co-pilots performance and user experience.
So the urgency comes as investors express frustration over co-pilots limited traction,
despite Microsoft's leadership in software in general.
So Ndela's initiative reportedly includes the upcoming launch of the E7 suite,
which I believe should be here around the beginning of May,
with ongoing updates and new features planned throughout the year
to accelerate co-pilot's adoption and usefulness.
So according to Slowinski, the initial feedback on co-pilot is improving.
Adobe just introduced an entirely new way to create,
bringing the power and precision of its creative suite into one conversational experience.
Meet Firefly AI Assistant, now live in the Adobe Firefly app, the All In One Creative
AI Studio.
Powered by Adobe's Creative Agent, Firefly AI Assistant lets you start with your vision, just
describe what you want, and shape the outcome as it takes form with the Assistant.
The Assistant orchestrates multi-step workflows, drawing on 60-plus pro-grade tools across Adobe
Creative Cloud apps, including Photoshop.
Illustrator Premiere, Lightroom Express, and more to help bring your ideas to life.
You can also get started with creative skills, a growing library of pre-built workflows for
common creative tasks like batch editing photos, creating mood boards, portrait retouching, and creating
social variations. Every step the assistant takes is visible so you can refine, redirect,
or take over at any time. You stay in the driver's seat as the creative director.
Adobe Firefly AI assistant now in public beta.
See it today at firefly.adobie.com.
Suggesting Microsoft's renewed focus could pay off as it leads to better user satisfaction and market perception.
But the competitive threat from rivals such as Anthropic is a major reason behind the Code Red strategy,
as Microsoft aims to stay ahead in Enterprise AI tools.
So, Slominski also,
noted that Azure could still outperform expectations due to growing demand for tokens and higher
GPU pricing, even if internal usage increases further.
Here's the thing with Microsoft, right?
It's no secret that the enterprise has been rather frustrated with Copilot, right?
They were one of the first out of the gate, right?
You technically had ChadGPD first, but I mean, Copilot was the first, like, serious enterprise business AI tool.
And I think a lot of enterprises who adopted early and invested heavily, right, in 2023 and 2024, maybe they've been disappointed in the last, you know, two years or so as you've seen Google Anthropic and Google Anthropic and Open AI really just take off.
However, if I'm Microsoft, I'm not exactly worried, right?
They're the only company that has the green flag, for the most part, across the entire enterprise, right?
It's much easier for Microsoft copilot to break its way through the enterprise, although, obviously, Google, Open AI, and others have been really cracking that space.
But in the end, I'm not super concerned.
Top level, if I'm Microsoft, yes, you've got to make co-pilot better.
Yes, a lot of people don't enjoy using it.
A lot of co-pilot users are jumping ship specifically to OpenAI into Google.
But I don't know.
Microsoft's a big investor in Anthropic.
Microsoft is the biggest single investor in OpenAI.
So yes, it's bad if they're losing users to Microsoft or, or sorry, if they're using
loose.
My gosh, I can't speak today.
if they are losing users to open AI or Anthropic,
but in the end, they're still just making money off that anyways.
So we'll see if this co-pilot Code Red leads to anything.
We did see similar stories earlier this year that, you know,
Saudi de Lueblo was going full PM mode, right?
Like product manager, he's rolling up his sleeve,
sitting down with the product team.
So I'm actually, and like I told some people
this had an in-person event last week in Chicago and I told people this like I'm actually
bullish on Microsoft. I've seen a lot of what they've released the last couple of weeks.
They're essentially what they're doing. I'm not going to say they're white labeling a lot of
products, right? But they came out with a version of co-pilot, their co-pilot co-work,
which is very similar to Anthropic co-work. It's really good, right? They have their new task
feature, which is really good. Similar to some features on Anthropic and Open AI, just
scheduled tasks.
So I think Microsoft has actually been shipping a lot.
I think those companies that maybe haven't found the utility in Microsoft copilot,
it's actually more of a training and education problem versus a model problem because now
you get the best of both OpenAI and ImproPic when you're using Microsoft.
All right.
Let's get to some scary stuff happening to Open AI CEO, Sam Altman.
Yeah, this was shocking about to read about over the weekend.
So Open AI CEO Sam Altman's San Francisco home was targeted twice over the past four days, raising concerns about the risks facing tech leaders in the AI sector.
So the latest incident happened early Sunday morning when suspects in a car allegedly fired a round of shots at Altman's property before fleeing the scene.
So police quickly traced the vehicle using surveillance footage.
and arrested two suspects later that morning.
So officers searching the suspect's residence found three firearms,
and both individually were booked for negligent disarm discharge of a firearm.
So this attack followed a Friday morning incident in which a 20-year-old man from Texas
allegedly threw a Molotov cocktail at Altman's home.
So security at Olman's property extinguished the fire from the Maltlovak cocktail,
and no injuries were reported by either or in either incident.
But the two attacks come as Altman has publicly voiced concerns about the societal impact and anxiety surrounding AI, calling it the largest change to society in a long time.
So the rapid succession of attacks underscores the growing tensions and security risks for leaders at the forefront of AI development.
So Altman did respond in a blogger.
post after the incident on Friday.
And he was also critical of a New Yorker article that questioned his trustworthiness,
acknowledging the impact of those negative narratives.
So Altman did admit past mistakes, including being, you know, conflict diverse and
mishandling issues with the Open AI board, but emphasized his commitment to improving open
AI's mission.
He called for less dramatic rhetoric in the AI industry, advocating for broad.
technology sharing and urging constructive debate to avoid further real world harm.
Here's the harsh reality, right?
I'm going to say this is someone that lives in Chicago.
And that's important because I think maybe the majority of our elicitor are not from Silicon Valley, right?
But I know, you know, there's other, you know, popular tech publications where the majority of people are from Silicon Valley.
Silicon Valley is a bubble in a bubble, right?
I don't quite think that Silicon Valley and all the big AI frontier labs really understand what the rest of truly understand, right?
Because I don't think you truly understand unless you live it, what the rest of the world or the rest of the U.S. feels about AI.
And the reality is, most people don't want it.
Most people don't like it.
Most people view AI as a threat.
So unfortunately, this is an, an extremely, an extremely.
extremely unfortunate incident that happened.
But I think that we're going to continue to see AI leaders from all the big companies.
I think this is going to be unfortunately an ongoing issue.
They're literal safety, right?
Because as people start losing their jobs to AI, right?
You can't just get mad at the cloud, right?
Unfortunately, it's people like Sam Altman, people like Daria.
Omode from Anthropic, people like Sondar Pachai, you know, people like Sadia Nadella,
it's the faces of these big, you know, four or five companies, you know, Mark Zuckerberg
at Meta as well.
These are the people that people are going to be angry at, right?
Because unlike, you know, the internet, there was really no face of the internet.
I guess you could say maybe Bill Gates.
But ultimately, the internet was a very slow change.
to jobs. It was a slower change to the economy. Yes, you had the dot-com boom and bus, but things
with AI are moving much, much faster. And I don't think that people in Silicon Valley necessarily
largely understand how the rest of the U.S. really feels about AI. And yeah, I think that,
unfortunately, we're going to see ugly incidents. And I don't want it to happen, right? And I hope
all the leaders of these AI companies stay safe because ultimately I am very optimistic about
AI's future and doing more good than bad.
Hopefully it's able to cure diseases and do all of these great things.
But yes, it's going to cause a lot of unemployment at the same time.
And people are going to be mad.
So this is terrible.
I hope it doesn't happen again.
But unfortunately, I do think that the leaders of AI tech companies are going to
have to be, you know, doubling up their security as the rest of the kind of U.S.
finally sees what AI is capable of in terms of job displacement.
All right.
Last but not least, more scary stuff.
A model so scary, Anthropic can't release it.
So Anthropic has announced its new Mythos preview model, which they say is so powerful
at finding software vulnerabilities that the company is keeping it private.
raising concerns about both cybersecurity and access to advanced technology.
So Anthropics said its new Mythos preview model has found thousands of critical vulnerabilities
across major operating systems in web browsers, including a 27-year-old flaw in OpenBSD
and a 16-year-old bug in FFMPEG, all that have previously gone undetected.
So essentially, they're saying,
that their new mythos model is a cyber security whiz, and it's able to find thousands of these
zero-day bugs that millions of human researchers could never find.
But the company is not, at least for now, releasing the model publicly.
Instead, they have their new project glass wing, which is essentially a group of companies
that they're getting access to Mythos to these companies.
And they're essentially saying use this to harden up your software to make your software better
because when a model like this kind of hits the streets, right?
We want these, you know, big tech companies to be safe.
And we want the technology that people use to not be exploited by a model like Mythos or similar.
Right.
So the company right now is sharing it only with partners such as Apple, AWS, Google, and VDVD,
Microsoft and 40 other organizations as part of Project Glasswing.
And that is kind of their defensive cybersecurity initiative.
But this move marks the first time in the modern AI era that a major model is being
withheld from the general public, but is being released privately due to concerns over its
potential misuse, creating a significant knowledge and technology gap between elite
companies and the broader public.
So yes, there was times early.
on, right? Like even I remember Open AI way back in the day because I was using their, you know,
their early, uh, GBT, I forgot if I was using GBT two or GPD three, uh, technology, right,
like back in 2020. I remember there's a time they're like, oh, we're not going to release this
model because, you know, it could, you know, write lies about people and, you know, they eventually
released it. It wasn't that they just released it to 40 companies. So there has been time in
the past when companies have said something like, oh my gosh, our model's too good. We're not going to
release it. But they eventually,
did release it, right? This one with impropic, presumably they'll eventually release a version of
mythos. Maybe it's a stripped down version, but it seems like at least for the short or medium term,
for the first time, there's a huge tech divide, right? There is, you know, the the democratization
of AI may no longer be a thing anymore, right? So it's like, oh, we had a great run for the last,
you know, four or five years when, you know, the Fortune 100 companies,
You know, we're using the same thing as, you know, small mom and pop shops.
So that time may not be gone with this new Mythos model.
So the company claims that Mythos was not intentionally trained to be a cyber threat,
but its advanced coding abilities led to the discovery of vulnerabilities that even the top human experts and previous AI tools missed.
So what's my take on this?
I mean, I did a whole episode so you can go listen to that, 752, so I'm not going to spend
too long on it. I mean, part of me, I think Anthropic made the right move here, right?
If they are truly actually concerned about this being a model that could be a cyber threat,
okay, that's great. To me, I don't know. You know, Anthropic has had in its CEO. I've had a lot of,
I won't say boy that cried wolf, but they've had a lot of instances in the past.
where they're really hyping things up, right?
They're like, oh, you know, AI is going to, you know, take all coding jobs.
And then they said AI is going to take, you know, half of white color jobs.
And those things might ultimately come to fruition.
I don't know.
But to me, it seems like this was a strategic play with Glasswing, right?
You get everyone talking, you know, about how it's this new dangerous model, right?
And, you know, I don't know.
To me, I think Anthropic had a huge,
an embarrassing data leak, right?
A couple of weeks ago.
And they know that they're going to be going for an IPO here,
presumably in quarter three or quarter four.
And they need something, right?
They need something in between that.
You can't have your last big, you know,
international splash on the news radar to be,
oh,
that time you accidentally leaked your source code to your most popular product
on the internet, right?
That's not a good time.
And then like four months later,
you know, at least outside of the AI scene, right?
Like we're talking about AI, you know,
we're talking about Anthropic every day.
But I'm saying the entire world, right?
The entire world was talking about Anthropic and that code leak.
Anthropic needed something, I think, to divert the attention from,
oh my gosh, we accidentally just leaked the source code to our most popular product
to Claude code, right?
And we're getting ready for an IPO here.
We need to start to spin up a new narrative.
So, you know, now it seems like this new narrative,
whether it's 100% true, 50% true, I don't know, right?
I don't know.
I would say it's 50% true.
They are actually concerned about, you know, releasing this publicly because, yeah,
it could create a lot of, a lot of bad, bad actors.
We'll just say that, right?
With all the software that we use, yet at the same time,
I do think this is a little bit of pre-IPO marketing and, you know,
just trying to flex on everyone and saying, yeah, look at how good our models are.
All right. So that is it for the big stories of the day. But we're going to end or sorry of the week.
We're going to end with our what's new and what's next. So this is a combination of, you know, some leaks, some rumors and just, you know, some pieces of news, some updates that came out this week.
That, you know, we just didn't give, we didn't have enough time to give full, full attention to all these.
So we're going to go quick here. So we're starting Google ads notebooks inside of Gemini.
with full bi-directional sync to notebook LM.
Yeah, so it's kind of like projects,
but it also works with notebook LM.
Pretty cool.
So leaks show that Anthropic may be building
a lovable-esque full-stap software building program.
That would be crazy.
Brad Gerstner of Ultimatur Capital said
that companies are already using OpenAI's spud model
and it rivals Claude's mythos.
Open AI launched a $100 a month.
month chat chb t pro tier with 10x codex access. So yeah, if you didn't want to pay the $200 a month,
but you wanted more than the $20 a month pro plan. Now you have the mid tier, $100 a month pro
tier. Apple is testing for premium material smart glasses design powered by AI and paired to your iPhone
targeting a launch in 2027. Spotify now creates podcast playlist from natural language prompts.
Just, I don't know, maybe you ask for the best everyday AI episodes.
Try that.
All right, Alibaba's Happy Horse One unexpectedly took the top spot in the AI video arena.
Yeah, it looks really good.
Better than C-Dance, better than V-O-3.
Sorry, V-O-3-1, at least for now.
Speaking of models that made a splash, ZAI released their GLM-5-1,
which is not only the new state-of-the-art model for open source.
But it also outbench top frontier models on SweenBedge like GPD 54, Opus 46, and Gemini 31 Pro.
That's huge, right?
In open source model, I mean, you got to have like a supercomputer thing to actually download this thing, right?
But it's open source.
And it outperformed the big three on Swaybench, which is one of the most popular benchmarks for software engineering.
Microsoft updated its co-pilot terms because previously,
They had that copilot was for entertainment purposes only.
So, yeah, there is some criticism on that, so they changed it.
Next, a DC court allowed the Pentagon to blacklist Anthropic,
but other agencies can still contract with Anthropics.
So, yeah, the ongoing kind of battle might be closer to being closed.
We'll see if that gets appealed.
Next, according to an Axios report,
Open AI is projecting $100 billion in ad revenue by 2030.
Nebius is in talks to acquire AI21 labs for up to $3 billion, according to reports.
Open AI is partnered with Upwork so users can hire freelancers directly in chat Shoebt.
Goldman Sachs came out with a new report that said AI displaced workers face lower earnings
and higher unemployment risk for a decade.
Elam Arena has released the full history of its AI leaderboards as a public data set.
OpenAI is testing a new image generation model on chat GBT, AB test in Elam Arena.
We talked about them testing it on Elam Arena, but now they're also testing it inside of chat GPT on AB tests.
So that would presumably be their new V2 of their image model.
Google Workspace launched a feature where Gemini suggests the best meeting times for everyone.
I mean, we've been needing that for like 20 years.
All right.
Next, PICA launched an AI self-video chat beta where your AI agent talks remembers and acts in real time.
Google quietly drop this one.
It's called the AI Edge Eloquent.
It is a free offline dictation app on iOS using Gemma models.
That is, it's super impressive to FYI.
Speaking of Google, they're preparing a Jewel's V2 coding agent that can set goals and drive
improvements without prompting.
The Gemini app now lets you create
interactive 3D simulations and
models inside of the actual
chat, which is really cool, just to visually explain
things. Google also expanded
its finance tools globally
with new AI capabilities.
Quad co-work hit general availability
for all paid plans and
meta signed a $21 billion
deal with CoreWeave to expand
AI cloud capacity.
That was a lot.
All right. I hope this was helpful.
I got a little tongue tied there.
It just happens, right?
There's so much going on.
Even I struggled to talk about it.
So don't span hours every single day trying to keep up.
Join us on Mondays as we bring you the AI news that matters.
If you are newer here, right, on Wednesdays, we go hands on.
We usually do a deep dive on one tool.
So make sure to check out today's newsletter.
We'll probably do a poll on that.
So what do you want to see?
And then on Fridays, we do our AI feature Friday.
which is where we usually do a handful of new features that you can start using now.
And on Tuesdays and Thursdays, we kind of rotate our shows.
So I hope this was helpful.
If you're listening on the podcast, do me a favor.
Leave a review for us.
I'd really appreciate that after you subscribe to the podcast.
So thank you for tuning in.
If you haven't already, please go to your everyday AI.com.
Sign up for the free daily newsletter.
Thanks for tuning in.
I hope to see you back tomorrow and every day for more everyday AI.
Thanks, y'all.
Meet Firefly AI Assistant.
Now live in Adobe Firefly, the Allman One Creative AI Studio.
Just describe what you want to create in your own words and the assistant handles the rest,
orchestrating multi-step workflows across Adobe Creative Cloud apps,
including Photoshop, Premiere Express, and more in one conversational interface.
You direct the outcome while the assistant accelerates execution.
Stand control with the ability to step in and refine at any time.
See it today at firefly.adobie.com.
And that's a wrap for today's edition of Everyday AI.
Thanks for joining us.
If you enjoyed this episode, please subscribe and leave us a rating.
It helps keep us going.
For a little more AI magic, visit Your EverydayAI.com
and sign up to our daily newsletter so you don't get left behind.
Go break some barriers and we'll see you next time.
