The AI Daily Brief: Artificial Intelligence News and Analysis - How AI Solved a Massive Coding Challenge for Morgan Stanley
Episode Date: June 5, 2025AI coding tools are now fixing problems in old software, helping non-coders build apps, and making work faster for big companies. Microsoft, Google, and Amazon use AI to write much of their code, whil...e Morgan Stanley used AI to update millions of lines of COBOL, saving thousands of developer hours.Get Ad Free AI Daily Brief: https://patreon.com/AIDailyBriefBrought to you by:KPMG – Go to https://kpmg.com/ai to learn more about how KPMG can help you drive value with our AI solutions.Blitzy.com - Go to https://blitzy.com/ to build enterprise software in days, not months AGNTCY - The AGNTCY is an open-source collective dedicated to building the Internet of Agents, enabling AI agents to communicate and collaborate seamlessly across frameworks. Join a community of engineers focused on high-quality multi-agent software and support the initiative at agntcy.org Vanta - Simplify compliance - https://vanta.com/nlwPlumb - The automation platform for AI experts and consultants https://useplumb.com/The Agent Readiness Audit from Superintelligent - Go to https://besuper.ai/ to request your company's agent readiness score.The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: https://pod.link/1680633614Subscribe to the newsletter: https://aidailybrief.beehiiv.com/Join our Discord: https://bit.ly/aibreakdownInterested in sponsoring the show? nlw@breakdown.network
Transcript
Discussion (0)
Today on the AI Daily Brief, a case study in how Morgan Stanley used AI to solve a very
intractable coding problem. Before then, in the headlines, a set of new features that make
chat GPT for business even more powerful. The AI Daily Brief is a daily podcast and video about
the most important news and discussions in AI. All right, friends, quick announcements here before we
dive in. First of all, thank you to today's sponsors, blitzie.com, Vanta, agency, and super
intelligent. A quick reminder once again that if you are looking for an ad-free version of the show,
you can get it at patreon.com slash AI Daily Brief. And one quick announcement, a couple jobs that are
open at Superintelligent right now. We are absolutely inundated right now with these agent readiness
audits that you hear about all the time on the ads. And I am looking for a copy editor and a layout
designer to help with those reports. These are contract positions, but there will be a fairly high volume.
So it could be a really good one for freelancers. Again, I am looking for a copy editor and
as well as a layout designer.
If you are interested, shoot me a note at Jobs at B-Super.A.I with report in the subject.
Again, that's Jobs at B-Super.A.I.
But with that, let's get into today's headlines.
Welcome back to the AI Daily Brief Headlines edition,
all the daily AI news you need in around five minutes.
OpenAI made a quick announcement today about chat GPT for business.
First of all, they gave us some numbers.
They say they now have 3 million paying business users,
and they are adding a slew of new features to make their tooling more powerful.
The one that people are most interested in is called connectors.
Basically, chat GPT can now plug into Google Drive, Dropbox, Box, SharePoint, OneDrive, and others,
and users can more easily access it to get answers from storage spreadsheets and documents.
Now, this is a bigger deal than I think it at first appears.
Enterprise Search is a massive category that companies like Glein and others have been really trying hard
to corner the market on inside the enterprise.
for the last year or so. And this is OpenAI directly going after that market. The company
wrote in a release, ChatGPT will structure and clearly present the data and respect your organization's
existing permissions on the user level. The update also includes a bunch of other features that,
once again, have OpenAI competing with other standalone tools. There's a record mode to take
notes on meetings, and the integration which we got promised a couple of months ago with MCP is
now coming into the enterprise sphere. OpenAI writes, Workspace Admins can also
also now build custom deep research connectors using model context protocol and beta.
MCP lets you connect proprietary systems and other apps so your team can search, reason, and act on
that knowledge, alongside web results in pre-built connectors.
This was just happening as I was recording the show, but like I said, I think it's maybe
even a slightly bigger deal than it seems at first glance.
Another little feature update?
OpenAI is also rolling out basic memory features for free users.
They wrote,
We're starting to roll out a lightweight version of memory improvements to free users.
In addition to existing saved memories, chat GPT now references your recent conversations to provide
more personalized responses.
Now, we've talked about how chat GPT memory isn't necessarily always a slam dunk.
Sometimes people have a bunch of different use cases going on, and they leave memory off
so that one chat doesn't influence the next chat.
All in all, though, I think that those tradeoffs are well worth it and that memory is a really
powerful feature.
Still, rolling out memory as widely as possible seems to fit with OpenAI strategy of building a, quote,
AI super assistant that deeply understands you and is your interface to the internet.
In strategy documents revealed in the Google antitrust case, OpenAI laid out a plan to ship
this assistant during the first half of this year.
Agenetics, advanced reasoning, and of course memory were all part of a coherent product
that's all about, quote, making life easier for their users.
It also fits with Sam Altman's view that young people are using AI as a life coach, commenting,
they don't really make life decisions without asking chat GPT what they should do.
In that frame, it of course makes sense to push memory out to free users as a priority.
Basically, give as many people as possible the ability to experience what a personalized AI that
remembers everything about you is like.
One last OpenAI story.
Even as it was happening a couple of years ago, you just knew that there was going to be a movie
about the boardroom drama.
Sure enough, a new film called Artificial will cover the tumultuous few weeks of 2023 when
Sam Altman was fired and then rehired, leading to a near complete turnover in OpenAI's
leadership.
Amazon MGM Studios has greenlit the production, which the Hollywood reporter says,
is, quote, being put together at lightning speed.
At this stage, nothing has been fully locked down.
They're in talks with the director of Call Me By Your Name,
and actors like Andrew Garfield are in conversations.
Sources say that Amazon is looking to shoot the film across San Francisco
in Italy this summer, so we could be watching this thing before long.
Lastly, today, a fun, useful one for you all out there.
Notebook LM users can now share their notebooks using a link.
The latest feature for the Google product allows anyone on the internet
to check out the research you've been collating with the AI tool.
Viewers won't be able to edit what's in the notebooks, but they will be able to ask the AI
questions about it and interact with generated content like audio overviews.
Essentially, it's the same sharing functionality from Google's productivity suite being transferred
over into their viral AI product. And while this is completely inevitable as a feature,
it's also a pretty powerful addition that really does open up some new use cases.
People were already using notebook LM for corporate communications, especially by sharing audio
overviews. Giving anyone the ability to ask follow-up questions and interact with generative features
opens up new functionality for information sharing. Rather than only sending out the final product like
a generated podcast, users could now share an interactive AI knowledge base as widely as they want to.
We're also watching in real time as AI companies try to figure out how to make AI use less of a
solo experience. And that's on both the consumer and the enterprise side. Ultimately, I think that this
will definitely continue to expand the use of notebook L.M and make it even more useful at work,
and I'm excited to see what people do with it. For now, though, that is going to do it for today's
AI Daily Brief Headlines edition. Next up, the main episode. This episode is brought to you by Blitzy,
the Enterprise Autonomous Software Development Platform with Infinite Code Context. Blitzy is used
alongside your favorite coding copilot as your batch software development platform for the
enterprise seeking dramatic development acceleration on large-scale code bases. While traditional
co-pilots help with line-by-line completions, Blitzy works ahead of the IDEE by first documenting your
entire code base, then deploying over 3,000 coordinated AI agents in parallel to batch-build millions
of lines of high-quality code.
The scale difference is staggering.
Copilots might give you a few hundred lines of code in seconds, but Blitzy can generate up to
3 million lines of thoroughly vetted code.
If your enterprise is looking to accelerate software development, contact us at blitzie.com
to book a custom demo or press get started to begin using the product right away.
Today's episode is brought to you by Vanta.
In today's business landscape, businesses can't just claim security, they have to prove it.
Achieving compliance with a framework like SOC2, ISO-27-01, HIPAA, GDPR, and more,
is how businesses can demonstrate strong security practices.
The problem is that navigating security and compliance is time-consuming and complicated.
It can take months of work and use up valuable time and resources.
Vanta makes it easy and faster by automating compliance across 35-plus frameworks.
It gets you audit ready in weeks instead of months and saves you up to 85% of associated costs.
In fact, a recent IDC White Paper found that Vanta customers achieve $535,000 per year in benefits,
and the platform pays for itself in just three months.
The proof is in the numbers.
More than 10,000 global companies trust Vanta.
For a limited time, listeners get $1,000 off at vanta.com slash NLW.
That's VANTA.com for $1,000 off.
Today's episode is brought to you by agency, an open source collective for interagent collaboration.
Agents are, of course, the most important theme of the moment right now, not only on this show,
but I think for businesses everywhere.
And part of that is the expanded scope of what agents are starting to be able to do.
While single agents can handle specific tasks, the real power comes when specialized agents
collaborate to solve complex problems.
However, right now there is no standardized infrastructure for these agents.
to discover, communicate with, and work alongside one another.
That's where agency, spelled A-G-N-T-C-Y, comes in.
Agency is an open-source collective building the Internet of Agents,
a global collaboration layer where AI agents can work together.
It will connect systems across vendors and frameworks,
solving the biggest problems of discovery, interoperability,
and scalability for enterprises.
With contributors like Cisco, Crew AI, Langchain, and MongoDB,
agency is breaking down silos and building the future of interoperability.
A.I. Shape the future of enterprise innovation. Visit agency.org to explore use cases now.
That's AGMTCY.org.
Today's episode is brought to you by superintelligence, specifically agent readiness audits.
Everyone is trying to figure out what agent use cases are going to be most impactful for their
business, and the agent readiness audit is the fastest and best way to do that.
We use voice agents to interview your leadership and team, and process all of that information
to provide an agent readiness score, a set of insights around that score, and a set of highly
actionable recommendations on both organizational gaps and high-value agent use cases that you should
pursue. Once you've figured out the right use cases, you can use our marketplace to find the right
vendors and partners, and what it all adds up to is a faster, better agent strategy.
Check it out at besuper.a.i or email agents at besupor.a.i to learn more.
Welcome back to the AI Daily Brief. At this stage, AI powered coding is
undeniably one of the most powerful and increasingly mainstream use cases for AI and this early
generation of agents that are starting to be deployed to production. In the consumer realm,
obviously, we've talked a ton over the last few months about vibe coding. And this is obviously
a big tent with some loose terminology that includes both AI assistant companies that are
used by existing software engineers to improve their processes, take certain types of burdensome
activities off their plate so they can be more focused on higher order issues.
Vibe coding also, however, is about bringing new people into the coding sphere, allowing people
who weren't technical before be able to use English as their new coding language to build applications.
This has been an incredible trend that is not only impressive for the speed at which it's happening,
but also in that it is the use case of AI that opens up other use cases of AI.
The more that AI is able to code, the more it's able to use code to solve other problems.
These tools have become so ubiquitous, in fact, that they've also been at the center of the question of whether AI is going to have a negative impact on jobs.
You've probably seen that Federal Reserve chart of the massive decrease in software development job postings from its COVID peak to now.
And yet if there has been one area where it felt like AI tools and vibe coding specifically really wasn't up to the task, it was in the context of the enterprise.
The issues have been numerous.
two short context windows to not be able to handle legacy codebases, design patterns that aren't
optimized for many contributors who can come in and out of projects. All of the issues replete with
big burdensome legacy codebases. This is not really what these vibe coding tools were designed for,
and so their traction and relevance inside the enterprise has been a little limited, which is not to say
that AI coding isn't having a big impact for organizations that have thrown themselves into it.
The CEOs of both Microsoft and Google have claimed that as much as 30% of their code is produced by AI,
and Amazon staff are reportedly putting pressure on management to provide internal access to
cursor as a matter of urgency.
And for as much as the conversation around AI and coding has led to a conversation around
job replacement, when one starts to dig in, we're actually finding quite a few examples
of stories of where AI and AI coding tools specifically aren't just being used.
to do things that were annoying before, but are actually opening up possibilities that were
literally impossible before.
AI's ability to ingest a huge amount of data is intersecting strongly with financial firms in
Wall Street.
Logistics companies are developing AI systems to optimize supply chains like never before,
and as the models improve, we're starting to see these big, not previously possible
things come to the realm of AI coding as well.
Following the release of Anthropics Claude Opus 4 last month, and you will remember
that Anthropics Claude models have been the go-to for developers for some time now,
one veteran developer on Reddit said the model had managed to fix their white whale bug
that had cost them hundreds of hours over several years. The post reads,
Background, I'm a C++ dev with 30 plus years experience, X-Fang staff engineer. I'm generally
the person on the team that other developers come to after they struggled with a problem for a week,
and I would solve it while they're standing in my office. But today, I was humbled by Claude Opus 4.
I gave it my white whale bug which arose from a re-architecting refactor that was done four years ago.
The original refactor span around 60,000 lines of code, and it fixed a whole slew of problems,
but it created a problem in an edge case when a particular shader was used in a particular way.
It used to work, then we re-architected and refactored, and it no longer worked.
I've been playing on and off trying to find it, and must have spent 200 hours on it over the last few years.
It's one of those issues that's very annoying, but not important enough to drop everything to investigate.
I worked with Claude Code running Opus for a couple of hours.
I gave it access to the old code as well as the new code
and told it to go find out how this was broken in the refactor.
And it found it.
Turns out that the reason it worked in the old code
was merely by coincidence of the old architecture
and when we changed the architecture,
that coincidence wasn't taken into account.
So this wasn't merely an introduced logic bug.
It found that the changed architecture design
didn't accommodate this old edge case.
This took around a total of 30 prompts and one restart.
I've also previously tried GPT 4.1, Gemini 2.5, and Claude 3.7, and none of them could make any progress whatsoever. But Opus 4 finally found it.
And so on this theme of AI not just helping people but solving problems that were somewhat unsolvable before,
today in the Wall Street Journal, we have another one of those stories. The WSJ is reporting that Morgan Stanley has used AI to solve one of the biggest problems for these legacy codebases,
which is updating legacy programs that were written in Cobol.
If you are younger or not a professional programmer, you likely have never had the joy of dealing
with COBOL. The programming language was first developed in 1959 and was fairly ubiquitous during
the early days of computing. Back during that era, computerized systems were so expensive that they
were only deployed against some of the highest value use cases across society. Think banking databases,
air traffic control, and nuclear facilities. This slowly expanded out over the decades,
but remained an extremely high-ticket item. Until the mid-1980s, with the mid-1980s, with the same time,
the first personal computers, essentially every computerized system in the world was programmed in this
language. The language is dense, monolithic, and difficult to use even for experts. It became obsolete
by the 1990s with much better programming languages coming along, but many of the systems that
used COBOL were so critical that they couldn't easily be replaced. In fact, there's been a
persistent fear that with the retirement of COBOL developers, it would become essentially impossible
to maintain these systems, let alone embark on rewriting of the programs. One area where this language
is still omnipresent is in banking infrastructure, and those critical systems were viewed by some as a
ticking time bomb. Morgan Stanley, however, has taken on the Goliath project of rewriting all of their
cobalt systems into modern language with the help of AI. Using an in-house fine-tune of open AI's
models, the bank created a system that can translate legacy code into plain English specs that developers can
used to rewrite it. According to the company's global head of technology, Mike Peezy, since the
AI's introduction in January, it's reviewed 9 million lines of code and saved developers 280,000 hours.
IBM, which was a major provider for the mainframes that use COBOL in the early days,
have been working on their own AI systems for migrating the language into Java.
To give a sense of scale of the problem, IBM's pitch was that their coding assistant could cut
the task of updating legacy systems down to one or two years rather than several years.
But that tool hasn't emerged thus far, so Morgan Stanley built their own.
PZ said, we found that building it ourselves gave us certain capabilities that we're not
really seeing in some of the commercial products. He said that off-the-shelf tools might evolve
to deliver those capabilities, but, quote, we saw the opportunity to get the jump early.
The journal writes, Morgan Stanley was able to train the tools on its own code base, including
languages that are no longer or never were in widespread use. Now the company's roughly 15,000
developers based around the world can use it for a range of tasks including translating legacy
code into plain English specs, isolating sections of existing code for regulatory inquiries and other
asks, or even fully translating smaller sections of legacy code into modern code. Now the tool
is technically capable of rewriting code automatically, but it doesn't necessarily know how to make
it efficient or take advantage of the features of modern languages. That's why humans are still in the loop,
and largely using the AI as a parser to understand the functionality of the legacy code.
Rather than paying highly skilled technical experts that know how to read that legacy code to
painstakingly write up specs, Morgan Stanley is using AI to automate that process.
PZ said that he's not expecting to see smaller headcounts in his software engineering department
because of AI.
Instead, he anticipates having a lot more code being produced.
The company currently has hundreds of AI use cases in production,
aimed at all manner of growth and efficiency targets.
And thanks to this AI moonshot of rewriting their entire legacy codebase, these AI automations
can now be deployed against modern code rather than programs that were written decades ago.
PZ said, you're always modernizing in tech.
Today, with AI, this becomes even more important.
So to recap, this problem of these legacy codebases was so big, hairy, intractable, difficult
that it has been kicked down the can for literally decades at this point,
because no one wants to take the time to just fix it.
However, at some point we were going to get to a time when no one even really knew how to interact
with these languages anymore, and we would have been up the proverbial creek without a paddle at
that point.
Even in this very incremental version that doesn't do the full code translation automatically,
you're still talking about a financial giant saying that they've saved 280,000 human hours
this year.
Vibe coding specifically and AI coding tools in general may not have fully infiltrated the enterprise
yet, but a few more stories like this and you better believe that they're going to get there soon.
For now that that is going to do it for this sort of case study version of the AI Daily Brief,
appreciate you listening or watching as always, and until next time, peace.
