Moonshots with Peter Diamandis - The State of AI: Elon’s $1T Package, Apple’s $600B for Trump & How Startups Win w/ Dave, AWG & Blitzy Founders Brian Elliott & Sid Pardeshi | EP #193
Episode Date: September 9, 2025Get access to metatrends 10+ years before anyone else - https://qr.diamandis.com/metatrends Salim Ismail is the founder of OpenExO Dave Blundin is the founder & GP of Link Ventures Dr. Al...exander Wissner-Gross is a computer scientist and founder of Reified, focused on AI and complex systems. Blitzy is an autonomous custom software supercharged by Generative AI. It was co-founded by Brian Elliott, a serial entrepreneur, and Sid Pardeshi, an ex-NVIDIA software architect with 27 Generative AI patents to his name. – My companies: Reverse the age of my skin using the same cream at https://qr.diamandis.com/oneskinpod Apply to Dave's and my new fund:https://qr.diamandis.com/linkventureslanding –- Connect with Peter: X Instagram Connect with Dave: X LinkedIn Connect with Salim: X Join Salim's Workshop to build your ExO Connect with Alex: Website LinkedIn X Email Connect with Blitzy: X LinkedIn Listen to MOONSHOTS: Apple YouTube – *Recorded on September 6th, 2025 *The views expressed by me and all guests are personal opinions and do not constitute Financial, Medical, or Legal advice. Learn more about your ad choices. Visit megaphone.fm/adchoices
Transcript
Discussion (0)
A piece of muse I saw yesterday that had me scratch in my head, which was the trillion-dollar pay package for Elon.
Elon Musk could officially become the first trillionaire.
Just about a trillion dollars in stock.
It's a striking number, but the benchmarks that he has to meet are also equally striking.
He's not just the leader of the company. He's the marketing voice.
If we really do expect to find ourselves in an abundant society soon, we should expect to have a lot of trillionaires in our society.
money will start to have far less value than ever before.
If we're really on the verge of abundance, then what comes after that?
What's going to remain scarce, even as energy and intelligence, the cost of both of those goes to zero.
But stuff is changing so quickly.
Media is changing quickly.
Elon is paving a new path for what it means to be a great corporate CEO.
But it's going to change again and it's going to change again and it's going to change again.
A trillion dollars here, a trillion dollars there.
How do you compete?
What's your moat?
So, Brian and Sid, welcome.
Pleasure to have you both.
Now that's the Moonshot, ladies and gentlemen.
Everybody, welcome the Moonshots.
The news that really matters in your life.
I'm here with my Moonshot mates, Dave Blundon, Alex Wiesner Gross.
And we're going to have a conversation today about David versus Goliath.
We're talking about trillion-dollar commitments, trillion-dollar pay packages.
It's insane.
But first of all, one of the most important pieces of news is, Alex, I'm reading the comments.
And what I keep on hearing is people want you, ask you a question, just have you talked
for the entire episode.
They love what you have to say.
So I'm not going to do that today, but everybody, if you're new to moonshots, Alex has been
just receiving huge fan mail because of his brilliance.
It's an honor to be here, Peter.
I think I got maybe two adoption offers.
I think you did.
Dave, are you jealous, Dave?
Am I jealous of Alex?
I mean, I'm surrounded by so many people that are so brilliant.
I don't know.
So I'm used to it by now.
That's one of the most important things is finding incredibly brilliant people to have around you in life.
Yeah, the old saying you're the average of the five people he spend the most time with.
And if you've got a community of individuals that are really uplifting you and challenging you, that have you do the best you can,
And that's critically important.
So one of the things I love about moonshots in our WTF episodes is sort of measuring
toe to toe and trying to really have this conversation in a meaningful fashion.
We're missing Salim, Miss Meal again.
Salim, we miss you, buddy.
I think he's back in India.
He's probably got, you know, a crate full of iPhone 17s coming, but we'll be talking to
him soon about that.
So I want to open up with a piece of news I saw yesterday that had me scratch in my head,
which was the trillion-dollar pay package for Elon for Tesla.
It's like, that's extraordinary.
Have you guys gotten an offer from your boards for a trillion-dollar paycheck?
A trillion?
Well, yeah, if you hit the metric, the metrics have to be, you know,
multi-trillion-dollar market cap.
But sure, why that?
It's just a fraction of what you create.
It's if Elon grows Tesla to an $8 trillion company,
so it doubles the size of Microsoft and Nvidia,
he'll earn a trillion dollars.
not like he needs help
becoming the first trillionaire on planet Earth.
Yeah, but he's unique.
He's really changed the definition
of what it means to be a CEO.
And he's not just the leader of the company.
He's the marketing voice.
Most car companies will spend 7% of revenue
or thereabouts on marketing.
Tesla spends zero because Elon's a one-man force of nature
driving consumers to the product.
And we're going to meet a couple entrepreneurs
in a minute here that have that same
like we embrace social media.
we're creating morale and momentum like you wouldn't believe.
But, you know, that's a 20% pay package as opposed to the normal five.
Yeah.
It's worth it.
I agree with you.
If you're a founding CEO and if you're very, very shy, that's not going to bode well for the company.
I know when I'm investing in companies, I'm looking for a CEO who's a great communicator,
who's able to go out there in front of the crowd, able to convey their passion what he or she is loving in life.
and for all of the insanity Elon does with chainsaws or whatever the case might be,
it grabs attention and people either love it or hate it.
One of the many reasons we love Alex so much is because he's not just brilliant,
but he has very, very high situational awareness.
That's really rare around the brilliant community.
But stuff is changing so quickly.
Media is changing quickly.
Elon is paving a new path for what it means to be a great corporate CEO,
but it's going to change again.
going to change again and it's going to change again. So, you know, if you map your behavior to the
change, but most people study backward in time, and they say, well, what did Jack Welch do? Or what did
Genghis Khan do? You know, like, it's okay, but that's not going to, anyway, we'll go on,
this. Brant will go too long if I go too far as. I also think, Peter, it's worth noting if
we really do expect to find ourselves in an abundant society soon, we should expect to have a lot
of trillionaires in our society.
We will, and then we'll have an expectation that money will start to have far less value than
ever before, right?
I mean, you and I have had this conversation, Alex, about a post-capitalist society, right?
Thoughts on that?
Do you still believe that's going to be the case?
Well, so I think that that's always the question.
What does so-called late-stage capitalism even look like to the extent the concept to make
sense. If we're really on the verge of abundance, then what comes after that? And I think the
what comes after abundance is closely tied to what remains scarce in an abundant society. In Star Trek,
a common foil, energy is relatively abundant. Intelligence is relatively scarce. The ability to
travel between stars relatively scarce. So the question I would ask is, what's going to remain scarce,
even as energy and intelligence, the cost of both of those goes to zero.
That is a critical question.
You know, my end point here, my mental experiment is if I, you know, in Eric Drexler's
parlance, if I build a number of assemblers that are able to rearrange atoms and I drop an
assembler into my hand and I say, hey, make me, you know, five copies of yourself and I give each
of you an assembler.
And the assembler is able to use energy and matter, resident, and build anything.
and I drop an assembler into the soil here, and it starts pulling the atoms together to make
me an electric Ferrari, and it says, I need a little bit of titanium, a little bit of lithium,
you add it, and all of a sudden you've got an electric Ferrari.
Everything starts to become effectively zero marginal cost, and that becomes a pretty cool society
where anyone can do anything.
Does it?
Or, I mean, again, not to overindex on Star Trek, but in Star Trek, everyone has replicators,
but not everyone gets to travel between the stars.
So maybe that the new post-scarce ability is the ability to travel outside the solar system.
Well, we're going to find out because we're getting there really fast.
On the news item of trillion dollars here, a trillion dollars there, there's a dinner with Tim Cook and Sam Altman and Mark Zuckerberg and Trump.
And I guess during this dinner, an offer was made by was it Zuck first or was it Tim first to invest?
$600 billion into the U.S. economy.
I think it was Tim first.
I can't tell from the clips, actually, because they get cut and mingled.
But then the other one matches it, and all of a sudden, over dinner, Trump is getting
$1.2 trillion of commitments into the U.S. economy.
You missed the really fun punchline there.
Tim Cook had it all scheduled, planned, and budgeted, and then Mark said, I'll match that.
It was like a YMC fundraiser.
If you can do that, I can do that.
I'm sure his CFO is back in Silicon Valley going, what the hell?
hell did he just commit to? Oh, my God. But it is, there's a third piece that comes out in this
related story, which is Altman announces to his employees that he expects Open AI to be the most
capital intensive company in history. And what was the number? Alex, $119 billion of additional
investment between now and 2029. It was something like that. I mean, do you remember it was a whole,
what, a year and a half ago or so that this number of five or six or seven trillion dollars
of CAPEX into AI chips was being floated. And a lot of people laughed at that. And yet,
and yet, we're finding ourselves a year and a half later in a world where it is entirely plausible
that the true amount of capital expenditure in fabs and AI chips and data centers and new
energy sources completely exceeds that. I think, yeah. I'll file that. I'll file that away.
way because you're dead right and this is the effect we see all the time something insanely
mind-blowing is predicted six months in the future everybody is like impossible then it actually
happens and then they're like oh yeah well it's just part of life and this trend is is going to you know
it's happening over and over and over again but the numbers you just quoted yeah everybody was like
oh sam's just blowing smoke there's no way that's that's you know that's trillion being thrown
around, but that's not a real word. That's just sort of a euphemism. And then here we are just a few
months later. You're going to hear some benchmarks, actually, like Sweet Bench later in this
podcast, where things have just been crushed that the timelines will blow your mind. Well,
we'll get to it when we get to it. Well, I guess the point I'm making here is there's a huge
amount of capital flowing here. I mean, we never, I mean, go back to when all of us were starting
our careers in the 90s and in the dot-com era, the idea that'd be trillion dollars.
movements of capital in any particular company or any particular industry, which is mind
boggling. And here it is routine. But this is in some sense, this is sort of a wonderful
opportunity with trillions potentially of of CAPEX being invested. There's going to be an
expectation, I would assume, by capital markets, that there's going to be enormous revenue
generation that pops out of those trillions in CAPEX. And the question you have to ask yourself is,
what form does that take? At some point, with trillions invested, I think there's probably a reasonable
expectation that entire classes of labor, of services are going to be automated and the cost of
what we currently construe as labor is going to be driven down to zero. And then perhaps at some point
immediately after that, you start to need transformative science, inventions, discoveries that will
really justify the trillions of capex. So it's sort of a blessing in disguise.
I would argue, trillions of CAP-X is going to motivate the demand and the supply of utterly
transformative discoveries and inventions soon. Otherwise, why invest trillions in this?
Yeah. The concern, of course, a lot of folks have is around inflation and are these
dollars really inflated dollars? We're going to find out. But it's interesting, Dave.
And in particular, as a venture capitalist, you know, seeing the valuation.
of companies going at this level.
I mean, for the average public, how do you get into any of these companies when they're
coming out at, you know, multi-hundred billion dollar and trillion dollar, you know, sort of
valuations here?
I mean, being able to get in early is one of, I think, the areas that you've been focusing
on.
One of the other companies that came out of Link Ventures, your early check in here, was
Merckor.
And I just saw Merckor has gotten a offer at a table.
$10 billion valuation.
You know, you must be pretty happy about that.
It's a $10 billion valuation, but it's also a half a billion dollar revenue run rate
after two years, which is completely unprecedented.
So go back.
When did you invest in Mercor?
Two years ago, first funding, you know, right?
What you really want to look for is undervalued, underappreciated talent and not so much
concepts.
But they had the concept right already.
It's rare.
But 18-year-olds old, you know?
I mean, that's not a lot of people invest in the 18-year-old gang.
So a couple of 18-year-olds come forward with this idea.
Do you remember what the opening valuation was when you invested?
30 million plus or minus five.
Okay.
And so 30 million to 10 billion in two years' time.
Yeah, yeah.
It's got to shatter all kinds of records.
But again, I don't want people to feel like that's a bubble because the revenue growth also
shattered all kinds of records.
Yeah.
And so from a cold start, I don't think anything like that's,
ever been done before. But you're going to see a lot more of them now, too. They just are setting the
trend for many, many other companies. I think what's different about them is they're inspiring
an age of people that normally would be, would have been uninvestable five years ago,
10 years ago. And now it's kind of, wow, mainstream. We've made that point that the average age
of a unicorn, VC-backed unicorn a decade ago was sort of mid-30s, right, in terms of the average age
of the founders. And today, I think, Dave, what you found out of the investments we're doing,
especially out of MIT and Harvard, it's age 20 to 23. And these guys were 18 when they started?
Yeah, 18. They got through one year of college, and then they got frustrated with the pace like
everybody does. But they met in high school. Which goes to the point, Alex, you made last time
on the last WTF episode, which was, listen, if you really believe we're sort of post-AGI,
on the verge of, you know, ASI, advanced superintelligence, going to college during those years
and trying to get credits versus building something, it's not the right trade.
It's going to distort all sorts of societal cues and societal expectations.
The best, I would say, fiction treatment that I've seen of this is a novella by Werner
Vingy, Fast Times in Fairmont High, where you see this start to completely distort the way secondary education
is run in this country, and you start to see high school students and middle school students
suddenly spending all of their time doing startups. And I think it's entirely plausible. We find
ourselves in a near future that looks a lot like. I agree. And one of the points here,
I think that we need to realize, or people need to realize, is the MAC, you know, sort of peak
creativity. If you measure it by when a Nobel laureate does their Nobel Prize winning work,
or not when they get their prize, but when they actually did their work, is typically in the
first half of your 20s.
Alex, do you have the data there at all off the top of your tongue?
Not at my fingertips, and I've seen those statistics too, and I've seen how they vary from
field to field purportedly math versus physics versus other fields.
I also tend to discount this notion because I expect that in the very near future, most of the
innovation is actually going to come either from pure AIs or some sort of human AI hybrid.
So I view those statistics maybe self-servingly as more of a retrospective.
This is how things used to be at best versus how they're going to be in the future.
Every week, my team and I study the top 10 technology metatrends that will transform
industries over the decade ahead.
I cover trends ranging from human robotics, AGI, and quantum computing to transport, energy,
longevity, and more.
There's no fluff.
Only the most important stuff that matters, that impacts our lives, our companies, and our careers.
If you want me to share these Metatrends with you, I writing a newsletter twice a week, sending
it out as a short two-minute read via email.
And if you want to discover the most important Metatrends 10 years before anyone else, this
reports for you.
Readers include founders and CEOs from the world's most disruptive companies and
entrepreneurs building the world's most disruptive tech.
It's not for you if you don't want to be informed about what's coming, why it matters,
and how you can benefit from it.
To subscribe for free, go to Demandis.com
slash metatrends to gain access to the trends 10 years before anyone else.
All right, now back to this episode.
Well, there's another company I want to talk about here, and we have a couple of guests to
join us.
If you're an entrepreneur, listen to how they built this company.
This is a company on the doorstep of being a unicorn itself, a company that you're going
to hear a lot about in the coming years.
Dave, you want to introduce our guests and Blitzy?
God, I cannot wait. So we have Brian Elliott, Sid Farnashi, the founders of Blitzy. My son, Jack, interned with them this summer. And I tell you, it drove my wife a little nuts. She started thinking, wow, we're going to play a lot of tennis this summer and have a great time. Jack got so wrapped into the culture of Blitzie so quickly. It's the most high-energy place I think I've ever seen. Morale is off the charts. And so he pulled in his best friend,
from high school, Yashba Lachetti, they pulled in a couple of other young computer science
majors at Northwestern and a couple other places. The whole gang worked all summer on
Sweebench and crushing numbers, but I tell you, the morale of this group is like nothing
I've ever seen. The mission is incredibly cool and fun. So I can't wait to tell you all about it.
So Brian, he is a West Point alum. My one experience in life with West Point was Rick Dalzell,
who was my biggest and most important customer I ever had, actually.
He ran all things complicated at Walmart, you know, massive logistics, half a million people
moving around.
And then he got poached by Jeff Bezos to work at Amazon.
So he was the number two guy at Amazon right when they were in total hypergrowth.
And he had a West Point background, really understood morale, people, logistics.
And then, you know, Brian went to Harvard Business School after that.
Sid went to Bits, which is, you know, the MIT of India.
It's actually statistically much harder to get into than MIT, if you can believe that,
I can.
So top of the class.
MIT let me in, so it's got to be some flaws there.
Oh, so humble.
So, Sid, after that, we spent a long time at Nvidia and saw it go from tiny to monstrous.
So that's got to be inspiring.
And I actually don't know their story before that, but they met at Harvard Business School.
And they're, I think, inspiring to a different class of people.
You know, they were already in a career path.
and then AI hits the world, but they're nimble.
You know, they're not going to watch it happen.
And this is way too rare, you know, people remapping their entire life to take advantage
of what's happening right now.
So I hope a lot of the listeners today get a lot out of their backstory and their transition
to building this incredible company, Blitzy.
Yeah, and I really want to frame the story here as David versus Goliath.
We've heard about a trillion dollars here, a trillion dollars there.
And how do you compete in that world, right?
If you're a young entrepreneur, you're building a company, and you're you're, you're
you're wondering, are you going to get literally decimated in the wake of Google or OpenAI or
XAI just, you know, happening to release a particular feature? How do you compete? What's your
moat? So, Brian and Sid, welcome. Pleasure to have you both. And where are you guys this
morning? I am in one Kendall Square here at the Link Studio offices with MIT right behind me. So we just
walk over the talent from the MIT AI Lab right over to here to work, which works well for us.
Nearby in Cambridge. Fantastic.
The first thing we had to overcome was convinced Dave that we could be successful when we're not
19 years old. So he totally flipped this paradigm on, I'm funding these young people.
And I said, you know, Dave, when I was these young, these kids' ages, I was, you know,
I was out, you know, across the ocean fighting at war. And I think I'd been enough experience
to hopefully have a second career here in technology. But Peter, I love
the question that you're asking, right, which is how on earth do you compete with these frontier
AI labs, with Google, right, with Open AI? And there's really two reactions you can have when
there's this trillion dollar investment, right? There's the reaction where you've built a company
that you say, oh, no, right? They're going to steamroll over me. Yeah. And then there's the reaction
where when every single model gets better and the combination of those models makes you
your product much better, you're jumping for joy.
So we just got a trillion dollars of R&D for Blitsey.
And we're rising a rising tide and you can float on top of all of that.
That's perfect.
But it's critical to find that product market that is able to benefit from the rise of these technologies.
The thing is, you know, this is the second time we're seeing this happening.
So I've, you know, I was at Nvidia back in 2016 and I heard all about this.
story of Kudah, and I saw Jensen believe in that. That was pre-Gen AI. The term
gen AI didn't exist, right? And I've been working on genetic AI models since the attention
is all you need paper came out. So Jensen was asked to stop investing in Kudah. He started
this back in 2006, right? And it was negative to the company. So, but he still invested in it.
He still believed in AI. He worked with researchers and built the technology to solve problems that
he foresaw, and that is very relatable to what we're doing. So Blitzie, if you go into this data for sure,
but it's very unique in terms of how the product is built. It is specific to the enterprise,
and it is based on the opportunity that we've seen over many years, you know, working at largest
companies like Nvidia. You know, I think we should begin for those who don't know Blitzie explaining
what it is. I want to get into like, you know, where were you guys when you said,
aha, we're going to build this, right?
I love that founding story.
And then I'll unleash Alex Weezner Gross on you
to ask the most intelligent, important questions.
Well, let me tell you what it does today,
and then I'll tell you the humble beginnings of all of this, Peter.
So this is an enterprise-grade autonomous software development platform.
So we ingest and understand up to 100 million plus lines of code
where most single LLM tools are stuck with this finite ability to understand context.
We've developed some really unique context engineering systems to understand enterprise-scale code bases.
From there, an enterprise will express their work from a development perspective, whether that's
a cobal to Java upgrade very common in these old financial service institutions that use us,
or steady-state development work. Witsy will send off the most compute-intensive workload in the
entire AI code generation space. We've done a 12-hour run. We've done multi-week runs for massive
of scale code basis, ultimately delivery high quality pre-validated, pre-compiled, pre-tested code,
right?
In our view, our thesis is we want to increase the quality of code at any cost because the
other side of a pull request that comes from AI code gen is human labor, which is exponentially
more expensive.
And so that's really the view that we have, enterprise scale, high quality code.
All right, Brian.
I want you to slow that down for a second.
which is to say a lot of companies out there,
a lot of traditional companies in particular
in the finance world, you're saying,
insurance world have large code bases.
They have software that they have inherited for,
how old are some of the software systems that you're playing in?
I mean, we're talking about PL1.
Oh, my God.
These are old school financial service institutions
that, quite frankly, for a long time,
have been afraid to touch the code because the cost to get something modern, this wasn't worth it.
Okay. So you've got a company out there running COBOL from, what, 20, 30 years ago?
Yeah, that's right. And their system is operating. It's working. It's not doing anything significantly
useful given that it's 20 or 30 years old. And do they have people that can still, you know, patch that
code? I mean, are there
engineers still around? They make
you and Dave look like spring chickens.
Spring chickens.
Oh, yeah, I know you're how long have to have a guy, Peter.
You look great.
So you've got this problem where
you're too scared to touch it
because you'd have to do a wholesale replacement.
And so they call in the blitzie guys.
And you come in and you're
able to do what
for this 30-year-old chunk of code?
For starters, we give them visibility.
So we'll ingest, index, and understand the state of their underlying source code,
of which times they rarely have somebody that understands the entirety of tens of millions
of lines with code.
It's actually an impractical problem to know that.
And then we allow them to execute large-scale transformations.
So whether that's getting onto a modern technology stack or that's adding required
functionality, these businesses, these enterprises are stuck with the inability to layer
artificial intelligence on top of their existing code base because it is so old, so
antiquated, and that's so little visibility of what they're doing. Massive value. Massive value
creation. It's a mind-blowing experience, too. If you take 10 million lines of unintelligible,
undocumented code, and you run it through blitzie, and then you say, tell me what it does
in plain English. Explain to me where there are bugs. It's just you're talking to the code. It's
mind-blowing. One of my favorite uses of Gen. AI is to give it some
patent that is unintelligible and say, how does this, what does this do? And how could I use it
in my company, right? The ability to take something that's complex and make it understandable to
grok it fully to use that term. Thank you, Robert Heinlein. Well, Sid's got 27 patents. So I had to
use that to understand what the heck you did it at in video. So, Sid, you had Nvidia for five
years? Eight years. Eight years. I heard, I heard Jensen recently say that most of the
executives there are now are now billionaires. Did you make it to that status?
Well, I held out to my stock. I wrote the way from like double digit billion to trillion
dollars, but then I got two degrees. You can do the math, Peter. And then what happened?
I got two degrees from Harvard, so they took all the money. You owe no way.
Wait a minute. Oh, my God. Let's not get started on the expense and value of degrees from Harvard or
MIT or any in any of these Ivy League schools.
So this is a David versus Goliath story, and I want to understand, you know, you've got incredible
success.
But before we get to that story, you guys are both at Harvard Business School.
When did this idea germinate?
What was the causative agent that said, okay, let's build that.
What was that founding story like?
Yeah.
If you can rewind the clock back to the GPT 3 to 3.5 era, right, where these things could code,
but it wasn't what we're experiencing today, right?
At that time, Sid and I were doing a pro bono project for our favorite local bakery here in Boston
as a part of our time at Harvard.
And they mentioned they're about to spend $300,000 or $400,000 on a new mobile app,
which got our attention, some young enterprise and entrepreneurs.
And so Sid and I went home, and we did what is.
now, you know, two and a half years later called Vibe Cody, and we built them the application
overnight, right? But through that experience. Literally over the weekend. Over the night,
yeah, yeah. Which now is like no big deal, but if we didn't pay, did they pay you, you know,
$200,000 for that? We should act like it took longer. I think that was our, that was our first
mistake. But was so clear during that time is that Sid and I were actually the bottleneck
for development. And so we were, you know, we, you know, we,
You get an error and you beat it back to the system and then you give it to a different model, right?
And then through that practice, you're able to get much higher quality code.
And so we said, if we could just invent a system where all the commoditized development work could be removed, right?
And we could have multiple models going back and forth, iterally refining and getting to code that compiles and tests.
That is going to be what the future looks like, right?
And we learned that by doing, by being hands-on, and then having an idea of what the enterprise needs from Cid's experience and building towards that.
Were you guys already friends or did the All-Nighter making friends?
Yeah, yeah, we've only gotten, I mean, Sid's actually the godfather to my, to my, like a son, later not.
Okay, that's important.
Investing in best friends is pretty good.
I mean, and that's another part of the story, Dave, that we've talked about is some of the most successful companies are when best friends get together and just build 24-7.
It is a, you know, I say this to the entrepreneurs that I coach, you know, being.
an entrepreneur and having co-founders, you're going to spend more time with your co-founders
in the trenches than you do with your husband or wife or kids. It's an intense period of time
and you better pick somebody or some buddies that you love spending time with. Yeah, I always tell
people, imagine you're on a long international flight, one of those 14-hour flights and you're
sitting right next to somebody. Do you walk off the plane feeling good and having fun or do you walk
off the plane not waiting to get away from this person? Your startup's going to feel just
like that every day. It can be work or it can be fun. It just depends on the personalities and the
match. So Dave, what do you find most exciting about Blitzy just from as investor? And so they
came through Link Studios and Link Investment. And yeah, tell a little bit of that story, if you
would. Well, they're definitely much more experienced than a lot of the entrepreneurs around the
studio. So they had the plan and the idea of fully baked on arrival. We were still first money in
and still gave them space and support
and have made a bunch introductions,
but they already had it more than figured out.
So that's, you know, that's not all the companies fit that profile.
They're also, you know, very different.
A lot of the companies coming right out of dorm rooms
will do image generation.
They'll do, you know, they don't understand the word cobalt
or PL1 to save their lives.
And so when I look across the range of business plans
that are right in front of us with AI,
more of them fit into the,
you need to understand the domain space
then you can just think of it in a dorm room space.
There's maybe two-thirds, one-thirds, rough numbers.
And so what I'm hoping with Brian and Sid
is that they inspire a ton more people to go after these.
These are still multi-trillion dollar markets.
But they're not, you know, AI girlfriend.
They're not, you know, apartment search.
Another photo sharing app.
Yeah, another photo sharing app.
They're, you know, and they get really deep.
You know, you've got all these, you know, manufacturing,
you know, semiconductor manufacturing automation.
you know, that's really deep.
You get insurance actuarial risk adjustment.
That's very deep.
This one is actually nice in that it's very, you know,
code generation is very broad.
So it's a huge market.
But it's also deep in that, you know,
like refactoring 10 million line code bases is a pretty deep knowledge set.
The other thing that's really cool about Blitzy to me
is that we have all these code generation products.
So I use cursor.
We've got windsurf, replit, lovable.
Companies are all worth billions now.
But they're, you know, I write a lot of code or I tell it to write a lot of code.
It creates a button for me.
I say, I don't like that button, make this other widget.
And, you know, you're doing it in real time.
But you can't build something really big.
You know, and when you put cursor in full agent mode, it's right in no man's land.
It sits there and grinds for like five or seven minutes, which is too long to wait, but too short to build something substantial.
So they're getting stuck in no man's land.
Blitzie just said, no, we're going all the way to the other end where it's going to run all night long.
or all week long, like Brian was just saying, and come back with something really big.
And that's just fundamentally a much different engineering problem than what lovable
replet, cursor, windsurf are doing. It's just a different kind of company. I don't know of any
other company that's there. Amazing. Well, we're here today to announce a particular piece of news
as well, some groundbreaking news. Is it Brian or Sid, which one do you want to talk about what
Sid, please, you are the inventor of the technology here.
Sure.
So. Tell us.
You know, so every time a new model comes out, they benchmark on this leaderboard, which is called
SweeBench verified.
The leaderboard itself was built by OpenAI.
It is a subset of suibench.
It contains 500 problems that were vetted by the researchers at OpenAI, and they confirm
that these are solvable problems, and these are worth testing models on.
And it's been ubiquitous.
So every time when the new one comes out, you always see results.
the current top of the leaderboard on the sweepbench website as a filming is 75.2%.
So we hired a bunch of extremely talented interns.
So Dave, going back to Jack, if we could hire him today, we would.
That's how good some of these interns are.
And every single intern who worked for us, they were amazing.
We were, you know, we really credit this to their effort and to Nirajj who led the efforts
on our end.
But we ran Blitzie on sweepbench.
And as it turns out, these are 12 repositories, but they have 500 branches.
And that equates to 400 million lines of code if you ingest them on Blitzie, right?
We've ingested anywhere from 1 to 2 billion lines of code overall on Blitzie,
depending on how you count it, because you count updates and the whole raw thing as well.
So it's 2 billion lines of code if you count all of the updates, right, including Sweet Bench.
So we ingested all of that.
We ran Blitzin on solving the problems.
And our final result, accounting for everything that we've tested and verified using SBCLI, was 86.8%.
That is a significant jump over the current leaderboard on the website.
And the last time this was done was when Devin had a 13% jump from 1% to roughly 14. some percentage.
So we've come a really long way with the system.
and the primary reason that we were able to achieve this,
echoing some of the points that we made earlier,
is we're very different from the existing tools of structure, right?
So one thing is you can reproduce these results in production using Plyssie.
We've not added any custom saffolding just for Swaybench.
We've not tampered with any of the features as to achieve this.
We've seen reports from, you know, some of the other labs that claim that even though,
for example, let's say, a latest frontier model claims 80% on SviBench if you actually run it
and I reproduce it, you get 60%, right?
And we wanted to not have that problem.
We care deeply about reproducibility and the practical real-word applicability, right?
Which three-bench verified has been vetted to be good at.
So you can reproduce these results, and it's live as of today.
Amazing.
Hey, Alex, help us understand how big and important this particular hallmark is for the company and for the world.
give us some background here sure well well first to brian and sid congratulations on your
announcement i think there i would expect there's going to be an enormous amount of interest
from the community once they they hear these results in in trying blitzie and in reproducing those
results so congratulations in advance on the onslaught of interests that i expect who will receive
i think to answer peter's question i think software engineering is arguably the
first major vertical of human labor that is very high economic value, very high productivity
that is perhaps succumbing to automation. So any sort of step function improvement in
software engineering is arguably super transformative to the global economy. And maybe just
pivoting on that thought, one of the first things that I was wondering when I heard that you
would be announcing these results, and maybe jumping back eight months, we all.
remember when Deepseek, aka High Flyer, launched R1 January of this year, and there was sort of an
aha moment all around the world. They didn't just announce a new reasoning model. They announced,
and maybe this got a lot less attention, they announced a bunch of new open source libraries
at the systems level, like a new file system. So one of the first things that I was wondering
when I learned that you'd be making this announcement is the whole world is sitting on the
sort of palimcess of legacy libraries and operating system code, billions and billions of lines
of code, Linux, Python, Gnu, all of these libraries. Is there something that you and Blitsey
and this new remarkable capability that you're announcing can do to speak to what can we do
to improve the performance of this entire tech stack that the whole world runs on at this
point. That's a fantastic point of question. You know, we've been running some of these
experiments. We've been taking some of these open source libraries that, for example, was written
MATLAB for one of our customers that they were using and we converted it to Python.
Matlab was written 20 years ago. It was specific to Windows and we made it OSGnostic.
We've run these POCs all the time where, you know, we go from OSGnostic to OS specific to
agnostic, from traditional to modern. But we've also been running other kinds of POCs where, for
example, we picked an Nvidia repo. We identified an issue that was marked as open.
And we just put Blitzie added and we saw it. We created the pull request that's all the issue.
So if you think about that problem, you can identify bugs, issues, feature requests in any of the modern frameworks and systems.
The system is not limited by how much code, how big the repo is. So you can send Blitzie added. It will come back with the solution.
You don't like it. You can iterate over it. You can trade five projects, get five different pull requests.
see how that works and deploy it, all within a matter of days.
I think that's a fundamental shift that really is going to change the way people work
with open source and also close-source technology.
Amazing.
What is the largest repository of code that you actually tackled?
Brian, you want to say?
Yeah, I got it.
So we frequently see 20 million lines, but I think the absolute largest we've seen
is about 60 million lines if we want, would it successfully.
That's crazy.
Just for comparison, for fun, if you had to guess, how long would it take in terms of human labor hours to do that?
It's insane.
It's insane.
Brian, I think we frequently sculpt these, right, as part of the POC process.
And I think you could probably speak to that.
So there's a question of, like, how long would it take to GROC, 60 million lines or code?
And the reality is it's just too big for a human to understand.
So you might pay, you know, it's into $100 million in three years, and they might come back with some more.
some diagrams over the 60 million lines of code.
And by the way, by the time they did that,
what they came back with would be out of date.
Exactly, which is why you can see that, you know,
essentially the industry has been stuck.
This is why your airlines are always misrouted
and they can't get their software updated.
This is this kind of fundamental problem.
So we, every time we run Blitzy,
we actually estimate for the clients in production
all our enterprise clients, how many hours were automated.
And the CIOs love this because it's like the KTIA,
they go give the forward on how many hours they've automated away
by their intelligent vendor selection.
But Gronkine is sort of an impossible problem,
but the real value is in the code generation, right?
Being able to accurately affect and develop code
and accelerate that life cycle for the development team
while in that large underscerned underscoring corpus.
But importantly, like Alex,
I know you want all developers to sort of go away
and we're going to live in a society of abundance,
but I think it's kind of going to go in the opposite direction
where it's almost an infinite demand for code, right,
and for software development.
And so Blitzie doesn't do everything.
Blitzie does about 80%
of the quantum of work on average for these large-scale problems,
but it knows exactly what it doesn't do, which is really the power.
And it kind of hands off that batch of work to the human developers to finish things out.
So we get a really clean full request, plus human labor at the end to accelerate the development,
but not sort of remove the need for the developers altogether.
So I have just, if I may, just to pull in the thread, Brian, I mean, I would argue we're about to enter sort of an age,
not necessarily of just abundance, but of great projects when it's possible to send
a lot, basically let lots of automation loose on the world and fix all the problems,
solve everything, as it were. In the case of Blitzy, this is letting AI agents lose on
an enormous sprawling legacy code base and just fix everything. I think it's a good name for
the episode, solve everything. Or a book or, you know, fill in the blank.
But are you familiar?
Maybe you're tracking this project.
I love this idea.
The Great Refactor.
Yeah, yeah, I love this.
What is that, Alex?
What's the Great Refactor?
So the Great Refactor, I love this.
This is like a classic solve everything concept.
This is we've built our whole civilization on a bunch of software libraries that could be better
maintained that are filled in many cases with legacy memory vulnerabilities.
There are statistics out there that most of the insecurity of present-day.
software is due to the way the software is written that exposes them to certain type of
cybersecurity vulnerability, memory vulnerabilities. And if we can only rewrite all of these
libraries that sort of, if you know the meme that goes around of the entire stack being
built on, of civilization being built on just like one block, it hangs by a thread,
if we could rewrite all of these libraries and dependencies and software supply chain
upstreams that our whole civilization depends on in rust,
or some other memory-secure language,
suddenly that would fix almost all.
That would solve everything in terms of so many vulnerabilities.
I think it had like 200 customers sent me that project.
Like they just immediately saw that.
It became hot news of the day, and they all sent it to me like,
oh, are you guys going to do this?
And I said, well, are you going to pay for it?
Like, I think it's worth it.
Here's the most critical thing, right?
If you think about this idea where you can give these projects,
three factors or whatnot to AI and have it come back
with the code, right? You can do that with any chatbot. You can give it to any AI, have a
right. Getting code from AI is a commodity, right? But if you add constraints to that problem where
the code needs to replicate the existing functionality, it needs to compile, all the unit test needs
to pass, it should not have newly added security vulnerabilities, and all the other items that are,
you know, crucial to the enterprise or the problem itself, right, that make it valuable. That's when
the challenges begin, right? Now, that is not something you can achieve. So, Sid, my question is,
I've got 3.2 billion lines of code, which is my genome. Can you compile that for me? Can you,
go and identify a more, you know, fix the, fix the broken parts? As long as LLMs can write, as long as it's
in the language that LMs are trained on understanding, we can do it. The scale is the problem that
we've solved for. And the other problem we've solved for is making sure the requirements match
so that when we put you back together or edit you, you actually look like you, right? It's validated
that it is you. We didn't change or break something that we shouldn't have. Alex, what do you think
about? Yeah, so maybe narrowly on the bio, I, I, there are many other projects that that speak
the language of the genome and the proteome that I think, Peter, for rewriting your genome,
you'll have the opportunity over the next few years to use one of these biological sequence-based
foundation models to do some variant of that. I do want, for Brian and Sid, though, really
pull on the economics of this. So I really want to press you guys. When we talk about the great
refactor or some of these great projects to basically rewrite the source code basis for much of
our civilization today, and you think about the economics of that. And there is a school of thought
that says we're seeing generative AI hyper deflate by 10x per year or so, an order of
magnitude cost reduction every year. At what point in your minds do you think using Blitzy
or maybe competitive tools does it become reasonably economical to basically rewrite all
of the legacy code out there that civilization depends on? I mean, I would argue from a value
perspective, it's there today because the value that this would be providing to society is just
dramatic. Now, this is a question of, like, who's the payer? A line of code from Blitsey is
100x more cost basis than a line of code from sort of any other provider, which is
maybe 100x less than it would be from a human developer. So we're talking about huge
or is a magnitude difference. And so would it be worth it from a society value to rewrite all
the software today with Blitzie? Absolutely. But am I going to continue to serve these financial
service institutions and insurance companies first that are readily paying me today? Like, yes. And so I think
If you grab the capital funding for us to break even on this AWG,
we'll start rewriting all the, we'll rewrite Linux TV just too.
Brian, I'm going to insert another topic in here.
You know, you guys shot a really cool podcast.
It's on your LinkedIn, where you're just bantering between the two of you
about the fact that the definition of truth within large-scale software has always been
the functional code.
You know, here it is.
This is the final thing.
It runs, the PL1 code that does all the nav accounting for the mutual funds, you know,
over at State Street.
It's like millions of lines of legacy PL1.
But that debugged code is the core asset.
And then the documentation is just something around the edges.
Post-Blitzy, the truth moves to the documentation because you can regenerate the code
overnight anyway.
And so your actual core asset has moved from code to a document, but it's going to move again.
And this is where it was really cool to hear you guys bantering around, like, well,
what is then the foundational truth of this piece of, because, because, because, you know,
like Alex is saying, the entire infrastructure of society is about to move and also expand
100 or 1,000 or a million X, you know, because code is so cheap to create all of a sudden,
we have much, much more of it. So you've got a much bigger world. But the ground truth is
some other format than just, you know, PL1 or COBOL or Python code. It's this human-readable
today spec becomes the central asset. That's a big shift.
The source. Yeah, the spec is still an abstract.
action layer, right? And so that's easy for the human to look at, right? The real source of
truth or understanding is actually we create a customer-specific hybrid graph vector database that
understand exactly what is going on from a functionality perspective. And you could change that
functionality from one language to another, but we are capturing the core essence of what is
required there. And we can display that as a spec, which is 200 pages, but of 20 million lines
of code, that's a intermediate representation. And really we want to get back to the
the core DB level understanding.
That's the core asset for these folks.
And Blitzy makes it.
It's the property of the enterprise.
Hey, everybody.
There's not a week that goes by
when I don't get the strangest of compliments.
Someone will stop me and say,
Peter, you've got such nice skin.
Honestly, I never thought,
especially at age 64,
I'd be hearing anyone say
that I have great skin.
And honestly, I can't take any credit.
I use an amazing product called One Skin,
OS01, twice a day, every day.
The company was built by,
four brilliant PhD women who have identified a 10 amino acid peptide that effectively reverses
the age of your skin. I love it and like I say, I use it every day twice a day. There you have
it. That's my secret. You go to Oneskin.co and write Peter at checkout for a discount on the
same product I use. Okay, now back to the episode. Brian, given your background, you know, in the military,
I mean, probably the one institution that's got the, you know, largest repository of, of, of
ancient code has got to be the U.S. government, right? I mean, so can you attack all of that?
I mean, unlock massive productivity. There's a lot of fear that the U.S. is a falling empire.
It's inability to, you know, understand and legislate efficiently. I mean, couldn't you have,
like, just a single massive impact on the U.S. government? You know, Peter, I live in Back Bay here in
Boston, and so does this lead investor at Incutel, or so we tell.
me because he keeps strutting into me when I'm walking to work and just bumping in and see how things are going, which has to be skeptical.
But the short answer is a short answer is yes.
Like, the U.S. customer is sort of a fantastic end customer to ultimately modernize to get your flights there on time to make getting IDs easier, right?
And so this is like an absolutely part of critical infrastructure that is a target customer that we have certainly.
Do we have them today?
No, 12 months will we'll be serving them?
Like, I think yes.
Yeah, I mean, that could catapult you into a, you know, a decade-billion-dollar company easily, just landing that kind of a customer.
Because once you've modernized, you know, for one of the agencies, they're all going to want it.
Yeah, I think we have the most top-secret security clearance to patent ratios of any company out there.
Fascinating.
I'd like to maybe pull on the theme that we've talked about on the pot in the past, the elephant in the room, which is recursive self-improvement.
So how much of Blitzy is written by Blitzie?
A lot. So it's interesting, right?
This is actually a question of like, where's the core value?
And I would say every single sprint that we do from a software development perspective is driven by Blitzy.
And so I would say a significant amount of the corpus of the code is driven by us.
Now, there are algorithms that are not about writing code, right?
they're about core invention.
So I think most of the company's core IP is not going to be the software that it
trades, but it's going to be some core invention that this is around.
And think about Google with page rank, right?
Then back to the literary page, not actual pages.
But page rank is really the core source of their original IP.
And SIDS invented a number of algorithms at the core of Blitzy that allow us to, for instance,
always compile code to never have circular dependencies.
So you could actually rewrite sort of a corpus of Blitzy with Blitzy pretty readily and pretty
quickly and it would look like it, but would it do what we do?
The answer is sort of no, and it gets the question, like, what is the source of IP for
companies?
And it's got to be a breakthrough and invention.
If you think about what we're doing, right, for these large companies, we're telling them,
for many of them, we're telling them how to use AI, right?
We're coaching them and consulting with them.
How do you can not do that unless you've actually used it yourself and perfected the process,
right?
Because they're not just making the engineering velocity changes or the, you know, tool user changes.
they're also making the process changes.
And that's why this is critical.
Alex, I'd love you to take a second and dive into the Sway bench metric here.
Again, if you could, for those who are not familiar, you know,
a little bit of the origin, you said from Princeton and ratified by Open AI,
but what is it measuring and who were the top of the leader boards before?
And then I want to get into the conversation about how do you compete against the MAG7
or against, you know, sort of the frontier models in this regard?
So let's contextualize it first, understanding what the sweep bench metric
really is. Sure. So for content. And why do we start with the title of the white paper? Because
that's going to drop same day that this podcast does. So people are going to need to find the paper.
And we should put a link to the white paper in the in the show notes here as well. Alex, please.
Sure. So maybe let me just speak to Sweebench. So Sweebench is a benchmark that measures the
ability for AI systems to solve typical software engineering, SWE,
for SWI tasks in the specific form of responding and solving issues on GitHub, a very popular
source code management system.
So SWEBENCH as a whole, not SWEBENCH verified, consists of a couple thousand instances
of tasks in which the central challenge that's posed to an AI is to respond to an issue
in a code base. And what would happen in a normal software engineering context is, what kind of
issue, Alex? So a wide range of issues could be bugs that need to be fixed, other performance
issues. And the usual workflow in a software engineering context is an issue will be identified
and pull request will be submitted. So identifying an issue, responding to an issue,
submitting a pull request that responds to an issue, satisfying unit tests.
These are all standard parts of what would be archetypically considered software engineering.
And SweeBench, I think, is sort of an excellent industry standard at the moment that
attempts to capture the life cycle of high-value-ed labor that a typical software engineer
would perform.
Now, I'll let the guys respond to their white paper.
Sorry, yeah. And so I'll respond by also answering your core question, Peter, which is how does one compete with the MAG7 in this utterly important labor task, right? And the reality is there's a significant amount of the MAG7 at the core of what Blitzie does. So we use Gemini's models. We use anthropics models. We use open AI's models. But what's unique about this technology moment in time is when you use these models against one another, the quality.
moves up quite dramatically. And when you use them against one another, hundreds of times, right,
in hundreds of different combinations, which hundreds of different tool sets and prompts,
the combination goes up even more exponentially, right? And so really, it's the art of orchestration
through what we call extended inference time validation to move up the quality of code,
ensuring at every moment in time the system has the right context to operate, despite the
large-scale underlying code base. So we say we're excited when Gemini or OpenAI release a new model,
like our product gets just dramatically better than a single time.
That's a really important insight, right?
Building so that the better your components are, the stronger you are as a whole.
And that's a unique niche.
When did you realize that?
I mean, that's sort of a fundamental for you.
I think we realized it back when we were just initially to figure out this first project together.
But I'll let's say expand on this.
I think we made a bet and we said that there's not a doubt.
Like we were building this when models had 5,000 tokens of context, right?
And we made a bet that there's no doubt that context windows are going to expand
and the models are going to get better at writing code.
Now, do we want to go and compete with the Mac 7 and build our own model?
Or do we want to stand in the holders of giants and use the technology to solve the problems
that they're actually meant to solve?
And that's exactly what we did.
Amazing.
And just for context, again, who had the record before you?
Who had the record last week?
I think there were some open source labs and there's also been some other, you know, unpublished.
reports that have claimed, you know, around 80%, but the 86.8% that we're claiming has,
is unprecedented in the highest number we have seen.
I think ByteDance was the most recent Trey model to be at the top there.
So, you know, we can rebrand this U.S. versus China if you want to.
Every time I hear 80-something percent, 90-something percent, I'm thinking that these benchmarks
are getting super saturated, right?
And so where does this go next?
I mean, what are you going to measure when you're at 100 percent?
Yeah, we talk about this in the white paper.
Like, this is, we certainly need new benchmarks, but the, the reason we didn't do
street bench verified for a long time is it's just not representative of the scale
of problems that most people use splits and forth.
So the typical pull request size is like 100 lines of code, and the largest repository
is a million lines within this.
So we really need a set of benchmarks that's on Linux and VS code, which is 20 million
and four million lines, respectively, with holdouts against those, trying to do larger scale work
to ultimately show how far we can push the bounds on autonomy.
Yeah, and in terms of Peter's saturation question, the paper does a really, really good job
of describing the landscape of benchmarks and Swee Bench in particular and the need for a new
benchmark.
But one of the points it makes is that when you score 86% on this benchmark, that's effectively
very close to 100% because the remaining subset of questions are just flawed. They're not
harder. They're just not, they're not structured well. And so you've basically capped out this
benchmark now. So if folks want to look up the paper, what's the name of the paper? Do you have
that? We, Alex is urging us to retitle it. So I'll give you the subtitle, which is a domain-specific
context engineering paired with extended inference sign validation breaks and barriers of
LLM-driven software development.
So that's really what we're talking about.
That rolls off the tongue and onto the floor.
It is a technical favor.
You know, you can search Blitzy and Sid's name.
He has a searchable name.
Brian Elliott, there are thousands of Brian Ellis.
It's a turn to ask.
But Sid Pardeschi plus Blitzy will get you to the paper.
Okay, perfect.
Alex, where do you want to take this next?
What have you found important and fascinating about what Blitzie is doing?
What's the implications in the long term, buddy?
It's so interesting, so many different directions to go in.
Maybe just to go back to this idea of great projects,
because I think Blitzy has the potential to be sort of an embodiment of an era
when we just, again, turn all the AI agents loose on all of the problems and a discipline.
So what I heard, I think, Brian, you say a few moments ago,
was something like 100x price difference per line of code or profile between blitzie with
its, you know, again, congratulations state-of-the-art performance announcement and other competing
tools. When I hear you say 100x price difference, I immediately internally say, oh, well, that's
just two years worth of cost hyper deflation. So you're sort of two years more expensive than the
competition is the way that I heard that. So if you project forward two years, three years,
four years, do you think we will find ourselves in a world where AI really has, the great refactor
has been completed? And we've rewritten all of our foundational systems with Blitzy or maybe
copycats of Blitzy. Do you think we find ourselves in a near future like that?
I would say pontificate first here. I think, you know, I think we will, we will
see the models get significantly better at doing this and the cost go down. The key thing that
I would like to underscore is a lot of the approach at some of the labs. And then what's happening
right now is the labs aren't really making money on the inference that they're running,
right, from the models. But what we're doing is because we're using the labs and we're able
to charge a premium, right, for the work that Plixie does and also provide the validations with
it, we're not losing money on the code that we're writing, right?
So as this equation improves over a period of time, the difference that Blitzie is able to create
is going to also grow.
So I think somewhere in that double negative, I heard the answer is that, yes, as hyper deflation
kicks in, call it an order of magnitude cost reduction per year, maybe more, not only does
Blitsey become very profitable, but also becomes very feasible to start to tackle these
solve everything level grand challenges in software engineering.
Yes.
So I've been attempting to kick the tires on Blitzy myself. My first project with Blitzy was
I wanted to rewrite Python, the very popular programming language. And I gather you, Brian,
and Sid, you've had access to your own product longer than I have, which is only
been two days. Have you tried to take some large-scale project? I think, Brian, you mentioned
Linux a few minutes ago. Have you tried to take some large project and either say, gosh, I want
to ask Blitzie to improve performance by 10% on some relatively mature code base or add some
crazy transformative new feature. Have you tried that?
Yeah, we did a fun project. We've done a number of these acts. But the, we did a lot of
One of the most fun thing we did, we onboarded VS code, and we said, hey, add a chat experience to VS code.
At the time, there was, you know, Cursor, I still is right, one of the biggest tools out there.
We tried, and we did, we built one of a subset of the features of Cursa using FITC, and we tried using that internally, right?
So anytime we consider SaaS products at this point of time, we're trying to first see if we can replicate that internally using Plitzy.
And if we're, you know, a few months out or a few years out, we'll say, hey, let's just use the SaaS products at the starting point and consider rebuilding it later on.
I think a lot of enterprises will do this.
Have you thought, I mean, so sort of free marketing advice before the public, have you thought about taking all of these open source projects that are in many.
cases, starved of core development team members and Hungary for human capital or human capital
equivalents, taking these projects and just aggressively setting loose the AI agents to submit
very friendly, very polished poll requests to these projects to launch improvements.
We've actually done that for MS flow, I believe, right, Brian?
Yeah, yeah, we've done this, especially for one of the enterprises specifically rely on.
It's quite possibly the best BDR, which is just sending pull request to open.
source libraries.
And we, there's an open source one right on the homepage of our website that I think
you'll find fascinating Alex, which is, it goes all the way back we're talking at the beginning
of the episode.
So AWS invented this or created this repository specifically to be incredibly messy mainframe
code, right?
So all different styles to represent, you know, different decades of people working on it.
And ultimately, say, like, how would one use code generation to be able to move this from
target cobal to target Java?
Oh, cool.
We ran that through Blitzy.
I guess mainframe is such a big problem in all these large, even government organizations.
And we moved it from COBOL to Java completely autonomously with the ability to compile
right out of the box.
And now there's sort of like remaining development to work on some runtime stuff.
But this is a multi-year-long project to move mainframe from a target messy coal into Java.
And the results of that have been probably one of the best business development tools that we've ever created.
How fast did you do it?
How long did that take?
A couple days?
Yeah, if you caught everything from start to end,
it was a week's worth of inference.
In Justin, yeah.
Amazing.
So if we project that going, maybe, if I made back to Peter's question
about what the human equivalent of this is.
Do you have any metrics that you can point to for either cost savings relative to
humans for a given unit of code, a line of code, or per file,
or how much faster than humans this is in general?
We typically see from a speed perspective,
a 5x philosophy difference when this is brought into the enterprise.
And the biggest challenge is really like operational deployment, right?
And so you're used to sort of starting your work the same day that you pick,
pull up your ID.
And so what we're having these organizations do is sort of start to sprint
for the next development work the week prior.
And so instead of developers starting with tickets and tasks,
they're starting with code that's mostly written.
and a project guide with all of the human tasks to begin.
So we really focus on this 5X.
Anytime we engage in enterprise to work with us,
we say pick a real world project
that you have upcoming next quarter.
Let us prove a 5X difference
and we should do the remaining development work
so we have an end-to-end solution.
And if we do that, then you do dot puts it across the org.
And it's incredibly successful
because people don't realize the cost of coordination in development
and all of this sort of requirements
to actually get a piece of full software out
that when you can offload a significant chunk of that to agents, the velocity gains are
honestly unbelievable.
That's a really cool insight because a lot of the younger teams are going into enterprises
and then they've never been in enterprises before and they're saying, look, the raw code
generation is 1,000 X, 10,000 X.
You're like, yeah, but what's it going to do for me in my enterprise?
And there are very few that are credible in saying, well, we actually have done it and
we know the final answer and right now it's 5X apparently.
But they don't know like the overhead of the, you know,
The organization and the documentation and all of the things that happened before you can even start code generation.
And so it's really nice, actually, to have at least one vendor that understands how to get the real thing.
We actually need this to work in the end.
It can't just be a hypothetical 1000X.
It's three years from now.
We've got digital superintelligence has landed.
It's come out of, you know, I'm going to put my bets on Google, but we'll see.
What does Blitzy look like?
Yeah, I think Blitzie is the core system of record and system of action for software development in the enterprise environment, right?
And so the source of truth moves from documentation and code, which is sort of like this hybrid today, to organizations relying on Blitzie's hybrid graph vector database that understands a core functionality, and organizations are going to be able to move incredibly quickly from a software perspective, and the source of value is going to be sort of core idea.
that's not easy to replicate.
To add more into that, you know, Peter,
there's always going to be some tasks
where it's better to have a human in the loop
and do them sequentially, right?
You're solving a problem that has never been solved before
and you need quick feedback from AI, right?
That's always going to be there where you use the co-pilots,
but there's always going to be this other category of tasks
where you can automate them away, right?
Build the code, run it, deploy to production,
and execute maintenance.
Blitzie now, you know, gives you the code.
The code is the final output.
but we're going to go into autonomously maintaining, deploying,
and keeping the applications running.
So you're not going to need humans for specific sections of the entire enterprise.
It's all going to be driven by AI.
I think Alex was about to paint kind of a two-year view.
He asked a question about, you know, what's the force multiplier today?
But then I think we were going to next segue into,
okay, but there's 100x and another 100x coming.
So I would love to finish that thought, Alex.
Totally.
Absolutely. So there's a lot of, to Dave's point and Brian and Sid, I think you were starting to gesture in this direction as well.
There's a lot of interest in the benchmark out of meter that's measuring the effective time of autonomy, the characteristic time scale over which AI systems, including AI co-gen systems, can basically operate without a human intervention, sort of like a disengagement with a driverless car.
how far can it drive without a human needing to take the wheel, as it were?
So I'm curious, have you thought about the characteristic time scale over which Blitsey is
able to do autonomous co-gen, or the human equivalent, really, of autonomous coding before which
the human needs to step back into the loop and be involved.
Right now, if I remember correctly, the current state of the art is something like one to three
hours. There's a nice, very clean on a, at least on a semi-log plot expectation that, and I think
we've discussed this previously, if you projected out a decade or two, we get to many, many years
and perhaps hundreds of millions of years and in a few decades, where does Blitzy fall in this
apparent exponential trend towards exponentially increasing times without humans needing to be in the
loop?
I think this is a X-Ex project. If you think, if you think,
Think about all the pieces needed to achieve this, right?
Let's take a very small example.
Let's take the ADWS example, right?
There's the part where you identify the requirements,
decide what you need to do, get the code,
and there's a part where you get it all the way to production,
and if you look at those parts,
each of them have already been automated in isolation.
For example, CICD, how do you deploy the code to production?
You have automations for that.
Debugging, security analysis,
monitoring in production, tracing, and viewing the logs,
ensuring that the system is not doing anything malicious.
All of these items exist today, and there are, you know, these blue layers that we are now seeing,
like, for example, MCP, A2A, right, that allow agents to form this mesh and automate work.
The only thing that's left to be done, really, in my opinion, for these projects, is to just connect the dots.
And that's exactly what we're working on.
So do you answer your question, you know, how far out?
I would say we are months out, actually, from delivering projects completely autonomously,
as long as they meet a certain set of criteria and conditions.
So what I just heard you say, Sid, correct me if I'm wrong, is that the documentation writers, the spec writers, are the new limiting factor for the speed of software developments. Is that correct?
That is correct.
How do you think about automating that process, if at all?
That is also automated. So if you go to chat GPT, right, and you ask it to write documentation at will following your criteria.
The reason we have document, you know, writers in the first place is to have quality, right? We have concerns that models.
lose context or a period of time and they skip and omit things or they can be gamed into,
you know, adding things that you don't want. And we're really concerned for quality control,
this is why we have humans. But as you can see, we've solved the context problem for large
code basis. And there's nothing really stopping anyone for that matter from effectively adding
in the right safeguards and, you know, layers of protection to ensure that we minimize the need for
humans. I think it's a matter of us becoming comfortable with AI doing that. And I definitely see
that happening over the coming months.
To try to be the downside.
Begging for a follow-up white paper because Alex's question is infinitely recursive, right?
If you said, okay, well, then that's not the constraint, then what, you know, because
there's always going to be a constraint is turtles all the way down.
You've got to just ask, okay, what's the next?
Speed of light, buddy, speed of light.
Yeah.
And also, yeah, Douglas Adams, right, that famously pointed to that the problem is far more,
far harder to pose than the solution and to the extent that the new limiting factor is the
spec writer or the prompt engineer, whatever we end up calling it in the future, I really
would like to press you sit on this. Like when do we get our automated program manager, product
manager, spec designer, documentation writer, if that really is the limiting factor for the speed
of software engineering in the near future. Automation all the way to do. I'll tell you something,
Alex. You know how the Blitzie platform works? We have these thousands of patients.
And each of these agents has a persona.
There is a product manager agent.
There is a software architect agent.
There is a QA agent.
And there is an agent that writes the prompts for the other agents.
So all of the challenges that you're describing are live today in production with the Plytzee panel.
This is such an important question, though, because, you know, I know it sounds very hypothetical.
But, you know, look at this timeline on Sweebench here.
This is only 18 months ago that you got like 12%.
And now it's saturated.
It's only been 18 months.
So, you know, like, what we think of as the distant scientific future, you know, it's all science fiction.
It's only a year, a year and a half in the future.
Star Trek's coming, buddy.
It's really, really hard to anticipate.
Yeah.
Now, I love this question.
Listen, I want to, I want to wrap.
This is another way paper waiting to happen.
I want to wrap this episode with a conversation amongst all of us on a particular topic.
We opened up talking about trillion dollar pay packages, trillion dollar investments.
you know, numbers that are extraordinary.
And the sovereign funds, the venture funds, family offices are just supporting this with massive
capital inflows.
And so the question is that I put to all four of you, think about competing in the long
term with the MAG7, who've got this incredible access to capital.
How should founders consider going about that, right?
What's your advice to others who are getting in here during this period of exponential growth in the AI economy?
How do you compete?
How do you think about that?
I think you want to be a large customer of those folks as well.
I mean, we are major customers of all the AI frontier lines.
And so they're quite excited that we're going to continue to push the bounds of autonomy.
And their market paths are going to continue to grow probably dramatically in line with that return on investment.
and Blitzie is going to ride those waves as well.
And so if you're happy when they're successful
and they're happy when you're successful,
that I think you're in a pretty good strategic position.
But there's one more thing, you know, to add on to that
what I'd like to go back to what Dave said, right?
Mercor, for example,
it was able to do that because it went deep.
I think that's also the case for us, right?
We've seen the enterprise,
by the enterprise side of the challenge
and the enterprise perspective and the security roadblocks
and the product roadblocks and the process gaps
that stop them from taking the full advantage of the products.
So if you're an entrepreneur or a founder,
and you've seen this personally,
and you've struggled with this problem,
and you think you have a solution that addresses the core of that,
and you've been able to test that with the actual enterprise
and demonstrate effectiveness,
I think you're holding on to something that is core.
So you're saying understand the problem deeply.
Yes.
Understand the problem.
Because, look, you have these Macs there, and they're giants.
They have all the money.
That's fine, right?
you're an entrepreneur, you're nimble, you can find the right investors.
We were grateful to find, you know, Dave who believed in us the moment we pitched it.
And we were able to get just a right amount of capital to get started.
And that's really all you need.
If you have the right talent, the right amount of capital, and you have the right problem
that you're going after that you're convinced about because you've experienced and solved it,
then you're going to be so nimble and make these moves and get a product out that is
significantly better than anything that the Max 7 can put together because they're struggling
with their own challenges.
like bureaucracy and all of the hurdles that they have to go through to actually put out
politics struggling with what to say over dinner with Donald Trump exactly
while they distracted all of that you can build a cake-ass product get it to market solve real
world problems and you've changed the world effectively Alex what's your thought how do you
how do founding entrepreneurs compete with companies I mean I remember famously you know
Amazon was out there as a platform for people to sell their products
But then when Amazon saw a product, they had incredibly high, you know, margin and growth,
they would clone the product and compete directly.
How do you keep from that happening?
Two words.
Solve everything.
In the name of this episode, solve everything.
The world is filled with so many problems that a startup standing on the shoulders of the trillions of dollars of Kappex that are being invested in cloud,
AI chips, fabs, energy are now poised to solve so many problems, thousands of problems.
I think Brian and Sid, and again, congratulations on the benchmark announcement, are well-poised
potentially to solve the problem that we face of decades of civilizational software
croft legacy code that's just piled up without enough human capital to invest in reinventing it.
And now I think we're arguably on the verge of doing that.
That's one of thousands of problems, entire domains that could be solved.
Protein folding was solved by AlphaFolt, essentially overnight transforming a subset of structural biology.
So many more opportunities.
Before I go to you, Dave, I just want to remind people, you know, I define an entrepreneur
as someone who finds a juicy problem and solves a juicy problem.
And the more entrepreneurs in the world, the more problems that get solved, the better the world is.
That's why we're going to hit on this over and over again.
I think the career of the future is being an entrepreneur, finding a problem, falling in love with the problem, not the solution, not the tech.
Because if you understand the problem deeply, as the tech evolves and continues, you're going to use the newest version to go and solve that problem.
And again, some of my favorite lines, the best way to become a billionaire is help a billion people.
And the world's biggest problems are the world's biggest business opportunities.
So that's what entrepreneurship means.
Dave, you see hundreds and thousands of companies.
You've got how many companies right now in the Link Studios?
28 in the building and about 50 total.
Amazing.
What are you, when you're looking to invest in a young entrepreneurial team like Brian and Sid or like the founders of Mercore or, again, some of the incredible unicorns that we've backed out of Link exponential ventures, what are you looking for to make sure that that company isn't going to get?
get disrupted in the wake of a, you know, open AI or Google slight jog to the right.
You know, it's funny, Kevin Wheel, we asked that exact question in that podcast we did two weeks
ago, and he answered exactly the way I had hoped he would answer, which is in a world where
the foundation model companies get to AGI and can do virtually anything, are you just going
to take over the world? And, you know, Kevin was really clear that maybe we can do that, maybe we
can't we probably can't anyway but even if we could we don't want antitrust to come in here and
break us up you know that's the last we want a huge thriving ecosystem of partners that give us money
you know it's blitzie one of those companies that gives us money yes therefore they're our best
friend go conquer the world i love that take over change the entire foundation of all legacy code base
make a trillion dollars and give us half of it we'll all be happy i took that's what they want
i took a two hour walk yesterday with a dear friend of mine here who runs a large venture fund
And we're talking about the notion that his bet was, you know, Google had so much more capability than they unleashed.
And they said, look, it's an open AI, go and do as much of this as you can because we need someone out there competing with us.
Otherwise, we'll get broken up for antitrust reasons, which is a fascinating idea.
You need viable competition to help you price, to help you remain on the edge, to help you not be, you know, sort of broken.
down by the government.
And be a good partner.
That was when Google was growing like crazy and we had all these portfolio companies.
We made a ton of gains.
But be a good partner to Google while they're growing like crazy.
And now it's the foundation.
Just be a good partner.
Talks them all the time.
Make sure you know where they're going and they'll love you.
Amazing.
I have a selfish question.
I don't know if we're running out of time here.
No, that's fine.
It'll close with your selfish question.
Okay.
Okay.
Well, this is always looking for traits.
Like this has obviously been one of our best investments.
and the sky's the limit from here.
And I'm always looking for trades of success.
And the morale at Blitzy is like nothing I've ever seen.
You know, which is not a no-brainer.
When you're doing video generation for a movie studio or whatever,
it's easy to keep high morale,
but when you're doing, you know,
five million lines of code, core, cobal conversion.
But yet you guys have just this crazy thriving culture.
And Sid mentioned, you know, we were first money in.
I don't remember why we loved the deal so much.
I do remember we absolutely was a no-brainer to invest in you guys.
So two things jump out at me.
One of them is BITS, which is just the hardest place in the world to get into.
And in video, which is, you know, you've seen growth.
The other one is Brian, I think you had Army Ranger.
Bangalore Institute of Technology?
That's the BILA.
BILA Center of Technology.
Yeah.
It's a cool name, though, BITS.
It's like MIT, BIT, but it's BITS.
And it was, you know, it was, by the way, MIT you designed the curriculum for BITS.
So that statement was actually true.
Oh, that's cool.
The other one, though, was, you know, Brian, I think you had not just West Point, but Ranger training, which is freakishly hard.
And then first boots on the ground in Syria, so literally the first people touching a war zone.
So I got to feel like there's something in those experiences that puts you a cut above in terms of building a team, managing logistics, building morale.
So any clues there that other founders can pick up on and build on?
Yeah, I think some evidence of being incredibly mission-driven and ambitious is what you would
see if you were an anthropologist looking at both of our backgrounds.
But if you take me, for instance, and if you fast forward or I guess rewind to 2017
when I was serving in the 75th Ranger Regiment, the mandate was going to Syria.
There's about 2,000 ISIS fighters in that old Raqa.
We're going to send you with 100 guys, recruit everybody.
else and take back the city, right? And oh, by the way, we can't let anybody in the United
States know we're here because we're there covertly, right? And to be able to sort of go in
and solve that problem, like that's a very ambitious undertaking where we're like conquering
cities isn't something that like most people have spent their time doing. So when you look at
for the level of ambition of the company, everybody here at the business, at the business,
let's see, has that ethos, right? And the very first thing we do when we interview is we
screen for ambition and the ability to invent and create. We have those core values. And if you
if you talk to any single person that sits in this building right here, they will tell you
and they're right that what we're doing is one of the most important things they will do in their
lifetime. Because the economic expansion that the globe gets, the GDP expansion that you get
from automating software development or at least huge chunks of software development, there's
almost no better incremental use of energy than driving towards that goal.
Wow. That's a beautiful thought.
Are you guys a 996 or 997 shop?
It's Saturday today, we'll be an estimate tomorrow.
Yeah, we're a 997 kind of crew.
Oh, my God.
Just to back Dave up on his 997 job.
That was a mistake.
Well, I hope my dance was the last leader on Street Bench Verified, so we can work longer than
them and beat them on the leaderboards.
Oh, my God.
Guys, listen, congratulations on hitting that new benchmark.
But more importantly, thank you for the work that you're doing from the companies that will benefit, from our government that will benefit, from the world that will benefit.
You know, this is you're upgrading the DNA of industries and of our planet.
So grateful for you.
Alex, Dave, any closing thoughts here?
I'm just super excited to see what you can get, what you guys can bring to the future.
I would, very few things would excite me more on.
the software engineering front than a few years from now to learn that the entire software stack
that I run, that companies that I work with run has been 99% rewritten by Blitzie, by Blitzies
agents to remove all the vulnerabilities, improve all the performance. I think it's the sort of
challenge before you guys that sets us on the road to recursive self-improvement and abundance
and also solving everything in in software solving everything that's my that's my phrase for the day let's
solve everything it's better it's better to dave's 997 than tang ping uh the opposite of 996
line flat in response to overwork yeah my closing thought definitely everybody read the white paper
the title may sound very complex but the paper itself is very very readable uh so please read it
And then your takeaway will be, wow, okay, now we need a new benchmark.
Inside baseball, Blitzie's already working with MIT to create the next generation benchmark.
But catch up to what they did right here by reading the paper.
To all our subscribers, thank you for following moonshots and WTF episodes.
We're grateful for your time.
We hope that, you know, in spending the time with us, you're able to understand how incredibly powerful this technology is for transforming our world, our lives,
creating a future of abundance.
I hope this counters all the dystopian news you get on the 6 and 7 o'clock news.
That stuff I don't watch.
This is the stuff I focus on.
I hope you do too.
I'm grateful to my Moonshot partners, AWG, Dave Blundon, Saleem, wherever you are,
transiting the Atlantic to come back here to the U.S.
And again, Brian and Sid, congratulations on your epic wins,
excited for your future success.
Every week, my team and I study the top 10 technology metatrends
that will transform industries over the decade ahead.
head. I cover trends ranging from humanoid robotics, AGI, and quantum computing to transport,
energy, longevity, and more. There's no fluff. Only the most important stuff that matters,
that impacts our lives, our companies, and our careers. If you want me to share these metatrends
with you, I writing a newsletter twice a week, sending it out as a short two-minute read via email.
And if you want to discover the most important meta-trends 10 years before anyone else, this reports for you.
Readers include founders and CEOs from the world's most disruptive companies and entrepreneurs building the world's most disruptive tech.
It's not for you if you don't want to be informed about what's coming, why it matters, and how you can benefit from it.
To subscribe for free, go to Demandis.com slash Metatrends to gain access to the trends 10 years before anyone else.
All right, now back to this episode.
Thank you.
Thank you.
Thank you.