Moonshots with Peter Diamandis - The State of AI: Elon’s $1T Package, Apple’s $600B for Trump & How Startups Win w/ Dave, AWG & Blitzy Founders Brian Elliott & Sid Pardeshi | EP #193

Starting point is 00:00:00 A piece of muse I saw yesterday that had me scratch in my head, which was the trillion-dollar pay package for Elon. Elon Musk could officially become the first trillionaire. Just about a trillion dollars in stock. It's a striking number, but the benchmarks that he has to meet are also equally striking. He's not just the leader of the company. He's the marketing voice. If we really do expect to find ourselves in an abundant society soon, we should expect to have a lot of trillionaires in our society. money will start to have far less value than ever before. If we're really on the verge of abundance, then what comes after that?

Starting point is 00:00:38 What's going to remain scarce, even as energy and intelligence, the cost of both of those goes to zero. But stuff is changing so quickly. Media is changing quickly. Elon is paving a new path for what it means to be a great corporate CEO. But it's going to change again and it's going to change again and it's going to change again. A trillion dollars here, a trillion dollars there. How do you compete? What's your moat?

Starting point is 00:00:56 So, Brian and Sid, welcome. Pleasure to have you both. Now that's the Moonshot, ladies and gentlemen. Everybody, welcome the Moonshots. The news that really matters in your life. I'm here with my Moonshot mates, Dave Blundon, Alex Wiesner Gross. And we're going to have a conversation today about David versus Goliath. We're talking about trillion-dollar commitments, trillion-dollar pay packages.

Starting point is 00:01:25 It's insane. But first of all, one of the most important pieces of news is, Alex, I'm reading the comments. And what I keep on hearing is people want you, ask you a question, just have you talked for the entire episode. They love what you have to say. So I'm not going to do that today, but everybody, if you're new to moonshots, Alex has been just receiving huge fan mail because of his brilliance. It's an honor to be here, Peter.

Starting point is 00:01:52 I think I got maybe two adoption offers. I think you did. Dave, are you jealous, Dave? Am I jealous of Alex? I mean, I'm surrounded by so many people that are so brilliant. I don't know. So I'm used to it by now. That's one of the most important things is finding incredibly brilliant people to have around you in life.

Starting point is 00:02:12 Yeah, the old saying you're the average of the five people he spend the most time with. And if you've got a community of individuals that are really uplifting you and challenging you, that have you do the best you can, And that's critically important. So one of the things I love about moonshots in our WTF episodes is sort of measuring toe to toe and trying to really have this conversation in a meaningful fashion. We're missing Salim, Miss Meal again. Salim, we miss you, buddy. I think he's back in India.

Starting point is 00:02:39 He's probably got, you know, a crate full of iPhone 17s coming, but we'll be talking to him soon about that. So I want to open up with a piece of news I saw yesterday that had me scratch in my head, which was the trillion-dollar pay package for Elon for Tesla. It's like, that's extraordinary. Have you guys gotten an offer from your boards for a trillion-dollar paycheck? A trillion? Well, yeah, if you hit the metric, the metrics have to be, you know,

Starting point is 00:03:06 multi-trillion-dollar market cap. But sure, why that? It's just a fraction of what you create. It's if Elon grows Tesla to an $8 trillion company, so it doubles the size of Microsoft and Nvidia, he'll earn a trillion dollars. not like he needs help becoming the first trillionaire on planet Earth.

Starting point is 00:03:26 Yeah, but he's unique. He's really changed the definition of what it means to be a CEO. And he's not just the leader of the company. He's the marketing voice. Most car companies will spend 7% of revenue or thereabouts on marketing. Tesla spends zero because Elon's a one-man force of nature

Starting point is 00:03:43 driving consumers to the product. And we're going to meet a couple entrepreneurs in a minute here that have that same like we embrace social media. we're creating morale and momentum like you wouldn't believe. But, you know, that's a 20% pay package as opposed to the normal five. Yeah. It's worth it.

Starting point is 00:04:00 I agree with you. If you're a founding CEO and if you're very, very shy, that's not going to bode well for the company. I know when I'm investing in companies, I'm looking for a CEO who's a great communicator, who's able to go out there in front of the crowd, able to convey their passion what he or she is loving in life. and for all of the insanity Elon does with chainsaws or whatever the case might be, it grabs attention and people either love it or hate it. One of the many reasons we love Alex so much is because he's not just brilliant, but he has very, very high situational awareness.

Starting point is 00:04:38 That's really rare around the brilliant community. But stuff is changing so quickly. Media is changing quickly. Elon is paving a new path for what it means to be a great corporate CEO, but it's going to change again. going to change again and it's going to change again. So, you know, if you map your behavior to the change, but most people study backward in time, and they say, well, what did Jack Welch do? Or what did Genghis Khan do? You know, like, it's okay, but that's not going to, anyway, we'll go on,

Starting point is 00:05:07 this. Brant will go too long if I go too far as. I also think, Peter, it's worth noting if we really do expect to find ourselves in an abundant society soon, we should expect to have a lot of trillionaires in our society. We will, and then we'll have an expectation that money will start to have far less value than ever before, right? I mean, you and I have had this conversation, Alex, about a post-capitalist society, right? Thoughts on that? Do you still believe that's going to be the case?

Starting point is 00:05:35 Well, so I think that that's always the question. What does so-called late-stage capitalism even look like to the extent the concept to make sense. If we're really on the verge of abundance, then what comes after that? And I think the what comes after abundance is closely tied to what remains scarce in an abundant society. In Star Trek, a common foil, energy is relatively abundant. Intelligence is relatively scarce. The ability to travel between stars relatively scarce. So the question I would ask is, what's going to remain scarce, even as energy and intelligence, the cost of both of those goes to zero. That is a critical question.

Starting point is 00:06:16 You know, my end point here, my mental experiment is if I, you know, in Eric Drexler's parlance, if I build a number of assemblers that are able to rearrange atoms and I drop an assembler into my hand and I say, hey, make me, you know, five copies of yourself and I give each of you an assembler. And the assembler is able to use energy and matter, resident, and build anything. and I drop an assembler into the soil here, and it starts pulling the atoms together to make me an electric Ferrari, and it says, I need a little bit of titanium, a little bit of lithium, you add it, and all of a sudden you've got an electric Ferrari.

Starting point is 00:06:51 Everything starts to become effectively zero marginal cost, and that becomes a pretty cool society where anyone can do anything. Does it? Or, I mean, again, not to overindex on Star Trek, but in Star Trek, everyone has replicators, but not everyone gets to travel between the stars. So maybe that the new post-scarce ability is the ability to travel outside the solar system. Well, we're going to find out because we're getting there really fast. On the news item of trillion dollars here, a trillion dollars there, there's a dinner with Tim Cook and Sam Altman and Mark Zuckerberg and Trump.

Starting point is 00:07:31 And I guess during this dinner, an offer was made by was it Zuck first or was it Tim first to invest? $600 billion into the U.S. economy. I think it was Tim first. I can't tell from the clips, actually, because they get cut and mingled. But then the other one matches it, and all of a sudden, over dinner, Trump is getting $1.2 trillion of commitments into the U.S. economy. You missed the really fun punchline there. Tim Cook had it all scheduled, planned, and budgeted, and then Mark said, I'll match that.

Starting point is 00:08:01 It was like a YMC fundraiser. If you can do that, I can do that. I'm sure his CFO is back in Silicon Valley going, what the hell? hell did he just commit to? Oh, my God. But it is, there's a third piece that comes out in this related story, which is Altman announces to his employees that he expects Open AI to be the most capital intensive company in history. And what was the number? Alex, $119 billion of additional investment between now and 2029. It was something like that. I mean, do you remember it was a whole, what, a year and a half ago or so that this number of five or six or seven trillion dollars

Starting point is 00:08:43 of CAPEX into AI chips was being floated. And a lot of people laughed at that. And yet, and yet, we're finding ourselves a year and a half later in a world where it is entirely plausible that the true amount of capital expenditure in fabs and AI chips and data centers and new energy sources completely exceeds that. I think, yeah. I'll file that. I'll file that away. way because you're dead right and this is the effect we see all the time something insanely mind-blowing is predicted six months in the future everybody is like impossible then it actually happens and then they're like oh yeah well it's just part of life and this trend is is going to you know it's happening over and over and over again but the numbers you just quoted yeah everybody was like

Starting point is 00:09:29 oh sam's just blowing smoke there's no way that's that's you know that's trillion being thrown around, but that's not a real word. That's just sort of a euphemism. And then here we are just a few months later. You're going to hear some benchmarks, actually, like Sweet Bench later in this podcast, where things have just been crushed that the timelines will blow your mind. Well, we'll get to it when we get to it. Well, I guess the point I'm making here is there's a huge amount of capital flowing here. I mean, we never, I mean, go back to when all of us were starting our careers in the 90s and in the dot-com era, the idea that'd be trillion dollars. movements of capital in any particular company or any particular industry, which is mind

Starting point is 00:10:10 boggling. And here it is routine. But this is in some sense, this is sort of a wonderful opportunity with trillions potentially of of CAPEX being invested. There's going to be an expectation, I would assume, by capital markets, that there's going to be enormous revenue generation that pops out of those trillions in CAPEX. And the question you have to ask yourself is, what form does that take? At some point, with trillions invested, I think there's probably a reasonable expectation that entire classes of labor, of services are going to be automated and the cost of what we currently construe as labor is going to be driven down to zero. And then perhaps at some point immediately after that, you start to need transformative science, inventions, discoveries that will

Starting point is 00:10:57 really justify the trillions of capex. So it's sort of a blessing in disguise. I would argue, trillions of CAP-X is going to motivate the demand and the supply of utterly transformative discoveries and inventions soon. Otherwise, why invest trillions in this? Yeah. The concern, of course, a lot of folks have is around inflation and are these dollars really inflated dollars? We're going to find out. But it's interesting, Dave. And in particular, as a venture capitalist, you know, seeing the valuation. of companies going at this level. I mean, for the average public, how do you get into any of these companies when they're

Starting point is 00:11:39 coming out at, you know, multi-hundred billion dollar and trillion dollar, you know, sort of valuations here? I mean, being able to get in early is one of, I think, the areas that you've been focusing on. One of the other companies that came out of Link Ventures, your early check in here, was Merckor. And I just saw Merckor has gotten a offer at a table. $10 billion valuation.

Starting point is 00:12:05 You know, you must be pretty happy about that. It's a $10 billion valuation, but it's also a half a billion dollar revenue run rate after two years, which is completely unprecedented. So go back. When did you invest in Mercor? Two years ago, first funding, you know, right? What you really want to look for is undervalued, underappreciated talent and not so much concepts.

Starting point is 00:12:28 But they had the concept right already. It's rare. But 18-year-olds old, you know? I mean, that's not a lot of people invest in the 18-year-old gang. So a couple of 18-year-olds come forward with this idea. Do you remember what the opening valuation was when you invested? 30 million plus or minus five. Okay.

Starting point is 00:12:45 And so 30 million to 10 billion in two years' time. Yeah, yeah. It's got to shatter all kinds of records. But again, I don't want people to feel like that's a bubble because the revenue growth also shattered all kinds of records. Yeah. And so from a cold start, I don't think anything like that's, ever been done before. But you're going to see a lot more of them now, too. They just are setting the

Starting point is 00:13:05 trend for many, many other companies. I think what's different about them is they're inspiring an age of people that normally would be, would have been uninvestable five years ago, 10 years ago. And now it's kind of, wow, mainstream. We've made that point that the average age of a unicorn, VC-backed unicorn a decade ago was sort of mid-30s, right, in terms of the average age of the founders. And today, I think, Dave, what you found out of the investments we're doing, especially out of MIT and Harvard, it's age 20 to 23. And these guys were 18 when they started? Yeah, 18. They got through one year of college, and then they got frustrated with the pace like everybody does. But they met in high school. Which goes to the point, Alex, you made last time

Starting point is 00:13:53 on the last WTF episode, which was, listen, if you really believe we're sort of post-AGI, on the verge of, you know, ASI, advanced superintelligence, going to college during those years and trying to get credits versus building something, it's not the right trade. It's going to distort all sorts of societal cues and societal expectations. The best, I would say, fiction treatment that I've seen of this is a novella by Werner Vingy, Fast Times in Fairmont High, where you see this start to completely distort the way secondary education is run in this country, and you start to see high school students and middle school students suddenly spending all of their time doing startups. And I think it's entirely plausible. We find

Starting point is 00:14:40 ourselves in a near future that looks a lot like. I agree. And one of the points here, I think that we need to realize, or people need to realize, is the MAC, you know, sort of peak creativity. If you measure it by when a Nobel laureate does their Nobel Prize winning work, or not when they get their prize, but when they actually did their work, is typically in the first half of your 20s. Alex, do you have the data there at all off the top of your tongue? Not at my fingertips, and I've seen those statistics too, and I've seen how they vary from field to field purportedly math versus physics versus other fields.

Starting point is 00:15:14 I also tend to discount this notion because I expect that in the very near future, most of the innovation is actually going to come either from pure AIs or some sort of human AI hybrid. So I view those statistics maybe self-servingly as more of a retrospective. This is how things used to be at best versus how they're going to be in the future. Every week, my team and I study the top 10 technology metatrends that will transform industries over the decade ahead. I cover trends ranging from human robotics, AGI, and quantum computing to transport, energy, longevity, and more.

Starting point is 00:15:47 There's no fluff. Only the most important stuff that matters, that impacts our lives, our companies, and our careers. If you want me to share these Metatrends with you, I writing a newsletter twice a week, sending it out as a short two-minute read via email. And if you want to discover the most important Metatrends 10 years before anyone else, this reports for you. Readers include founders and CEOs from the world's most disruptive companies and entrepreneurs building the world's most disruptive tech.

Starting point is 00:16:13 It's not for you if you don't want to be informed about what's coming, why it matters, and how you can benefit from it. To subscribe for free, go to Demandis.com slash metatrends to gain access to the trends 10 years before anyone else. All right, now back to this episode. Well, there's another company I want to talk about here, and we have a couple of guests to join us. If you're an entrepreneur, listen to how they built this company.

Starting point is 00:16:39 This is a company on the doorstep of being a unicorn itself, a company that you're going to hear a lot about in the coming years. Dave, you want to introduce our guests and Blitzy? God, I cannot wait. So we have Brian Elliott, Sid Farnashi, the founders of Blitzy. My son, Jack, interned with them this summer. And I tell you, it drove my wife a little nuts. She started thinking, wow, we're going to play a lot of tennis this summer and have a great time. Jack got so wrapped into the culture of Blitzie so quickly. It's the most high-energy place I think I've ever seen. Morale is off the charts. And so he pulled in his best friend, from high school, Yashba Lachetti, they pulled in a couple of other young computer science majors at Northwestern and a couple other places. The whole gang worked all summer on Sweebench and crushing numbers, but I tell you, the morale of this group is like nothing I've ever seen. The mission is incredibly cool and fun. So I can't wait to tell you all about it.

Starting point is 00:17:41 So Brian, he is a West Point alum. My one experience in life with West Point was Rick Dalzell, who was my biggest and most important customer I ever had, actually. He ran all things complicated at Walmart, you know, massive logistics, half a million people moving around. And then he got poached by Jeff Bezos to work at Amazon. So he was the number two guy at Amazon right when they were in total hypergrowth. And he had a West Point background, really understood morale, people, logistics. And then, you know, Brian went to Harvard Business School after that.

Starting point is 00:18:10 Sid went to Bits, which is, you know, the MIT of India. It's actually statistically much harder to get into than MIT, if you can believe that, I can. So top of the class. MIT let me in, so it's got to be some flaws there. Oh, so humble. So, Sid, after that, we spent a long time at Nvidia and saw it go from tiny to monstrous. So that's got to be inspiring.

Starting point is 00:18:38 And I actually don't know their story before that, but they met at Harvard Business School. And they're, I think, inspiring to a different class of people. You know, they were already in a career path. and then AI hits the world, but they're nimble. You know, they're not going to watch it happen. And this is way too rare, you know, people remapping their entire life to take advantage of what's happening right now. So I hope a lot of the listeners today get a lot out of their backstory and their transition

Starting point is 00:19:01 to building this incredible company, Blitzy. Yeah, and I really want to frame the story here as David versus Goliath. We've heard about a trillion dollars here, a trillion dollars there. And how do you compete in that world, right? If you're a young entrepreneur, you're building a company, and you're you're, you're you're wondering, are you going to get literally decimated in the wake of Google or OpenAI or XAI just, you know, happening to release a particular feature? How do you compete? What's your moat? So, Brian and Sid, welcome. Pleasure to have you both. And where are you guys this

Starting point is 00:19:38 morning? I am in one Kendall Square here at the Link Studio offices with MIT right behind me. So we just walk over the talent from the MIT AI Lab right over to here to work, which works well for us. Nearby in Cambridge. Fantastic. The first thing we had to overcome was convinced Dave that we could be successful when we're not 19 years old. So he totally flipped this paradigm on, I'm funding these young people. And I said, you know, Dave, when I was these young, these kids' ages, I was, you know, I was out, you know, across the ocean fighting at war. And I think I'd been enough experience to hopefully have a second career here in technology. But Peter, I love

Starting point is 00:20:16 the question that you're asking, right, which is how on earth do you compete with these frontier AI labs, with Google, right, with Open AI? And there's really two reactions you can have when there's this trillion dollar investment, right? There's the reaction where you've built a company that you say, oh, no, right? They're going to steamroll over me. Yeah. And then there's the reaction where when every single model gets better and the combination of those models makes you your product much better, you're jumping for joy. So we just got a trillion dollars of R&D for Blitsey. And we're rising a rising tide and you can float on top of all of that.

Starting point is 00:20:55 That's perfect. But it's critical to find that product market that is able to benefit from the rise of these technologies. The thing is, you know, this is the second time we're seeing this happening. So I've, you know, I was at Nvidia back in 2016 and I heard all about this. story of Kudah, and I saw Jensen believe in that. That was pre-Gen AI. The term gen AI didn't exist, right? And I've been working on genetic AI models since the attention is all you need paper came out. So Jensen was asked to stop investing in Kudah. He started this back in 2006, right? And it was negative to the company. So, but he still invested in it.

Starting point is 00:21:38 He still believed in AI. He worked with researchers and built the technology to solve problems that he foresaw, and that is very relatable to what we're doing. So Blitzie, if you go into this data for sure, but it's very unique in terms of how the product is built. It is specific to the enterprise, and it is based on the opportunity that we've seen over many years, you know, working at largest companies like Nvidia. You know, I think we should begin for those who don't know Blitzie explaining what it is. I want to get into like, you know, where were you guys when you said, aha, we're going to build this, right? I love that founding story.

Starting point is 00:22:15 And then I'll unleash Alex Weezner Gross on you to ask the most intelligent, important questions. Well, let me tell you what it does today, and then I'll tell you the humble beginnings of all of this, Peter. So this is an enterprise-grade autonomous software development platform. So we ingest and understand up to 100 million plus lines of code where most single LLM tools are stuck with this finite ability to understand context. We've developed some really unique context engineering systems to understand enterprise-scale code bases.

Starting point is 00:22:46 From there, an enterprise will express their work from a development perspective, whether that's a cobal to Java upgrade very common in these old financial service institutions that use us, or steady-state development work. Witsy will send off the most compute-intensive workload in the entire AI code generation space. We've done a 12-hour run. We've done multi-week runs for massive of scale code basis, ultimately delivery high quality pre-validated, pre-compiled, pre-tested code, right? In our view, our thesis is we want to increase the quality of code at any cost because the other side of a pull request that comes from AI code gen is human labor, which is exponentially

Starting point is 00:23:27 more expensive. And so that's really the view that we have, enterprise scale, high quality code. All right, Brian. I want you to slow that down for a second. which is to say a lot of companies out there, a lot of traditional companies in particular in the finance world, you're saying, insurance world have large code bases.

Starting point is 00:23:46 They have software that they have inherited for, how old are some of the software systems that you're playing in? I mean, we're talking about PL1. Oh, my God. These are old school financial service institutions that, quite frankly, for a long time, have been afraid to touch the code because the cost to get something modern, this wasn't worth it. Okay. So you've got a company out there running COBOL from, what, 20, 30 years ago?

Starting point is 00:24:16 Yeah, that's right. And their system is operating. It's working. It's not doing anything significantly useful given that it's 20 or 30 years old. And do they have people that can still, you know, patch that code? I mean, are there engineers still around? They make you and Dave look like spring chickens. Spring chickens. Oh, yeah, I know you're how long have to have a guy, Peter. You look great.

Starting point is 00:24:46 So you've got this problem where you're too scared to touch it because you'd have to do a wholesale replacement. And so they call in the blitzie guys. And you come in and you're able to do what for this 30-year-old chunk of code? For starters, we give them visibility.

Starting point is 00:25:08 So we'll ingest, index, and understand the state of their underlying source code, of which times they rarely have somebody that understands the entirety of tens of millions of lines with code. It's actually an impractical problem to know that. And then we allow them to execute large-scale transformations. So whether that's getting onto a modern technology stack or that's adding required functionality, these businesses, these enterprises are stuck with the inability to layer artificial intelligence on top of their existing code base because it is so old, so

Starting point is 00:25:36 antiquated, and that's so little visibility of what they're doing. Massive value. Massive value creation. It's a mind-blowing experience, too. If you take 10 million lines of unintelligible, undocumented code, and you run it through blitzie, and then you say, tell me what it does in plain English. Explain to me where there are bugs. It's just you're talking to the code. It's mind-blowing. One of my favorite uses of Gen. AI is to give it some patent that is unintelligible and say, how does this, what does this do? And how could I use it in my company, right? The ability to take something that's complex and make it understandable to grok it fully to use that term. Thank you, Robert Heinlein. Well, Sid's got 27 patents. So I had to

Starting point is 00:26:19 use that to understand what the heck you did it at in video. So, Sid, you had Nvidia for five years? Eight years. Eight years. I heard, I heard Jensen recently say that most of the executives there are now are now billionaires. Did you make it to that status? Well, I held out to my stock. I wrote the way from like double digit billion to trillion dollars, but then I got two degrees. You can do the math, Peter. And then what happened? I got two degrees from Harvard, so they took all the money. You owe no way. Wait a minute. Oh, my God. Let's not get started on the expense and value of degrees from Harvard or MIT or any in any of these Ivy League schools.

Starting point is 00:27:04 So this is a David versus Goliath story, and I want to understand, you know, you've got incredible success. But before we get to that story, you guys are both at Harvard Business School. When did this idea germinate? What was the causative agent that said, okay, let's build that. What was that founding story like? Yeah. If you can rewind the clock back to the GPT 3 to 3.5 era, right, where these things could code,

Starting point is 00:27:36 but it wasn't what we're experiencing today, right? At that time, Sid and I were doing a pro bono project for our favorite local bakery here in Boston as a part of our time at Harvard. And they mentioned they're about to spend $300,000 or $400,000 on a new mobile app, which got our attention, some young enterprise and entrepreneurs. And so Sid and I went home, and we did what is. now, you know, two and a half years later called Vibe Cody, and we built them the application overnight, right? But through that experience. Literally over the weekend. Over the night,

Starting point is 00:28:07 yeah, yeah. Which now is like no big deal, but if we didn't pay, did they pay you, you know, $200,000 for that? We should act like it took longer. I think that was our, that was our first mistake. But was so clear during that time is that Sid and I were actually the bottleneck for development. And so we were, you know, we, you know, we, You get an error and you beat it back to the system and then you give it to a different model, right? And then through that practice, you're able to get much higher quality code. And so we said, if we could just invent a system where all the commoditized development work could be removed, right? And we could have multiple models going back and forth, iterally refining and getting to code that compiles and tests.

Starting point is 00:28:47 That is going to be what the future looks like, right? And we learned that by doing, by being hands-on, and then having an idea of what the enterprise needs from Cid's experience and building towards that. Were you guys already friends or did the All-Nighter making friends? Yeah, yeah, we've only gotten, I mean, Sid's actually the godfather to my, to my, like a son, later not. Okay, that's important. Investing in best friends is pretty good. I mean, and that's another part of the story, Dave, that we've talked about is some of the most successful companies are when best friends get together and just build 24-7. It is a, you know, I say this to the entrepreneurs that I coach, you know, being.

Starting point is 00:29:28 an entrepreneur and having co-founders, you're going to spend more time with your co-founders in the trenches than you do with your husband or wife or kids. It's an intense period of time and you better pick somebody or some buddies that you love spending time with. Yeah, I always tell people, imagine you're on a long international flight, one of those 14-hour flights and you're sitting right next to somebody. Do you walk off the plane feeling good and having fun or do you walk off the plane not waiting to get away from this person? Your startup's going to feel just like that every day. It can be work or it can be fun. It just depends on the personalities and the match. So Dave, what do you find most exciting about Blitzy just from as investor? And so they

Starting point is 00:30:07 came through Link Studios and Link Investment. And yeah, tell a little bit of that story, if you would. Well, they're definitely much more experienced than a lot of the entrepreneurs around the studio. So they had the plan and the idea of fully baked on arrival. We were still first money in and still gave them space and support and have made a bunch introductions, but they already had it more than figured out. So that's, you know, that's not all the companies fit that profile. They're also, you know, very different.

Starting point is 00:30:38 A lot of the companies coming right out of dorm rooms will do image generation. They'll do, you know, they don't understand the word cobalt or PL1 to save their lives. And so when I look across the range of business plans that are right in front of us with AI, more of them fit into the, you need to understand the domain space

Starting point is 00:30:55 then you can just think of it in a dorm room space. There's maybe two-thirds, one-thirds, rough numbers. And so what I'm hoping with Brian and Sid is that they inspire a ton more people to go after these. These are still multi-trillion dollar markets. But they're not, you know, AI girlfriend. They're not, you know, apartment search. Another photo sharing app.

Starting point is 00:31:17 Yeah, another photo sharing app. They're, you know, and they get really deep. You know, you've got all these, you know, manufacturing, you know, semiconductor manufacturing automation. you know, that's really deep. You get insurance actuarial risk adjustment. That's very deep. This one is actually nice in that it's very, you know,

Starting point is 00:31:32 code generation is very broad. So it's a huge market. But it's also deep in that, you know, like refactoring 10 million line code bases is a pretty deep knowledge set. The other thing that's really cool about Blitzy to me is that we have all these code generation products. So I use cursor. We've got windsurf, replit, lovable.

Starting point is 00:31:53 Companies are all worth billions now. But they're, you know, I write a lot of code or I tell it to write a lot of code. It creates a button for me. I say, I don't like that button, make this other widget. And, you know, you're doing it in real time. But you can't build something really big. You know, and when you put cursor in full agent mode, it's right in no man's land. It sits there and grinds for like five or seven minutes, which is too long to wait, but too short to build something substantial.

Starting point is 00:32:18 So they're getting stuck in no man's land. Blitzie just said, no, we're going all the way to the other end where it's going to run all night long. or all week long, like Brian was just saying, and come back with something really big. And that's just fundamentally a much different engineering problem than what lovable replet, cursor, windsurf are doing. It's just a different kind of company. I don't know of any other company that's there. Amazing. Well, we're here today to announce a particular piece of news as well, some groundbreaking news. Is it Brian or Sid, which one do you want to talk about what Sid, please, you are the inventor of the technology here.

Starting point is 00:32:56 Sure. So. Tell us. You know, so every time a new model comes out, they benchmark on this leaderboard, which is called SweeBench verified. The leaderboard itself was built by OpenAI. It is a subset of suibench. It contains 500 problems that were vetted by the researchers at OpenAI, and they confirm that these are solvable problems, and these are worth testing models on.

Starting point is 00:33:18 And it's been ubiquitous. So every time when the new one comes out, you always see results. the current top of the leaderboard on the sweepbench website as a filming is 75.2%. So we hired a bunch of extremely talented interns. So Dave, going back to Jack, if we could hire him today, we would. That's how good some of these interns are. And every single intern who worked for us, they were amazing. We were, you know, we really credit this to their effort and to Nirajj who led the efforts

Starting point is 00:33:47 on our end. But we ran Blitzie on sweepbench. And as it turns out, these are 12 repositories, but they have 500 branches. And that equates to 400 million lines of code if you ingest them on Blitzie, right? We've ingested anywhere from 1 to 2 billion lines of code overall on Blitzie, depending on how you count it, because you count updates and the whole raw thing as well. So it's 2 billion lines of code if you count all of the updates, right, including Sweet Bench. So we ingested all of that.

Starting point is 00:34:17 We ran Blitzin on solving the problems. And our final result, accounting for everything that we've tested and verified using SBCLI, was 86.8%. That is a significant jump over the current leaderboard on the website. And the last time this was done was when Devin had a 13% jump from 1% to roughly 14. some percentage. So we've come a really long way with the system. and the primary reason that we were able to achieve this, echoing some of the points that we made earlier, is we're very different from the existing tools of structure, right?

Starting point is 00:34:54 So one thing is you can reproduce these results in production using Plyssie. We've not added any custom saffolding just for Swaybench. We've not tampered with any of the features as to achieve this. We've seen reports from, you know, some of the other labs that claim that even though, for example, let's say, a latest frontier model claims 80% on SviBench if you actually run it and I reproduce it, you get 60%, right? And we wanted to not have that problem. We care deeply about reproducibility and the practical real-word applicability, right?

Starting point is 00:35:25 Which three-bench verified has been vetted to be good at. So you can reproduce these results, and it's live as of today. Amazing. Hey, Alex, help us understand how big and important this particular hallmark is for the company and for the world. give us some background here sure well well first to brian and sid congratulations on your announcement i think there i would expect there's going to be an enormous amount of interest from the community once they they hear these results in in trying blitzie and in reproducing those results so congratulations in advance on the onslaught of interests that i expect who will receive

Starting point is 00:36:08 i think to answer peter's question i think software engineering is arguably the first major vertical of human labor that is very high economic value, very high productivity that is perhaps succumbing to automation. So any sort of step function improvement in software engineering is arguably super transformative to the global economy. And maybe just pivoting on that thought, one of the first things that I was wondering when I heard that you would be announcing these results, and maybe jumping back eight months, we all. remember when Deepseek, aka High Flyer, launched R1 January of this year, and there was sort of an aha moment all around the world. They didn't just announce a new reasoning model. They announced,

Starting point is 00:36:58 and maybe this got a lot less attention, they announced a bunch of new open source libraries at the systems level, like a new file system. So one of the first things that I was wondering when I learned that you'd be making this announcement is the whole world is sitting on the sort of palimcess of legacy libraries and operating system code, billions and billions of lines of code, Linux, Python, Gnu, all of these libraries. Is there something that you and Blitsey and this new remarkable capability that you're announcing can do to speak to what can we do to improve the performance of this entire tech stack that the whole world runs on at this point. That's a fantastic point of question. You know, we've been running some of these

Starting point is 00:37:48 experiments. We've been taking some of these open source libraries that, for example, was written MATLAB for one of our customers that they were using and we converted it to Python. Matlab was written 20 years ago. It was specific to Windows and we made it OSGnostic. We've run these POCs all the time where, you know, we go from OSGnostic to OS specific to agnostic, from traditional to modern. But we've also been running other kinds of POCs where, for example, we picked an Nvidia repo. We identified an issue that was marked as open. And we just put Blitzie added and we saw it. We created the pull request that's all the issue. So if you think about that problem, you can identify bugs, issues, feature requests in any of the modern frameworks and systems.

Starting point is 00:38:30 The system is not limited by how much code, how big the repo is. So you can send Blitzie added. It will come back with the solution. You don't like it. You can iterate over it. You can trade five projects, get five different pull requests. see how that works and deploy it, all within a matter of days. I think that's a fundamental shift that really is going to change the way people work with open source and also close-source technology. Amazing. What is the largest repository of code that you actually tackled? Brian, you want to say?

Starting point is 00:39:02 Yeah, I got it. So we frequently see 20 million lines, but I think the absolute largest we've seen is about 60 million lines if we want, would it successfully. That's crazy. Just for comparison, for fun, if you had to guess, how long would it take in terms of human labor hours to do that? It's insane. It's insane. Brian, I think we frequently sculpt these, right, as part of the POC process.

Starting point is 00:39:26 And I think you could probably speak to that. So there's a question of, like, how long would it take to GROC, 60 million lines or code? And the reality is it's just too big for a human to understand. So you might pay, you know, it's into $100 million in three years, and they might come back with some more. some diagrams over the 60 million lines of code. And by the way, by the time they did that, what they came back with would be out of date. Exactly, which is why you can see that, you know,

Starting point is 00:39:50 essentially the industry has been stuck. This is why your airlines are always misrouted and they can't get their software updated. This is this kind of fundamental problem. So we, every time we run Blitzy, we actually estimate for the clients in production all our enterprise clients, how many hours were automated. And the CIOs love this because it's like the KTIA,

Starting point is 00:40:08 they go give the forward on how many hours they've automated away by their intelligent vendor selection. But Gronkine is sort of an impossible problem, but the real value is in the code generation, right? Being able to accurately affect and develop code and accelerate that life cycle for the development team while in that large underscerned underscoring corpus. But importantly, like Alex,

Starting point is 00:40:26 I know you want all developers to sort of go away and we're going to live in a society of abundance, but I think it's kind of going to go in the opposite direction where it's almost an infinite demand for code, right, and for software development. And so Blitzie doesn't do everything. Blitzie does about 80% of the quantum of work on average for these large-scale problems,

Starting point is 00:40:44 but it knows exactly what it doesn't do, which is really the power. And it kind of hands off that batch of work to the human developers to finish things out. So we get a really clean full request, plus human labor at the end to accelerate the development, but not sort of remove the need for the developers altogether. So I have just, if I may, just to pull in the thread, Brian, I mean, I would argue we're about to enter sort of an age, not necessarily of just abundance, but of great projects when it's possible to send a lot, basically let lots of automation loose on the world and fix all the problems, solve everything, as it were. In the case of Blitzy, this is letting AI agents lose on

Starting point is 00:41:27 an enormous sprawling legacy code base and just fix everything. I think it's a good name for the episode, solve everything. Or a book or, you know, fill in the blank. But are you familiar? Maybe you're tracking this project. I love this idea. The Great Refactor. Yeah, yeah, I love this. What is that, Alex?

Starting point is 00:41:50 What's the Great Refactor? So the Great Refactor, I love this. This is like a classic solve everything concept. This is we've built our whole civilization on a bunch of software libraries that could be better maintained that are filled in many cases with legacy memory vulnerabilities. There are statistics out there that most of the insecurity of present-day. software is due to the way the software is written that exposes them to certain type of cybersecurity vulnerability, memory vulnerabilities. And if we can only rewrite all of these

Starting point is 00:42:21 libraries that sort of, if you know the meme that goes around of the entire stack being built on, of civilization being built on just like one block, it hangs by a thread, if we could rewrite all of these libraries and dependencies and software supply chain upstreams that our whole civilization depends on in rust, or some other memory-secure language, suddenly that would fix almost all. That would solve everything in terms of so many vulnerabilities. I think it had like 200 customers sent me that project.

Starting point is 00:42:51 Like they just immediately saw that. It became hot news of the day, and they all sent it to me like, oh, are you guys going to do this? And I said, well, are you going to pay for it? Like, I think it's worth it. Here's the most critical thing, right? If you think about this idea where you can give these projects, three factors or whatnot to AI and have it come back

Starting point is 00:43:10 with the code, right? You can do that with any chatbot. You can give it to any AI, have a right. Getting code from AI is a commodity, right? But if you add constraints to that problem where the code needs to replicate the existing functionality, it needs to compile, all the unit test needs to pass, it should not have newly added security vulnerabilities, and all the other items that are, you know, crucial to the enterprise or the problem itself, right, that make it valuable. That's when the challenges begin, right? Now, that is not something you can achieve. So, Sid, my question is, I've got 3.2 billion lines of code, which is my genome. Can you compile that for me? Can you, go and identify a more, you know, fix the, fix the broken parts? As long as LLMs can write, as long as it's

Starting point is 00:44:04 in the language that LMs are trained on understanding, we can do it. The scale is the problem that we've solved for. And the other problem we've solved for is making sure the requirements match so that when we put you back together or edit you, you actually look like you, right? It's validated that it is you. We didn't change or break something that we shouldn't have. Alex, what do you think about? Yeah, so maybe narrowly on the bio, I, I, there are many other projects that that speak the language of the genome and the proteome that I think, Peter, for rewriting your genome, you'll have the opportunity over the next few years to use one of these biological sequence-based foundation models to do some variant of that. I do want, for Brian and Sid, though, really

Starting point is 00:44:46 pull on the economics of this. So I really want to press you guys. When we talk about the great refactor or some of these great projects to basically rewrite the source code basis for much of our civilization today, and you think about the economics of that. And there is a school of thought that says we're seeing generative AI hyper deflate by 10x per year or so, an order of magnitude cost reduction every year. At what point in your minds do you think using Blitzy or maybe competitive tools does it become reasonably economical to basically rewrite all of the legacy code out there that civilization depends on? I mean, I would argue from a value perspective, it's there today because the value that this would be providing to society is just

Starting point is 00:45:34 dramatic. Now, this is a question of, like, who's the payer? A line of code from Blitsey is 100x more cost basis than a line of code from sort of any other provider, which is maybe 100x less than it would be from a human developer. So we're talking about huge or is a magnitude difference. And so would it be worth it from a society value to rewrite all the software today with Blitzie? Absolutely. But am I going to continue to serve these financial service institutions and insurance companies first that are readily paying me today? Like, yes. And so I think If you grab the capital funding for us to break even on this AWG, we'll start rewriting all the, we'll rewrite Linux TV just too.

Starting point is 00:46:10 Brian, I'm going to insert another topic in here. You know, you guys shot a really cool podcast. It's on your LinkedIn, where you're just bantering between the two of you about the fact that the definition of truth within large-scale software has always been the functional code. You know, here it is. This is the final thing. It runs, the PL1 code that does all the nav accounting for the mutual funds, you know,

Starting point is 00:46:30 over at State Street. It's like millions of lines of legacy PL1. But that debugged code is the core asset. And then the documentation is just something around the edges. Post-Blitzy, the truth moves to the documentation because you can regenerate the code overnight anyway. And so your actual core asset has moved from code to a document, but it's going to move again. And this is where it was really cool to hear you guys bantering around, like, well,

Starting point is 00:46:56 what is then the foundational truth of this piece of, because, because, because, you know, like Alex is saying, the entire infrastructure of society is about to move and also expand 100 or 1,000 or a million X, you know, because code is so cheap to create all of a sudden, we have much, much more of it. So you've got a much bigger world. But the ground truth is some other format than just, you know, PL1 or COBOL or Python code. It's this human-readable today spec becomes the central asset. That's a big shift. The source. Yeah, the spec is still an abstract. action layer, right? And so that's easy for the human to look at, right? The real source of

Starting point is 00:47:36 truth or understanding is actually we create a customer-specific hybrid graph vector database that understand exactly what is going on from a functionality perspective. And you could change that functionality from one language to another, but we are capturing the core essence of what is required there. And we can display that as a spec, which is 200 pages, but of 20 million lines of code, that's a intermediate representation. And really we want to get back to the the core DB level understanding. That's the core asset for these folks. And Blitzy makes it.

Starting point is 00:48:05 It's the property of the enterprise. Hey, everybody. There's not a week that goes by when I don't get the strangest of compliments. Someone will stop me and say, Peter, you've got such nice skin. Honestly, I never thought, especially at age 64,

Starting point is 00:48:19 I'd be hearing anyone say that I have great skin. And honestly, I can't take any credit. I use an amazing product called One Skin, OS01, twice a day, every day. The company was built by, four brilliant PhD women who have identified a 10 amino acid peptide that effectively reverses the age of your skin. I love it and like I say, I use it every day twice a day. There you have

Starting point is 00:48:42 it. That's my secret. You go to Oneskin.co and write Peter at checkout for a discount on the same product I use. Okay, now back to the episode. Brian, given your background, you know, in the military, I mean, probably the one institution that's got the, you know, largest repository of, of, of ancient code has got to be the U.S. government, right? I mean, so can you attack all of that? I mean, unlock massive productivity. There's a lot of fear that the U.S. is a falling empire. It's inability to, you know, understand and legislate efficiently. I mean, couldn't you have, like, just a single massive impact on the U.S. government? You know, Peter, I live in Back Bay here in Boston, and so does this lead investor at Incutel, or so we tell.

Starting point is 00:49:30 me because he keeps strutting into me when I'm walking to work and just bumping in and see how things are going, which has to be skeptical. But the short answer is a short answer is yes. Like, the U.S. customer is sort of a fantastic end customer to ultimately modernize to get your flights there on time to make getting IDs easier, right? And so this is like an absolutely part of critical infrastructure that is a target customer that we have certainly. Do we have them today? No, 12 months will we'll be serving them? Like, I think yes. Yeah, I mean, that could catapult you into a, you know, a decade-billion-dollar company easily, just landing that kind of a customer.

Starting point is 00:50:06 Because once you've modernized, you know, for one of the agencies, they're all going to want it. Yeah, I think we have the most top-secret security clearance to patent ratios of any company out there. Fascinating. I'd like to maybe pull on the theme that we've talked about on the pot in the past, the elephant in the room, which is recursive self-improvement. So how much of Blitzy is written by Blitzie? A lot. So it's interesting, right? This is actually a question of like, where's the core value? And I would say every single sprint that we do from a software development perspective is driven by Blitzy.

Starting point is 00:50:45 And so I would say a significant amount of the corpus of the code is driven by us. Now, there are algorithms that are not about writing code, right? they're about core invention. So I think most of the company's core IP is not going to be the software that it trades, but it's going to be some core invention that this is around. And think about Google with page rank, right? Then back to the literary page, not actual pages. But page rank is really the core source of their original IP.

Starting point is 00:51:13 And SIDS invented a number of algorithms at the core of Blitzy that allow us to, for instance, always compile code to never have circular dependencies. So you could actually rewrite sort of a corpus of Blitzy with Blitzy pretty readily and pretty quickly and it would look like it, but would it do what we do? The answer is sort of no, and it gets the question, like, what is the source of IP for companies? And it's got to be a breakthrough and invention. If you think about what we're doing, right, for these large companies, we're telling them,

Starting point is 00:51:40 for many of them, we're telling them how to use AI, right? We're coaching them and consulting with them. How do you can not do that unless you've actually used it yourself and perfected the process, right? Because they're not just making the engineering velocity changes or the, you know, tool user changes. they're also making the process changes. And that's why this is critical. Alex, I'd love you to take a second and dive into the Sway bench metric here.

Starting point is 00:52:03 Again, if you could, for those who are not familiar, you know, a little bit of the origin, you said from Princeton and ratified by Open AI, but what is it measuring and who were the top of the leader boards before? And then I want to get into the conversation about how do you compete against the MAG7 or against, you know, sort of the frontier models in this regard? So let's contextualize it first, understanding what the sweep bench metric really is. Sure. So for content. And why do we start with the title of the white paper? Because that's going to drop same day that this podcast does. So people are going to need to find the paper.

Starting point is 00:52:36 And we should put a link to the white paper in the in the show notes here as well. Alex, please. Sure. So maybe let me just speak to Sweebench. So Sweebench is a benchmark that measures the ability for AI systems to solve typical software engineering, SWE, for SWI tasks in the specific form of responding and solving issues on GitHub, a very popular source code management system. So SWEBENCH as a whole, not SWEBENCH verified, consists of a couple thousand instances of tasks in which the central challenge that's posed to an AI is to respond to an issue in a code base. And what would happen in a normal software engineering context is, what kind of

Starting point is 00:53:29 issue, Alex? So a wide range of issues could be bugs that need to be fixed, other performance issues. And the usual workflow in a software engineering context is an issue will be identified and pull request will be submitted. So identifying an issue, responding to an issue, submitting a pull request that responds to an issue, satisfying unit tests. These are all standard parts of what would be archetypically considered software engineering. And SweeBench, I think, is sort of an excellent industry standard at the moment that attempts to capture the life cycle of high-value-ed labor that a typical software engineer would perform.

Starting point is 00:54:16 Now, I'll let the guys respond to their white paper. Sorry, yeah. And so I'll respond by also answering your core question, Peter, which is how does one compete with the MAG7 in this utterly important labor task, right? And the reality is there's a significant amount of the MAG7 at the core of what Blitzie does. So we use Gemini's models. We use anthropics models. We use open AI's models. But what's unique about this technology moment in time is when you use these models against one another, the quality. moves up quite dramatically. And when you use them against one another, hundreds of times, right, in hundreds of different combinations, which hundreds of different tool sets and prompts, the combination goes up even more exponentially, right? And so really, it's the art of orchestration through what we call extended inference time validation to move up the quality of code, ensuring at every moment in time the system has the right context to operate, despite the large-scale underlying code base. So we say we're excited when Gemini or OpenAI release a new model,

Starting point is 00:55:20 like our product gets just dramatically better than a single time. That's a really important insight, right? Building so that the better your components are, the stronger you are as a whole. And that's a unique niche. When did you realize that? I mean, that's sort of a fundamental for you. I think we realized it back when we were just initially to figure out this first project together. But I'll let's say expand on this.

Starting point is 00:55:42 I think we made a bet and we said that there's not a doubt. Like we were building this when models had 5,000 tokens of context, right? And we made a bet that there's no doubt that context windows are going to expand and the models are going to get better at writing code. Now, do we want to go and compete with the Mac 7 and build our own model? Or do we want to stand in the holders of giants and use the technology to solve the problems that they're actually meant to solve? And that's exactly what we did.

Starting point is 00:56:10 Amazing. And just for context, again, who had the record before you? Who had the record last week? I think there were some open source labs and there's also been some other, you know, unpublished. reports that have claimed, you know, around 80%, but the 86.8% that we're claiming has, is unprecedented in the highest number we have seen. I think ByteDance was the most recent Trey model to be at the top there. So, you know, we can rebrand this U.S. versus China if you want to.

Starting point is 00:56:39 Every time I hear 80-something percent, 90-something percent, I'm thinking that these benchmarks are getting super saturated, right? And so where does this go next? I mean, what are you going to measure when you're at 100 percent? Yeah, we talk about this in the white paper. Like, this is, we certainly need new benchmarks, but the, the reason we didn't do street bench verified for a long time is it's just not representative of the scale of problems that most people use splits and forth.

Starting point is 00:57:07 So the typical pull request size is like 100 lines of code, and the largest repository is a million lines within this. So we really need a set of benchmarks that's on Linux and VS code, which is 20 million and four million lines, respectively, with holdouts against those, trying to do larger scale work to ultimately show how far we can push the bounds on autonomy. Yeah, and in terms of Peter's saturation question, the paper does a really, really good job of describing the landscape of benchmarks and Swee Bench in particular and the need for a new benchmark.

Starting point is 00:57:42 But one of the points it makes is that when you score 86% on this benchmark, that's effectively very close to 100% because the remaining subset of questions are just flawed. They're not harder. They're just not, they're not structured well. And so you've basically capped out this benchmark now. So if folks want to look up the paper, what's the name of the paper? Do you have that? We, Alex is urging us to retitle it. So I'll give you the subtitle, which is a domain-specific context engineering paired with extended inference sign validation breaks and barriers of LLM-driven software development. So that's really what we're talking about.

Starting point is 00:58:25 That rolls off the tongue and onto the floor. It is a technical favor. You know, you can search Blitzy and Sid's name. He has a searchable name. Brian Elliott, there are thousands of Brian Ellis. It's a turn to ask. But Sid Pardeschi plus Blitzy will get you to the paper. Okay, perfect.

Starting point is 00:58:44 Alex, where do you want to take this next? What have you found important and fascinating about what Blitzie is doing? What's the implications in the long term, buddy? It's so interesting, so many different directions to go in. Maybe just to go back to this idea of great projects, because I think Blitzy has the potential to be sort of an embodiment of an era when we just, again, turn all the AI agents loose on all of the problems and a discipline. So what I heard, I think, Brian, you say a few moments ago,

Starting point is 00:59:13 was something like 100x price difference per line of code or profile between blitzie with its, you know, again, congratulations state-of-the-art performance announcement and other competing tools. When I hear you say 100x price difference, I immediately internally say, oh, well, that's just two years worth of cost hyper deflation. So you're sort of two years more expensive than the competition is the way that I heard that. So if you project forward two years, three years, four years, do you think we will find ourselves in a world where AI really has, the great refactor has been completed? And we've rewritten all of our foundational systems with Blitzy or maybe copycats of Blitzy. Do you think we find ourselves in a near future like that?

Starting point is 01:00:02 I would say pontificate first here. I think, you know, I think we will, we will see the models get significantly better at doing this and the cost go down. The key thing that I would like to underscore is a lot of the approach at some of the labs. And then what's happening right now is the labs aren't really making money on the inference that they're running, right, from the models. But what we're doing is because we're using the labs and we're able to charge a premium, right, for the work that Plixie does and also provide the validations with it, we're not losing money on the code that we're writing, right? So as this equation improves over a period of time, the difference that Blitzie is able to create

Starting point is 01:00:49 is going to also grow. So I think somewhere in that double negative, I heard the answer is that, yes, as hyper deflation kicks in, call it an order of magnitude cost reduction per year, maybe more, not only does Blitsey become very profitable, but also becomes very feasible to start to tackle these solve everything level grand challenges in software engineering. Yes. So I've been attempting to kick the tires on Blitzy myself. My first project with Blitzy was I wanted to rewrite Python, the very popular programming language. And I gather you, Brian,

Starting point is 01:01:29 and Sid, you've had access to your own product longer than I have, which is only been two days. Have you tried to take some large-scale project? I think, Brian, you mentioned Linux a few minutes ago. Have you tried to take some large project and either say, gosh, I want to ask Blitzie to improve performance by 10% on some relatively mature code base or add some crazy transformative new feature. Have you tried that? Yeah, we did a fun project. We've done a number of these acts. But the, we did a lot of One of the most fun thing we did, we onboarded VS code, and we said, hey, add a chat experience to VS code. At the time, there was, you know, Cursor, I still is right, one of the biggest tools out there.

Starting point is 01:02:20 We tried, and we did, we built one of a subset of the features of Cursa using FITC, and we tried using that internally, right? So anytime we consider SaaS products at this point of time, we're trying to first see if we can replicate that internally using Plitzy. And if we're, you know, a few months out or a few years out, we'll say, hey, let's just use the SaaS products at the starting point and consider rebuilding it later on. I think a lot of enterprises will do this. Have you thought, I mean, so sort of free marketing advice before the public, have you thought about taking all of these open source projects that are in many. cases, starved of core development team members and Hungary for human capital or human capital equivalents, taking these projects and just aggressively setting loose the AI agents to submit very friendly, very polished poll requests to these projects to launch improvements.

Starting point is 01:03:19 We've actually done that for MS flow, I believe, right, Brian? Yeah, yeah, we've done this, especially for one of the enterprises specifically rely on. It's quite possibly the best BDR, which is just sending pull request to open. source libraries. And we, there's an open source one right on the homepage of our website that I think you'll find fascinating Alex, which is, it goes all the way back we're talking at the beginning of the episode. So AWS invented this or created this repository specifically to be incredibly messy mainframe

Starting point is 01:03:47 code, right? So all different styles to represent, you know, different decades of people working on it. And ultimately, say, like, how would one use code generation to be able to move this from target cobal to target Java? Oh, cool. We ran that through Blitzy. I guess mainframe is such a big problem in all these large, even government organizations. And we moved it from COBOL to Java completely autonomously with the ability to compile

Starting point is 01:04:11 right out of the box. And now there's sort of like remaining development to work on some runtime stuff. But this is a multi-year-long project to move mainframe from a target messy coal into Java. And the results of that have been probably one of the best business development tools that we've ever created. How fast did you do it? How long did that take? A couple days? Yeah, if you caught everything from start to end,

Starting point is 01:04:37 it was a week's worth of inference. In Justin, yeah. Amazing. So if we project that going, maybe, if I made back to Peter's question about what the human equivalent of this is. Do you have any metrics that you can point to for either cost savings relative to humans for a given unit of code, a line of code, or per file, or how much faster than humans this is in general?

Starting point is 01:05:02 We typically see from a speed perspective, a 5x philosophy difference when this is brought into the enterprise. And the biggest challenge is really like operational deployment, right? And so you're used to sort of starting your work the same day that you pick, pull up your ID. And so what we're having these organizations do is sort of start to sprint for the next development work the week prior. And so instead of developers starting with tickets and tasks,

Starting point is 01:05:25 they're starting with code that's mostly written. and a project guide with all of the human tasks to begin. So we really focus on this 5X. Anytime we engage in enterprise to work with us, we say pick a real world project that you have upcoming next quarter. Let us prove a 5X difference and we should do the remaining development work

Starting point is 01:05:41 so we have an end-to-end solution. And if we do that, then you do dot puts it across the org. And it's incredibly successful because people don't realize the cost of coordination in development and all of this sort of requirements to actually get a piece of full software out that when you can offload a significant chunk of that to agents, the velocity gains are honestly unbelievable.

Starting point is 01:06:01 That's a really cool insight because a lot of the younger teams are going into enterprises and then they've never been in enterprises before and they're saying, look, the raw code generation is 1,000 X, 10,000 X. You're like, yeah, but what's it going to do for me in my enterprise? And there are very few that are credible in saying, well, we actually have done it and we know the final answer and right now it's 5X apparently. But they don't know like the overhead of the, you know, The organization and the documentation and all of the things that happened before you can even start code generation.

Starting point is 01:06:35 And so it's really nice, actually, to have at least one vendor that understands how to get the real thing. We actually need this to work in the end. It can't just be a hypothetical 1000X. It's three years from now. We've got digital superintelligence has landed. It's come out of, you know, I'm going to put my bets on Google, but we'll see. What does Blitzy look like? Yeah, I think Blitzie is the core system of record and system of action for software development in the enterprise environment, right?

Starting point is 01:07:05 And so the source of truth moves from documentation and code, which is sort of like this hybrid today, to organizations relying on Blitzie's hybrid graph vector database that understands a core functionality, and organizations are going to be able to move incredibly quickly from a software perspective, and the source of value is going to be sort of core idea. that's not easy to replicate. To add more into that, you know, Peter, there's always going to be some tasks where it's better to have a human in the loop and do them sequentially, right? You're solving a problem that has never been solved before and you need quick feedback from AI, right?

Starting point is 01:07:42 That's always going to be there where you use the co-pilots, but there's always going to be this other category of tasks where you can automate them away, right? Build the code, run it, deploy to production, and execute maintenance. Blitzie now, you know, gives you the code. The code is the final output. but we're going to go into autonomously maintaining, deploying,

Starting point is 01:08:00 and keeping the applications running. So you're not going to need humans for specific sections of the entire enterprise. It's all going to be driven by AI. I think Alex was about to paint kind of a two-year view. He asked a question about, you know, what's the force multiplier today? But then I think we were going to next segue into, okay, but there's 100x and another 100x coming. So I would love to finish that thought, Alex.

Starting point is 01:08:23 Totally. Absolutely. So there's a lot of, to Dave's point and Brian and Sid, I think you were starting to gesture in this direction as well. There's a lot of interest in the benchmark out of meter that's measuring the effective time of autonomy, the characteristic time scale over which AI systems, including AI co-gen systems, can basically operate without a human intervention, sort of like a disengagement with a driverless car. how far can it drive without a human needing to take the wheel, as it were? So I'm curious, have you thought about the characteristic time scale over which Blitsey is able to do autonomous co-gen, or the human equivalent, really, of autonomous coding before which the human needs to step back into the loop and be involved. Right now, if I remember correctly, the current state of the art is something like one to three

Starting point is 01:09:17 hours. There's a nice, very clean on a, at least on a semi-log plot expectation that, and I think we've discussed this previously, if you projected out a decade or two, we get to many, many years and perhaps hundreds of millions of years and in a few decades, where does Blitzy fall in this apparent exponential trend towards exponentially increasing times without humans needing to be in the loop? I think this is a X-Ex project. If you think, if you think, Think about all the pieces needed to achieve this, right? Let's take a very small example.

Starting point is 01:09:50 Let's take the ADWS example, right? There's the part where you identify the requirements, decide what you need to do, get the code, and there's a part where you get it all the way to production, and if you look at those parts, each of them have already been automated in isolation. For example, CICD, how do you deploy the code to production? You have automations for that.

Starting point is 01:10:08 Debugging, security analysis, monitoring in production, tracing, and viewing the logs, ensuring that the system is not doing anything malicious. All of these items exist today, and there are, you know, these blue layers that we are now seeing, like, for example, MCP, A2A, right, that allow agents to form this mesh and automate work. The only thing that's left to be done, really, in my opinion, for these projects, is to just connect the dots. And that's exactly what we're working on. So do you answer your question, you know, how far out?

Starting point is 01:10:36 I would say we are months out, actually, from delivering projects completely autonomously, as long as they meet a certain set of criteria and conditions. So what I just heard you say, Sid, correct me if I'm wrong, is that the documentation writers, the spec writers, are the new limiting factor for the speed of software developments. Is that correct? That is correct. How do you think about automating that process, if at all? That is also automated. So if you go to chat GPT, right, and you ask it to write documentation at will following your criteria. The reason we have document, you know, writers in the first place is to have quality, right? We have concerns that models. lose context or a period of time and they skip and omit things or they can be gamed into,

Starting point is 01:11:21 you know, adding things that you don't want. And we're really concerned for quality control, this is why we have humans. But as you can see, we've solved the context problem for large code basis. And there's nothing really stopping anyone for that matter from effectively adding in the right safeguards and, you know, layers of protection to ensure that we minimize the need for humans. I think it's a matter of us becoming comfortable with AI doing that. And I definitely see that happening over the coming months. To try to be the downside. Begging for a follow-up white paper because Alex's question is infinitely recursive, right?

Starting point is 01:11:54 If you said, okay, well, then that's not the constraint, then what, you know, because there's always going to be a constraint is turtles all the way down. You've got to just ask, okay, what's the next? Speed of light, buddy, speed of light. Yeah. And also, yeah, Douglas Adams, right, that famously pointed to that the problem is far more, far harder to pose than the solution and to the extent that the new limiting factor is the spec writer or the prompt engineer, whatever we end up calling it in the future, I really

Starting point is 01:12:26 would like to press you sit on this. Like when do we get our automated program manager, product manager, spec designer, documentation writer, if that really is the limiting factor for the speed of software engineering in the near future. Automation all the way to do. I'll tell you something, Alex. You know how the Blitzie platform works? We have these thousands of patients. And each of these agents has a persona. There is a product manager agent. There is a software architect agent. There is a QA agent.

Starting point is 01:12:51 And there is an agent that writes the prompts for the other agents. So all of the challenges that you're describing are live today in production with the Plytzee panel. This is such an important question, though, because, you know, I know it sounds very hypothetical. But, you know, look at this timeline on Sweebench here. This is only 18 months ago that you got like 12%. And now it's saturated. It's only been 18 months. So, you know, like, what we think of as the distant scientific future, you know, it's all science fiction.

Starting point is 01:13:21 It's only a year, a year and a half in the future. Star Trek's coming, buddy. It's really, really hard to anticipate. Yeah. Now, I love this question. Listen, I want to, I want to wrap. This is another way paper waiting to happen. I want to wrap this episode with a conversation amongst all of us on a particular topic.

Starting point is 01:13:38 We opened up talking about trillion dollar pay packages, trillion dollar investments. you know, numbers that are extraordinary. And the sovereign funds, the venture funds, family offices are just supporting this with massive capital inflows. And so the question is that I put to all four of you, think about competing in the long term with the MAG7, who've got this incredible access to capital. How should founders consider going about that, right? What's your advice to others who are getting in here during this period of exponential growth in the AI economy?

Starting point is 01:14:21 How do you compete? How do you think about that? I think you want to be a large customer of those folks as well. I mean, we are major customers of all the AI frontier lines. And so they're quite excited that we're going to continue to push the bounds of autonomy. And their market paths are going to continue to grow probably dramatically in line with that return on investment. and Blitzie is going to ride those waves as well. And so if you're happy when they're successful

Starting point is 01:14:47 and they're happy when you're successful, that I think you're in a pretty good strategic position. But there's one more thing, you know, to add on to that what I'd like to go back to what Dave said, right? Mercor, for example, it was able to do that because it went deep. I think that's also the case for us, right? We've seen the enterprise,

Starting point is 01:15:03 by the enterprise side of the challenge and the enterprise perspective and the security roadblocks and the product roadblocks and the process gaps that stop them from taking the full advantage of the products. So if you're an entrepreneur or a founder, and you've seen this personally, and you've struggled with this problem, and you think you have a solution that addresses the core of that,

Starting point is 01:15:23 and you've been able to test that with the actual enterprise and demonstrate effectiveness, I think you're holding on to something that is core. So you're saying understand the problem deeply. Yes. Understand the problem. Because, look, you have these Macs there, and they're giants. They have all the money.

Starting point is 01:15:39 That's fine, right? you're an entrepreneur, you're nimble, you can find the right investors. We were grateful to find, you know, Dave who believed in us the moment we pitched it. And we were able to get just a right amount of capital to get started. And that's really all you need. If you have the right talent, the right amount of capital, and you have the right problem that you're going after that you're convinced about because you've experienced and solved it, then you're going to be so nimble and make these moves and get a product out that is

Starting point is 01:16:05 significantly better than anything that the Max 7 can put together because they're struggling with their own challenges. like bureaucracy and all of the hurdles that they have to go through to actually put out politics struggling with what to say over dinner with Donald Trump exactly while they distracted all of that you can build a cake-ass product get it to market solve real world problems and you've changed the world effectively Alex what's your thought how do you how do founding entrepreneurs compete with companies I mean I remember famously you know Amazon was out there as a platform for people to sell their products

Starting point is 01:16:40 But then when Amazon saw a product, they had incredibly high, you know, margin and growth, they would clone the product and compete directly. How do you keep from that happening? Two words. Solve everything. In the name of this episode, solve everything. The world is filled with so many problems that a startup standing on the shoulders of the trillions of dollars of Kappex that are being invested in cloud, AI chips, fabs, energy are now poised to solve so many problems, thousands of problems.

Starting point is 01:17:17 I think Brian and Sid, and again, congratulations on the benchmark announcement, are well-poised potentially to solve the problem that we face of decades of civilizational software croft legacy code that's just piled up without enough human capital to invest in reinventing it. And now I think we're arguably on the verge of doing that. That's one of thousands of problems, entire domains that could be solved. Protein folding was solved by AlphaFolt, essentially overnight transforming a subset of structural biology. So many more opportunities. Before I go to you, Dave, I just want to remind people, you know, I define an entrepreneur

Starting point is 01:17:56 as someone who finds a juicy problem and solves a juicy problem. And the more entrepreneurs in the world, the more problems that get solved, the better the world is. That's why we're going to hit on this over and over again. I think the career of the future is being an entrepreneur, finding a problem, falling in love with the problem, not the solution, not the tech. Because if you understand the problem deeply, as the tech evolves and continues, you're going to use the newest version to go and solve that problem. And again, some of my favorite lines, the best way to become a billionaire is help a billion people. And the world's biggest problems are the world's biggest business opportunities. So that's what entrepreneurship means.

Starting point is 01:18:33 Dave, you see hundreds and thousands of companies. You've got how many companies right now in the Link Studios? 28 in the building and about 50 total. Amazing. What are you, when you're looking to invest in a young entrepreneurial team like Brian and Sid or like the founders of Mercore or, again, some of the incredible unicorns that we've backed out of Link exponential ventures, what are you looking for to make sure that that company isn't going to get? get disrupted in the wake of a, you know, open AI or Google slight jog to the right. You know, it's funny, Kevin Wheel, we asked that exact question in that podcast we did two weeks ago, and he answered exactly the way I had hoped he would answer, which is in a world where

Starting point is 01:19:25 the foundation model companies get to AGI and can do virtually anything, are you just going to take over the world? And, you know, Kevin was really clear that maybe we can do that, maybe we can't we probably can't anyway but even if we could we don't want antitrust to come in here and break us up you know that's the last we want a huge thriving ecosystem of partners that give us money you know it's blitzie one of those companies that gives us money yes therefore they're our best friend go conquer the world i love that take over change the entire foundation of all legacy code base make a trillion dollars and give us half of it we'll all be happy i took that's what they want i took a two hour walk yesterday with a dear friend of mine here who runs a large venture fund

Starting point is 01:20:05 And we're talking about the notion that his bet was, you know, Google had so much more capability than they unleashed. And they said, look, it's an open AI, go and do as much of this as you can because we need someone out there competing with us. Otherwise, we'll get broken up for antitrust reasons, which is a fascinating idea. You need viable competition to help you price, to help you remain on the edge, to help you not be, you know, sort of broken. down by the government. And be a good partner. That was when Google was growing like crazy and we had all these portfolio companies. We made a ton of gains.

Starting point is 01:20:44 But be a good partner to Google while they're growing like crazy. And now it's the foundation. Just be a good partner. Talks them all the time. Make sure you know where they're going and they'll love you. Amazing. I have a selfish question. I don't know if we're running out of time here.

Starting point is 01:20:58 No, that's fine. It'll close with your selfish question. Okay. Okay. Well, this is always looking for traits. Like this has obviously been one of our best investments. and the sky's the limit from here. And I'm always looking for trades of success.

Starting point is 01:21:11 And the morale at Blitzy is like nothing I've ever seen. You know, which is not a no-brainer. When you're doing video generation for a movie studio or whatever, it's easy to keep high morale, but when you're doing, you know, five million lines of code, core, cobal conversion. But yet you guys have just this crazy thriving culture. And Sid mentioned, you know, we were first money in.

Starting point is 01:21:29 I don't remember why we loved the deal so much. I do remember we absolutely was a no-brainer to invest in you guys. So two things jump out at me. One of them is BITS, which is just the hardest place in the world to get into. And in video, which is, you know, you've seen growth. The other one is Brian, I think you had Army Ranger. Bangalore Institute of Technology? That's the BILA.

Starting point is 01:21:48 BILA Center of Technology. Yeah. It's a cool name, though, BITS. It's like MIT, BIT, but it's BITS. And it was, you know, it was, by the way, MIT you designed the curriculum for BITS. So that statement was actually true. Oh, that's cool. The other one, though, was, you know, Brian, I think you had not just West Point, but Ranger training, which is freakishly hard.

Starting point is 01:22:12 And then first boots on the ground in Syria, so literally the first people touching a war zone. So I got to feel like there's something in those experiences that puts you a cut above in terms of building a team, managing logistics, building morale. So any clues there that other founders can pick up on and build on? Yeah, I think some evidence of being incredibly mission-driven and ambitious is what you would see if you were an anthropologist looking at both of our backgrounds. But if you take me, for instance, and if you fast forward or I guess rewind to 2017 when I was serving in the 75th Ranger Regiment, the mandate was going to Syria. There's about 2,000 ISIS fighters in that old Raqa.

Starting point is 01:22:55 We're going to send you with 100 guys, recruit everybody. else and take back the city, right? And oh, by the way, we can't let anybody in the United States know we're here because we're there covertly, right? And to be able to sort of go in and solve that problem, like that's a very ambitious undertaking where we're like conquering cities isn't something that like most people have spent their time doing. So when you look at for the level of ambition of the company, everybody here at the business, at the business, let's see, has that ethos, right? And the very first thing we do when we interview is we screen for ambition and the ability to invent and create. We have those core values. And if you

Starting point is 01:23:33 if you talk to any single person that sits in this building right here, they will tell you and they're right that what we're doing is one of the most important things they will do in their lifetime. Because the economic expansion that the globe gets, the GDP expansion that you get from automating software development or at least huge chunks of software development, there's almost no better incremental use of energy than driving towards that goal. Wow. That's a beautiful thought. Are you guys a 996 or 997 shop? It's Saturday today, we'll be an estimate tomorrow.

Starting point is 01:24:04 Yeah, we're a 997 kind of crew. Oh, my God. Just to back Dave up on his 997 job. That was a mistake. Well, I hope my dance was the last leader on Street Bench Verified, so we can work longer than them and beat them on the leaderboards. Oh, my God. Guys, listen, congratulations on hitting that new benchmark.

Starting point is 01:24:24 But more importantly, thank you for the work that you're doing from the companies that will benefit, from our government that will benefit, from the world that will benefit. You know, this is you're upgrading the DNA of industries and of our planet. So grateful for you. Alex, Dave, any closing thoughts here? I'm just super excited to see what you can get, what you guys can bring to the future. I would, very few things would excite me more on. the software engineering front than a few years from now to learn that the entire software stack that I run, that companies that I work with run has been 99% rewritten by Blitzie, by Blitzies

Starting point is 01:25:10 agents to remove all the vulnerabilities, improve all the performance. I think it's the sort of challenge before you guys that sets us on the road to recursive self-improvement and abundance and also solving everything in in software solving everything that's my that's my phrase for the day let's solve everything it's better it's better to dave's 997 than tang ping uh the opposite of 996 line flat in response to overwork yeah my closing thought definitely everybody read the white paper the title may sound very complex but the paper itself is very very readable uh so please read it And then your takeaway will be, wow, okay, now we need a new benchmark. Inside baseball, Blitzie's already working with MIT to create the next generation benchmark.

Starting point is 01:26:01 But catch up to what they did right here by reading the paper. To all our subscribers, thank you for following moonshots and WTF episodes. We're grateful for your time. We hope that, you know, in spending the time with us, you're able to understand how incredibly powerful this technology is for transforming our world, our lives, creating a future of abundance. I hope this counters all the dystopian news you get on the 6 and 7 o'clock news. That stuff I don't watch. This is the stuff I focus on.

Starting point is 01:26:29 I hope you do too. I'm grateful to my Moonshot partners, AWG, Dave Blundon, Saleem, wherever you are, transiting the Atlantic to come back here to the U.S. And again, Brian and Sid, congratulations on your epic wins, excited for your future success. Every week, my team and I study the top 10 technology metatrends that will transform industries over the decade ahead. head. I cover trends ranging from humanoid robotics, AGI, and quantum computing to transport,

Starting point is 01:26:56 energy, longevity, and more. There's no fluff. Only the most important stuff that matters, that impacts our lives, our companies, and our careers. If you want me to share these metatrends with you, I writing a newsletter twice a week, sending it out as a short two-minute read via email. And if you want to discover the most important meta-trends 10 years before anyone else, this reports for you. Readers include founders and CEOs from the world's most disruptive companies and entrepreneurs building the world's most disruptive tech. It's not for you if you don't want to be informed about what's coming, why it matters, and how you can benefit from it. To subscribe for free, go to Demandis.com slash Metatrends to gain access to the trends 10 years before anyone else. All right, now back to this episode.

Starting point is 01:27:46 Thank you. Thank you. Thank you.

Moonshots with Peter Diamandis - The State of AI: Elon’s $1T Package, Apple’s $600B for Trump & How Startups Win w/ Dave, AWG & Blitzy Founders Brian Elliott & Sid Pardeshi | EP #193

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.