TBPN - Will AWS Buy Google’s TPUs, Remembering Claude The Gator, Ricursive Raises $35M | Diet TBPN

Starting point is 00:00:00 Today I wrote about will AWS buy TPUs from Google in the front page of the Wall Street Journal's business and finance section. They're singing the Traneum chips praises. Amazon's chips pose risk to Nvidia. The whole week we've been talking to people. Is that clickbait? I don't know. What we're going to find out. We'll see.

Starting point is 00:00:19 It certainly doesn't seem, you know, good to have more competition in the market. There's a lot of losers if Google winds up winning with TPU. The losers came out to fight, apparently. Amazon.com is the latest big tech company to muscle in on NVIDIA's turf. Give me a sound cue from the fall sound. Muscle in? How about this? There we go.

Starting point is 00:00:42 That's right. On Tuesday, Amazon Web Services announced the public launch of its Training 3 custom AI chip, which it says is four times as fast as its previous generation of artificial intelligence chips. 4X speed up. That's actually very significant. That's great.

Starting point is 00:00:58 3 produced by AWS is Annapurna Labs. Fascinating company acquired a decade ago for around 350 billion, or 350 million. So it's pretty small acquisition actually, 350 million. In AI, you never know. But back then, you start a custom silicon company. You could barely clear nine figures on the way out the door. Inapurna Labs has been working on custom silicon for Amazon for a long time.

Starting point is 00:01:23 They actually do have a custom CPU at AWS to accelerate CPU-based workloads, then for the last few years, they've been working on GPUs or A6 for accelerated workloads. And so this custom chip design business, Anapurna Labs, can reduce the cost of training and operating AI models by up to 50% compared with systems that use equivalent GPUs. The chips are meant to provide a stronger backbone of computing power for software developers like Dean Leiters, Lighterzorf, the co-founder and chief executive officer of the startup, Descartes, who we had on the show. And Descartes is valued now at $3.1 billion.

Starting point is 00:02:09 Let's go. So if you don't remember Descartes came on, and Dean was doing live AI video generation while he was doing the interview with us. It was really crazy. Yeah, he basically, yeah, it was real time. He looked like he was in a video game, but it was happening with little to no delay,

Starting point is 00:02:27 a really, really cool demo. He said his company had a breakthrough enabled by a Traymium 3 chip, by the Traneum 3 chip, after trying out several other competitor chips, including Nvidia's processors. Dozens of programmers and AI researchers from his San Francisco-based company

Starting point is 00:02:44 had been trying four months to train a version of Descartes' flagship AI-powered video-generation application known as Lucy, that would be able to render footage in real time without bugs or hiccups. A.D.B.S. gave Descartes early access to Training 3. After meeting with the startup and being impressed with the founders, the company was two weeks into a marathon coding session in a rented house in Silicon Valley, which I think he took us on a tour of while he was in Wizardland

Starting point is 00:03:12 and AI-generated sci-fi world. It was very fun. That a few of his employees were celebrating wildly behind him. The moment that I saw it worked, I saw four people just start jumping up and down, said Dean. The next question was how fast can we get it to market and start changing industries with it? The launch of Traneum 3 is the latest broad side against Nvidia, which dominates the GPU market. A flurry of deals in recent months have caught the attention of investors indicating that more AI firms are seeking to diversify their suppliers by buying chips. Meta Platforms is in talk with Google to buy billions of dollars worth of advanced AI processors. known as TPUs, and OpenAI has struck deals with

Starting point is 00:03:54 RIVAN, NVIDIA-Rival AMD, as well as Broadcom. Very exciting that Descartes got good results out of the Traneum chip. That's awesome, obviously. I'm sure everyone over at Amazon has been working very hard on that. At the same time, we've heard that Anthropic maybe didn't have that great of an experience with Traneum, and that's why maybe they're moving over to TPU a little bit more.

Starting point is 00:04:12 Even though Amazon remains a major fairholder in Anthropic. And so my question is, will AWS buy TPU from Google? I asked Matt Garman that question. You asked me that question. Yes. I said they will be mocked. They would be mocked. Which is ridiculous.

Starting point is 00:04:29 Where I just like please my arch rival, can I please get some chips for my data center to compete with your data center? Okay, well let's actually go to what Matt Garman, the CEO of AWS said on TBPN yesterday because I asked him, will you be buying TPUs? And he said, hey look, we're very excited about Traneum And we think it has enormous potential.

Starting point is 00:04:51 And we absolutely think there's a benefit to optimizing every layer of that stack. People were joking on the timeline. You know, oh, there's this new Traynium chip. And somebody was like, all five people using Traynium are ecstatic. You know, that there's this new news. Amazon's so bad at hype. Traynium is used by 500 million people through Bedrock. But their marketing team just can't.

Starting point is 00:05:12 AWS is undervalue, blah, blah, blah. And he's obviously a bull on the stock. But what's interesting is that like, it is. He says, I'm out some of their GTM staff today. Let's just say you'll have years to accumulate stock at cheap prices. Very funny. Yes, there obviously is value. Even if Traneum winds up being for a particular niche, like maybe it's for real-time video.

Starting point is 00:05:33 Maybe that's what it gets really good at. The thing with real-time video that's interesting, something that Descartes is focused on is working with live streamers, specifically on Twitch. Amazon owns Twitch. Oh, that'd be cool. That makes that kind of partnership more interesting. I like that. So obviously there is value to saying, hey, if you go to AWS, you can get bedrock

Starting point is 00:05:54 and some services that have been fine-tuned specifically for Traneum. You go all the way down. You're going to get very good performance because we have a stack from top to bottom that's very efficient. But at the same time, if you're trying to do something that's sort of like not within the training ecosystem, you might have a rough go. You might wind up on a different chip. But he did say something.

Starting point is 00:06:13 He said, we are going to support choice for our customers as well. And so we'll continue to offer GPUs from Nvidia as an example. And we'll have, and we have a very tight partnership there. So this idea of customer choice, I think, is important. And if you go back to Jeff Bezos, he said, we're not competitor obsessed. This idea that Google is there arch rival, that's not in Amazon's DNA. Jeff Bezos said, we're not competitor obsessed. We're customer obsessed.

Starting point is 00:06:42 We're customer obsessed. And so if the customer says, look, it's great that you acquired Annaperno. labs for $350 million. I'm really happy with what you've done with Traneum 3. It doesn't work for me. I'm the customer and I want you to give me an Nvidia GPU in your server or in your data center or I want you to give me a TPU in your server. They might do that because that's actually in Amazon's DNA. Yeah. And then the follow-up question is, is there any world where Google sells TPU to Amazon? Already they are partnering. Like this was another partnership that came out. Separately, there was an announcement of an AWS partnership with Google Cloud. Now they aren't buying

Starting point is 00:07:20 TPUs but what they're doing is they're enabling customers to establish private high-speed links, links between the two companies computing platforms in minutes instead of weeks. And so the general idea here is that Google has some amazing AI capabilities that customers are just struggling to match on AWS at this point and the same thing's happening on Microsoft as well because on Azure you have access to open AI models that you might not have have access to on ABS. And so even though your whole infrastructure

Starting point is 00:07:51 might be on AWS, you might be going back and forth to GCP constantly. Companies used to think about AI as a special piece of their application. So it would be fine to bounce around to another cloud to get the best possible results. But if the next generation of companies, I'm sure we'll talk to some of the AI-focused YC Demo Day

Starting point is 00:08:09 companies today about this. I hope there's at least one. I hope there's at least one company that's doing something with AI. That would be a real treat. So it used to be fine to bounce around. now the next generation companies, they're maybe making their entire infrastructure decision

Starting point is 00:08:22 based on who has the best AI products. What do you laughing at? I'm laughing because I texted Simon. They have a turbo puffer has a booth at AWS. I said, how's it going at Reinvent? And he says, I'm not there. I just make it seem like I'm there as a joke because the VCs keep going to the booth

Starting point is 00:08:41 and then our growth intern is like, oh, Simon, I don't know. I think I saw him over there. Just continuing to Mogg while ARR Skyrog hits. Amazon needs to fight back against this and allowing high-speed interconnect between ABS and GCP. Solves a piece of that, but will they go further? NVIDIA has an insane amount of power right now. They've just ramped full-year revenue from $27 billion in 2023 to $60 billion in 2024 to $130 billion in 2025.

Starting point is 00:09:14 That's like one of the greatest revenue range. amps at scale in history. And then also, they grew their net profit margin from 16% to 56%. That's insane. Insane. Yes, Goat. That's why Jensen Wong is on Joe Rogan, and I'm sure it's going to be a fantastic episode. All the hyperscalers and Open AI, but that creates problems, right? Because all the hyperscalers and Open AI are now sort of incentivized to form a bit of an anti-NVIDIA alliance to commoditize the accelerator market and drive down those margins a bit. So, 56% net profit margins on 130 billion of revenue. People are just sitting there and they're like,

Starting point is 00:09:53 there's 50 billion dollars of profit over there. Like, that's a lot of acquisitions. And that's our- And that's our costs. Yeah, that's our costs. Like, you're just eating a lot off of these plates. How much do you think it hurts Amazon that they don't have a dedicated podcast guy?

Starting point is 00:10:09 Like they don't have a Sholto, they don't have a Sam, they don't have a Satya. You know how much that hurts because they definitely have someone in that role. You just don't know them. That's what I'm saying. Yeah, they might have the title, but they're not really in the driver's seat, right? Yeah, they don't have a run.

Starting point is 00:10:23 They don't have a roon. They don't have a sholto. Yeah, they should step it up. They should definitely get someone. Fortunately, I mean, the semi-analysis crew was over there, taking pictures, sharing photos in the timeline of the Traneum 3, ultra-server liquid-cooled with a lot of hard eyes. That's a glowing endorsement from the semi-analysis crew. And look at this. Very purple.

Starting point is 00:10:45 I wonder if that's, like, intentional. Google is having this kind of success with TPUs. What about Amazon's Traneum? Traneum is new and underpowered. Just 667 T-flops BF-16. It has lots of HBM, but the bandwidth is lower than the H-100, TPUV-V-6E. This is competitive with H-100, not on HBM or bandwidth. And Ironwood is competitive with Blackwell on Flops, bandwidth, and HBM capacity.

Starting point is 00:11:08 I expect Ironwood to quickly gain market share as it ramps up. As you can see from throughput slash TCO, Nvidia versus Traynium, Rubin Maw. Trainium 3 harder than Blackwell versus Trinium 2 on TCO training flops and reduces the gap by 5% on TCO MEM bandwidth. So the gap between Nvidia and Traneum is actually increasing rather than decreasing. By the way, this math was done before CPX was introduced. I won't be surprised if CPX plus Rubin is cheaper than Traynium for inference. So I do think that there's a world where there's something specialized like what's going on with Descartes, some sort of special model that's that thrives in what Traneum is good at and they can further niche down.

Starting point is 00:11:50 But we'll see. I mean, maybe they come from behind and they just destroy TPU, and we're all talking about training next year. We're going to say a little rest in peace. Rest in peace to Claude. San Francisco's beloved albino alligator has passed away at age 30. That's a good age. I don't know how long alligators typically live,

Starting point is 00:12:09 but I'm glad. Looking it up. It feels like. 30 to 50 years for the American alligator. Cut a little bit short, but Claude was of course often reaching 70 years or more. Yes. You know, obviously people started speculating immediately.

Starting point is 00:12:25 Anthropic, of course, was the sponsor of Claude. And, you know, people were wondering, was there foul play involved? Was it possible this poor dinosaur, not dinosaur, alligator, passed the day that it got announced that they've hired, IPO lawyers. Some people were speculating could, is it possible, Claude was sacrificed to the capital markets gods and some type of ritual. But anyways, he, look at this expression he has on his face. Can we zoom in a little bit? What, uh, what a, what a cool guy. And he will be remembered. Yeah. Trump administration will invest $150 million into a lithography startup called

Starting point is 00:13:13 X-Light, its first Chips Act Award, chat this morning with X-Light CEO. There's a few lithography companies now. We've had some on the show. It's a very interesting tier of investment, like $150 million from the government that feels like a series B. They did raise a series B this past summer, led by Playground Global, makes sense that the government's investing in Intel. Pat Gelsinger, of course, former Intel CEO. Now he's getting involved in X-Light, marshaled 40 million of capital, went and got $1,3,000. 50 from the government. There's also another AI startup that wants to remake the $800 billion

Starting point is 00:13:48 chip industry. This one's in the Wall Street Journal, founded by ex-Google researchers, recursive intelligence, raised 35 million with backing from Sequoia to automate chip design. Obviously, this is not lithography. This is the design process, but still... This is AI for AI chip design. Oh, that's right. Yes. On a quiet residential street, a few blocks from Stanford University, two former Google researchers are launching a startup. They hope will remake the 800 billion dollar chip industry, trying to build software that can automate the design of cutting edge chips, a prospect that would allow every company to build their own chips from scratch, working from the top floor of a suburban home. The duo recently raised 35 million

Starting point is 00:14:25 to kickstart recursive intelligence with funding from Sequoia Capital. The recursive, we got to add that. We got to add that to the name. We got to add that to the list of, because there's standard capital, modern capital, standard intelligence, modern intelligence. Raw intelligence. Wow, the company, 35 million for evaluation of 750 million. That's very low dilution. What, 5% or something like that? VCs were mocked. High. Yeah, I would have assumed this would be a very capital-intensive business, but I suppose if it's just a software that they're developing, maybe they have more control here. Thoughts on AI progress. He says he's moderately bearish in the short term, but explosively bullish in the long

Starting point is 00:15:10 term. Well. Very interesting. So he says he's confused why some people have short timelines. They say AGI is coming soon. But at the same time, they're bullish on RLVR, which is reinforcement learning with verifiable rewards. He says, if we're actually close to a human-like learner, this whole approach is doomed.

Starting point is 00:15:30 Currently the labs are trying to bake in a bunch of skills into these models. Through mid-training, there's an entire supply chain of companies building RL environments, which teach the model how to use Excel to write financial models. For example, I think we're actually talking to an AI Excel analyst for Excel power users called crunched at 1250 YC company. In the context of when does AGI arrive, when does superintelligence arrive, I understand, to our question, to our question, either these models will soon learn on the job in a self-directed way, making all of this pre-baking pointless, or they won't, which means AGI is not imminent. Humans don't have to go through a special training phase where they need to rehearse every single piece of software we might ever use.

Starting point is 00:16:09 When we see frontier models improving at various benchmarks, we should not, we should think not just of increased scale and clever ML research ideas, but billions of dollars spent paying PhDs, MDs, and other experts. One counter argument I've heard from the takeoff within five years, crew is that we have to do this Clue GRL in service of building a superhuman AI researcher, and then the million copies of automated Ilya can go figure out how to solve robust and efficient learning from experience. This gives the vibes of, we're losing money on every sale, but we'll make it up in volume. This automated researcher is somehow going to figure out

Starting point is 00:16:49 the algorithm for AGI, something humans have been banging their heads against for the better part of a century while not having the basic learning capabilities that children have. That seems super implausible to me. You've been asking about economic diffusion, what is the rate that we're diffusing. Let's see what Dwar Keshe has to say about economic diffusion. He says that economic diffusion lag is cope for missing capabilities. I'm very sympathetic to this because when I go to the doctor's office and they hand me a piece of paper, I know that a web form is good enough. The capabilities of the digital form are complete. No, it's just a diffusion problem. There's just someone who runs that doctor's office is like, I like doing it the old way,

Starting point is 00:17:37 right? And that's the economic diffusion lag problem that I think is real in a lot of scenarios. Right now, AI is great at generating text, right? It's great at kind of analyzing a piece of content and then generating text based on that. And yet we still have multiple people on the team at TPPN whose job is to find interesting moments of the show and then create captions around that and share it to X and Instagram and YouTube and other platforms. And Dwork Keshe's parents that too, where he was trying to find the most interesting pieces of a full podcast with one big Gemini prompt.

Starting point is 00:18:13 And he was trying all the different models and couldn't get it to actually find like the most salient and viral points. The other thing that stands out is like one of the seeming missing capabilities is like ability to like identify humor or even something like it's almost emotional. So Ilya and Dwar Keshe talked about this, where I think Ilya was giving the example of scientists studied people who had had various brain injuries that limited their ability to experience emotion. And when they took out emotion, it took them, it can take somebody two hours to figure out which pair of socks to choose. And they were kind of stunned. It's just a pair of socks.

Starting point is 00:18:53 You know what's going on in your day. Why do you need emotion in order to make that kind of decision? And so it seems like at least in AI, a missing capability is like, okay, finding out like what's an interesting moment of a podcast, right? Is it something that makes the audience member feel something, right? The other thing that's notable is like on WOP, one of the best, one of like the top jobs that people do on WAP or way they make their first dollar online is just like clipping for various content creators and media companies. And some of the clips that they make are so sloppy. Like it's literally just like a random segment of the show and they're blasting it out from like 20 different accounts.

Starting point is 00:19:34 And the fact that we're still paying humans to do that still, I mean, it just feels notable. Well, let's read Dwar Keshe's take on economic diffusion lab lag being coped from missing capabilities. Sometimes people will say that the reason that AIs aren't more widely deployed across firms and already providing lots of value outside of coding is that technology takes a long time to diffuse. Dorcasch thinks this is cope. He says, people are using this cope to gloss over the fact that these models just lack

Starting point is 00:20:07 the capabilities necessary for broad economic value. It says new technologies take a long time to integrate into the economy. Well, ask yourself, how do highly skilled, experienced and entrepreneurial immigrant humans manage to integrate into the economy immediately? Once you've answered that question, note that AGI will be able to do those. things too. If these models were actually like humans on a server, they'd diffuse incredibly quickly. In fact, they'd be so much easier to integrate and onboard than a normal human employee. They could read your entire Slack and drive in minutes and immediately distill all the skills

Starting point is 00:20:42 that your other AI employees have. Yeah, I agree with that. The one thing that I don't necessarily agree with here, he says, well, ask yourself, this quote from Stephen Burns, how do you highly skilled experience and entrepreneurial immigrant humans manage to integrate into the economy immediately. I mean, they do sort of integrate into the economy immediately, but like the immigration flow is like a slow process. Like it doesn't just happen immediately. It's not just like, you know, the amount of immigration went from like zero to like, I don't know, a million people or something. Like it's like people move around. There is like a, there is a bit of a drag. But I understand what he's saying here. It does make sense.

Starting point is 00:21:19 Silicon Valley is rallying behind a guy who sucks. It's like, what does that mean? Pure ad hominin. It's rage bait. It's going to go hard. It already got a thousand likes. On a linked article, the Verge is not putting up

Starting point is 00:21:34 a thousand likes per link. So this is outperformance. And it's heavily paywall. You cannot learn how David Sacks sucks without subscribing to that thing. They did a good job. You got to pay. You want to know why he sucks.

Starting point is 00:21:48 That'd be really funny if behind the paywall is like, we're just kidding. He's actually awesome. We think the New York Times missed on this one. The startup told me that one of their investors didn't like that they were selling to newly founded startups and wanted them to sell the bigger companies who have more money. If investors tell you this, write them off as idiots.

Starting point is 00:22:07 Selling to startups is the best thing you can do. I'm sure many of the companies we're talking with today will be selling to other companies in the batch. A lot of people say that's bad. They try to say like YC is a circular economy. But you have to ignore the hundreds of very real businesses that have been created through YC and gone on to work with every kind of company in the world.

Starting point is 00:22:36 Yeah. Even if there's some sort of insular circular economy in the startup ecosystem, like there's a pretty immense amount of pressure to actually deliver something that's valuable. Yeah, they're being rational. It's not like I'm sure there's been small instances where companies were actually, you know, had somewhat bad behavior. But in general, it's like if I'm going to pay for the SaaS tool or the beta that you're running,

Starting point is 00:23:01 it has to be good. So there's a $1.5 billion judgment against Anthropic for including $480,000 books in training their AIs. Five of my books are among them. Where it is, there might be $1,500 payout per book, according to my agent Max Brockman. I wrote to my agent Max, the following. If any payment comes to me, please send it back to Anthropic with my thanks for including my books and their AIs. The judgment website offers a way to opt out of the payment, but I found it cumbersome. So I didn't.

Starting point is 00:23:34 I'm principled, but too lazy to be highly principled. The secondary markets are rife with fraud and bad actors, and it pains me to see these bottom feeders profiting off of Anderil's growth while fleecing retail investors through unreasonable or opaque fee structures. In this week's episode of Nonsense, Ignite VC, a fund we've never taken a meeting with or had any contact with whatsoever, founded by Brian, who we've never met a soliciting investors via public Google Doc to invest in an SPV that will in turn invest in another SPV that will in turn potentially enter into a forward contract with a supposedly, though, unnamed early and a role employee. A few problems here. First off, so-called forward contracts are notoriously hard to settle in private companies, and counterparty risk is extremely real. What about the many

Starting point is 00:24:18 complicated corner cases like acquisitions where shares don't trade or merit? divorces, or deaths where ownership of the underlying shares is complicated. Just generally a risky structure to close that I don't think most folks actually understand. Yeah, if you enter into a forward contract and you basically buy the right to the future value of some shares and then somebody gets, you know, again, married or divorce or passes away or bankruptcy is another situation where you might not be actually able to collect even if the, even if your investment should have generated some return. Matt says, second, this deal memo includes basically no details about Anderil's performance, no revenue figures whatsoever, no product

Starting point is 00:24:56 specifics. I guess that's good, right, like if they were if they were just floating around information that they had acquired, almost as if it's soliciting investors to invest on hype and momentum and not fundamentals. Generally, I'd advise folks to be skeptical of any deal memo lacking basic details. Third, forward contracts are explicitly disallowed by Anderil's stock plan and bylaws, which means that Anderol will never consent to Team Ignite's SPV, actually taking possession of these shares while we are privately held. Zero chance. And finally, the memo spends most of its time talking about the structure and fees, which are insane. A double-layered SPV with all legal and admin costs pass through in addition to an 8% upfront fee, 3% annual fee for two years,

Starting point is 00:25:38 20% carried interest, and the craziest part, an implied price per share that is completely insane. In this case, the implied PPS is 115% higher than the most recent preferred raise from nine months ago. flattered, I suppose, but also puts these investors in an almost absurd position by paying more than double the price per share of our most recent transaction. As stated, at the top, I don't know Brian or Team Ignite at all. Maybe they're kind of wholesome people, and this is all a big misunderstanding. But if I were an investor looking at this, quote, I'd run for the hills. And I believe the founder, the founder replied and said, appreciate the heads up. The document reference was an internal draft prepared for discussion with an existing LPN was not intended

Starting point is 00:26:17 for public circulation. It appears someone shared it without authorization, and we're looking into how that happened. But do you see what... There's like seven people that share a screenshot of like a direct email we got with this exact memo. Okay, and the other thing is, they say not soliciting investment

Starting point is 00:26:32 for any anderil-related vehicle. Matt says, really? The draft was written by your founder and managing partner. I literally watched him edit the dock in real time. And he has a screenshot of like the founder's name in Google Doc. What a mess. Don't do this.

Starting point is 00:26:49 Don't do it. Instead, why don't you start a company and apply to Y Combinator? Build an actual business instead of going around hustling SPVs and companies that don't want to sell shares. We will talk to you later. Cheers. Goodbye.

TBPN - Will AWS Buy Google’s TPUs, Remembering Claude The Gator, Ricursive Raises $35M | Diet TBPN

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.