Catalyst with Shayle Kann - AI scaling pathways: on grid, on edge, off grid, off planet

Starting point is 00:00:02 Latitude Media covering the new frontiers of the energy transition. I'm Shail Khan, and this is Catalyst. But if you just do the simple math on a single gigawatt scale space-based data center, you end up with a radiator the size of a small town, right? Like between the radiator and the solar panels require, and I think you end up with a four-square kilometer orbiting asset. And that's obviously complex to manage, but it's also a target. Coming up, a unified framework for all the crazy data center stories you've already heard and will inevitably keep hearing.

Starting point is 00:00:45 When utilities need flexible capacity they can count on, they turn to Energy Hub. Energy Hub works with more than 170 utilities, coordinating over 2.5 million devices to manage 3.4 gigawatts of flexibility, built for the moments when utilities can't afford uncertainty. Energy Hub builds and operates virtual power plants that utilities actually stake their grid planning on, coordinating EVs, batteries, thermostats, and more through a single platform built for utility scale. Predictive, verifiable, and designed to perform when it counts. Learn more at energy hub.com. Trillions of dollars are flowing into clean and critical infrastructure, but those investments aren't driven by technology alone. They're shaped by markets, by policy, by capital,

Starting point is 00:01:27 and by the institutions that connect them. I'm Alfred Johnson, CEO of Crux, and host of a brand new podcast, Critical Capital Capital. Each episode, I talk with people deploying capital, shaping policy, and building the clean economy. Tune in as we unpack how progress is actually made. Listen to critical capital on Spotify, Apple, or wherever you get your podcasts. Catalyst is supported by Fishtank PR, an award-winning PR firm focused on climate and energy tech, renewables, and sustainability. Fishtank is known for generating prominent and effective media coverage for the brands they work with. If you want a PR partner that's thoughtful, shoots straight, and gets results, you'll like Fish Tank PR. To learn more about Fish Tank's approach, visit fish tankpr.com. That's F-I-S-C-H-Fish-Tankpr.com.

Starting point is 00:02:19 I'm Shale Khan. I lead the early-stage venture strategy and energy impact partners. Welcome. So, massive data centers on the grid, massive data centers off-grid, small data centers on the edge, huge data center clusters in space. Each of these might get built. Actually, each of them probably will get built, but how much and when? When you boil it down, I think there are actually two basic questions at play here.

Starting point is 00:02:48 The first is the amount of demand for compute in the future, and the second is how to deliver the energy required to meet that demand. I don't really personally have anything insightful to say about the first one, but boy do I spend a lot of time thinking about the second one. And it has occurred to me that I've never really seen anyone attempt a cohesive framework to think through all of these different pathways. You see proponents of one or another adopt kind of a maximalist approach to any given one, but I haven't seen anybody try to think through how they weigh against each other. So I've been trying to organize this in my head. And in doing so, I've realized that each of these different configurations, sightings, types of data centers has a core constraint or two.

Starting point is 00:03:31 But each also has its strengths. So to talk through all of them with me, I brought on my colleague Jake Elder. Jake works with me at EIP, and he leads our research practice that is focused on the built environment, which these days increasingly means a lot of data centers. One more thing. Coming up on April 13th in San Francisco, we're going to do a live episode of this podcast at the Transition AI Conference. It should be actually a really interesting conversation. My guest is going to be Amin Vahat, who's Google's chief technologist for AI infrastructure.

Starting point is 00:04:01 So obviously relevant to this conversation as well, I rarely do these in person. So if you're in San Francisco or you want to be on April 13th, go sign up for transition AI. Register at latitudemedia.com slash events. Here's Jake. Jake, welcome. Thanks. Excited to be here. All right.

Starting point is 00:04:19 So the premise here is let's just, in the interest of not adjudicating this question, let's just assume compute demand continues to scale. Let's assume superintelligence, AGI, maybe neither of those things, but that, like, the demand for compute and actually the demand for watts to deliver that compute, right? Let's also assume that there is no massive energy efficiency gain that comes and, like, totally changes the paradigm. So if it is true, and let's assume that's true for the next, I don't know, five years, ten years, whatever we want to talk about. Let's assume that's true. I think the thing that we want to talk about here is what are the various options to deliver as much of that demand, as possible. What are the options on the supply side? And so we're going to talk about the incumbent

Starting point is 00:05:05 solution, which is large hyperscale grid-connected data centers. And then we're going to talk about each of the alternatives that I think are currently being proposed, some of them already being developed, some of them being talked about on X a lot. And I think we'll compare and contrast, right? I'm going to talk about the constraints on each of them. But why don't we start with the incumbent thing? The thing we are doing right now, where all the data centers are, which is like large hyperscale data centers connected to the grid. What do you think of as being like the core constraint to just delivering 10X the compute in that way? Yeah, no, great question.

Starting point is 00:05:40 I think it's going to make for a great conversation as we look across the different options here. I think the constraints on the grid side, right, are fairly well known at this point. It's a speed issue, in particular on the transmission side. How long can we, how much time will it take to build out the transmission capacity necessary to interconnect these mega sites, gigawatts scale sites, to new power supply, ideally, you know, carbon-free power supply. And in many markets, right, that's running five to seven years now, which is a pretty massive timeline for data centers,

Starting point is 00:06:09 given the speed of power and speed of deployment on the AI buildout that we're trying to drive. Maybe the other couple issues that we should at least be mindful of here are power quality, right? These large data centers, especially as they cluster in certain locations, can have bigger impacts on the grid writ large, and the extent to which society regulators and utilities are willing to serve those customers if they have bigger grid impacts, I think is still a bit to be determined in a space I'm watching pretty closely. And then, of course, maybe the third vector from my side would be, let's call it, you know, social license to operate. And we're seeing in many states, right, just blanket bans on new data center developments. We're seeing some developments get pulled, you know, years after announcement because of community pushback.

Starting point is 00:06:51 And, you know, if you listen to Elon, for example, in his best cases for going off the, planet with compute infrastructure, that's really his argument that at the end of the day, society is not going to move at the pace that the AI buildout requires, and therefore at some point we're going to have to abandon the planet and go to the stars. Yeah, we're going to get to the orbital compute thing a little bit later. I think that is a good point, right? People, so you mentioned three things. There's the capacity, actual physical capacity on the grid and deliverability. There's the power quality thing, which of the three, I think, is probably the most manageable, honestly.

Starting point is 00:07:28 It's like an engineering problem. And then there's a social license to operate, which we're already seeing kind of burst at the seams in some locations, despite being kind of in the early days of this trend. So I think that third one is underappreciated in the question of, like, are we going to be able to deliver all the compute capacity that we need via the current paradigm? On the first one, I would say, I do think people conflate the transmission problem and the generation capacity problem.

Starting point is 00:07:54 And the thing is, they are both problems. I mean, you said five to seven years. The five to seven years is the timeline to get new gas turbines, if you're ordering them, and it's close to the timeline to get two transformers and other, and switch gear and stuff like that. Like, we're all in the, like, three to five or maybe seven-year timeline for that kind of thing at this point. And maybe the timeline also to get, like, a substation upgraded, which is part of the deliverability thing. But I want to say the timeline to get a new transmission line built, especially if it's like interregional or across state lines or whatever, is not five to seven years. It is essentially infinite years in the United States, at least in recent history.

Starting point is 00:08:35 Like we just aren't doing it. So there's a limitation there that might be even more intractable than just the generation thing. Totally. And I think it's the unique constraint to the grid-connected pathway, right? If you wanted to go towards some of the other options, we'll explore down the road and you're going to go off grid, for example, you're still stuck with the timeframes for transformers, the timeframes for generation assets, et cetera. And I know we'll talk about some ways to shortcut that. But the transmission side is really unique to this first scenario and certainly makes the case that, you know, if you want to run around that, you need to think about some amount of on-site power as the only way to avoid having to build more poles and wires to rub power from elsewhere to a new site. And so, you know, that on the capacity side is one thing that I think gets lost a little bit in this grid-connected conversation is it tends to be an all-or-nothing conversation around how we power these data centers.

Starting point is 00:09:24 And I do think there's, like, you know, a hybrid option here where you're still grid-connected, but the data center brings some of its own power for a few hours in the day, specifically to overcome that transmission bottleneck. And to be clear, that's what's happening now, a lot of that. This concept, in fact, I've seen people get confused about this because there was some, I don't remember who put it out, but there's some report that came out that saw. like there's like 50 gigawatts of behind-the-meter generation and development at data centers, right? And some people have interpreted that to be, oh, 50-gawatts of off-grade data centers getting built. It's actually close to zero of those that are true off-grid data centers. They are all either grid-connected but have some behind-the-meter generation, or the behind-the-meter generation is a bridge, and they ultimately intend to be grid-connected.

Starting point is 00:10:08 So that is true. There's a hybrid there. But, okay, so this is the least interesting one, because this is the way we do things now, And it's going to be the way that we do things as much as we possibly can. Like, I think you and I agree that, like, the first thing that's going to happen, this is already happening, is that developers are going to find as many sites as possible that can handle hundreds of megawatts or gigawatts of load. They're going to develop those into data centers. So, like, we just assume that happens, and we should just assume it's not enough.

Starting point is 00:10:39 Or maybe it doesn't happen because of community pushback. But either way, we'll assume it's not enough. Now let's talk about the other, I think, three categories of ways of configurations to get a lot of new compute online. The first one is maybe the least distant, which is you still grid connect data centers, but they're smaller and you put them at the edge. So I've talked a little bit about edge compute on the podcast before. You and I have spent a lot of time thinking about it separately. First of all, define what you think of as edge compute because it is sort of malleable. and then, like, what is your latest thinking on what role that plays in the market?

Starting point is 00:11:16 Yeah, so this is a really tricky question, right? Edge computing has been around for a while. Historically, it evolved to serve certain use cases like telecommunications, and more recently, video streaming, for example, is something that happens much closer to the edge than other hyperscale data center activities. But moving forward, I think there's a school of thought that says that AI inference in particular might move to the edge. And I think the, you know, first principles argument that folks tend to be.

Starting point is 00:11:42 to make is that latency is going to matter more. And so citing compute infrastructure closer to demand just has a performance benefit that can't be met via large central sites in West Texas, for example. As we've dug in a little more, I think that's a little bit of a red herring. And so let's come back to that in a second and talk about why you would actually pursue edge data centers and edge computing. But latency has certainly been one of the reasons historically. That said, edge computing can mean a few different things, to your point, right? So in the extreme scenario, I think, you know, as you move out, you know, 10 plus years, more and more is going to happen on device. We already know that, you know, like Waymo cars, for example, have a lot of their day-to-day or all of their day-to-day navigational tools and driving decisions get made in the car directly.

Starting point is 00:12:26 And increasingly, as we have models that can operate on a phone, for example, you might have a version of ChatGPT or Gemini that just operates natively on your phone and doesn't need to go out in the world at all to get access to basic inference, you know, results. On the other end of the spectrum, we've seen a few folks announced larger-scale projects, really think about 20-Megawatt-style data centers, maybe 15 to 30. And those folks are basically building many hyperscale sites, but they're trying to build them in locations where they think they can get power sooner, and perhaps in a regional node where they could serve some more latency-sensitive applications. But from a design perspective, and a deployment perspective, they kind of look like much of what we're building today, just small-scale relative to the gigawatt scale.

Starting point is 00:13:10 assets. My suspicion is that's probably the most economic piece here. And so if this becomes a cost play, that that's the space that becomes most interesting. But again, let's come back to that. And then I think there's this third category, which is really more kind of true, you know, what we might have thought about as edge computing, where you've got a, you know, 100 kilowatts at a given site or a couple of megawatts at a given site. You could think about these being located at the utility substations or in a commercial real estate, you know, office basement. And the, reason to pursue that, right, is probably cost at the end of the day. We know across the folks that we know well, right, that there are a number of individual parcels of land that were, you know,

Starting point is 00:13:50 provision for five megawatts of power and are only using two. And so I think the theory to pursue that is probably more around speed, where you could probably suck up a bunch of assets relatively quickly and start to build out a network. But if you end up in a cost game and you're trying to be the cheapest form of inference, strikes me that that probably struggles because your subscale relative to bigger sites. Yeah. You said a couple things that resonate with me based on what I've learned. The first is that latency is a bit of a red herring, the latency benefit of being edge.

Starting point is 00:14:22 Not for zero applications, but for very few does it seem that you need such low latency that Edge has a big benefit over, you know, the sort of like regional hyperscale model that we have today. And people use the example of things like autonomous vehicle. That was like a classic case people would talk about it. So, well, you need Edge computing for autonomous vehicles. But as you said, most of what a Waymo needs is inside the car. And so, as I understand it, they can operate with compute inside the car.

Starting point is 00:14:51 And then when they need to go pull something from the cloud, it's generally not so latency sensitive that they can't handle the hyperscale. So this concept of Edge being necessary for latency purposes, I'm yet to have that proven to me. I'm waiting for it, but does seem unlikely. Secondly, it's hard to imagine. imagine it's cheaper. Now, people do make the argument that you might get free land, right? And that could be true. Like, if you're taking land that's already getting paid for because it's at a commercial

Starting point is 00:15:18 property or whatever, it's in a parking lot, it could be any of those places, any utility substation that's already a substation, the land could be pretty cheap. But if you look at the fully loaded cost of a data center, land is not a big portion of it. It's a very, very small portion of it. the cost is actually in the GPUs, obviously, in the building, in the labor, all those kinds of things. And as you said, being subscale is tough. Maybe you can make some modularization argument. You know, you have the standardized shipping container, and the shipping container is, like, super cheap and easy to deploy. You just plug it in. But as is true in many other sectors, my guess is, you know, your 300 megawatt data center on a fully loaded, levelized cost of flop

Starting point is 00:16:01 is just going to be cheaper. So it's probably not a cost thing either, which means it's a speed thing, right? And speed is the name of the game right now. But I think what remains to be proven in Edgeworld is that it can actually be faster at the same scale. This is what we need to find out. Yeah, I think that's right.

Starting point is 00:16:22 And I do think at some point the speed game is going to slow down and cost is going to matter, especially in the inference world. I don't know exactly when that happens. And in our future scenario where we're in some kind of relatively quick takeoff around AI capabilities, maybe speed matters for longer because models continue to improve kind of indefinitely. But at some point, when we have agentic employees in most Fortune 500 companies and this kind of future, right, like the cost of those workers matters. And so I do think at some point, if there's an edge buildout and you're looking at two or three different edge deployment models, while speed matters, the cheapest of those models might be the one that ends up winning at scale. Yeah, but I think speed remains a question mark.

Starting point is 00:17:03 Like, in principle, if you have an existing interconnect, as you said, there's some commercial site that has like a 5 megawatt interconnect and is using two megawatts, you put three megawatts on there, that should be much faster than waiting for an upgrade in the system. But of course, to match the speed with which you need to go, you're going to go deliver your 300 megawatt data center, you then need to go find 100 of those sites and develop 100 of them. And like, in principle, I can understand how that,

Starting point is 00:17:30 that could be faster, but I'm waiting for somebody to show me that that is true. Yeah, and certainly requires a lot more conversations and turning over rocks and, you know, dead leads as you try to build it out, right? You've got to have 100 success outcomes in terms of site evaluation, not just one. Right. Okay, so that's edge. So both of those are grid connected. Let's assume the grid becomes the constraint. It just is the constraint, right? Okay, so now we're either going to, we're going to get into like increasingly distant in a and a metaphorical sense. Well, let's start with the one that I think is maybe the least talked about

Starting point is 00:18:06 relative to how interesting I find it as an answer to this question, which is just off-grid. Like, and again, we're not talking about a hybrid version where you have behind the meter generation and you're still grid-connected. Let's just say put a data center anywhere. It has an amazing relaxation of a constraint. If you remove the grid as a constraint, we have plenty of land available, right? that is not the constraint here. And you can go where there's the cheapest labor.

Starting point is 00:18:34 You can go where there's the easiest permitting and citing. It does change the game in that manner, but it does have its own set of challenges and constraints, which is why it hasn't happened a lot historically. So what's your perspective on just straight off-grid? Yeah, I mean, you make a pretty good case. It should be pretty attractive, right? There was this foundational study that came out about two years ago

Starting point is 00:18:57 that was co-authored by Stripe. in paces and scale microgrids, and they found over a terawatt of opportunity in the American Southwest alone, with high levels of renewable development being able to support those assets, like 50% solar plus batteries at cost parity to using all gas and the ability to get up to, I think, 80 or 90% solar without a meaningful cost increase. So like from a land perspective and a resource perspective, it makes a lot of sense. And to your point, it can also move really quickly, you can avoid the places where the public really doesn't want data centers, right? You've got such geographic flexibility. It should be the opportunity if you just take a

Starting point is 00:19:36 first principles approach. And we certainly don't need to be thinking about going to space until we think about going to remote parts of the earth, right? But to your point, it's not happening at scale yet. And I think there's a couple of reasons for it. There are some projects that are happening that we can learn from, right? And we've got some manic data to support that. I think at the end of the day, the grids of market, the grid's a market. of humanity, and it does a lot of really good things, in particular being a giant shock absorber for any one individual asset. And so if you go off grid and you have to operate on an island, you have to build the whole shock absorber yourself, all of the inertia, the fault response,

Starting point is 00:20:11 the ability to black start the asset. And that's just not just expensive. It's really complicated. And there's not a lot of folks out there that know how to run a gigawatt scale grid, right, at all. And so when you think about the risks that these new data centers, developers are needing to take in the values of these assets, betting on a model where you can't be comfortable or can't guarantee that you're going to have 99% uptime is possibly a non-starter in some cases. And we've heard some of the early data from some of the off-grade projects that have been built so far, the ANIC data suggests they're not able to stay above even 90% uptime yet. Will they get there over time? Probably, right? This is a learning curve, and we know

Starting point is 00:20:54 that there are power quality solutions that can manage a lot of these issues, but it's a big risk if you're going to be a first mover for a $10 billion asset to design it in a way that you don't know how to manage and operate it and keep it running. Right. It strikes me as one of these things that, like, clearly that is, it should be solvable. It is a real engineering challenge, it appears, and I've, you know, you and I've looked at some of that same data. Like, it does appear that there are, there are actually projects that are mostly these ones that are bridge power projects, so they're currently off-grid intending to be on-grid eventually. But as they are operating off-grid, they are not operating at the normal five-nines of reliability or whatever. Now, interestingly,

Starting point is 00:21:31 you may or may not need that. In some ways, it's sort of a legacy of the cloud business, where AWS and Azure and Google basically promised in their SLAs to their customers that they would be able to offer really high uptime. And so they have this, you know, huge redundancy requirement and so on. And the new thing, world of AI, sometimes you do need that, sometimes you don't need that. And so there may be a class of data centers that can accept sub-99.99% reliability. There's an economic impact, of course, to lower up time. But again, in a world where we're so constrained on the grid side, it seems inevitable to be that that is going to happen to some degree and that the engineering

Starting point is 00:22:13 challenge is going to get at least partially solved. Yeah, I think that's right. And I think over time, we'll figure out better ways to, you know, have more and more checkpoints as you're doing model training runs and whatnot, such that you could tolerate a major outage. I think the key is you could make it work at 90% uptime if you know when that 90% is. I don't know whether you could handle total randomness with that 10% downtime. And if all the downtime happens to come in the middle of big, long, expensive model runs that it takes down, right? I don't know what that does to the economics of those projects. And, you know, I do think we'll learn a lot here. I think it's critical also to acknowledge that, you know, those that operate our larger grid don't yet know how to manage

Starting point is 00:22:53 these sorts of voltage swings and harmonic distortions that are coming from these data centers. And so if we can't solve the problem when the data centers are a small part of the overall load on the system, then it tells me that's probably going to take us some time to figure out how to solve it when they're the only load. And it's, you've got a much more constrained set of tools to manage the impact. Virtual power plants are becoming a reliable way for utilities to manage capacity. But enrolling devices is just the start. What really matters is confidence, knowing those resources will perform when dispatched,

Starting point is 00:23:25 and being able to prove it, from the control room to the living room. Energy Hub's platform handles the full picture, from near-real-time forecasting, locational dispatch, and the kind of rigorous verification that holds up when regulators, grid operators, or leadership ask, did it deliver? Easy enrollment creates momentum, proven performance builds trust. That's why more than 170 utilities rely on Energy Hub, to manage over 2.5 million devices delivering 3.4 gigawatts of flexible capacity. See what that looks like at energy hub.com.

Starting point is 00:23:57 We're living through a profound economic shift, and energy sits at the center of all of it. Trillions of dollars are flowing into power plants, transmission lines, battery factories, data centers, but the future of energy isn't shaped by technology alone. It's shaped by markets, by policy, by capital, and by the institutions that connect them. I'm Alfred Johnson, CEO of Crux, the capital platform for the clean economy.

Starting point is 00:24:22 Join me for my brand new show, Critical Capital, as I talk with people deploying capital, shaping policy and building projects. Together, we unpack how risk is priced, how incentives are structured, and how progress is actually made. Listen to Critical Capital on Spotify, Apple, or wherever you get your podcasts. Are you tired of overpaying for big-name PR firms, but not really knowing what they're delivering? Is your comms team wasting time reviewing lengthy messaging briefs and decks instead of engaging journalists or producing content? Are you wondering why your competitors are getting press and you aren't? Fishtank PR is an award-winning climate and energy tech, renewables, and sustainability-focused

Starting point is 00:25:00 PR firm dedicated to elevating the work of both early stage and established companies. Whether you need to position yourself as a thought leader in between project announcements or translate complex ideas and technologies into tangible, compelling stories that resonate with the media, fish tank can help. Check out fish tankpr.com. That's f-is-c-h-fish-tankpr.com. Yeah, and though cost is not the determinate factor in this stuff right now, it's not nothing. And, you know, the way to engineer yourself into five-nine's reliability off-brid right now is to over-invest in both capacity and sort of, and storage. And you can do that, but it does come at a significant cost. it starts to actually matter for your economics.

Starting point is 00:25:46 And back to your point, like, who finances your $10 billion asset if it is, you know, at the top end of the cost curve, essentially? Yeah. And then you start getting into, you know, do you need two different fuels, right? If you're going to use, you know, some kind of base load resource or if it's just gas, you need two separate gas pipelines, and that constrains sites, adds costs. And, oh, by the way, we've just jumped in assuming that location doesn't matter in this world, right? and that you can do everything in remote parts of the country, for example.

Starting point is 00:26:15 I'm curious for your take there. My suspicion is that at least to date, folks are still generally sensitive to where they're being cited for not all projects, but for most projects. And if there were lots of off-grid opportunities in Virginia, for example, I think we'd see them being pursued more quickly than we're seeing some of the stuff move forward in West Texas, New Mexico, et cetera. I think that's changing in real time. Like, historically, you know, there were these tier one markets like Northern Virginia or something. Chicago or Phoenix or whatever, Atlanta, and they were where 90% of the demand for new data centers was going to be, and there's still that. But it is broadening out quickly, right? And you see all this development in West Texas, for example, so many data centers going into Texas. And I think that's

Starting point is 00:26:59 just because of speed to power and availability and scale, right? And so I think that the constraint of, like, you need to be in certain locations, it still matters from a, is there a workforce, is there, you know, can you get enough labor, electricians, and construction workers, and water and all that kind of stuff. But I think apart from that, my sense is that it is not the most important thing. The one thing I do want to say, though, about the off-grade thing, and you mentioned this before, but let's reiterate it, your fundamental, assuming you can solve for sufficiently advanced engineering to get to whatever reliability you need, your constraints then on scaling, predominantly become power generation and delivery. So you're still, you still need to, because you're probably going to need some gas,

Starting point is 00:27:48 you still need turbines. Or if you're doing a lot of solar and storage, you need solar and you need batteries, you need transformers, you need switchgear, you need whatever, all that kind of stuff. And that you're now still in that supply chain problem. And I want to mention that because if that is the constraint on really massive scale off-grid,

Starting point is 00:28:06 in a minute we're going to talk about orbital. And so we can compare and contrast Like, which is the more challenging constraint between those two? Yeah, yeah, no, that's a great, great reminder. You're still stuck with all the generation and supply chain issues, maybe with one possible exception, which is that your gas infrastructure is going to likely be smaller and more modular, right? Like, you're not going to have a 500-magawatt, you know, combined cycle turbine.

Starting point is 00:28:29 That's your sole generation asset for a massive data center just because of the redundancy issues. And so you can get a lot of one-meagot reciprocating engines today. I know you can find some smaller air derivative turbines or, all the repurpose jet engines if you want to get a little bit crazy. But I do think the off-grid option in some ways maybe shortcuts the actual supply chain bottlenecks on the generation equipment side, at least to some extent relative to the other options.

Starting point is 00:28:53 But agree with you, there's still a bunch of other pieces of equipment, transformers, et cetera, that you're stuck waiting for. Okay, so let's shift to the most fun one. We talked about off-grid. Let's go off-world and talking about orbital data centers. There's such a long conversation to be had about orbital data centers here, but I want to frame it in the context of these other things. Again, I think the premise here, and certainly the way that Elon talks about it,

Starting point is 00:29:17 as the most prominent proponent of orbital data centers, is this is going to be the only way. It's a scalability thing. I mean, he says, too, okay, let's dispense with the premise. He says he thinks orbital data centers are going to be the cheapest way to get compute in three to four years. Correct. I do not believe that. Do you believe that?

Starting point is 00:29:36 I do not believe that. I think we need to start this conversation with a bit of the acknowledgement, moving off planet for lots of reasons is a crazy proposition, right? And if you listen to Elon talk through it, it starts to sound like a logical endgame in a world where we're building hundreds of gigawatts of compute infrastructure a year, and Elon asserts that that's going to start happening in three or four years, right? I don't think it is going to be the cheapest source of new compute capacity in three or four years,

Starting point is 00:30:05 Nor do I think that we're going to be building hundreds of gigawatts of compute infrastructure per year in the U.S. alone in three or four years. But in a world where we're assuming that we're somewhere between, you know, AGI and some, you know, more super intelligent, you know, computing infrastructure, it's kind of the end game, right? It's kind of the only place you could go to build, you know, infinite amounts of compute capacity. Whether that's in five years or 500 years, you know, I'm not quite sure, but I agree it's not before 23rd. As this has become a bigger conversation, people have talked about lots of things that they think are going to be the killer of the idea of orbital data centers. I think we should dispense with them because despite what you and I just said, which is both like fairly skeptical on the cost side, I think we both think it's not like totally insane. And it doesn't seem like the tactical challenges are insurmountable. So people talk about like heat transfer as one of the big problems.

Starting point is 00:30:59 I think it doesn't seem like actually that is likely to be, it's not nothing, but it doesn't seem likely to be the thing that killed. orbital data centers. Agreed. I think the heat transfer conundrum, right, is that space is a vacuum and it's very, very hard to dissipate heat in a vacuum. I think the whole international space station, for example, rejects less than 100 kilowatts of heat in total, and they have irradiator the size of a soccer field, right? And when you think about the compute infrastructure we're building out, like a single

Starting point is 00:31:27 Nvidia, high-density rack could soon be more than 100 kilowatts. It may already be, in some cases, more than 100 kilowatts. On the flip side, of course, heat dissipates to the fourth power of temperature. And so it turns out that the hotter and hotter you run chips and the denser and denser you run chips, the better your heat rejection gets on its own. And so it does seem like as we move to a world of denser and denser computing infrastructure, it gets easier and easier to reject chips. But if you just do the simple math on a single gigawatt scale space-based data center,

Starting point is 00:32:00 you end up with a radiator the size of a small town, right? Like between the radiator and the solar panels required, I think you end up with a four-square-kilometer orbiting asset. And that's obviously complex to manage, but it's also a target. I saw this really great piece of analysis this morning from an analyst called Thunderset Energy. And I think the stats on the odds that a Starlink system gets hit today by a piece of space debris is like a couple percent maybe per year.

Starting point is 00:32:30 If you scale that up to a single floating thing that's four square kilometers large, you can basically expect to have a piece of space debris hitting that, you know, data center every hour. And I don't know how you operate something that's going to, you know, get knocked off forward and or destroyed just every hour, like by a piece of space debris every single hour. That sounds really complicated. Yeah, I mean, to me the thing that seems, this is sort of related to it, the thing that seems like the hardest to solve, it's all hard. But the thing that is the hardest to solve with orbital data centers is O&M.

Starting point is 00:33:04 Because actually, data centers on land require a lot of maintenance, and you can't really do a lot of complicated maintenance to a satellite. Right? And so, either we solve that with some robotics that's going to be very clever. That seems difficult for me to imagine,

Starting point is 00:33:20 or it's an economic thing. You lose a bunch of... You just have some loss rate, and you have to account for that. Yeah, I mean, you know, in a hyperscale data center today, right? Like, there's a meta-engineer or a Google engineer that is going to replace every CPU or GPU as it breaks more or less in real time. And in space, if it breaks, at least today,

Starting point is 00:33:38 you're kind of stuck with it broken. And to your point, maybe in 20 or 30 years, if we're really in some super-intelligent future, there's, you know, robotic replacement and ways to update chips in real time and whatnot. But until then, it just adds economic drag on the overall project. And, you know, we kind of skipped over costs, but it's not clear that there's a real economic advantage here. I mean, the economic reason to do this, right, is free power. You can effectively get 95% capacity factor on the solar panels at a space-based data center because you put it in kind of permanent sun, right, from an orbital perspective. And then there's much better solar irradiance. So you get somewhere between 5 or 10x, the energy output per panel over the

Starting point is 00:34:16 life of the panel than you would on Earthbound panel. And so, you know, power is really cheap. But as you mentioned earlier, you know, total cost-wise, energy is only, you know, five to 15 percent of a AI-focused data center, and chips and maintenance are the rest. And you're stuck with the same chip cost, whether you put the thing in space or on Earth, and the maintenance piece gets much more expensive. And so I kind of have a hard time seeing it being a cost play, even in a world where launch costs go way down. And if you buy Elon's view of the world, that the starship's going to get super reusable and be able to launch it 100 bucks a kilogram. And so I kind of come back to like, it just has to be the sort of thing that we pursue from a physics perspective, because we can't

Starting point is 00:34:57 built at the pace needed for AGI on Earth. I think that's right. Okay, so but that gets us then, maybe to close it out, into what I think is the interesting comparison that I don't hear people making very much, which is orbital data centers versus off-grid data centers. Let's just compare those two. As we said, the rate limiter, we have plenty of land.

Starting point is 00:35:19 I mean, you know, in the long arc of history, to build many terawatts, sure, we're going to run out of land. But like, to a first order, for the next decade. I don't think we're running out of land. So we've got land. And then the rate limiter is all of the other stuff we talked about, you know, turbines or whatever, power grid infrastructure and so on. And we certainly don't have enough of that today to go build hundreds of gigawatts a year of off-grid data centers.

Starting point is 00:35:45 The rate limiter on orbital data centers is, sure, there's going to be some like solar first space, right? Elon is saying that XAI or I guess now now SpaceX is going to develop 100 gigawatt solar manufacturing presumably for space. There are also Tesla is going to do it for land. But let's say that that's the lesser constraint. The bigger constraint is Starship. Starship has to launch a lot, like a lot, a lot to get that kind of capacity into space. And they've got a ways to go. So as I think about it, I'm like, okay, if you're if you're binding constraint is like capacity of Starship launch on one side versus. ability to scale up the supply chain for power generation and delivery on land.

Starting point is 00:36:30 It's not clear to me that, like, spaces eminently more scaled. On that measure of the problem, like, can we not, as a planet go develop, you know, 200 gigawatts a year of new turbine manufacturing capacity? Seems possible. Yeah, I think that piece we could. Maybe the question back to you is, do you think that society over time, is supportive of us building, you know, 200 plus gigawatts of incremental gas infrastructure year over year for the next 20 years. And I know that's one of the other concerns that Elon raises,

Starting point is 00:37:03 right, is at some point, you know, the conversation around carbon-free energy will shift back in a different direction and do we get stuck in a world where we can't build that? But then, right. But so, but then be a maximalist on solar and storage. Be a maximalist on geothermal. Be a maximalist on new nuclear. Like, are those things all so much crazier than like five starship launches a day. When you hear him talk you through it, and it's basically the ship lands and then takes off again within, you know, a few minutes, that certainly does sound pretty crazy. And, you know, solving fusion might even be easier than cracking that code.

Starting point is 00:37:35 Yeah. Again, I think for me, it's not that, like, it's totally insane to do orbital data centers. That's not my takeaway here. It's just, I think if we're going straight to space, I'm surprised that we're not making, stopping at a waypoint along the way of doing a lot of off. I'm surprised that hasn't happened. Agreed. And I think the other constraint that obviously exists across both scenarios, and we've

Starting point is 00:37:56 kind of washed over in the decision to talk about a world where we continue to see massive AI progress is just the chip supply chain, right? And in a world where we're building a couple hundred gigawatts a year, I don't know how many chips that actually turns into, but I know that we don't have the semiconductor fabrication space today to build at that level. And so we probably end up bottlenecked by chips before we're really in a world where we're really in a world where we can't build everything on the ground, for example, and probably before we're in a world where Starship launch costs are so cheap that space becomes the cheapest option. So,

Starting point is 00:38:30 you know, if you take that as a fundamental constraint, then I think you probably do bet on the off-grid stuff moving materially faster. But yet, same as you. I think, you know, we shouldn't dismiss the orbital option. And I think in a world where, you know, compute buildout does rapidly accelerate in 20 years or 30 years, there's going to be a lot of AI models being trained in particular in space. And that's maybe just the one last topic. We didn't quite hit on is latency in space, right? If you've got latency concerns building in West Texas, then you're certainly going to have latency concerns building a few miles north of the, above the South Pole. And so I do still think in that world, right, we're still going to have to build a lot of our infrastructure here,

Starting point is 00:39:09 even if we're training the brain that is a thousand times as smart as a human in the atmosphere. All right. So I'm going to put you on the spot to wrap up here. Ten years from now, you've got a fixed pie of all the global compute that exists. We have four categories here. Hyperscale grid connected, edge, let's define it as like some 50 megawatts or something like that. So a broad definition of edge, off grid, off world. Ten years, all compute infrastructure that's operating. What is your best guess?

Starting point is 00:39:39 If I were to look forward about 10 years and assume we're talking about all compute infrastructure that's operating, I still think the majority of it's going to be in hyperscale data centers, and that's probably, you know, 50 to 60 percent of the total. Let's assume that on top of that, there's another 10 to 15 percent that gets built off grid in a similar hyper-scale-like format, but never connects. And so that puts us at, you know, 65 or 70 percent that's built in more of a traditional way, whether grid-tied or not. I suspect that the bulk of the rest comes in the edge markets for certain use cases or applications, call it 15 percent or so there. And I do think we'll see a couple, you know, efforts to really build out some infrastructure in space, and we know

Starting point is 00:40:18 SpaceX and Google in particular are going to take their shot there. And so I wouldn't be surprised if we're training some models. We've got 5 to 10% of our overall compute capacity out there over time. I'm curious, which of those are you buying or selling? That's interesting. It's so hard. Okay, so again, it comes down to this, like, how bullish are you on compute demand? Yep.

Starting point is 00:40:37 Like, if you told me that the total number, the size of the pie in 10 years, is 10 terawatts, I have a very different answer from if the size of the pie is 300. Right. Agreed. That like dictates the shares to me, so it's really hard to know. I would say, I generally agree with you. And to be clear, that's actually like a fairly bullish statement on, it's, I would say what you're saying is bullish on off grid and bullish on orbital, just because you're starting

Starting point is 00:41:05 from zero in both of those. And so getting to 5% even, 5% of hundreds of gigawatts is going to be a big number to do in 10 years for orbital. So it's actually like a fairly bullish statement on. all of them, again, depending on how big the size of the pie is, I'm filibustering because I'm trying to figure out which one of these I disagree with the most. I guess maybe where I currently said I'm a little bit even more bullish on off-grid. It just, it has the scalability. I think it can have the cost. There are challenges, engineering challenges, but if we're really going to be

Starting point is 00:41:39 in this world where we're that heavily constrained, like it just seems inevitable to me. Do you think that comes from the grid-tied large sites, or where do you think that that comes from. Where the... From my view of the world, which of those categories do you see losing market share, let's call it,

Starting point is 00:41:56 if Moore's going to go off grid? Oh, I see. I'm still having... I mean, you didn't put a lot into the edge category in the first place, but where I currently said, I don't know why we're going to have a lot of edge.

Starting point is 00:42:07 In the grand scheme, we'll have some, but like, as a portion of overall compute, I don't know why that's going to be a lot. Which is frustrating because it's the least... In many ways, it's the most obvious and theoretically fastest way to deploy compute, right?

Starting point is 00:42:21 Like, this is why you and I have spent a lot of time thinking about this over the last three or four months is... Totally. It should be the right answer, but I agree with you. And I reserve the right to change my mind, right? Like, I think you and I have spent a few months trying to, like, convince ourselves of Edge, and I think we haven't done so yet, but that's a matter of time.

Starting point is 00:42:37 In fact, if a listener wants to convince us of Edge, I would welcome it, Jake and I both. But, yeah, we're struggling to find the, like, it's going to happen, and then here's why for all these reasons. Anyway, I would maybe take a little bit away from Edge, and I guess I'd take a little bit away from grid-connected hyperscale, but I agree with you that that's like most of what we're going to do is just build more grid-connected hyperscale.

Starting point is 00:43:01 All right, Jake, all the time we've got. Thank you so much. This was fun as always. This was a pleasure. Thanks for having me. Jake Elder is a senior vice president of research and innovation at Energy Impact Partners. This show is a production of Latitude Media. You can head over to Latitude Media.com for links to

Starting point is 00:43:17 today's topics. Latitude is supported by Prelude Ventures. This episode was produced by Max Savage Levinson, Anne Bailey, and Sean Marquand. Mixing and theme song by Sean Marquand. Stephen Lacey is our executive editor. I'm Shale Khan, and this is Catalyst.

Catalyst with Shayle Kann - AI scaling pathways: on grid, on edge, off grid, off planet

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.