The a16z Show - The 2045 Superintelligence Timeline: Epoch AI’s Data-Driven Forecast

Starting point is 00:00:00 People are spending a lot on these models. They're presumably doing this because they're getting value from them. You can maybe argue like, oh, well, I don't think that value's real. I think people are just playing around, whatever, but like, whatever, they're paying for it. That's a pretty solid sign. We're almost giving you here the useful answer of like, I don't think it's a bubble because it's not burst yet.

Starting point is 00:00:20 When it's burst yet, then you'll know it's a problem. People often make the case, oh, AI hasn't been profitable yet, and they're spending more to make it profitable. In reality, they'll have paid off the cost of all of the development they've done in the past very soon. It's just that they're doing more development for the future. Will they regret that spending? How much are they spending?

Starting point is 00:00:39 You can look at Nvidia and how much they're selling each year and you can see whether it keeps on growing and you can see whether stuff is kind of looking good to continue. Math team is unusually easy for AI. I'm going to be honest. People often make claims about it being like this, you know, intuitive deep thing that it would mean that AI has achieved something, some huge level of intelligence for its association.

Starting point is 00:00:59 solve. I think in practice this is just like, you know, making a piece of art. It turns out to be farther down the capability street than people might have guessed. We sort of had this with chess decades ago, right? Like, computers solved chess very well. And everyone was thinking at this as the pinnacle of reasoning and everyone as a result kind of concluded by, oh, well, of course computers can be chess. The like interesting scenario to think about, you know, 20% chance, 30% chance, something like this will happen. And the next decade is like, you know, a 5% increase unemployment over a very short period of time, like six months due to AI. The public's reaction to this will determine a lot.

Starting point is 00:01:36 There will be very, very strong feelings about AI once this happens. I think there will be a bunch of very strong consensus on what to do on things that we don't normally think of as things that people are considering. I know when this happened with COVID, there was a several trillion dollar stimulus package. In a matter of weeks to days, it was breakneck speed. I don't know what that will look like for AI, but I think it's like every single, Everything else in AI, it's exponential, which means it will pass the point of people sort of care about it, to people really care about it, quite fast.

Starting point is 00:02:07 I just expect wherever we end up there will be this certain thing, which we would have considered unimaginable a year ago. Are we building towards the biggest economic boom in human history or the fastest collapse? Right now, AI labs are burning billions on compute. Anthropic just built a data center that uses as much power as Indiana State Capital and Microsoft's planning one that really, rivals New York City. The bet? The AI will eliminate entire categories of work before the money runs out. David Owen and Yafa Edelman from Epoch AI have done something unusual. They've actually measured what's happening. They track down permits, analyze satellite imagery, and calculated exactly how fast these data centers are scaling. Their conclusion challenges both the skeptics and the true

Starting point is 00:02:52 believers. They don't see a bubble. They see revenue doubling every year with inference already profitable. But they also don't see the software-only singularity that some predict, where AI recursively improves itself overnight. Instead, they forecast something stranger, a world where AI solves the remand hypothesis before it can reliably fold your laundry, where 10% of current jobs vanish, but unemployment might barely budge, where we hit artificial general intelligence not with a bang, but through a series of increasingly surreal milestones that keep moving the goalposts. Along with A16Z partner Marco Mascooro, we cover their timeline predictions, what stops or doesn't stop the scaling, and why the political response might happen faster than anyone expects. Guys, there's a lot of conversation about the macro. Are we in a bubble? How should we even think about this question? We're going to get into forecasting later on.

Starting point is 00:03:45 But why don't you just take your first stab at how you approach such a big general question? Yeah, I mean, for me at least, the way that I thought about this a little bit is I look at kind of the big indicator, being how much people are spending on stuff like compute. And I guess maybe some sense of will they regret that spending? That's relevant. But how much are they spending thing? Like you can see. You can look at Nvidia and how much they're selling each year and you can see

Starting point is 00:04:11 whether it keeps on growing and you can see whether stuff is kind of looking good to continue. Will they regretted side? I mean, that's just 2B.C. Right? Like we'll actually have to wait and see. It does seem as if most compute gets spent on inference that companies don't, so far regret, like, using to offer their products. So, I mean, on that side, I'm, like, thinking not too bubbly yet.

Starting point is 00:04:35 But, yeah, I low confidence on those other stuff to think about. Right now, the amount of money companies are actually earning in profit, not including the cost to develop the models initially, seems to be, like, very positive, such that if they stop developing bigger and bigger models and just stick with the ones they've had, they'd have earned a profit pretty quickly at the current margins. And in this sense, it doesn't seem, On the other hand, at any given time, they're investing in building even larger and larger models.

Starting point is 00:05:02 And if that goes well, then they'll learn more money. And if that doesn't go well, then no matter how profitable they are right now, it'll be a small amount of money compared to how much they would have spent. So I think right now there are not financial signs that there's a bubble. A lot of people worrying about bubbles just aren't necessarily used to the level of spending and just like the level of success that sort of happened and like scaling. but if there is a bubble, it could happen very suddenly and be pretty bad. Yeah, I think we're almost giving you here the useful answer.

Starting point is 00:05:33 I don't think it's a bubble because it's not burst yet. When it's burst yet, then you'll know it's a bubble. Yeah, yeah. I do think, like, you could imagine a world, which is all the spending, and the current level of success does not, like, people often make the case, oh, AI hasn't been profitable yet, and they're spending more to make it profitable. But right now it's not making anything. And in reality they're making, they'll have people.

Starting point is 00:05:54 paid off the cost of all of the development they've done in the past very soon. It's just that they're doing more development for the future. So I think there is this underlying financial success so far that I wouldn't expect to see if there are the very least an obvious bubble. Yeah, that does seem very relevant. People are spending a lot on these models. They're presumably, like, you know, users to use them. They're presumably doing this because they're getting value from them.

Starting point is 00:06:21 You can maybe argue like, oh, well, I don't think that values, I think people are just playing around, whatever, but like, whatever they're paying for it. That's a pretty solid sign. I guess one quick question related to this is like you're talking the report of the AI in 2030, basically that you haven't seen signs of basically these models kind of plateauing or like the capabilities keep increasing. And you have the benchmarks. You have the amount of data that is going, the amount of compute. Do you think faces or parts of the models are plateauing though?

Starting point is 00:06:49 Like, for instance, pre-training, are we seeing some sort of plateauing in that? or do you think people are still exploring some innovations in that stage? And Kirsten, what do you think about that? Yeah, I think this gets a bit harder to look at. Like, we get to an area where there isn't as much public data to say a lot, right? It seems as if pre-training is comparatively less of a focus than it was before, partly because, like, you have this exciting new direction of, well, new-ish direction of place training where they've done so much about reasoning, whatever.

Starting point is 00:07:23 But then I don't necessarily take that as evidence of like, oh, no, and that means pre-training, you couldn't scale further or whatever. It seems as if there is meaningfully more data out there, it seems as if plausibly, like, even a lot of this stuff is quite synergistic. You develop a better model. You like use post-training stuff to make it better. You get a load of data of the model actually being used successfully or not. A lot of that can probably go into pre-training next time.

Starting point is 00:07:50 You aren't projecting a software-only singularity, where AI is able to automate AI research, but it's an automated feedback loop. Why not? Yeah, I mean, I guess like I'm answering this and I'll have to say more. And it's like, to me, it's like that report, it's no one person's kind of, oh, this is like the forecast. This is the prediction, right?

Starting point is 00:08:11 This report very specifically looks at what are the current trends? Are there reasons that they clearly couldn't continue or might not? And if they do continue, where do they lead? I think whether you see this self-improvement thing, that's very hard to do from a sort of trend extrapolation basis, right? Like, currently, AI stuff does help AIR&D at least a little in terms of stuff like coding or selecting your data sets and creating those, whatever. But it's quite hard to actually measure, and it's not really helping in some big way,

Starting point is 00:08:43 like this kind of self-improving thing would suggest. There are reasons that you might think it could be very hard. people have discussed before how possibly, you know, if stuff just depend a lot on scaling up compute, then maybe automating a load of the R&D isn't that helpful. I find that somewhat compelling, but I think it's also just, it's pretty uncertain. It's hard to speculate about something that's quite out of regime like that. One thing that needs to happen in order for a software-only singularity to occur is you need to be in this world where scaling up the amount of researcher R&D time, basically, allows you to, like, improve AI enough that it makes up for the

Starting point is 00:09:26 lack of being able to scale experimental compute or pre-training. I think that something you would expect to see if this were the case is maybe not that much experimental compute being used in practice, and instead all of the money is going towards researchers. Now, there's a very good case that there's a very large amount of money going towards researchers, but as far as we can tell, experimental compute, which you seem to need to do research, is receiving a similar amount of money and that in fact, it's receiving many times more money than the final training runs

Starting point is 00:09:54 of the models that are actually being released. I think this is, in my mind, is a strong update towards, oh, you need to do very large-scale experiments to do research, and that we don't really have good evidence that researchers and just researchers would be able to speed things up

Starting point is 00:10:10 without doing more experiments. However, there are like pretty good arguments on either side of this. I tend to lean towards, no, you actually need to do more experiments, and that means you can't get this software only singularity. But I don't think the people who claim otherwise are, like, crazy. I think they're making some, like, they have, like, very reasonable differences,

Starting point is 00:10:28 and we're both speculating on something where the data is currently pretty sparse. Actually, related to that, like, what do you think on... So if you have, like, some of the exploration that researchers are trying, I mean, obviously, like, people are exploring a lot with RL, trying to go beyond verifiable domains, And what do you think about the argument, for instance, that grading descent is really good on learning and the current data set that you're giving, right? And if you keep training this over and over, it's going to start forgetting things that it was trained before, right? Like, catastrophic forgetting.

Starting point is 00:11:01 And there's this argument, right? Like, well, kids don't learn that way. Like, maybe there's some imitation learning that kids do. Maybe there's some sort of exploration that they do. And I wonder what you think about it. And it sounds right. Like, if kids really would just learn on it. limitation learning, I think parents would have a great time just raising kids, but it seems like

Starting point is 00:11:20 the reason why they have such a hard time raising kids is because they explore all these different things. What do you think about it? In terms of the algorithms and the things we need to keep improving these models over and over beyond the data and the compute? I am cautious about comparing how AI is learned to how humans learn, not because I don't think they are comparable, but because I think we know a lot more about how AI is learn right now than we know about how humans learn, and people like making sort of assumptions about how human learning works and saying, oh, yeah, I doesn't do it that way. And I don't know, maybe that's true.

Starting point is 00:11:51 Maybe human kids learn via RL. I'm not very, I think that, yeah, I don't have strong opinions on whether or not, like, you know, you need to change to a method that's more like what we think kids do right now. I suspect people will find some method that works to use the computer available because they've been able to do this in the past. Yeah, I'm also sort of reluctant. I guess as well, it's one of those things where when we point to particular issues, like the example of catastrophic forgetting, it's sort of, well, okay, but as we've scaled up,

Starting point is 00:12:31 we have managed to do quite well at having models that remember more and more things. This isn't to say that hence the problem is solved, hence we're done, Hence, no more other mitigation is necessary or anything like that, but I'm not exactly going to write it off. Yeah, I definitely don't think we've seen any slowdown yet in capabilities from any of these concerns people have. I think that people always have these sorts of concerns. I'm reluctant to believe any given one of them

Starting point is 00:13:02 until this actually shows up in numbers I can see on a graph, which I just don't think has happened yet. Dario Anthropic has said, he said in March 2025 that within six months, AI will write 90% of a code. And of course, that hasn't happened yet. He also said, we have, you know, we could have AI systems equivalent of a country of geniuses in a data center as soon as 2026 or 27. How do you evaluate why Anthropic is so bullish or what is the crux of difference between

Starting point is 00:13:30 what would they believe and perhaps what you believe? My model, at least, which I don't know if it's right. But what it is is that they think a bit more like the people who believe in you automate R&D, and that gives you very quick takeoff. So they see it as like, yep, we're working on these AIs that are great for kind of research engineering type coding. And at some point, they're going to be useful, and that's going to rapidly accelerate us to develop the next ones. And then it's going to be quick progress. Yeah, I think that it's hard to tell

Starting point is 00:14:07 the extent to which I don't think we've gotten a lot of evidence that there's sort of views of this software-only takeoff are wrong insuffar as like they were taking a little bit longer to get to like the minimum level of competence for AI to get you there definitely seems to be the case but it I don't know it's hard to tell the extent to which we've actually had

Starting point is 00:14:31 significant updates on this. Dario often qualifies what he says by like saying as soon as or something like this. So this is like maybe more so the faster timelines he gives, although I'm not sure. Yeah, there has also been, I think, sort of, you know, Talmud-style commentary where people are carefully looking at his exact wording and then at wording of other people's discussion of how many lines of code that are generated by some teams at Anthropic are generated by Claude Code. and whether this does or doesn't satisfy what you said. So it gets a bit tricky.

Starting point is 00:15:08 I remember there was the paper from the uplift paper that was claiming that actually models would slow you down. But I think it matter a lot what models they were using at the time because I think they were pretty outdated by the time the report came out. And I mean, in my personal experience, you definitely become way faster. And it just saw so much more for you. Like you're just having the whole context on your code base.

Starting point is 00:15:29 That's such a huge advantage that I think. for humans just would be really hard to do. I mean, far more than 90% of the code I write is written by AI these days. But I know I'm not like the average coder at all. But it's definitely, I don't think it's like a wild prediction at this point that 90% of code is going to be written by AI. I mean, for all I know, somewhere at OpenAI, there's someone just, you know, or that, you know,

Starting point is 00:15:57 with alpha code doing evolutionary algorithms on having tons and tons of trials, trying to, you know, million shots some hard problem. But it's just like, it's really unclear how many lines of code are actually being written by AI right now. I don't think it's such a wild. It's by a lot of, like, people's intuitive sense in terms of like, oh, is 90% a job of a programmer being done by AI's? Definitely not. But there's this more complicated sense of, like, how much is being written by AI. Probably not 90%, but it's hard to tell.

Starting point is 00:16:32 Yeah, and I think that is a very meaningful distinction. Yeah, like if you were to measure how many lines of code are being written, quote, unquote, by like, tab completion, then it's probably quite high. But you don't necessarily expect that that's taking on that much of the programmer's really hard work. That uplift paper that you mentioned, like, I find it really interesting and really good. And it's also surprisingly recent in a way. Like, you know, you mentioned, ah, the models are outdated. I mean, this was early 2025.

Starting point is 00:17:01 So these were models that people actually did think were helping them. And in the paper, they even got them to say ahead of time, like, how much do you think this will speed you up? And they said, yeah, I think however much. They then asked them afterwards, how much do you think this sped you up? And they're like, yeah, yeah, it sped me up. And I feel it does reveal actually, like, it might be hard for us to judge whether we were sped up or not.

Starting point is 00:17:24 Yeah, one thing that might be happening here is that a lot of the code that's getting written by AI is code that wouldn't have been written otherwise. so it's not really speeding off things that would normally happen. But, you know, there's a lot of simple graphs or simulations I run that might have not gotten written otherwise. And so it's hard to tell exactly what's going on here in terms of the impacts. I think at the end of the day, the most reliable indicator here

Starting point is 00:17:50 is going to be how much money these people are making from programmers and from, you know, subscriptions in general. And it's a lot of money. I think there's definitely indications that people are finding a use for them. and probably a decent amount of that uses for coding, but not exactly for the metric of doing 90% of an existing coder's job. Yeah. Biology is this phrase that's been being used a lot,

Starting point is 00:18:12 which is AI is an end-to-end, it's middle-to-middle, and which is meant to imply that, you know, we're going to need a lot more human involvement than some people typically think. What is your mental model of what AI is going to do for labor markets, either on the sort of lower end and or the higher end in the next, you know, decade, let's say. Oh, in the next decade, like, on the higher end, I'm definitely like, you know, probably I expect new jobs to be created. Everyone could still be influencers. But

Starting point is 00:18:46 on the higher end, it's like, there are not very good individual things that you can point to where it's very obvious that AI can't automate that job at this point. Now, you can't, could argue, okay, but there's some unknowns. And I think it's, like, pretty reasonable. But those unknowns, we sometimes, you know, AI gets up against its limits and we figure out what they are, and then it learned surpasses that. And I don't know, at the higher end, it definitely seems plausible that it could just automate all of the, basically all of existing jobs, with the exceptions of ones that require

Starting point is 00:19:20 manual labor, that people actually care about being done by a human. It's just like, does not seem at all implausible to me that that kind of. happen or that that could happen very fast, with the caveat there being like there's probably some regulatory pushback if that happens. On the lower end, I don't know, could just, you know, could be a bubble and doesn't have any impact. The thing I talk about when I'm talking about, the interesting scenario to think about, which I don't know, you know, 20% chance, 30% chance something like this will happen in the next decade is like, you know, a 5% increase in unemployment over a very short period of time, like six months, due to AI being released,

Starting point is 00:20:03 to something that I think will have a very substantial impact on the world, both in terms of how people think about AI, and sort of how much attention it gets and seems plausible to me, but, you know, far from guaranteed. Yeah, I think I strongly agree with being just highly uncertain. It seems very plausible to me that you end up more, or less kind of, you know, this generation actually is exactly where we run out of progress. It would be kind of crazy, but it could happen.

Starting point is 00:20:37 And then it's like, oh, okay, everything is very much just generating more jobs for technical people to try to integrate it into doing kind of useful but janky things for all the existing work people do. The stuff where it kind of becomes a crazy runaway thing that you can, yeah, really automate large swaves that promote work with. I mean, my timelines are, I guess, pretty a bit longer than the others. But yeah, I mean, it seems hard to rule out that something really big happens in a decade, a decade's quite a long time.

Starting point is 00:21:11 I think I would be surprised if there were not 5% of jobs that exist now, which AI has automated away over the course of the next decade. Honestly, I'd be surprised if it's not 10% of the jobs that exist now, I think. how fast that happens and the extent to which those people find other jobs is something which I don't think I have seen compelling evidence for

Starting point is 00:21:34 either way and probably depends on how fast various things go in exactly what jobs are automated. I think the 10% of current jobs seems like a pretty reasonable lower, it's not quite my lower bound but you know a pretty reasonable number over the next decade but this might not show up an overall employment number.

Starting point is 00:21:54 This is interesting. I mean, definitely, like, to the extent there is a mainstream economics view of this stuff. It would probably be that automation happens at the level of tasks rather than occupations. And occupations can, as a result, you know, go down quite a bit. But a lot of the time you're automating these like similar tasks across lots of jobs. I think this is compatible with what you're saying. It's just that some jobs get really hit by it. I don't know.

Starting point is 00:22:26 I find it, yeah, quite hard to think about. I'm not sure what the, even the historic base rate for kind of jobs ceasing to exist is, I know there are problems with this, like the historic employment data series. There is actually quite a high, I believe, base rate of just the tasks in a job changing, jobs themselves changing, jobs kind of going away, coming in. So, yeah, even this 5% thing, I don't know what to think, yeah, that would be like a big effect. or kind of, yeah, that's actually roughly the size of the fact you've already sent from something like software, I don't know.

Starting point is 00:22:59 Yeah, probably 5% of jobs that existed before software no longer exist. It seems pretty reasonable. But I'm not confident to this. It's definitely something which, like, I don't know, I expect, especially if revenue trends continue, I expect to know a lot more about this in a couple in a year or two, probably within the next year, because it will just be the case that, okay, we will have AI's earning enough to be a substantial part of the economy.

Starting point is 00:23:30 If it's not showing up in unemployment, then we've learned something about what it's doing. We've learned that it's able to do this without showing up unemployment numbers. Or maybe it will show up in unemployment numbers and we'll see exactly what. There's been some early work looking at indicators of this. There's a lot of things that complicate looking into this because interest rates also. have effects on like the sort of things you might care about or just like normal churn or also it's possible that tech companies you know maybe they'll lay off a bunch of programmers so that they have the capital to build data centers and are those programmers being laid off because of AI?

Starting point is 00:24:07 I don't know maybe. If you had a kid that was a freshman in college and they were asking, hey, you know, what should I major in if I want to have a great career? You know, what may you tell them? And if they asked you about, you know, computer science or math or, you know. Trump engineer. Yeah, exactly. Yeah, what would you say?

Starting point is 00:24:25 I mean, I'd probably say not prompt engineer, I think, in general. People get better at using AI is very easy to use. Yeah, I think it's a good question. I think they should probably measure in something where if they're majoring and programming, the thing that they should be, or computer science, the thing that they should be looking for is not being a person who's going to like, like the skills that are going to be useful are not going to be knowing a programming, language. It's going to be more general purpose skills, ability to work with other people,

Starting point is 00:25:01 communication skills, this sort of thing. I don't really know entirely if this points to a particular major. Most majors are probably not majors that are actually relevant for your job. Yeah, I guess I'd sort of be like, well, there's not too much that you can do to plan around the super crazy futures. So I guess go for something that you're passionate about, that's useful in the worlds, but don't go crazy in that way. I actually think that, yeah, computer science, maths,

Starting point is 00:25:29 if you're passionate about them, they're very good because you'll learn interesting things that are valuable in many worlds. But I don't know. I gave advice to a younger relative recently and they chose to study drama instead. I do think that, you know, one of the things that

Starting point is 00:25:47 if you have a better time in college, that's like four years of your life you've had a better time during. And at the end of the day, like, you know, if it's a crapshoot, which of those things is actually going to give you a better time in the future, planning for the present is a lot easier. I mean, it's definitely becoming really hard to know, right?

Starting point is 00:26:06 I remember, like, the problem engineer was obviously a joke because everyone believed two years ago that that was sort of, some sort of viable thing. And obviously, models are phenomenally better at, like, just, being great prompters. So obviously, that's kind of like one thing that has been happening. It's really hard to predict what's happening as these models keep better and keep being better.

Starting point is 00:26:27 One question that I have related to this is obviously code is such a big market and it has had such a big impact. One that I'm very excited about, but still much earlier, I think, is computer use, right? It's basically automating all the digital tasks that you're doing in your computer. And there's very few benchmarks around this, like whether it's Webberina or always, world, and you talk a little bit on your report about benchmarks. Curious and like, what do you think is missing in that space? Like, why we haven't seen yet that moment where the moment, for example, when Sonnet 3.5 came out or cloud code or codex, where we saw significant improvement on coding in general, we haven't

Starting point is 00:27:05 had that moment for computer use. What do you think is missing there? Interesting. I mean, there have been improvements on computer use for sure. I do have, I mean, this, maybe I'm going out on a limb here slightly, but also I do think that there is a sense in which models are a little bit artificially hobbled by their vision capabilities. Like it does seem as if a common pattern you see when you try to get models to do stuff with a GUI is they kind of get a bit confused about manipulating it. And, you know, in a way where it's like, okay, this is interacting with your general propensity. to get infused in long, as you would in like difficult long coding problems.

Starting point is 00:27:50 But it's kind of exacerbated because like you're not able to just easily look back on the thing and see kind of, huh, I was wrong. You instead go down like some awful dead end of just, I'm just going to click this again and again and again. So I think that's part of it. I think there is something here also probably about kind of long context coherence stuff. like those tokens to represent the GUI are pretty big and then you're filling up your context window as you go with like, oh yeah, well, I had all of this stuff

Starting point is 00:28:21 that's happened before and you seem to just run into a kind of spiral of increasingly less sensible outputs. So I feel like these are two of the big things, but I don't know if that answers your question. I found computer use. I don't know. This was the first year I found computer use actually useful. We use CHATGPT agent in our data center research

Starting point is 00:28:43 because a lot of what we have to do is find permits, which are all going to be on Janky, county by county databases of air permits for, you know, the county that Abilene, Texas is in. And I don't know what databases exist for every county in the U.S. ChatGPT does.

Starting point is 00:29:05 Normal chat GPT can't search them because it's these, you know, these actual user interfaces you can't just search them with, you know, URLs because they definitely don't work that well. And it's able to navigate this, such that I can just ask it to find me permits on a data center in a particular city, and it will come back with air pollution permits and, like, tax abatement documents and all of this stuff that let me learn a huge amount. And this is just like, because of the improvements we've seen in computer use over the past year or so, I'm excited to, yeah, I think it's just going to get better from there, but I've definitely found it starting to,

Starting point is 00:29:41 to the point where it's actually useful. What's your mental model more broadly for what is going to happen to productivity or or just sort of economy statistically in general? Are you, some people say GDP growth would be, you know, 5%. I think it's a Tyler Cowan view. I think some people would say, no, no, you should get up to 10% growth or maybe even higher if we truly have AGI in terms of how we understand it. What's your model of what happens to the productivity?

Starting point is 00:30:09 I think my kind of baseline guessing would be, you know, I forecast out kind of if revenue keeps growing the way it has, in theory, for it to be worth spending that much on that, you know, those chips to do that inference, you should be getting something kind of similar to that value after those chips by then. So then you could just draw from that kind of like, oh, okay, so extrapolating to 2030 you need. And I think for there it was. in the report, I don't know, I calculated it. I think it was on the order of like a percent kind of GDP increase. That's in a few years, right? That's not presuming AGI. That's presuming like if Nvidia stock revenues keep like growing as they sort of previously have and you assume that

Starting point is 00:30:56 they make roughly as much compute from it as before and so on. If you actually get something, I mean, AGI is like, yeah, people use it to be emptying different things. I think if you actually get something that can do any tasks that humans can do remotely, then presumably you see a lot of growth. It feels sort of difficult to guess exactly what kind of a lag you're going to see. I think there's reasons to think, oh, well, maybe people will be slow to adopt stuff. How do they learn to trust it? Whatever, there's other reasons to think, well, they're already using these technologies. A lot of it might actually be quicker than most growth. And indeed, adoption's been quicker for LMs than for many previous technologies.

Starting point is 00:31:43 So, yeah, I think it sort of gets hard at that point to model. At some point on our site, we had some rough numbers where it was stuff like, what if you doubled the virtual labor force? What have you 10 times? Whatever. Then you see these crazy GDP boosts. I don't know whether that's the most reasonable way to think about it. I think a lot of it comes down to whether you imagine that, like, yeah, you really get something that can do everything versus you get something first that can do a meaningful fraction of remote tasks, but maybe can't do like an entire bucket of burn the minute bottle next you more.

Starting point is 00:32:22 So I guess it's again this thing of like, my best guess on current trends is this fairly well-defined, you know, few percent of GDP in 2030 thing, which is already pretty crazy. by economic standards. But then once you go much further, it's like, God, my predictions are just going to be even crazier. I'm reluctant to make them. I am going to be slightly less reluctant. Assuming in the next 10 years, we get AI that is capable of doing any remote job as well as any human,

Starting point is 00:32:56 I think, you know, 30% GDP growth seems like a lower bound on something that's reasonable. Assuming you get, this is a big assumption, And a lot of people are going to, that, you know, there's a lot going on in that assumption. But assuming that happens, I think you either are going to get like 30% GDP growth or, you know, negative 100% GDP growth because everyone's dead. It's just like, you know, it's just like at the end of the day,

Starting point is 00:33:23 it seems like you're going to have AI that can scale, that if you have AI that can scale there, you can probably have AI that scales even farther. And right now, I think the like economic models, I have seen of what happens if you get this sort of full replacement, you can automate a job. Are, you know, I either show this sort of an extremely fast, wild takeoff, or with a couple of it, or, you know, you have some people attempted to do this, who then say, and then you, like, look down through paragraphs, and it's like, assuming current levels of,

Starting point is 00:33:57 assuming AI is as capable as GPT3. I think the smaller number is just like you know they're either neuroterm predictions or predictions that aren't looking at like the full the more the upper end of what sort of capabilities you might see in the next 10 years yeah I mean it does seem hard to imagine a world where you have this supply of virtual labor

Starting point is 00:34:22 that literally can do any stuff that humans can do and then it doesn't need to crazy things I definitely agree with that I guess perhaps maybe some sort of a, I don't know, a heavy regulation situation. There are, yeah. I think there exist worlds in which things don't go crazy after that. It does seem like those worlds are not in an indefinite stable state. But, you know, it's not impossible, but it does seem like the default there is

Starting point is 00:34:53 you either go crazy up or you either go crazy down. And it's probably going to be one of those two. if you get to a world where it's like, genuinely AI can do any job as well as any human. I think people, I don't know, it seems wild to be to claim that, you know, given that your default case should be, you know, not super ridiculous changes.

Starting point is 00:35:12 It's just like, that's a lot of things that your AI can do right there. And that's like, yeah, it just like seems like it should have fundamentally changed the economy in one direction or another. My intuition is a lot of the disagreement. I mean, probably some of it does come down to sort of cashed,

Starting point is 00:35:28 beliefs people already have. But I do also think some of it is that when people talk about like, oh yeah, AGI, AI that can do a remote job, whatever, even though we feel like we're talking about the same thing, maybe sometimes we're not. I don't know. I've certainly had examples of conversations that it's like, yeah, AI can do any remote job. And then they discuss stuff that it can't do and the stuff that it can't do. It's like, well, no, like, that's also a remote job. Like, that's the kind of thing people currently do. So I think there is some of this. What do you think like, I mean, you talk about benchmarks on your report, but I wonder like 2007, 2008, what are going to be the right benchmarks to measuring the progress, more than

Starting point is 00:36:07 the economic growth, more of the capabilities on the model, like intelligence on the model. Like we had in 2012 AlexNet, obviously that, that got solved long ago. But that was probably not a measure of AGI by any means. Do you think the same would happen with the current benchmarks we have? So SweetBinge, MLU, let's say we maxed out on those benchmarks. What comes after the, how do we measure that is sort of like GDP growth with these models? Is it sort of breakthroughs in science? How do you think is the right measure going forward?

Starting point is 00:36:40 Yeah. I mean, I think most of what we have is likely to be solved. And indeed, the examples you gave are like pretty close already. Like, I don't know, it is basically solved sweepbench as like possibly close. depends a bit on how ambiguous some of the questions are. There's some details, but it's really getting there. I mean, I think some directions are obvious. You kind of do similar things, but harder and a bit better,

Starting point is 00:37:08 and trying to make them a bit more realistic, and people are doing this. There are harder software benchmarks that people have made more of an effort to try to curate and that cover larger tasks, for example. I think there's also some question of kind of budgets involved. I do think there's this kind of thing where like obviously if you just burn money, it doesn't intrinsically make the benchmark better. But probably you are going to see something where you're just going to have to devote more

Starting point is 00:37:35 resources on average to them. Like if you're trying to prove a sort of higher level of capabilities to a higher standard of proof, probably it's going to involve kind of more effort in developing them. I do also think, though, you're going to see examples of, you know, relatively small, kind of small numbers of things that are just very impressive. And these are also a valuable signal. Like when you see Alans being able to do things like, oh yeah, it just refactor this entire code base and it was really useful, then this is going to be useful.

Starting point is 00:38:10 And even if it's not yet formalized into a benchmark, if you've seen it for yourself, it's going to be kind of useful for you as evidence. And then people are probably going to make benchmarks that cover things like this to try to systematize them. I want to go back to our question on timelines. And I want to ask you about a few different sort of milestones and get your perspective on timelines there. So first is what is a rough timeline for a major unsolved math problem being solved by AI?

Starting point is 00:38:39 I actually wondered, yeah, because you had a few of these that you said, trust the look at. When you say that it solves this, I mean, is this unassisted entirely? it's or is it kind of a news report or someone tweets that hey like I dump this at GPT and it solved it and what counts is major something that we would all agree like a substantive you know version of it not not a you know just an anecdotal you know person describing it hmm that does it have to solve it on its own yeah let's go with that sure yes honestly Oh yeah, because I mean there's already cases, it seems, of LMSB, you know, like people are debating a little bit,

Starting point is 00:39:21 but mathematicians who seem trustfully are saying, like, wow, I used this and it was really helpful during my proof. I would not be surprised if AI solves, like, a major unself math problem, like the Raymond hypothesis, so similar in the next five years. I'm not going to say that, like, that's my, you know, median case necessarily, but I definitely wouldn't be that surprised. It's like, right now, it doesn't look like math is that hard for AI. It's just like some things turn out to be hard and some things don't. And math is just like one of the domains where it's all seems to work pretty well. And where it's most other domains, it's not at the point where it's like useful to a full professor. To the same extent I think it is for math or getting very close to for math.

Starting point is 00:40:08 And also it's like very unclear to what extent certain capabilities that it has unusually well might actually turn out to be very, very useful. Like maybe it'll turn out that there's like four papers out there that it knows about that have obscure results in them that when combined solve some big conjecture, which is the sort of thing that it like might be much more feasible to figure out with AI than for a human to figure out

Starting point is 00:40:33 or something similar. There's a lot of uncertainty here, but just like does not currently seem like something that AI is actually going to struggle with. People often make claims about it being like this, you know, intuitive deep thing that it would mean that AI has achieved something, some huge level of intelligence for it to solve. I think in practice this is just like, you know, making a piece of art. It turns out AI could just do that before it could do a lot of other, before it can, you know, remember things for more than a couple of days or whatever. Yeah. It turns out to be further down the capabilities tree than people might have guessed.

Starting point is 00:41:11 Yeah, I think I'm also bullish, though I do think that, yeah, it's one of those things where it's tricky and you really probably do need to define it quite well to get a good forecast on it, to hope to get a good forecast on it. Like, I don't know, we've had this experience that with benchmarking mathematics, you know, we got mathematicians to cut with problems that I think aren't as difficult as the kind of problems you're talking about. But nevertheless, they're like, yeah, AI could solve this. It would be like a big deal for AI progress. It would mean something to me. And then AI has solved them. And usually their response has been kind of like, oh, yeah, that updates me a bit. Although, man, when I look at it, I just realize, like, yeah, you can kind of brute force this.

Starting point is 00:41:52 You can kind of cheese this. You can get through. And it's a bit like, oh, okay. I mean, what if there's a problem that for humans we consider sort of, oh, this would be quite big. And then, yeah, AI solves it. like, oh, well, it solved it, whatever. We sort of had this with chess decades ago, right? Like, computers solved chess very well,

Starting point is 00:42:12 and everyone was thinking of this as the pinnacle of reasoning, and then they did, and everyone as a result kind of concluded, like, oh, well, of course computers can do chess. So, yeah, I don't know. I suspect that math is quite nice for AI to do. I'm reluctant to go out and assert, like, oh, yeah, definitely air is going to like

Starting point is 00:42:35 solve some of the millennium price problems in the next few years but it would not at all surprise me if it solves quite impressive seeming things in the next few years

Starting point is 00:42:46 to that what about a breakthrough in biology or medicine and we've already seen some of that with the what's it called alpha

Starting point is 00:42:57 alpha fold it math team is unusually easy for AI I'm going to be honest So to the extent where I'm like, oh, is it going to do the same exact level of like, oh, on its own did this huge thing? That seems a much bigger stretch to me.

Starting point is 00:43:11 It definitely seems plausible. But there's a lot of other concerns there where it needs to be able to actually do experiments and get data and interact with the real world for a lot of these. In a way that does not need to happen at all for math, in particular for certain. Yeah, it's just, they in fact seem farther off. What seems more plausible to me is that we see, like, you know, it become ubiquitous that sums tools, like, of using AI in some sort of aspect of, like, biology or chemistry or something useful like that, that it, like, certain aspects of it are enhanced. It also is possible that AI will, you know, make incredible strides without,

Starting point is 00:44:00 yeah, I think, without humans, but it's harder. Yeah, I think, again, it's a bit tricky for where you draw the line. I mean, I think you're not counting tools like alpha-fold, because if you were, then probably you'd argue for that, right? The inventors co-won the shared Nobel Prize. But, yeah, I mean, I guess there's kind of different directions. In biology, you could have AI being able to predict quite, you know, specific things like that, or you could have something that's more general purpose, this so-called, like, co-scientist,

Starting point is 00:44:31 or whatever they want to call it approach, or it's more about, like, oh, it was able to look through the literature and have good ideas, and there's different extents of human involvement. There already seem to be some results where impressive stuff is happening. I've not vetted them enough to really have a sense of, like, would this already count as having satisfied, yeah, the sort of level of impressiveness you're looking for. I sort of assume that finding things that end up

Starting point is 00:45:01 being meaningful will happen pretty soon if it hasn't already happened. But then maybe there's a question of kind of, okay, but is it doing as well as human researchers are actually like prioritizing the best few ones to work on? I think most of these co-scientist results have probably had pretty involved humans prioritizing. Though again, I've not looked enough to say. Lastly, how about for real superintelligence, your definition of super intelligence? I have, I think I am on the record as saying that the median timeline I discussed, or the modal timeline, sorry, I think it's modal, yeah, which might be on the early side compared to where my median is, is, you know, 2045 was where when I did the podcast with Jaime,

Starting point is 00:45:55 We discussed our forecasting, breaking down, and everything going bananas is the terminology I have used. And that looks like superintelligence. I, you know, I think that it's like the case that if we get AI that can do every single job that a human can do as well as any human can do that job in the near future. And this is, you know, means that scaling just works to get things much, much better and probably means that you are not that many steps that you are just a bit more scaling away from getting AI that could do anything that humans, sorry, two things vastly better than humans. Yeah, it gets hard to predict it. I think as well it gets to be one of these things where the predictions get a bit unmoored from the stuff that you can,

Starting point is 00:46:52 properly model. Like my sort of, you know, guesses, my like judgmental forecasts to use the fancy term for just kind of can do any remote work tasks, probably have a median of about like 20, 25 years. I kind of struggle to imagine a world where that happens and people are like deploying it and doing research and yet they're not making further progress

Starting point is 00:47:20 to being able to do stuff much. better. So I guess they have to be like not too much longer after that for some definition of superintelligence. But yeah, all very uncertain and yeah, it seems to break down a bit. You talk a lot about the progress in data centers, benchmarks, biology. And there was one interesting part that I noticed just in the field that is robotics is making a lot of progress with, let's say, world models and like the physical space a little bit. Curious and like, what is your take here? Like, what do you think it's, it seems like a lot of the problems in robotics can be solved purely with imitation learning. You might not need a lot of sort of like breakthroughs in math or whatever.

Starting point is 00:48:00 Like you can just basically learn it from a lot of data. And I think in the last couple of years has been remarkable using in robotics and world models overall. Curious on your take a little bit on this and if you did some kind of research in the space. So we've looked into what sort of amount of compute is actually being used to like do these training runs. and what we found is that like compute the training runs that are being used for robotics are like a hundred times smaller than the training runs that are being used for

Starting point is 00:48:31 than the training runs that are being used for like frontier models and so there's a lot of skill you can do there I don't think that until plausibly until very very recently there have been serious attempts to gather data for robotics at a massive scale it's just the case that you can hire a bunch of people to move around in motion capture suits if you need to. And there have been a lot of attempts to do that,

Starting point is 00:48:52 although I think this might be changing. I think of robotics is mostly a hardware problem. A hardware and, like, economics problem of, if it costs $100,000 to build a robot, then, you know, it's not necessarily better than a human who could work for $20,000 a year, or a very cheap human in certain countries, or something, like, sort of minimum wage in some countries

Starting point is 00:49:19 that you might be able to afford labor for. It's just not obvious to me that there is a software problem here. The hardware, it does seem like unclear, it's very unclear to me how much of a hardware problem is left. In particular, there's certain tasks with robots might be able to do,

Starting point is 00:49:39 but are they actually the tasks that you care about a robot being able to do? If you want your robot to be able to, like, nimbly walk around while lifting up heavy things and moving fast and react, And that's hard. That's a hardware problem that I don't think they've seen solutions for yet. Yeah, I think my impression roughly matches this.

Starting point is 00:49:58 It's sort of, I don't know, people fairly often talk about this distinction between remote work and physical work. I think because there's this perception of robotics progress lagging behind a bit, and there even is some intuition that maybe, maybe this physical manipulation stuff is actually just harder. But I wouldn't conclude that with much certainty. Like Jaffa's said, it feels like you'd kind of also want to see, well, okay, what happens if it gets scaled up in a similar way to even get a sense of like, oh, okay, was it actually

Starting point is 00:50:34 harder versus was it just deprioritized? Is there anything we didn't get to that you feel is important that we leave our audience with? We did discuss the data centers really suggested. I'm not sure if there's a good way to leave the audience with that. Yeah, let's get into it. Okay, so you guys just did a, you know, released a Dennis and a Project. What did you talk a little bit about what you were trying to achieve there and what you hope people take from it?

Starting point is 00:50:56 Yeah, so we took 13 of the largest data centers we can find. These includes a few from each of the major labs in the U.S. And we found permits. We took satellite images, including new satellite images, of all these data centers. We figured out how to determine how much compute is in them based off the cooling infrastructure at their building, as well as when they're coming online and their future timelines. So we understand this like real world data, and it's all available online on our website for free.

Starting point is 00:51:28 This like to give insight into this giant infrastructure buildup that's happening and the pace of it, there's some things about it that surprised me a lot. For instance, we learned that the most likely candidate to have the first gigawatt skill data center is in Anthropic, which would not have been my pick, but Anthropic Amazon's New Carlisle Project Rainier development seems on track to come online in January, followed shortly thereafter by Colossus 2. We also learned a lot about what the largest concrete plans are rather than just like marketing plans. Some people will throw around numbers, but the one we found that's actually seriously underway and has permits and is, you know,

Starting point is 00:52:10 setting up the electrical infrastructure for is one by Microsoft, which is going to be used by OpenAI, at least in part, in Mount Pleasant. They're calling it Microsoft Fairwater. And that one's going to be use a size,

Starting point is 00:52:25 use not quite as much power as New York City, but I think more than half. What's stopping us from significantly increasing the cluster size? Is it the, Is it cost? Is it supply lead times? Are there any other engineering breakthroughs required power? I think that people are approximately wrong that there's something stopping us and we are scaling

Starting point is 00:52:48 up as fast as there is money to scale up approximately. I suppose they could want there to be all of the clusters literally today, but they're scaling up really quite fast. You're seeing these data centers which are using, I think the one I mentioned for Anthropic Amazon is using about as much power. nearly as much power as the state capital of Indiana, which is where it's located. And the timelines on some of these, like the Colossus 2,

Starting point is 00:53:16 are, you know, two years or less, which is just an insane thing to build this thing that's using as much power as a city. I think that plausibly, you know, you don't want to buy chips now. You want to wait for there to be better chips. I think that people think of, there's a lot of noise about things being difficult and scaling up. And I think this is because people are having to spend a little bit more

Starting point is 00:53:41 than they would ordinarily have to spend. You can't use the ordinary sort of power pipeline, which is designed to deliver this affordable infrastructure at a slow pace. You have to, you know, buy things that you wouldn't ordinarily have to buy and spend more than you would ordinarily have to spend, but not buy enough to slow it down. All of these things pale in comparison to the cost of your GPUs. So my actual takeaway from a lot of this has been, oh, we're not having too much trouble scaling up.

Starting point is 00:54:13 But just like these plans are going really quite fast. And it's not obvious that people would actually have the finances and desire to do them faster. When people are talking about energy as a as a major potential bottleneck or as having to increase our capabilities significantly, you're not worried that that's going to be a sort of durable, sustainable bottleneck. that that's not working. I think people like complaining because they can't just use the traditional tug into the grid for cheap affordable power four years down the line pipeline. At the end of the day, there are expensive technologies that exist right now. You could pay for solar power plus batteries. This is fairly small lead times. It might cost twice as much as normal power, but that's still

Starting point is 00:54:59 way less than your GPUs. So you're going to do it if you have to. And you see people doing these sort of emergency things that cost them a bit more, you know, starting up their data centers. A common thing we see is people starting their data centers before their data centers are connected to the grid. I think Abilene was an example. X-A-I Colossus 1 is a prominent example of just finding ways around this that are expensive. And you complain about it because, you know, it would be nice if you could do the cheaper way. And no one's used to having to do it this expensive way.

Starting point is 00:55:29 At the end of the day, though, it's just like does not, there seem to be enough solutions. especially if you are as willing to pay as people are in AI, that I don't really expect it to be a significant bottleneck. Maybe we'll close with this. If these systems get as powerful as we're discussing, as we're discussing, I'm curious to how the sort of political system is going to respond. I'm curious if you're sympathetic to the Ashton Brenner view, that there's some potential nationalization that occurs.

Starting point is 00:56:00 But how do you expect governments to respond? It's kind of remarkable of how not in the political discourse it is, given how powerful it is already. I'm curious how you think about that. I expect, so the thing I, calling back to what I mentioned earlier, this concept of, you know, the potential for 5% unemployment increase in like six months. I think that the public's reaction to this will determine a lot. There will be very, very strong feelings about AI once this happens. I think there will be a bunch of, you know, very strong consensus on what to do.

Starting point is 00:56:31 on things that we don't normally think of as things that people are considering. I know when this happened with COVID, there was a several trillion dollar stimulus package passed at like, you know, in a matter of weeks to days, it was breakneck speed. I don't know what that will look like for AI. But I think it's like everything else in AI, it's like, you know, exponential, which means it will pass the point of, you know, people sort of care about it to people really care about it.

Starting point is 00:57:01 quite fast if things keep going. I just don't know where we're going to end up. I just expect, you know, wherever we end up, there will be, it will look like, oh, everyone suddenly agrees that why, that's to do this certain thing, which we would have considered unimaginable a year ago. And I don't know what that will look like. It might look like nationalization.

Starting point is 00:57:20 It might look like pausing. It might look like, I don't know, going faster, guaranteeing better unemployment benefits. Who knows? I just think there's going to be some sort of, like, strong response of some sort, and it's going to happen very fast. Yeah, I mean, you know, you make the point that governments are maybe less interested than you'd expect now, but, I mean, the current impacts, I think, aren't really that large.

Starting point is 00:57:49 I feel like the attention is getting larger, but it's not the day I as of right now is that powerful. And yet, governments are already talking about it a lot, right? and you have people meeting with heads of state from various hardware manufacturers and AI companies and countries talking about their AI strategy, stuff like this. So I feel clearly country, national governments are going to be quite involved. It's just a question of how.

Starting point is 00:58:16 And yeah, I also am a bit unclear on that. I think that right now we've seen this thing in revenue and finances where it's been doubling or tripling every year. And my default assumption is that attention that AI gets from policymakers and governments is going to follow a similar trend, where it will double and triple every year. This means that in the future, if trends continue, there will be a huge amount of attention, and it means that right now there's a lot more attention than last year. But you don't suddenly skip from very little attention to all of the attention,

Starting point is 00:58:49 although you do move quite, we are moving, I think, quite fast. I think we made enough predictions that we'll have to have you back next year and check in and see where we're at and then make it for next year. David, thank you so much for coming to the podcast. Thank you. Thanks so much for having us. Thanks for listening to this episode of the A16Z podcast.

Starting point is 00:59:12 If you like this episode, be sure to like, comment, subscribe, leave us a rating or review and share it with your friends and family. For more episodes, go to YouTube, Apple Podcast, and Spotify. follow us on X, A16Z, and subscribe to our substack at A16Z. com. Thanks again for listening, and I'll see you in the next episode. As a reminder, the content here is for informational purposes only. Should not be taken as legal business, tax, or investment advice, or be used to evaluate any investment or security, and is not directed at any investors or potential investors in any A16Z fund.

Starting point is 00:59:47 Please note that A16Z and its affiliates may also maintain investments in the companies discussed in this podcast. For more details, including a link to our investments, please see A16Z.com forward slash disclosures.

The a16z Show - The 2045 Superintelligence Timeline: Epoch AI’s Data-Driven Forecast

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.