Limitless: An AI Podcast - AI DEBATE: Runaway Superintelligence or Normal Technology? | Daniel Kokotajlo vs Arvind Narayanan

Starting point is 00:00:03 All right, everyone, I've been looking forward to this conversation all month. Maybe for many months, I've got to say. So the central question on today's episode is, is AI normal technology? Maybe it's something like the Industrial Revolution, we've been through that before. Or is it very, very abnormal? Is there incredible variance? Is it something between utopia and annihilation? It could go either way.

Starting point is 00:00:25 It's like nothing we've ever dealt with in the past. We've got two guests on today that are going to discuss, or I might use the word debate, this subject. We've got Arvind Narayanan. He is a Princeton computer science professor. He is the professional hype slayer, including some hype slaying he's done in crypto in the past. He is the writer of AI snake oil, and he is a believer that AI is normal technology, kind of like electricity or the internet. Arvind, welcome to the podcast. Thank you. I'm really excited. Thank you for having me. We also have Daniel Kokitello. He is an ex-open AI researcher. He's the author of an incredible report called AI 2027, which is like a month by month.

Starting point is 00:01:03 month forecast of AI progress all the way to AGI, which I bet you can guess the year in which he predicts that's coming. He thinks AI is very, very much not normal and might become super intelligent by the time you upgrade your next iPhone. So pretty soon. Daniel, welcome to the podcast. Thank you for having me. Excited to discuss. All right, guys, this is going to be a lot of fun. We've got seven topics to go through and we'll see if we get to all seven. We're going to do this in a compare and contrast way. I'm going to see the conversation. David's going to help. And then just fire you right into the topics. You guys ready? Let's do it. Okay. The first topic on the agenda, the question, is AI normal technology

Starting point is 00:01:42 or is AI very much not normal like nothing we've seen before? Is this kind of a tool that we can control or is this a new species destined for its own autonomy? Arvin, I've got a quote from you to kind of set the scene, maybe in your perspective. You said this, AI belongs with electricity and the internet. It's transformative, but yes, it's still a tool. Daniel, on the other side, you said this. This isn't another general purpose tech, talking about AI. It's a new species about to outthink us. Arvin, I'm going to start this with you on the topic. AI, normal tech. You put out a report entitled, like with that as the title. What's your case for AI being normal tech? We break down the way in which AI or really any technology is going to impact society into a kind of four-step process.

Starting point is 00:02:29 The first step is improvements and capabilities, new models coming out, AI is able to do new things, but there's no direct line from there to the impacts that we really care about. The next step after the first step of invention, as we call it, is innovation, turning that into product development. And from there to users learning how to use these new capabilities and ways that make sense for their workflows, what they want to do and their jobs. and the last and final and slowest step is adaptation. We talk about many reasons why business models need to change, laws and norms need to change for AI's impact on the world to really be felt.

Starting point is 00:03:08 And we can already see this process playing out in the last two years where, you know, we had this big leap in capabilities with chat GPT and other releases around that time frame. But when you look at the extent to which that has transformed people's work, flows, I think we're starting to see that that's going much, much slower than people initially predicted, and we have some numbers in the paper to back this up, and we predicted this trend broadly will continue. Daniel Arvin thinks it's going slow.

Starting point is 00:03:38 What's your take? I mean, I also think it's going slow right now. For example, you said people, it's gone slower than people predicted. It hasn't gone slower than I predicted, in fact. It's gone about the same speed that I predicted. I know you're moderating this debate, so you're in charge, but I wondered if I could also ask some questions to help the stage. So, Arvind, if you've read AI 2027,

Starting point is 00:03:59 I would love it if you could give your own sort of alternative story, basically. Like, if you had been in charge, like, suppose that someone gave you a whole extra year of life and said you have to spend your year writing your own counter scenario to AI 2027. What would that be? What would you depict? And maybe one way of putting it is, like, at what point would it start to diverge from what we predict in AI 27? And then, like, what would do it?

Starting point is 00:04:24 how it would gradually diverge more and more as time goes on. Please just, like, paint that picture for me because I think it's a useful way to sort of, like, set the stage for everything else that we're going to be talking about. Sure, yeah. So, I mean, our fundamental point is that in 2027, the world is going to look more or less like the world in 2025. And like you said, yeah, we think gradually things are going to look different,

Starting point is 00:04:46 and that divergence, we say, you know, is roughly on the time scale of decades. But qualitatively, what the diversions would look like for us, there's perhaps a greater role of human agency and policies and so forth in determining what kind of future we end up in, as opposed to predictions stemming directly from the capabilities of the technology. And so that's the reason why, you know, picking one scenario would be kind of sampling from a very broad range of possible futures. So it's a little difficult for me to paint a scenario, but broadly I can, I can. I can say that over, let's say, the next couple of decades roughly, we expect to see, just for concreteness to take one industry like software engineering. You know, people are going to figure out what it even means to be a software company or a

Starting point is 00:05:38 software engineer at a time when AI is much more capable than it is today at writing code. Today we can vibe code, create simple apps. What if we can create, you know, enterprise scale apps in the future with AI and minimal human involvement. So one scenario for what software might look like in that situation is it'll be silly to create software once and expect hundreds of different companies to use it because you're forcing everyone to be boxed into the abstractions that are built into the software. Instead, enterprise software will mean creating software for one company or one team or even one individual user. And the role of the software engineer of the software company will be primarily to talk to

Starting point is 00:06:20 that client understand the specifications. So creating software will primarily be, you know, that communication role as opposed to typing code into a keyboard. So that's one scenario for how one industry might change. Again, it's hard to give an overarching scenario for how the world overall might change in the next couple of decades. Thanks. So I agree that the future is uncertain, lots of branching possibilities. And so there's a lot of uncertainty that we sort of have to collapse down like the wave function, collapse down into like a particular path. for purposes of discussion. And so I totally understand, you know, we dealt with that a lot when we were doing a

Starting point is 00:06:55 2027 is like we're not like confident that this particular path is what's going to happen. But it sounded like you were saying that like, yes, the AIs are going to get a lot better at coding. They might be better at autonomous coding so that they can basically like do the full stack development process without human intervention. But then the humans might go to other roles such as interfacing with the customer to understand their requirements and so forth. And you think that might happen in the next few years, but that it'll take. like decades to sort of like gradually transform the actual industry to to upend the way things

Starting point is 00:07:26 are going now. So that's that's definitely a disagreement between us because I think that I mean to put it in like a little potted summary, the AI 2027 story is first you automate the coding, then your research speeds up so that you build new AIs that can automate the whole research process. Then your research speeds up even more so that you can get to what you might call superintelligence, which means it's like cognitively. superior to humans in every well-event way. It still has to learn from real-world data. It still has to like, you know, deal with physical bottlenecks, but it's, it needs less data than humans do, and it like can, like, work through the bottlenecks faster than humans can. And there's a

Starting point is 00:08:04 million of it. And it thinks, you know, 50 times faster, you know. And so then we depict that sort of process happening in 2027. And then in 2028, we depict it sort of exploding into the economy and actually transforming things. And so it was interesting. You said earlier, like, your first thing that you said in response to my question was, I think 2027 will look much like 2025. And I kind of wanted to say, I also think 2027 will look much like 2025. AI 2027 depicts, you know, the economy looking basically the same in 27 as it does today, because a lot of what's going on is that the capabilities are advancing rapidly within the companies, but they're not like being, they're not like transforming the world yet.

Starting point is 00:08:45 And then in 2028, that's when the world transformation happens. That's when the new factories start going up, et cetera. So, you know, you talk a lot about the bottlenecks and the frictions and so forth. And I think, I guess I'd want to say something like, yes, those things exist and they're extremely important. And that's part of why, like, there's so much of a gap between what AI systems are currently capable of and the actual transformations of the irrelevant industries. Like, there's just a time lag to, like, build new companies based on the new technology and to integrate it into everything. so that's why I think things won't look that different up until around the time you get superintelligence.

Starting point is 00:09:21 And then after you get superintelligence, they'll also be bottlenecks and there'll also be time lags. However, I think that they won't take decades to overcome. I think that when you've got the army of superintelligence is we're looking at something more like a year to get to a fully automated economy rather than something like 20 years. So having just like, you know, recapped sort of my position there, I would then ask you, do you think that we will get to superintelligence anytime in the next decade? Yeah, thank you, Dan. That's very helpful. Yeah, so let me pinpoint exactly where some of my disagreements are.

Starting point is 00:09:56 So in the paper, AI is normal technology, we dispute the whole premise of superintelligence. We don't disagree that AI capabilities on many individual dimensions are going to keep increasing and there are going to be superhuman AI capabilities, just like there are today in things like chess. However, we think that most tasks are like writing and not like chess. So the key difference between these tasks is that when it comes to chess, human performance is limited by our computational abilities. So it's very natural to see how AI can be dramatically superhuman

Starting point is 00:10:30 and, you know, has been for the last 20 years. In a task like writing, our abilities are not computation, Our abilities are limited by a few things. One is hard to agree on what constitutes good writing and what doesn't. It's limited by our knowledge of the world. And we talk in the paper about why there are going to be bottlenecks to acquiring knowledge that humans haven't been able to acquire because it requires real world interaction. It requires experimenting on people and so forth. And finally, our writing abilities are limited because, let's say you want to write to persuade some one, we think there's only so much that you can make a piece of text persuasive. There are fundamental limits there. And so computational limits are relevant only in a narrow set of tasks. In most cases, it's these intrinsic limits and knowledge limits. And we do think that over and over AI is going to run up against those. So this idea that you can run AI at 50x speed and have a million copies of it that make things go a million times faster, we dispute that premise.

Starting point is 00:11:35 Well, I certainly don't think it will go a million times faster. We give our quantitative estimates of how much faster it will go, and it never gets close to a million. Oh, okay. Sorry if I misremembered. I agree that the parallel thing, there's massive diminishing returns there and things like you said. But I think I want to push you again on the point about superintelligence. The way that I would define it is it's an artificial AI agent that is better than the best humans at every important or relevant cognitive task. And so it's fine. And so according to this definition, if there's a hard ceiling on how good you can be at writing that's only slightly better than the best writers that are humans, it's still possible for there to be superintelligence.

Starting point is 00:12:15 It's just that the superintelligence would be running up against that ceiling, but only on an absolute scale would still only be slightly better than the best humans. But as long as you think it's possible to be like ever so slightly better than the best humans, then it's possible to have an AI that's meeting that definition for what I'm saying. And so then the question is like on all the relevant domains. So, so, you know, obviously in chess, they're already superhuman, but like, you know, enumerate all your favorite domains, all the domains that you think are important. Will there be AI systems that are better than humans at all of those ones? And it doesn't have to be like massively better,

Starting point is 00:12:45 just like somewhat better than the best is basically what I'm asking. So if they're only a little bit better than humans, what leads to this dramatic acceleration? You know, I mean, if humans got a little bit better at something, we wouldn't think that would lead to, you know, a 10x of GDP or whatever. Well, let's talk about the different types of acceleration. So first, the research acceleration is the one that we've thought the most about. So on the website, we've got this appendix where you can go read the way we did our little

Starting point is 00:13:11 hasty back-of-the-envelope calculations to try to get the numbers that we used in the scenario. And we estimated something like a 25X speed-up for the AI research stuff when you hit the superhuman AI researcher milestone, which is kind of like relevant to this discussion because it's basically like slightly better than the best team in researchers. And the way we get that 25x, we did a couple different methods and then sort of aggregated them. And then we like shaded it downwards just to be conservative. So, well, I guess I won't get into that here. But I'll try to briefly summarize. One way of thinking about it is we pulled researchers at companies, how much slower their research would go if they had less compute. And then we sort of flipped it on its head and thought, well,

Starting point is 00:13:58 like if you're running these AIs at 50 times human speed, or say 30 times human speed, I think, which is the number we actually used, that's kind of like having a company going 30 times faster, but with 1.30th the compute. And so applying the numbers from the pole, we get some sort of slowdown factor of like, well, we're going this much slower

Starting point is 00:14:18 because you have 1.30th the compute. But then we're going 30 times faster because we're running things 30 times faster. That was one of the methods we used. We also use a more like granular method of just thinking through the research process and looking for ways you can be more efficient here and there

Starting point is 00:14:33 and adding up all the multipliers. Anyhow, so that's how we got the 25X number for the overall speed of research. We haven't done a similar analysis for every other field of human endeavor, like, you know, rocketry or plumbing. We just did it for AI research. When it comes to those other things

Starting point is 00:14:50 like rocketry and plumbing, I think I would say the general heuristic that I would use is something like think about the gap between people who are the best in the world at it and people who are merely good professionals. And then look at the shape of the distribution. Oftentimes we see a sort of like heavy-tail distribution

Starting point is 00:15:10 where like most people are around the level of a professional, but then like some really good people are like able to get things done in half the time. And then like some really, really, really good people, the best in the world are able to get things done in like a quarter of the time that it takes a typical professional or something. So you can sort of plot this distribution. And then you can try to speculate about where the sort of limits are. You know, where is that fundamental limit

Starting point is 00:15:34 to how fast you can build new rockets, for example? And the insight that we're relying on is basically that if you have this distribution that looks heavy-tailed, then you should expect that the fundamental limit is not right above the highest data point that you've observed. Like probably it keeps going for a little bit before we're hitting limits. Otherwise, it'd be the strange coincidence that, like, the fundamental limit is, like, right above what we just happened to have observed.

Starting point is 00:15:59 It would be different if the distribution looked different. Like, if the distribution was sort of clumping up, then that would be evidence that we're sort of, like, asymptotting towards some sort of limit. Anyhow, all this to say, the way that we got our headline figure for 2028 in the scenario was looking at historic examples of transformations of economies, such as what happened in wartime, when, like, the U.S. repurposed all their factories to build planes instead of, cars. And so we're imagining something similar where we're imagining a political content. This is another possible disagreement we have with you, is that you're talking a lot about the like the bottlenecks, the frictions, the regulations, the market being cautious to adopt new technology, etc. We're imagining a lot of that still happening, but to a much lesser extent than perhaps you think due to the race dynamics where they were projecting where like

Starting point is 00:16:45 the US is afraid of losing out to China. China's afraid of using out to the US. So the, so the president is basically partnering with AI companies to, like, cut regulations as fast as they can and, like, you know, smooth things over in the special economic zones and so forth. In that environment, we're thinking it's similar to World War II, where they also cut regulations, cut red tape, you know, did this sort of massive pivot from things. So we were thinking basically, like, again, to use the distribution analogy, we were thinking, like, how fast have good humans been able to do this in the past? How much faster are the best humans able to do it compared to, like, typical professional humans.

Starting point is 00:17:21 It seems like there's a decent gap there. It seems like the Elon Musks of the world are able to like, you know, get SpaceX off the ground much better than Boeing and the various like legacy companies. And so then that suggests that if you do have the army of superintelligences that are better than the best humans, you should be able to go faster still than the best humans. And how much faster? Well, maybe another factor of three, maybe another factor of five, something like that. And so instead of taking like a World War II length of time to do this transformation,

Starting point is 00:17:49 mission. Maybe it takes like one-third that or one-fifth that. That was sort of roughly how we got those numbers. But yeah, obviously there's lots of uncertainty. Cool. Thanks. So there's lots of things in there if I can respond to a few of them. So first, let me start with the policy implications. So I think there's a little bit of a self-fulfilling prophecy effect here. Right. So in our view, AI as normal technology is not just a prediction, but a prescription for how we should approach it. We should not approach it as if we're in wartime, and that's for all the reasons that we describe. And my worry is that with AI 2027, it's leading to the very same prescriptions that will lead to the arms race dynamics that you're so concerned about. So, you know, if we think that we have to do this because China is going to do this, it inevitably kind of leads you to that spiral.

Starting point is 00:18:40 I do agree that if it's not just cut regulations, we're talking about civil liberties, right? if we were to suspend civil liberties and suspend democracy in a very real sense and act as if we reoriented the whole economy toward a kind of military production, except here it's improving AI industrial capacity. You know, things could go qualitatively very differently. But why should we? I think it's the only reason we would do that is if we assumed that each side assumed that the other one is going to do that. So that's the first point. The second point is when it comes to the acceleration, this idea that the fastest humans are much faster than the median, and so we can achieve that speed up. That's a little tricky to me. So when we look at past technologies,

Starting point is 00:19:28 you know, when the internet was starting to become a thing, people thought that this was going to lead to a 10x improvement in all kinds of knowledge work, because instead of going to the library to look up a fact, you can look it up instantly on the internet. But that turned out that was only one step in a larger process. Right. So whatever step of a process we're looking at, you know, let's say one surgeon is much faster than another surgeon, but when you look at the entire context of a hospital, that surgeon's sufficiency is only one small part of it, and there are so many other bottlenecks. Same as with the internet. When we remove one bottleneck, other parts of the process become the bottleneck. Within each job, I think the slowest task is going to become the

Starting point is 00:20:08 bottleneck. And for the whole economy, the slowest sector, the least productive sector, is going to become the bottleneck. And we have some citations from economic works that talk about this. And third and last point, I would say, is in our earlier conversation, I think you acknowledge that there might be limits, most tasks being like writing, where AI is only slightly better than the best humans. And so that maybe seems a little bit at odds with the claim that there is this long tail and therefore AI can be far out in the extremes of the distribution, far better than the best human. Thank you.

Starting point is 00:20:42 I guess I'll take them in reverse order. So I agree that it's task-dependent or field-dependent. The way I like to think about it is you keep improving capability, but at some point you sort of asymptote towards whatever the hard limits are in terms of actual effects on the world. And it's going to be different from field to field. So I think in a lot of research-y-type fields, it seems like we have evidence that it's an extremely like fat-heavy-takes,

Starting point is 00:21:08 distribution and the very best researchers are just like orders of magnitude more productive than the typical researchers in terms of actual effects, like in terms of like how much like actual scientific progress they produce. Whereas in other field, like perhaps in plumbing, you know, the world's best plumber is only able to do, you know, twice as fast as like a typical plumber, maybe. I don't actually know that number, but like I wouldn't be surprised if something like that were the case, right? I think that for things like the transformation of the economy, we're sort of taking the zoomed out perspective of like, yeah, that's going to involve a lot of specific things, that will run into bottlenecks,

Starting point is 00:21:39 but then also sometimes you can sort of route around that bottleneck by finding a different way of doing things. It doesn't depend on that. Zooming out and being like, what about this whole transformation task as itself, the thing that we are measuring, well, it seems like there have been humans trying to do something like this in the past. Like the human is in charge of the transformation of the economy during Rollo 2

Starting point is 00:21:59 and stuff like that. Also, similarly, I think economic production, like I think that like what Tesla and SpaceX and Boeing and so forth are doing is kind of similar, where you want to like produce more, cars that are better and that are cheaper than your competitors, but you have to design them and you have to test them and you have to produce them. You have to purchase the relevant equipment and install it and debug it and so forth. And there's bottlenecks every step of the way through that

Starting point is 00:22:22 whole process. And there's ways in which like, I mean, that's why it takes so long is that like your assembly line is not going to work correctly on the first try and you're going to need to like purchase new equipment and like integrate it and so forth. But you can sort of zoom out of all of that and be like, okay, so how much better is SpaceX at doing that sort of thing compared to Boeing? And the answer is, like, a factor of five or something. Like, they seem to be able to, like, actually make progress in the aggregate, like, five times faster or something. So similarly, we're thinking that once you have the superintelligences, and you've given them the legal authority and, like, the money and the financing and the support

Starting point is 00:22:57 to basically be in charge of all of these factories and to be calling the shots, we're imagining a similar sort of gap between that situation and, you know, SpaceX being in charge as we see between SpaceX being in charge and like Boeing being in charge, you know. And that's where we got the sort of one-year takeoff, one-year transformation sort of thing. If I can also talk about some of the other things we're mentioning, so the self-fulfilling prophecy thing is like, it gets me to the core because I don't want to create this self-fulfilling prophecy.

Starting point is 00:23:28 And this is actually my number one concern about AI-2020. in the ways, like, like, if it turns out that I regret this on my deathbed, it'll probably be because of this. I understand that, like, these companies benefit from hype about their products and that, like, they have an incentive to hype it all up so that they can get more investors and so that they can, like, get the government to, like, wave red tape for them and stuff like that. However, I think that, like, well, this is what I think is going to happen, and it's what the companies are gunning for. Like, like, AI-227 is not that new. to the people who've been at Anthropic and Openia and GDM,

Starting point is 00:24:04 it's sort of like what they're aiming for and what they expect. And like, I think that they're going to get it if we don't raise up a storm about it. Like, like, I think that we can, I don't know.

Starting point is 00:24:17 Like, I think it would be different if, like, we were the only people. I think, I think basically, like, I'm torn about this, but I, but I've, like, made the,

Starting point is 00:24:24 made the gamble that, like, since it seems like this is where we're headed anyway, we need to, like, talk about it. and raise awareness about it and hopefully use that as steering towards something better rather than sort of just hoping this doesn't happen

Starting point is 00:24:39 because this is what the companies are trying to make happen and they are very powerful and they have lots of money and lobbyists and so forth. And so if I don't tell the president now, like what about China? You know, like if I don't raise these concerns,

Starting point is 00:24:51 the company is totally will a year from now, you know? And like when they have automated the AI research, they're totally going to be like saying, oh, you need to cut the red tape for us and you need to like, I don't know. So I guess I'm making the gamble that like AI 227 has a relatively small effect on the self-fulfilling prophecy direction

Starting point is 00:25:10 and that it has a bigger positive effect in the sort of like getting people to wake up and pay attention and then hopefully steer things in a better direction. But I'm not entirely confident that that's true and I sure hope it is. Yeah, I definitely agree that that's something we should all hope for. And just to put a little bit more clarity on perhaps where we agree and disagree. And this gets to some of the other.

Starting point is 00:25:31 items in the outline that Ryan and David shared. I think for us, if some of the things that I think you kind of take as inevitable happens, which is that there will be this capability of improvement, and then we will have kind of no choice but to put AI in charge of the factories, maybe give it legal authority. And for us, and then they're kind of red line as AI owning wealth and so forth. If all those things happen, to me, we've already lost. We've massively lost from a policy perspective. Even if some catastrophic or existential event doesn't happen, this is all, you know, not the kind of thing we want to be doing in a democracy, not the kind of thing that's compatible with civil liberties. So for us, those are where a lot of the intervention points are, as opposed

Starting point is 00:26:18 to, you know, alignment, international treaties and so forth. Those concerns only come up at a stage where for us, a lot of mists have already been made. So that's perhaps one point of difference. Another one is, oh, sorry, Ryan. Yeah, I agreed. And I'm wondering if we could zoom in on that for a second, then get back to kind of the second. But this is something I read very much in your work, Arvinida, and I wanted to get Daniel to really respond to this.

Starting point is 00:26:43 So throughout your work and kind of the case for this being normal tech, you make this distinction between capability and power, and you sort of view those as decoupled things, right? So you say this, capability is intrinsic, power is the permissions that we grant the AI, right? So we can keep power away from the capability of AI's and do things like maybe don't give it legal status, don't give it the ability to collect wealth, don't give it the ability to lobby governments, right? And you're imagining that we can control power and separate that from capability. I have actually not heard Daniel's take

Starting point is 00:27:21 on the decoupling of capability and power. And so I'd like to hear that, but I'm wondering if Daniel thinks that you can't really decouple these things. Capability leads to power. And so, Arvin, maybe we could get Daniel to respond there on your first point. So Daniel, this idea of capability and power being separate, what do you think about that? I agree that they are separable in principle. Like, there are different things for the reasons that Arvins has said. I do think that in practice, they're going to tend to come together for reasons illustrated in a 20227. I mean, I think just concretely, if you let the companies automate their research and get super intelligences, I mean, like Irvin said, you don't want to be in that situation in the first.

Starting point is 00:28:03 Like, if you get to that position, maybe you've already messed up because now they're sitting on this incredibly capable, incredibly valuable thing, and they're going to be able to credibly lobby the government and say things like China's building their own version and China is going to let it out into the economy and have it transform things and, you know, rapidly build up military capabilities and so forth. And so if you don't want to lose to China, then you have to let us do it over here on our side of the Pacific. Like this is a prediction about not the technology, but about the social dynamics. We are predicting politically that that's what the companies are going to be saying, and that's what the government is going to agree to, at least by default. And so that

Starting point is 00:28:40 means they will end up with the power once shortly after they have the capabilities, if that makes sense. And to what Irvin, you were saying earlier, like, I think, yeah, like in a sense, I agree with you that like if you're relying on alignment, you've already messed up as a civilization. Like, if you're getting to the point where you're like, yeah, we're going to have the army of super intelligences and it's going to be like autonomously doing all this stuff in the economy and the government's going to be supporting it rather than like rather than trying to fight it. And so that's why we have to make sure that it has the right values because otherwise it could go very badly for us. Like in some sense, you've already taken a ton of risk by the time you're putting yourself in that point

Starting point is 00:29:18 and you've sort of played the board game into like a very difficult to win position. I think I agree with you there. And here's maybe the thing to say is like A.227 is a prediction. It's not a recommendation. Like this is not what we think should happen. Yeah.

Starting point is 00:29:34 Yeah. I totally agree that the AI companies are going to be lobbying for all kinds of things that again, I want to keep using this language, are incompatible with a democracy. And so I think that's the thing we should be trying to stop and we should not in any way take it for granted. And I don't think

Starting point is 00:29:52 that AI companies are as powerful as we make them out to be. When we look at the numbers, especially in the U.S., on the extent to which people are apprehensive about AI as opposed to excited about AI, I think it's very clear that if the companies make a play for this kind of, you know, authoritarian partnership between big tech and the government, we will be able to mobilize the sufficient public backlash to that and we should build the infrastructure for potentially doing that now. Those are some of the things we should be focusing on. And for me, this difference between capability and power is not even primarily a matter of the AI companies themselves.

Starting point is 00:30:30 So we make this really big distinction between development and deployment of AI. So the entities that are going to be in charge of those limits on power are going to be the everyday organizations, you know, schools and educational institutions. government agencies, banks, you know, hospitals, legal firms, every other firm in the economy who are going to be using AI in their various sectors in ways that are either with oversight and compatible with the principles we want, or are going to be handing over crazy amounts of power to AI systems in the pursuit of, we think, small increases in efficiency. And once again, you know, it doesn't matter if the AI companies themselves.

Starting point is 00:31:16 themselves are hyping things up and are power hungry, we can ensure that the broad base of organizations and institutions in our society don't fall for this hype and abide by the principles that we think are important. I would like to try and take a stab at really defining the line that separates you to because there's a lot of agreeing going on about many, many things. And I can't quite put a pin in exactly where I feel there is disagreement. On Daniel's side of things, it feels like there is this, like, demon in a Pandora's box that we need to not allow out of the box. And it is this truth, this object that's there that we need to make sure is contained. And I think on Arvin's side of things, there's just declining of the existence

Starting point is 00:32:03 of the demon in the first place. And that's an okay way of articulating where this line is. But, like, Arvin's fundamental premise, I think, is like, this isn't a demon. inside of a Pandora's box. This is a regular Web 2 tech company. This is the next iteration of Facebook, meta, Twitter, Instagram, just now in the AI age. And they're going to have power and control and influence like we've seen from all the current cohort

Starting point is 00:32:29 of AI companies. But it's not fundamentally different from any of the power that we've seen, like Mark Zuckerberg, a crew. And then Daniel is like, no, this is an organism that exists, that it will have life. It will have a mind of its own. It will escape. And that will change the face of Earth,

Starting point is 00:32:46 forever. And these are our two extremes. And so it's easy for me to, like, identify when they're polar opposites. But I'm still struggling to find this, like, line where you guys actually can't cross each other's, cross into others' territory. Well, it sounds like, Arvind, perhaps you just don't think that there's going to be superintelligence that soon. And you also separately think that even if there was, it wouldn't transform the economy that fast. Is that correct? Yeah, I agree. And there we go. That's the answer. Again, I mean, I'm, I'm, I'm skeptical. of superintelligence ever existing for some definitions of superintelligence. And the reason why AI is slightly outperforming humans at most tasks to me is not superintelligence is that there's

Starting point is 00:33:28 kind of inversion of cause and effect here in the sense that I think if we ensure our policies and practices are such that humans remain in control and use AI as tools, then those increases and capabilities are going to be effectively increasing what Zayash and I think of as human intelligence, as opposed to an unated, technologically unated biological human versus AI, which we think is the wrong comparison in the first place. So, Daniel, where is Arvin wrong here? So the definition of superintelligence that I was describing in this conversation is not slightly better than humans at most tasks, but rather slightly better than the best humans at all relevant tasks. And I do think that if you achieve that milestone, then things are going to be quite transformative. I think that

Starting point is 00:34:14 in particular, one of the relevant tasks is operating autonomously as an agent in the world, you know, doing whatever it takes to steer towards your goals. And that's where the analogy, that's where, I mean, I didn't, I think it's your words, not mine to say it's like this alien demon or whatever. Those are my words. But, like, that's why that analogy is relevant is that, like, that something that meets the definition of superintelligence as I've described it is a agent. It's, it's not just a tool is something that can instead operate fully autonomously in the world and, like, you know, learn as it goes and pursue its goals and so forth. And in fact, is better at doing that than the best humans at that skill, you know? And so it is like an agent that is, you know, comparable to humans

Starting point is 00:34:56 in that sense and in fact, it's superior to humans in that sense. And I think that even if we, even if we keep it capped at just slightly better than the best humans at everything, already that would be quite enough to cause this sort of analogy to be correct. Why? Why? Why? Why? Well, one way of thinking about it is think about the, it would be sort of like an alien species landing that was like basically like humans except just a bit better and not just better than like the average human but better than the best humans, you know? If this alien species landed on Earth, even if they weren't like dramatically better at anything, but if they're just a little bit better than the very best humans at everything and there's a million of them, already that would be like, whoa, like they're going to create like a colony somewhere. It's going to be like a really economically productive colony. It's going to be probably militarily powerful, too. And then you realize like, oh, wow, like their population doubles every six months.

Starting point is 00:35:46 What? Like, whoa, let's play that forward a bit. And in a few years, they'll have more population than humanity. Holy cow. You know, like, like that's sort of what you get. That's the sort of thing that you'd imagine. Basically, like, if it was literally aliens with like, you know, tentacles landing on Earth and they had that level of capability and that population doubling time,

Starting point is 00:36:07 it would be like, wow, like, in a couple years, they're going to be a competitor species to humanity. And then when you add in the factor that, like, actually, I think they're going to be not just like a little bit better than the best humans at everything, but like that as like a floor, but then also much better than humans at many things, you know, then that like only strengthens it. And then I think an additional thing as well is that like this whole like political angle too is that like I think if aliens landed on Earth, I think there'd be a natural coalition of like humans against the aliens. you know, and people would sort of be naturally hesitant to, like, put the aliens in charge of the U.S. military effectively and to, like, you know, basically have them run all sorts of companies and things like that. Although actually, now that I say that historically in colonialism, analogous things happened all the time, so whatever. But, but like with AIs, it's like, well, I think that the companies are going to tell everybody and tell themselves that, like, actually they're under control and that, like, the AIs are just doing what they want them, what the humans want them to do. This is so different. The depiction of aliens landing on Earth being super intelligent, kind of doing whatever they want, is so different, Arvin, than normal tech. Like, help us square this.

Starting point is 00:37:19 Sure. I mean, that's very helpful to me. It helps me pinpoint exactly where I disagree. I think this alien analogy, population doubling, et cetera, all of that is a choice. And I think it's a choice that we will not make. We don't have to make, at least. Wait, Arvin, so you can see it's possible. You can see it's possible, but you think.

Starting point is 00:37:37 that humans can essentially opt out of that to not let it happen? It's exactly the capability is not equal to PowerPoint. So we're talking about capabilities, again, that I think are going to be slightly better than unaided humans and equal to humans operating with the help of AI. And in that scenario, it's entirely a choice to say we're going to treat this as an alien colony. And so what that means is that not just let one AI system operate in the world unsupervised, but let entire collaborators of AI systems operate unsupervised, you know, give them this power, give them control over resources. And more than that, the humans versus AI framing to me, I think, is a deeply dangerous one, dangerous in the sense that it will bring about a lot of

Starting point is 00:38:23 the safety concerns that we are worried about. And the reason for that is, when you talk about it as humans versus AI, you're assuming that all of these AI systems are going to act toward the same goal in collusion with each other. But in fact, we should be designing all our AI systems so that the best defenses against a malfunctioning AI system is another AI system. So we should never get to the point where it's even human versus AI. Arvin, with my metaphor of like there's this demon in the Pandora's box, this demon is super sentient AI that's very powerful. And if it escapes, then it's game over. I think the thing that you contest is this notion that this Pandora's box is confusing and uncontrollable to humans. And I think what you would say is like, actually, no,

Starting point is 00:39:08 it's a totally reasonable box that we can think about and discuss and reason about. And we do have control over that. And I think Daniel would say it's like, we actually have less control over the nature of the box that constrains AI than we think that we do. Would you agree with that characterization? Yes. So, sorry, let me try to figure out what this analogy actually means in terms of the technical specifics that I'm more used to thinking about. So, yeah, I mean, there's a sense in which AI is unexplainable and uninterpretable. And sorry to take this to a more technical discussion, but it helps me make sure that I'm not misunderstanding the analogy

Starting point is 00:39:48 and misrepresenting my own views because of that. And there are concerns like deceptive alignment. AI might pretend to be aligned when we're testing it, but then when it's deployed, it might act very differently. But what we're seeing is that, yes, those concerns are already coming up with, even with, you know, current far below super intelligent AI systems. We don't even have to wait for the future. But actually, our best defenses against that are other AI tools in order to tease out this kind of behavior. Right. And so even if we're not the ones building the demon, if you will, neuron by neuron, you know, it's emerging out of the training process. We are in control of the box. and we are in control of the box through the help of other AI systems, and we can ensure the balance of power stays that way, and we can ensure that as AI capabilities improve,

Starting point is 00:40:37 it's making not just the demon more powerful, but the box itself more powerful. So that's one way of bringing it back to your analogy. Daniel, what do you think when you hear that? A couple things. First of all, A.22027, this is going back to something you said earlier. Air 2027 doesn't actually depict the AI as escaping. So, I mean, I think that could be, could happen.

Starting point is 00:40:58 AIs could escape, but also that's not actually what I was predicting. I want to distinguish between two risks, which I just want to put on the table so that for future parts of this discussion, one is the loss of control stuff, which is like, what if the AIs are misaligned, what if they pretend to be aligned, that sort of thing. But then entirely separate from that, even if you're not worried about that at all, there's the concentration of power stuff of like who's in charge of all the AIs, you know, and who gets to tell them what to do, and who gets to put values into them, you know? And, okay, having put those on the table, now to answer your question, I think control research is great.

Starting point is 00:41:32 And it sounds like that's kind of what Arvin's talking about, where basically you should build your system on the assumption that an individual AI system might not be trustworthy. And you should in fact assume that it's untrustworthy and that it's, you know, trying to kill you if it can. And then you should build a system of checks and balances and security permissions and so forth so that it can't. You know, that seems like a reasonable way to design things. And so there's this whole like sub-literature, which sort of grew out of the alignment literature called AI control, which is people working on like that aspect of things. And I think that's great. And I think that's important.

Starting point is 00:42:10 And there I would just be like, well, the companies aren't going to do this and they're not going to do it well enough and they're going to lose control, is my prediction. And I think that like in practice, like, I think it's possible in principle to have this whole system set up. But it's just not like the companies aren't really investing that hard. into making sure that this whole system of checks and balances is ready and has been debugged and has been like robustified to superintelligence or to anything close to that level of intelligence by the time we have those things.

Starting point is 00:42:36 And instead I think that, well, it's kind of like it parallels the whole alignment thing. Like with alignment, there's all sorts of techniques that you can use that maybe will result in more aligned AIs, things like scalable oversight and like interpretability and so forth. But then like the important thing to ask is like, okay, let's like what techniques are we actually going to use in practice and how confident are we that they're actually going to work in practice. And similarly with control, I'd be like, yes, impossible you can have this whole system of checks and balances. But like in practice, during the intelligence explosion, when the companies are automating all their research and so forth, I don't expect them to like invest that heavily

Starting point is 00:43:12 in this compared to what they would need to do. So can I say one thing to this before you move on to the next point? So this actually highlights another big worldview differences between us. Because for us, the role of the, what you call AI companies, we call model developers, is much, much less than what it is for you. It doesn't matter to us what the AI companies do or don't do. A lot of this research is happening anyway. It's happening in academia. We're actually compiling a big list of the many and varied kinds of AI control methods. So that's one thing.

Starting point is 00:43:44 The companies are not that powerful. And secondly, for us, these control methods have to have to. happen closer to the deployment point. So it's going to be the law firms and the hospitals who are finding these ways of AI control that make sense for their own industries and their own deployment scenarios. So that's another reason. It doesn't matter that much what Open AI does or doesn't do. And to give just one practical example of that, even in the simplest case of deployment of a relatively really innocuous kind of bot compared to the kinds of things we're talking about, we see many cases in the real world of exactly how hard it is for deployers to get away with deploying AI systems without

Starting point is 00:44:26 oversight if they're not innovating on these kinds of control methods. And great example is, I think, from last year, Air Canada deployed a customer service chatbot. Again, super, super innocuous thing. You know, it didn't have abilities to actually book people's flights or anything like that, just answer questions. But this bot was not very accurate, as you might expect, and gave people incorrect information about loyalty programs or something like that. And the customer lost money because of that.

Starting point is 00:44:55 They actually sued the airline. And Canada's courts concluded that the airline is actually responsible to make whole the monetary losses that the customer suffered. So this kind of thing is not an isolated example, has been such a serious liability issue for companies that even deploying, again, the simplest, dumbest thing, like a chatbot, has been going so slowly. Right.

Starting point is 00:45:17 And therefore, our prediction, and this is consistent with the observation so far, is that deployers, if they start to do anything meaningful with these systems, are going to face such serious liability risks, even under existing law, which I think should be buttressed by future AI regulation, that they're going to be forced to innovate on control methods before they get anywhere with serious deployments. And one example of such innovation. Some of it is technical innovation, but a lot of it is going to have to be non-technical.

Starting point is 00:45:45 there is a startup now that's offering insurance for exactly this kind of mistake that might be made by AI chatbot. So for us, insurance is an interesting AI control innovation. So those are the kinds of things that are going to be needed and that are actually happening on a pretty massive scale. People don't necessarily seem to realize how much because, as Daniel said, control originally started as a subfield of like AI alignment, but it's not anymore. If you look at all the control stuff that's happening in various disciplines like

Starting point is 00:46:12 human computer interaction and computer security and in the industry, It's actually bigger than alignment, and I think that story deserves to be told, and we hope to do that in a follow-up to our paper. Daniel, Air Canada chatbot, like, you know, it does seem like Arvin is saying every time something goes wrong with AI, basically our existing institutions will hold, you know, slap it back into place. We'll keep the alien species in the zoo if we want to use analogies and metaphors and have it live there. Like, why is that so wrong? I mean, that is what we've seen in the past. I know earlier in the conversation you were saying, well, of course, things are going to look like 2025 all the way till 2027, and then something happens.

Starting point is 00:46:53 And then we get this AGI moment. That's maybe another subject for discussion if we want to get into it. It seems very much like Arvind doesn't think the AGI milestone matters at all. Yeah, Arvin, you've said this. AGI is not a milestone. There's no sharp capability threshold that yields instant impact. Whereas Daniel, that seems to be part of the premise of AI 2027, right? The AGI line, March 27, we hit it, we cross it, and we start the intelligence explosion clock.

Starting point is 00:47:24 I mean, I forget if we actually use the term AGI somewhere in the text of A. 2nd27, but we basically don't. Like, I think instead we break it down into different milestones, and then we say it's continuous progress that we're just going to summarize as this series of milestones. The first milestone being automated coder, second milestone being automated AI researcher, eventually you get to super intelligence. But we agree it's continuous progress. There won't be any particular sharp, bright lines. And we depict that progress happening over the course of 2027 in the scenario. First, I want to just focus on understanding Arvind, your position.

Starting point is 00:47:56 So it sounds like you're saying companies are mostly going to be properly cautious about integrating AI into their stuff. And some of them will be incautious, such as O Canada, and then we'll pay for and then the industry will sort of learn from their mistakes. And this is going to result in, A, market pressure on the AI developers to make more aligned and more controlled AIs, but then B, separately, even if they're not aligned and controlled by the developer, like, the industry will sort of control them and, like, only deploy them in ways that, like, they can't actually cause that much harm.

Starting point is 00:48:33 So I'm definitely, there's definitely a huge disagreement with me. So we've identified a crux. I think that this is like kind of true right now. Like I do agree that like right now people are a lot they're rightly cautious about integrating AI into everything and this is creating this sort of like market force. However, as you yourself have said, there's plenty of companies that aren't cautious

Starting point is 00:48:55 and that are integrating things anyway and we'll see how that plays out. The main thing that I want to say though, which is maybe a disagreement, is that I think that once you've got the army of superintelligences, like I'm imagining, let's suppose that things go exactly as AI 2027 predict, which of course isn't going to happen. But suppose it's literally exactly that way.

Starting point is 00:49:12 Then, like, in 2028, Professor Arvind will be writing op-eds about how we shouldn't trust these super intelligences with all this power and responsibility and how they shouldn't be, like, integrated into the economy. And we shouldn't be, like, letting them autonomously run factories and design new types of robots and so forth. But then you'll be shouted down by, like, the president and all these other powerful people who will be saying we have to be China. And, you know, like, if we don't do, like, look at what's happening in China.

Starting point is 00:49:39 In China, they're deploying their AI everywhere. And, like, you know, and, like, it just feels like a political fight that you're going to lose. And I don't, I'm not saying you should lose the fight. Like, I agree, you know, like, it's dangerous to deploy superintelligence everywhere. But I'm just saying, like, politically speaking, I think the point to intervene is before you have the army of superintelligencies, not after it's already been built. And the companies are already, like, using it to lobby the government and so forth. like personally, I'll probably just retire at that point. And I'll probably just like, I'll probably just like officially give up and just like go spend

Starting point is 00:50:12 some time with my family or something because I think it's basically hopeless once you've gotten to like, you know, middle of, once you're at like March of 28 in our scenario, like I feel like it's basically hopeless at that point. Whereas it sounds like you instead have quite a lot of hope and think that like even if you let things get that. Well, okay, but now this contradicts what you're saying. Yeah, tell me what you think. I'm confused about where your position on this is. Sorry, I don't know what contradiction you identified, but I think one crux of disagreement I can point to

Starting point is 00:50:43 is that for us deploying AI with adequate oversight is not this losing business proposition. I think we are innovating in ways that we're able to ensure AI control in ways that are not simply a human in a loop, rubber stamping or second-guessing every decision. A lot of people intuitively think of AI control like that. And if you do, it seems clear that the more efficient thing is to deploy AI autonomously. But for us, those are, that's not the only option. And secondly, again, this whole army of superintelligence is framing. Again, I keep coming back to most tasks are like writing. And outside of certain areas like AI research itself, which is a purely computational problem, and then maybe, you know, purely robotic tasks, most sectors of the economy, again, if you take

Starting point is 00:51:35 law and medicine and things like that, there are such inherent limitations to performance that whatever army you build is only going to give you a slight efficiency improvement. And I think it will be a losing idea for most companies to trade off huge safety risks for those slight efficiency improvements. Well, I definitely also disagree about the slightness. I think that, like, it differs some field to field. And perhaps in some fields, it'll be slight, but I think in many fields, it will be massive, and that will cause this huge profits for the AI companies. And, like, it'll be this crazy thing. Not everywhere. Like, I agree in some fields, it'll be only a slight advantage. But, yeah, I think, trying to give back to the main thing that you were saying,

Starting point is 00:52:15 oh, yeah, you were more optimistic about the control stuff on the technical level. You're basically, like, it's not just going to be, like, trying to put humans in the loop. It'll, like, actually be, it sounds like you're saying that, like, by the time we have, you know, by the time 2027 rolls around, we will have control techniques that allow us to get the benefits of these really smart super intelligences without the risks as long as they're properly applied, basically, like as long as, is that, is that you're saying? Yeah, I mean, our timelines are longer, right? We don't think it's by 2027, but we also don't think we're going to have the potential for, you know, super intelligent AI without oversight giving us massive economic efficiencies by 2027.

Starting point is 00:52:54 Right. But like hypothetically, if the timelines were shorter than you think, and we do get these very powerful AIs by the end of 2027, you would be more optimistic than me that we could deploy them in a way that's sufficiently controlled that they don't need to be aligned in order for it to be safe. Well, I mean, if it is 2027, then we're massively wrong on the timelines, right? Then we would also have to be massively wrong on the timeline of controlled research. A lot that control research is happening, we see that accelerating over the course of the next decade or so. If capability massively accelerates but control research doesn't accelerate, then yeah, then we're in a bad place.

Starting point is 00:53:29 Okay. In that case, that's another nice point of agreement is that, like, I also think that if we had another, like, you know, 15 years before we got to superintelligence by the definition that I described, I would feel much more optimistic in general because I think that things like control research and alignment research would have just generally had a lot more time to cook. And in a bunch of ways, I think the world would be off to a better start if things were happening in 2037 instead of, you know, 2027 or whatever. Yeah. Yeah, so we agree there.

Starting point is 00:53:59 Throughout the course of this discussion, we've been covered in a number of places where the two of you disagree and a few where you degree, where you agree. I'm wondering if you could, you know, maybe start to summarize this for our audiences as we move to a close. And one way to do that is to kind of illustrate, I'm going to move us beyond 2027. Let's move to something more interesting, another date, which is like 2030, let's say, five years from now, okay? And so I'm going to ask you both this question through your lens of prediction in terms of how you're viewing AI and kind of the world and how this all plays out. But basically, what does AI look like by 2030 and what does the world look like? Arvind, first you in 2030. What are we looking at?

Starting point is 00:54:40 Sure. So let's start with the economy. I think some jobs are likely to be radically transformed. perhaps will face massive job displacement. But I think for most jobs, we're going to have task-by-task, gradual automation, along with lots of innovations in how to deploy these AI systems safely. We are likely to have a ratcheting up of the geopolitical competition. But I predict and also hope that we are not going to buy in wholesale to the,

Starting point is 00:55:17 to the arms race dynamics, because look, if I can make a slight digression here, those arms race dynamics are present even without AI. And that's the reason that war has always happened, where each side is fearful of the other side. And, you know, if that's the approach we're going to take to geopolitics, that's a very risky approach, even if you take AI out of the picture, right? So whatever diplomatic methods we've been using throughout the history of, you know, war and geopolitics, to be able to tamp down those tensions and to realize that escalation is not in anyone's best interest,

Starting point is 00:55:54 we have to apply all of that to the current moment as well, perhaps even more skillfully. And again, I'm cautiously optimistic that that's something we will be able to manage. And then finally, predictions in terms of other aspects of AI-related disruption, I think challenges to the education system, challenges to people's notion of identity, challenges to what art means. A lot of these things that are not really about the labor market, those are perhaps going to happen on a shorter time scale. And right now, they have been happening to a greater extent than economic disruption has actually been happening. So I would kind of maybe flip. I don't know if Daniel agrees with this perspective or not.

Starting point is 00:56:39 but the disruptions that we're going to see quicker are not really related to economic efficiencies, but rather people adopting AI in their personal lives in various ways, whether as companions that they form social relationships with or, you know, in the education system, and then you have a crisis of what the heck does college even mean? Are we teaching our kids the meaningful things? Similarly, what does art mean? Even back in 2023, I think Hollywood had to contend with what does this mean for, you know, for movies and for art? And then there was a movement to push back against the use of AI and art. So those changes happen very quickly and those are perhaps going to accelerate by 2030.

Starting point is 00:57:21 But I did not foresee large-scale job replacement, for instance, by 2030. Daniel, how about you? What does AI look like? What does the world look like, 2030? Well, if you go to AI-227.com, which depicts in hundreds of pages of detail, this sort of thing. And to be clear, I actually would guess I'd get on, I've updated towards slightly longer timelines since about end of last year

Starting point is 00:57:46 when we were writing the core story that became Mayah 2027. So now I would aim more towards 2028 instead of 27, but like, it's still basically the same. So, so like the short answer to your question is like, I expect the world to be utterly radically transformed by 2030, probably. I'm not confident.

Starting point is 00:58:03 You can ask me what I would, what I think it would look like if that's not true, you know? And then perhaps I would agree with Arvind that like I think that the I could imagine things being taking longer than I expect. I could imagine the current sort of like wave kind of petering out and resulting in lots of downstream applications

Starting point is 00:58:18 and lots of cool apps that use large language models but but the core dynamo of AI automated AI research just sort of like never really materializing. I could imagine that happening. It's not what I'm betting on though. What does that look like? Can you get for people who haven't read 2027? I know there's a lot more detail there

Starting point is 00:58:36 and there's like multiple paths we might take but like for people who haven't gotten into that detail yet, what does 2030, when you say radically different by 2030, I mean, some people are envisioning, yeah, Daniel, I mean it's radically different after the internet. And then we got, we got phones, smartphones, everything is radically different. This goes back to the start of this whole thing, like normal technology versus not. Yeah. Which, by the way, slight tangent, like, I think that, like, it's going to be normal in some

Starting point is 00:59:00 ways and not normal in other ways, you know, like, nuance. Perhaps you agree with that. But anyhow, if things go the way that AI-20s, you know, I think. 2027 predicts, which I think they probably will by the end of this decade, if not literally in 27, maybe afterwards, maybe before, whatever. If things go roughly like that, then basically it's like, first you get the automated coders, then you get the automated AI researchers, then you get the superintelligences by the definition just described.

Starting point is 00:59:25 Then you get even more powerful AIs that are not just like a little bit better than the best humans, but like maybe only a little bit better in some domains, but like a lot better in other domains. You continue to improve them, make them more efficient, faster, etc. you've got millions of them running, et cetera. They start assisting in the creation of new fabs to produce more chips so that you kind of have. And at that point, it is starting to look like the sort of aliens thing. The analogy becomes relevant because it's sort of like you've got this new species

Starting point is 00:59:52 that's better than humans in a bunch of ways and at least economically is competing with humans, if not militarily. And it's got a population size that's growing really fast because they're producing more chips and more factories and so forth. How long does it take to get to the point where, The next milestone I would mention would be the sort of like fully autonomous AI robot economy. And what that means is that, like, yeah, there'll still be humans around. But like the collection of factories and automated mines and automated this and automated that taken as a whole is self-sustaining.

Starting point is 01:00:26 Like it doesn't, it doesn't like crucially depend on humans for any crucial components of it, such that, like, hypothetically, if all the humans were just sidelined, it could continue to grow as an economy, and it could continue to build more factories and mine more things and do more research and so forth. And my guess is that that happens, like, maybe a bit less than a year after we achieved the superintelligence milestone. Maybe it'll take longer,

Starting point is 01:00:51 but probably by 2030 it will have happened. So probably by 2030, there will be all of these special economic zones full to bursting with all of these newly built mines and factories and chip fabs and data centers. And they might, still be humans working there, but the humans will be gradually being replaced by robots of various kinds. And, like, as a whole, it'll be as if there is, like, this new species that is now,

Starting point is 01:01:16 is now just, like, superior and is sort of self-sustaining. Now, if we're lucky, that new species will itself be sort of subordinate to humans and, like, we'll be following human orders and so forth. And that's also possible. Can I just say one quick thing? Sorry, Danielle, I didn't be to interrupt. I thought you were finished. I mean, so one thing I'll see here is that I think, for a lot of AI technologists, and I'm not necessarily saying for you, Daniel, you know, all of this might happen and that it might go in the direction of kind of either utopia or dystopia, utopia where these AIs are safe and they have provided for all our material needs and we can spend all our time on leisure and then the dystopia is catastrophe. But like, that's only in the

Starting point is 01:01:59 Silicon Valley bubble for the vast majority of people. Both of these possibilities are dystopian. almost no one actually wants this. So whatever the economic arguments are, this is only going to be possible if there is a complete collapse of people's democratic ability to influence the direction of things. Because the public is simply not willing to accept either of the branches of this scenario. Yeah, let me let's talk about that. So have you heard of pause AI? Yeah, of course. Yeah.

Starting point is 01:02:28 So like, I mean, they're basically, I'm not sure if they agree with you, but they at least agree on that. They're basically like, we just shouldn't do this. Why don't we just stop all this from happening? And it sounds like you're saying at least sort of something vaguely similar, that like most people in the world, if it was put to them in a vote, would basically be like, how about we just don't build all this crazy robot stuff? Is that right you're just saying? Well, I mean, how about we don't use taxpayer money to massively subsidize AI companies

Starting point is 01:02:54 and create special economic zones, right? So there's a big asymmetry here between intervening in the default course of action and pausing things versus, affirmatively, you know, having government work towards this future by doing all the enabling things that I think you agree needs to happen before this can be realized. Yeah, so excellent. So zooming in on that a little bit, this gets back to what I was saying earlier about the like, when will I just give up?

Starting point is 01:03:19 And I'm saying, like, politically speaking, we could imagine like an alternate version of AI-227 in which the government basically is immune to the lobbying from the companies and is like, no, of course we're not going to make special economic zones for you. Yes, we of course we're going to regulate you. Blah, blah, blah, blah, blah. Like, you can imagine that sort of alternate trajectory. And then it would take more than a year to transform the economy, you know? And it would, like, all that sort of stuff would be spread out and much more gradual. However, just as a matter of, like, politics, I think it's, like, going to be really tough to, like, win that political fight. If you let it get to the point where there's already this crazy race to superintelligence between,

Starting point is 01:04:01 U.S. and Chinese companies, and they've already built superintelligence, and they're already like networked heavily with the government, and like the government has been helping them with security to beat China. Like, like, at that point, I think it's just like very likely that the government will be like, yes, make the special economic zones, yes, we'll cut the red tape, yes, we'll subsidize you. You know, like, like, I kind of am expecting the government to be captured basically by the companies at that point, possibly literally with the help of the AIs. After all, If they're super intelligent, that means they're better at lobbying than the best lobbyists, and they're better at, like, you know, persuasion, they're better, like, that certainly can't hurt, you know.

Starting point is 01:04:38 I feel like every time I interact with chat GPT-O-3, right, it's persuading me as something. I've just got a lot of good ideas. I mean, you know, it could be bottom up like that. Well, whatever. The point is that I'm hoping to intervene before then. Like, I'm hoping to, like, not let it get to the point where they've already got the superintelligences and then we're asking the government to, like, make this be a slow transition, you know? Yeah, yeah.

Starting point is 01:04:59 Yeah, I totally agree that if even before Super Intelligence, if we get to the point where national policy is oriented around the race to AGI, then, yeah, I will retire at that point. I'm glad you guys are both not retired yet and able to give us some insights today. Thank you so much. I think the audience will benefit from this. The entire AI community will as well. Maybe I want to just end with this kind of like bonus question because this has been in the back of my mind.

Starting point is 01:05:28 I can't really figure this out. I mean, both of you are genuine, informed, intelligent, earnest people, right? Like, I very much believe that about the two of you and many in the space. And what I am perplexed, this is kind of the meta question going to this debate, is like, how is there such wide variance of opinion here? Like among informed, intelligent people. On your side, Arvin, you're like, hey, this is normal tech. We've been through this before.

Starting point is 01:05:57 We've seen it. on Daniel's side, he's like, this is a new alien species that's going to be super intelligent. We've just unleashed it on the world. How is there such a variance? I don't think I've ever seen a debate like this in anything I've ever done where you have like all of these very smart informed people reasoning about this and coming up with widely different conclusions. And the different P. Dooms and P. Utopias range from like literally Yon Lacoon, 0.01% all the way to

Starting point is 01:06:27 but L.E. Z. Yucowski and beyond. It's, like, going to happen. It's 100%. I don't understand this. Have you guys been able to make sense of this? It's not so surprising for me. I was trained as an academic philosopher, and there's a saying that, like, for all P, there was a philosopher who says P

Starting point is 01:06:43 and a philosopher who says not P. So, like, it's, I think that, like, it's actually just kind of normal for human discourse for this to be the case, that there's, like, intelligent, smart people who disagree with each other. when the stakes are this high, Daniel? Why would the stakes being high change that or something? Like, it's the reason why the philosophers disagree is not because, well, they also don't think

Starting point is 01:07:07 the stakes are that high. But, like, I don't think that, like, charging them up with lots of emotion would, like, suddenly make them sing kumbaya. You know? Fair point. Yeah, if I can also take a tab at this, we actually try to say a few words about this in the concluding section of the paper. We talk about what a worldview is and why worldviews can't be so different from each other. One, I think different predictions about the future arise from different interpretations of the present. So I think that's one big factor. So fundamental questions like, is AI currently being adopted rapidly and is it faster than previous waves of tech adoption like PCs when they first came out in the 70s or whatever? People disagree radically on that, right? So even on the things that

Starting point is 01:07:51 are in principle, empirically testable today, there is a big disagreement. And that's at least disagreement that you can potentially, you know, minimize by looking more carefully at the data, talking to each other, and so forth. But then when you get to really stuff about the future, there are assumptions that I think are essential to even start to think about these questions and those assumptions are different. A bigger one is our epistemic tools are different. So what weight should we put on which kinds of historical analogies? What is the role of probability estimation, and various other epistemic tools. And then values also, I think, start to play into this.

Starting point is 01:08:31 And all of these factors kind of reinforce each other. And people are in different epistemic communities. I think a couple of observers have talked about how these are kind of West Coast versus East Coast views. And for me, you know, they're, I mean, I was in Silicon Valley for three and a half years. I was doing startups in a literal reason why I wanted both physical and intellectual distance from that community was because my modes of thinking were different and I felt more

Starting point is 01:08:56 at home here. And so we kind of segregate ourselves into physical and intellectual communities based on our assumptions, values, epistemic tools, and so forth. And so it's not, I think, too surprising that you should get divergences as a result of all of that. Well, Arvin, Daniel, thank you so much for airing these divergences publicly so we can all come to our own conclusions. Whatever happens next, it's going to be interesting. Let's just say that. We appreciate you coming on. Yeah, thanks. Good luck, guys. Take care.

Limitless: An AI Podcast - AI DEBATE: Runaway Superintelligence or Normal Technology? | Daniel Kokotajlo vs Arvind Narayanan

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.