Hard Fork - ‘Hard Fork’ Live, Part 3: Differing Visions of an A.I. Future

Starting point is 00:00:00 This episode of Hard Fork is brought to you by our Hard Fork Live 26 sponsors. Premier sponsor, IBM, Associate Sponsors Everpure, Pure, Pure Leaf, and the University of Notre Dame, and supporting sponsor at Lassian. Well, Casey, we are still on our annual summer vacation, and can you believe there is yet more amazing stuff from Hard Fork Live that we have not shared with our podcast listeners. There is, in particular, we had a really fun to. discussion at the event between Daniel Kokatelo and Syesh Kapoor, who have somewhat different views

Starting point is 00:00:53 of how fast the AI conversation is going to go. We've heard them debate before. We wanted to sort of have an updated discussion with them now that it's been, like, you know, getting close to a year since the last time they had it. So I think you'll really enjoy hearing what they have to say about that. We also had the great podcaster, Dwarkesh Patel, stopped by and hanging out with a bit. Tell us a little bit about what is on his mind. And just to round it out, we took some live Q&A and heard what was on the minds of our audience after a spectacular hard forklift live too. So these are all conversations that I would classify in sort of the same bucket of like insider sense making, people who are deeply enmeshed in the AI scene in San Francisco, trying to

Starting point is 00:01:31 understand and explain what is going on, the pace of progress, the trajectory of these models to the outside world. And Syash Daniel and Dwar cash are among the three most gifted people I have ever heard, try to explain this stuff to an outside world that doesn't always know exactly what's going on. It's a great set of conversations. We think you'll really enjoy it. This is our final installment of our episodes from Hard Forklive 2. We will be back in two weeks to our regularly scheduled hard fork programming. In the meantime, enjoy your summer. Wear your sunscreen. So this next segment I am so excited for because we're going to have a conversation with two

Starting point is 00:02:20 people who have very different views about how AI is going right now. Yes, we have Daniel Kokatelo with us tonight. He is the co-author of AI 2027, a report that many of you, I'm sure, have read. This came out in 2025 and laid out a vivid scenario or account of how AI could fundamentally upend the world, achieving tasks like autonomous coding and R&D. He's since updated that prediction a few times. We'll ask him about that. And he'll be joined by Syash Kapoor, who is an AI researcher at Princeton, with a very different view of the future. He's the co-author of AI as normal technology, which looks at evidence that AI is much like previous technologies

Starting point is 00:02:58 that have upended the economy that take a long time to diffuse through society. We've invited them both here tonight because we saw last year a very interesting debate that the two of them had at an AI conference called The Curve. We thought it was so interesting that we decided to bring them back tonight and hear how their views have evolved since then and where they continue to disagree and where they might agree now. So please give a warm welcome to Daniel Kokatelo and Saas Kapoor.

Starting point is 00:03:30 Hey, Daniel. Hey, Syash. All right. So Daniel, Kevin mentioned this up top. You have updated your timelines a few times since you first published AI 2027. Give us the most up-to-date view of your thinking. What's your best estimate for when we will achieve AI models that can do their own AI R&D? Probably 50% by late 2028.

Starting point is 00:03:54 Okay. That's soon. I'm thinking about the calendar. That's two years. Yeah, that's like a little bit later than Anthropic expects, I think. Things take longer than you plan for, you know. Which is a point that Syash makes sometimes. Syash, can you summarize where your views are today?

Starting point is 00:04:13 My sense is that you do not believe in the sort of sudden takeoff scenario that some other observers believe in. That's exactly right. I think the main reason for that is basically this is just a situation. The agreement boils down to whether the bottlenecks to this intelligence explosion, the bottlenecks to automating R&D are all computational or whether they rely on real-world bottlenecks that will be really hard to automate away. So I guess this is one place where we disagree. I think that in a lot of domains, making these advances won't be as easy as it had been encoding. And to really get to sort of artificial superintelligence, you need to cover all of these different domains. You need sample efficiency across the board, which is much easier to do in a field like programming very.

Starting point is 00:04:54 you have these simulators, these virtual environments, but much harder to do in the real world. And, you know, some evidence bears this out. Adoption of AI systems has indeed far slower in other domains as opposed to coding. So give us an example of what these bottlenecks are. I talk to a lot of AI research for folks. And to them, at least the way they make it sound to me is, look, eventually the model just gets good enough and then it's game over. You're saying that there's something that exists called a real world, and I'd like to hear more about it. I mean, look, I mean, to be honest, I think these are just two independent, self-consistent world views about the future of AI.

Starting point is 00:05:30 And the reason that Daniel and I have had such productive conversations is that we are basically trying to figure out where these worldviews differ. Now, speaking of, I think, like Daniel's actions and Daniel's predictions are entirely self-consistent with the worldview that we'll get to AI systems at this point. Unfortunately, in order to get evidence we're going one way or the other, we need to actually carry out lots of evaluations. We need evaluations to be of a much higher standard than we have today. So to give you one example of a bottleneck, the other day I was talking to a lawyer friend of mine, and he uses these tools, he's very bullish about them.

Starting point is 00:06:05 But what has turned out to be the case is, as he started using these tools for bigger and bigger tasks, the rate of hallucinations, the rate of unreliable outputs, has sort of remained the same, right? It's not because the AI systems haven't gotten better. They indeed have. They are so much better today than they were. just a year ago. But the fact is that the tasks that you can do with these systems actually

Starting point is 00:06:25 are bounded by the rate of hallucinations or the reliability. And that's one place where AI systems continue to struggle. And in a domain like software engineering where you have this instant feedback where you can actually run the code and see what the output would be, it's a much easier bottleneck to address as opposed to something like the law where even the right answer is not obvious to a domain expert. Domain experts can reasonably differ in the approach that they take. So this is just one example of a bottleneck in a domain where the right answer can be a bit more subjective than encoding. Daniel, I think when AI 2027 first came out, there were some people who dismissed it as sort of

Starting point is 00:06:59 speculation or scary science fiction was a term that some people were throwing around a lot. I reported on this. I talked to you and your co-authors then. I know that you grounded this in like real like forecasting work, like months of trying to figure out what would happen as the technology got better. And I will say like a lot of that has come true already. So you predicted in your AI 2027 that we would start to see large parts of coding become automated. That much has come true.

Starting point is 00:07:26 I was reading today. Someone was copying and pasting something that you had written about Frontier Labs, sort of restricting the use of their models for Frontier LLM development, something that has happened this week with Claude Fable. So what are the things that you think will happen if your scenario continues to mostly hold for, let's call it the rest of 2026? What are we going to see this year? So we're not going to see an intelligence explosion this year in the scenario that happens next year.

Starting point is 00:07:55 That's close. So that's nice. I think... Intelligence explosion being recursive self-improvement leading to sort of out-of-control, runaway superhuman AI. Yeah, or to put it another way, just fully automating the AI research process, causing AI research to happen even faster than it currently happens. And it's currently happening at a very fast rate compared to many other technologies. But yeah, I would say the coding agents are just going to get better and better, and that maybe a year from now, maybe two years from now,

Starting point is 00:08:25 they will be good enough that you can sort of say they've automated coding fully. They haven't fully automated coding yet, but maybe in a year or two they'll have fully automated coding, at which point the bottleneck will be research taste and management and all the other aspects of the AI research process besides the actual coding. And then the companies are going to turn towards resolving those bottlenecks and teaching their AIs to do those skills as well. And that's going to take some time, but it's going to go by faster than you might think when all the coding has been automated. Once they've finished doing those things, they won't have super intelligence immediately.

Starting point is 00:09:03 The first AI system that can do the complete AI research process probably won't be able to do various other things. But once they've fully automated the AI research process, things will probably go faster and faster. and then the type of system that can do absolutely everything is probably not far off. Sayash, do you believe this sort of recursive self-improvement is possible? I mean, in some sense, I think the process of recursive self-improvement started like six decades ago. In fact, the entire history of computing has been one

Starting point is 00:09:32 where we develop tools that then aid us in the development of better tools. We've developed compilers that have allowed us to be like two orders of magnitude better at programming. We've developed frameworks on top of that. we've developed entire systems and libraries that allows to do things that would, frankly, take an experience of software engineer years or decades of time if they were using assembly language. So I think in some sense, this loop has been kick-started already. This loop is something that the entire history of computing bears out.

Starting point is 00:09:59 What I disagree with in terms of, like Daniel's predictions, is whether this process will naturally lead us to a point where we develop the automated AI R&D researcher, or whether humans will continue to have this edge and teams of humans with AI will continue to outperform AI alone and whether this process will lead to artificial superintelligence. I actually think that it's a very plausible scenario for me that we get this sort of recursive self-improvement that AI systems do indeed continue performing better and better at AI research tasks. But the end process of that need not be ASI. The end process of could just be far more capable models than we have they are perhaps following the trend of previous technologies,

Starting point is 00:10:39 and yet not the point where we have, these systems that outperform humans, the top human experts on everything, which is, I believe, the definition of ASI. Perhaps you should talk about the point of agreement. Yeah, yeah. That's the point of agreement. Yeah, so we wrote this blog post together,

Starting point is 00:10:54 the authors of AI as a normal technology and AI 2027, where we talked about the things that we agree on. And correct me if I'm misstating it, but roughly speaking, we talk about what you might call strong AGI or like humans in the cloud, like AIs that can sort of do all the cognitive tasks or the tasks you can do at your computer as well as professional humans or as well as the best professional humans. And I guess the headline is, I agree that AIs that aren't that powerful are still normal technologies. And they agree that AIs that are that powerful are not

Starting point is 00:11:25 normal technologies. Exactly. Or like the normal technology thesis sort of stops being accurate or helpful in a world where we have like humans in the cloud, let's say. Yeah. The reason that we spend this time talking about recursive self-improvement is that, you know, RSI is kind of the moment that observers believe is like kind of the scariest moment in the development of AI, right? It's like it becomes ever harder to control. And so how far away are we from it and is it possible, I think, are probably two of the most important questions that we will ever ask on the podcast. You know, having heard your, you know, what sounded me like very, you know, sensible objections to why it may not be possible anytime soon. And I, and understanding, Daniel, you know, why you do think

Starting point is 00:12:04 as possible. I'm curious, like, if at the very least, like, you hope Syosh is right. You know, like, would you bring a sigh of relief? Yeah. I would love it if you're right. Okay. Yeah. Okay. Thank you. But what do you see that makes you think that he's not right? So I think that, like, I've tried to spend some time thinking about, like, what are the barriers, what are the bottlenecks that could block Anthropic from succeeding in their stated plans? And none of them really seemed that strong to me, basically. Yeah. So we can, we can go through them like bit by bit, like data efficiency, you mentioned. It does seem like AIs currently are less data efficient than humans, but that also seems like something that companies could probably

Starting point is 00:12:44 make rapid progress on if they tried. And also separately, it may not actually be that important for automating the AI research process. It might be that you can sort of like 99% automate the AI research process without getting that data efficiency to human level. And then even though that's not like quite there, even a 99% automation would speed things up quite a lot, which would then allow you to do, you know, a decade or two decades worth of research in a year, perhaps, you know. So those are, I think, my two arguments for why it seems like we're bringing pretty close. Another argument, a sort of meta argument that I would make is that I feel like there's been a long history of AI scientists and other commenters making claims about what AI's can't do,

Starting point is 00:13:28 like various walls that deep learning is going to hit. and they just keep getting smashed through almost as soon as people are making the claims. And I just feel like that's probably what's going to happen with data efficiency, for example. Yeah, let's pause there because that actually seems really important to me, because that's been my observation as well. And it's why I am more inclined to believe the labs when they make grand pronouncements, right? So, Sajic, I'm curious, like, what is your relationship to that? Because you've also seen these models come along and blow away the benchmarks and see the evals get saturated,

Starting point is 00:13:54 and we have to make new. In fact, you've been making your own evals because the old ones got saturated. Yeah, I mean, we've worked on several evals that, example, Anthropic has used and were saturated with the release of Opus 4 or 5, we were the first ones to say that, look, this is like solved now. And I think this progress will continue. I think as long as we can specify things well enough, we'll continue to build AI systems that can solve those tasks. Where I differ perhaps is whether the natural endpoint of this process is something like, you know, we solve data efficiency. I'm skeptical about that for a couple of reasons. First, you know, sample

Starting point is 00:14:25 efficiency or data efficiency is not the only bottleneck to getting what we call humans in the cloud earlier. And the past sort of, if you look at past progress in AI, we've continued to develop these more general systems. But at any given level of generality, we've been really bad at predicting what the bottlenecks to the next level are. We've been really bad at knowing when we solve those bottlenecks and what underlying transformative breakthroughs are needed to solve them. And, you know, like as evidence of that, perhaps we can take the transformer moment. And before that, we can take all of the skepticism about neural networks that pervaded the research community in AI. And, you know, it took a matter of like a few years until the community pivoted.

Starting point is 00:15:06 And now everyone is all in on transformers. But perhaps that's not the right architectural choice either. Perhaps you're sort of yet to discover these new architectures that would allow us to make these data efficient AI systems. And perhaps those will still not be enough to get us to the point where we have the sample efficiency of humans in the cloud. So that's sort of the broad stroke of things. I think the AI community in general has been really accurate about near-term predictions, about things that are within the event horizon, so to say, and has been really bad at predicting transformative shifts that sort of change the entire research paradigm.

Starting point is 00:15:38 And maybe like credit where credit is due, I think Daniel was one of the few people who got some things right in his report from 2021, was it about what 2025 looks like? But in general, I would say the community has a very poor track record. Well, see more about like what's the prediction that they made that just wasn't true at all? Come again? Like, what is that prediction that the AI industry made that just was not true at all? Hmm. I guess, like, the entire skepticism about neural networks. So from the 1990s to the 2010s, the entire AI community has dismissed neural networks as a joke.

Starting point is 00:16:07 Basically, you could count the number of researchers who took you seriously if you worked on neural networks on, like, two hands. And it was only through the persistence of a few people like Fei-Fali, who released this big data set that led to the deep learning revolution. and Joshua and Jan and Jeffrey Hinton, who later went on to win the Turing Award for their work on deep learning, that this sort of subfield persisted and eventually was able to disprove claims of skeptics. And, you know, in the same way, I think the AI community might be hurting too much around,

Starting point is 00:16:37 let's say, transformer-based models right now, and perhaps at the expense of other transformative improvements, that are breakthrough improvements, that are sort of being sidelined because of this community, single-minded focus on it. I think an experience, that you both have in common and that Casey and I also share is

Starting point is 00:16:54 writing things that we think are very measured and careful and precise and then just having people interpret them in the wildest possible ways. You both published your sort of breakout essays, scenarios, and it was immediately, both of them were sort of seized

Starting point is 00:17:11 on by these polarized camps. David Sachs, the former White House advisor, was posting things about AI being a normal technology and sort of agreeing with you and taking issue with you for changing your forecast. And oh, my God, the Dumers are, you know, are backed into a corner now. Gary Marcus and J.D. Vance and all Bernie Sanders and all kinds of people have used your arguments in support of kind of whatever they already believed.

Starting point is 00:17:35 How has that been to watch your work ripple out in maybe these ways that aren't what you expected? Well, I guess I'll go first. It's been a sort of a leap of faith, faith in humanity. You know, at Open AI, I was doing scenario forecasts like this, too, much smaller, you know, low effort versions. But they were just for internal use only. Like, I wouldn't be allowed to publish them. And it seemed to me that the world really needs to wake up to AI and what's coming and start thinking more seriously about it. And, you know, the discourse is not necessarily so great. And there's lots of terrible people and lots of terrible takes. And, you know, it's very chaotic and confusing. But, you know, we at our futures project are sort of making a bet that like, well, we should say what we think is coming. We should be clear. We should be articulate.

Starting point is 00:18:29 We should expand our reasoning. The discourse will get rolling. Lots of people will say lots of things. Hopefully in the end it will converge towards the truth. Hopefully in the end it will converge towards better decision making on average. And we'll see what happens. I have faith. Saj.

Starting point is 00:18:44 I guess the biggest surprise for me was how few people. read things in depth. I mean, it was honestly shocking. Like in the first line of the essay, we compare AI to the internet or perhaps the electricity, like electrical revolution. We talk about AI's impacts as sort of being at par with perhaps the first industrial revolution. And people put us in the same camp as Gary Marcus sometimes, which is just honestly shocking. But, you know, like one level deeper, I think it has been really nice to see sort of these intellectual communities use these essays to advance their intellectual thinking. I think perhaps the biggest surprise to me was the fact that our essay and perhaps both of our essays were sort of taken

Starting point is 00:19:32 so seriously by people who are thinking deeply about the future of AI, and that was really hard-forming. Looking back, have you ever had second thoughts about using the adjective normal to describe AI? Because I read your writing, and I think it's beautifully argued, and I share it widely with folks to sort of help them explore, you know, reasons why AI may diffuse more slowly than other folks think. And yet I have never really thought that AI was all that normal. You know what I mean? I do understand that. I mean, I guess part of it is the fact that we have been in these cycles of discourse where at least the people who are thinking seriously about AI take it for granted that AI is transformative. And we do too. Now, within that discourse as well, there's this huge

Starting point is 00:20:16 spectrum of opinions, right? Like, even just between the two of us, I think here will be as impactful as the internet. Daniel perhaps thinks this is the most important invention in the history of humanity. And, you know, how do you put yourselves on that spectrum? So this was the debate that we felt was really worth having. Like, we're not interested in the takes of people who think there's nothing to see here. Like, we actively sort of distance ourselves from that, let's say, in the first paragraph of the essay, in a lot of our writing. And I think this is the debate that's worth having. So within the context of this debate, I don't know. Like, I feel like, it's a fair description of where we lie on the spectrum.

Starting point is 00:20:50 And I don't know if you agree, Daniel, but I think it's also been helpful between us to clarify where we stand at this technology. And to just say that, you know, today's AI is normal technology, I think it's like a really powerful statement. And of course, this doesn't discount the importance of the technology. It does not discount the importance of taking its societal impact seriously. But it does sort of put things into perspective

Starting point is 00:21:13 compared to the view that Daniel perhaps has about the future of AI. So AI 27, because it warns us that these sort of very disruptive changes are coming very soon, has a sort of natural set of policy responses that we might want to see in response to that. What is the right policy response to AI's a normal technology, and it's going to take longer than Daniel says? I mean, one thing that I don't know if you'll find surprising, but maybe many people here will find surprising is that Daniel and I share a lot of common ground when it comes to policy responses. I think both of us value transparency immensely. Both of us

Starting point is 00:21:53 value the ability of external third parties to be able to see what's going on inside companies. In fact, I mean, we were just talking backstage about Anthropics release of Fable 5 and the fact that the model purposefully is decreed for tasks involving AI R&D. And I think I speak for both, Both of us, when I say that this is a very dangerous precedent, we shouldn't be fine-tuning our models in such a way that they lie to the customers. Companies shouldn't be allowed to do this. They should act in good faith. And so that's the sort of thing where we have a lot of policy agreement.

Starting point is 00:22:27 I do think there are areas where we diverge. For example, there might be sort of the more, in these more aggressive scenarios, you might want a conditional slowdown. You might want companies to pause. Whereas when you consider AI as normal technology, the benefits of diffusion of the of AI and the development of more capable AI systems perhaps outweigh the risks a little bit more. But at least in the near term, and it was funny

Starting point is 00:22:51 when I spoke to Thomas, who was another one of the co-authors of AI 2027, we spent hours trying to figure out where it is on the timelines that we actually disagree. And it was funny because we couldn't find any near term disagreements. I mean, we wrote this blog post together where we say that, you know, I agree completely with the events of AI 2027, or at least find them plausible until the end of 2026,

Starting point is 00:23:15 which is a long time we read this last year. And so in some sense, I think there is much more common ground in terms of policy than you might think. You guys are being much too agreeable. Daniel, what is something you are worried about more than Siyos is? And then I'll ask the same question of Sias. What is an AI risk that concerns you more than you think it concerns Siyos? Well, in general, you know, strong AGI or superintelligence,

Starting point is 00:23:39 that sort of thing. main one would be loss of control. Number two would be concentration of power. There's a whole bunch of other ones besides that, but I'll stop there. I can elaborate if you like. Those seem pretty bad. Syash, what about you? Actually, this is another thing we were just talking about backstage.

Starting point is 00:23:53 I mean, I was surprised to hear that we disagree far more, or like I'm far more concerned about military uses of AI than Daniel is. I mean, it's on the list. It's just a couple notches down. But I mean, like, as you both know, in the SAV, explicitly carved out military air because he said like we weren't the right people to comment on it. And, you know, people who are experts on this like Michael Horowitz have used our frame to argue that military air, at least today, is a normal technology in his view as well.

Starting point is 00:24:23 But frankly, the actions that are being taken by, like, countries worldwide by nation states are pretty, pretty alarming. I mean, I think we shouldn't take it for granted that companies or countries can use killbots. And that is not something that requires further technological investment either. It's not something where we have any technical bottlenecks we can use, like off-the-shelf, computer vision libraries, to basically build killer robots today. It is actually something where we need to exercise a lot of agency. And I'm not really positive about where things are going right now on that front. Well, I truly believe that whatever is about to happen to us lies somewhere in between the views of these two people. So we will

Starting point is 00:25:00 continue to pay very close attention to your work. Thank you so much, Daniel and Syach. Thank you for joining us. Thanks, guys. That was fun. Thank you. Thank you. Thank you. Thanks, thank. We'll be back with more Hardfolk Live after these messages.

Starting point is 00:25:37 One thing we know for sure is that no matter what happens with the future of AI, it will be extremely fun to talk about robots. Yes. So we have already shown you, I think, more than 10 robots tonight, including members of our robot choir. But we have one more very special robot guest.

Starting point is 00:25:55 tonight. We are about to bring on George Ekes. He is the director of engineering at ToberLife AI, a robotics company in Silicon Valley that is one of the leading distributors of humanoid robots, specifically these unitary robots from China. And we are going to be joined by George and Toby the robot. George and Toby, come on out. Thanks for having me. Good to see you. George. You're a very convincing humanoid. Oh, no, wait.

Starting point is 00:26:27 That's Toby. Do we shake hands? Okay. Let's try it here. Hi. What? Short King. It's great. I appreciate the weak grip strength. It gives me comfort. Yeah, it's sort of like a dead fish handshake. Yeah. Now, he is advancing on me. All right. Oh, okay. Wow. Now, we're going to talk about all the things that Toby and his brethren can do, but we heard that Toby can actually dance. Is that true? That is the case. Okay. Can we see that? Toby, can you dance for us? Dan, will you help us out?

Starting point is 00:27:09 Hit it, DJ. Right. Listen, we've all been there. Sometimes you just dance to the drop. All on the dance floor, ladies and gentlemen. Could have been an operator there. Thank you. Thank you, Toby, for your sacrifice.

Starting point is 00:27:42 You will not be forgotten. We'll add you to the end memoriam next year. Now, is Toby capable of standing up? Okay? Yeah, probably just a misclick on the controller. Oh, okay. We're so back. He's absolutely fine. They're quite durable. Yeah. Oh, my God. That was not in the script. Yeah. No. Now, I'm sorry. We've traumatized our audience here tonight. I'm so sorry. Now, George, we the choreographer on that or?

Starting point is 00:28:17 Nope. Okay. Well, it was great. So, George, what is the use case for these other than doing dance demos and sometimes following over? Who is buying and renting these humanoid robots from your company? And what are they doing with them? Well, right now, the early market for the humanoids is the research market. People want to collect a lot of data. You guys had the neo folks on specifically burnt, right? And they're deploying the humanoids into households to try to collect a lot of data in the households.

Starting point is 00:28:45 People with the unitary robots are also targeting different use cases. different companies are pursuing different verticals with them and trying to get big data sets and train models on these humanoids. There are also a set of robots that we also sell, which are more reliable, more industrial right now, called quadrupeds. And probably easier just to remember them as the dog robots. You can put LiDAR on them. You put different sensors. Mark Zuckerberg or Elon Musk on them. We saw that earlier tonight, yes. I forgot about that. Somehow, somehow I forgot about that. But they are practical for like inspection use cases or security patrols.

Starting point is 00:29:24 So those are kind of being pushed out into industry and applications more. And these are on the edge of research and acquiring data to build policies. How much is one of these costs? They range in cost if you want one to just dance around. I don't remember the exact figure on the low-level dancing ones, but they're less than the ones that you could put dexterous hands on. and then go and collect manipulation data with on tasks. So you collect data from doing tasks with them.

Starting point is 00:29:58 So like more or less than $10,000? More. More. Okay. That's a great question. The ones I was getting to are like in the 50 to 70 range. The ones with the hands. So like a mid-range sports car?

Starting point is 00:30:13 Yes. Yeah. All right. I have to say, it did not inspire a lot of confidence in me to learn that the primary use case for these robots is data. collection. I mean, I think the vision is that these things, as we saw when we talked with Burt from 1X about their robot, as we're hearing about these unitary robots, the dream is that these things will just be in your house and will be doing chores for you, folding laundry,

Starting point is 00:30:36 doing the dishes, cleaning the house. What is the timeline for that? Do you think that is realistic? Should people be pre-ordering now in hopes of automating their chores forever? Where are we on the chore spectrum? I think Burns very optimistic. I'd put it a few more years out than he would on terms of being in your house, but in terms of maybe operating an industrial setting or they can maybe load up a fabricator or something with a material or a part, I think that's in the next couple of years. And there's actually early implementations of that and by like figure in unitary and unitary in their factory, figure in the BMW factory. So people are doing that with these. But the widespread adoption, I believe, in the next couple of years

Starting point is 00:31:16 will happen in those settings. Let me ask one question about the data collection. Some security researchers have claimed that Unitary robots might have a backdoor that could allow remote users to control or monitor what they're seeing. Can Toby send the data to China? So they do send logging data to China just like every other Chinese thing that you can own like a computer or any other computer chip-based thing that connects to the Internet that sends logging data. They send that, but they don't actually... like there hasn't been an established thing that sends camera data or telemetry data of the joints to China. So there are things that people will be like, oh, it sends data to China. It's like, yeah, and your computer sends data to Microsoft.

Starting point is 00:32:06 And it's because your computer crashed and it needs to send data to Microsoft. Right. I think the difference is in this case that Unitary is a Chinese company and some members of Congress have become very worried about the fact that these are now being sold in the United States. some of you propose banning the importation of these specific unitary robots. How likely do you think that is? And would that be a big hit to your business? What's your plan if they ban these?

Starting point is 00:32:28 Certainly be problematic. They're not a lot of American alternatives. Yeah, if they're going to ban all Chinese humanoid robots, I wouldn't be too stoked on that. So I don't know how much more to say. Well, much to consider. Before we let you go, does Toby maybe have, you know, one more cool routine he could show us? Yes, he does.

Starting point is 00:32:59 Take it easy. All right. DJ Dan, will you help us out again? That's great. This is like what happened the last time Casey had a Long Island iced tea at the club. All right. Fascinating. George and Toby.

Starting point is 00:33:25 Thank you. Thank you. stretch, but we had one more Friend of the Pot who we just wanted to bring on and have a little bit of fun with before the end of the show. Yes, our next guest is Friend of the Pod and

Starting point is 00:34:01 YouTuber and podcast sensation Dwarkesh Patel, Dwarkash. Come on out. What's up, guys? Good to see you. Hello. All right. How am I supposed to follow a robot dancing? You can fall over.

Starting point is 00:34:18 Yeah. You can just face plant. That would be great. Larkash, it's been a hell of a year for you. You are firing on all cylinders doing interviews with Jensen Huang and other tech luminaries. You've got a new Blackboard series that teaches people, you know, extremely dense and esoteric concepts in AI. You also got profiled in the New York Times in April, and they made a big deal of you and your media empire that you are building here. I don't really have a question about that.

Starting point is 00:34:50 I'm just kind of like in awe of what you have managed to build. I'm curious, like, what you hear when you hear the conversation about AI 2027 versus AI and normal technology, where are you on the spectrum of like everything is changing, the scaling laws are holding to maybe things are slowing down and we don't quite have the breakthrough ideas yet to get to AGI? I think fundamentally the scary thing is we realize just how far we are from human intelligence, yet these models are so powerful. And so that raises the obvious question is when they not only have the current advantages

Starting point is 00:35:26 that they do, that they can think, you know, thousands of times faster, they have greater ability to absorb knowledge across a wide variety of domains. If anybody's used these models that, like, coding work or any sort of like computer use work, you must have experienced this. And then you think, well, there's this huge overhang where humans are able to learn about new things literally a million times faster. how much information you see from birth to adulthood versus what these models see. We're capable of retaining information across sessions. We're learning on the job.

Starting point is 00:35:57 We're not just like first day on the job the way these models are experiencing things. And so I think that the really scary thing really is that like we know that there's a big difference between where these models are currently and where human intelligence lies. We're making really fast progress towards human intelligence. Already these things are so capable. What happens when they not only have their inherent advantages because they're digital minds but also have all our advantages? You've written and spoken before about how you've tried and failed to automate parts of your own production process with your podcast and your YouTube show and how hard it's been to sort of get rid of some of the sort of sticky human processes there. Are you having better luck with newer models? Like, is your operation more AI than it was six months ago?

Starting point is 00:36:38 So most of the tokens I see in a given day are produced by AI. And so I can't really come here and say, like, no, AI is not making me more productive or I'm not using it in six months. I do, I think people underrate how hard just to automate jobs. Like, people underrate how much it takes to do every single thing, a human, even white collar worker might be doing. At the same time, you guys must be finding this as well. Just the ability to free out use a bunch of information, which is a large part of my job, has just gotten way better. Yeah, how are you guys been finding these models? I mean, sort of the same. I do feel like with each of the big leaps and model capability, it becomes better at tasks that are quite useful in, for example,

Starting point is 00:37:19 like the preparing for a podcast, right? If we're sitting down with a guest that I'm not that familiar with saying, hey, go out and, you know, prepare a briefing document for me about this person and give me some interesting directions to maybe take the conversation based on things they've said in public in the last three months. I mean, that's absolutely a job that I could have hired for. And now, you know, I can get in about like four minutes on my computer. So that's really useful. Does it make me more productive? Yes. But do I like, work less or use the computer less? No. I'm finding something similar.

Starting point is 00:37:50 I want to use these models to automate a lot of my life, and I've been very successful at doing some pieces of it. But there are just things that... Now, the primary feeling I had, like, I got access to Claude Fable yesterday, and the primary feeling I had was, like, I am too dumb to use this thing. Like, I actually don't know what I would prompt it to do that a previous model would not have been able to do.

Starting point is 00:38:12 But I'm not building RL environments. I'm not overseeing training runs. So what is the use for you as a media figure and podcaster? What is the thing that you wish the models could do that they can't currently? I think because we're so, first of all, every time I say something you're worse about the models, I put it in the context that we're living in like an absurd timeline. And I am reacting to my close friends who are just like,

Starting point is 00:38:43 well, you just had some of them on, and we're talking about, like, the singularity in two years. But I feel like we're so used to what these models are capable of currently, that we ask these questions, like, well, what is it that they can do? Aren't they clearly already AGI? It's like, no, we all have jobs. That wouldn't happen in a world with the AGI. Right? Like, just to get them to do something pretty, okay, so, for example, I'm negotiating with a sponsor for next season or something. And, like, they ask for, you do, like, the back and forth there with the relevant context about how we think about our,

Starting point is 00:39:13 business and stuff. It's like probably a one-hour horizon task for me or my general manager. The models couldn't do it at all. Or like, let's say book a show in another city, like book an event like this, right? There's a lot of people who are involved in this. What part of it could the models do reliably? It's like, anyways, all this to say, I think people really underrate what the range of human, even white-collar work is. I mean, it seems to me like it might be very helpful in a negotiation, though, like particularly, I mean, you know, you're not in this position, but maybe you're just starting a new podcast and you have some interest from a sponsor and you say, go tell me something about this market and what's sort of the best place to get started?

Starting point is 00:39:48 I could see it compressing that into a much smaller problem, but to your point, somebody still has to do the rest of the job. Yeah, that's right. I mean, they can't like do something on a computer you might want them to do, right? And it's actually quite interesting. Why are they so bad a computer use given that it's an extremely verifiable domain? And I think that it actually goes to show you that it's not just about verifiability. It's about like the ability to the environment has to be,

Starting point is 00:40:11 which allows you to deterministically run many parallel rollouts at the same time. And, like, if you try to do that on Amazon, Andy Jassy will just shut your ass down. And so, you know, they have to build clones every single website because it takes a ton of data in the relevant domain in order for these models to become competent, like learning how Amazon works or Slack works. So you got to build clones of those things. That's very labor-intensive. Yeah, so I think we'll make progress on that as well. But, yeah.

Starting point is 00:40:36 One of the issues that you really brought to the forefront of the industry's conversation, I would say, over the past year has been the failure of these models when it comes to continuous learning, right? So, you know, it's often observed that, like, a good LLM might be better on day one than an intern, but the intern is almost always better, like, after two weeks because they've been able to, like, learn. Are you still as convinced that, like, this is going to be a major hiccup to getting us all the way to AGI, or has, have recent developments, maybe any new models change the way you think about that? So there's a big crux in how people think about how these models will evolve. And one side of the discussion says, you need some way in which between sessions for a given user, the weights themselves are updating.

Starting point is 00:41:21 Because if you think about the way humans learn, there's not like, you know, you're way better at your job than you were the first day you were on your job. Like people often say an employee is not net productive until six months on the job. What is happening at that time? It's not like you're building up this intensely accurate episodic recall of every single thing that has happened to over the six months, which is just what, in common. context learning is like, that just grows linearly in size as you spent more time on the job. It's like, you know, there's some distillation back in like a higher level abstraction that's happening over time. And so does there need to be an updating that happens back in the way it's a real question? Some people say, well, no, you just basically you'll get to a

Starting point is 00:42:00 point where these models are spending six months on the job and that six months is happening in context. And we're going to train them in such a big variety of RRL environments that they'll learn how to adapt to any given situation you put them in. My question with something like this is, I think that might be enough to get these laps to like a trillion dollars in revenue or something, like truly ludicrous outcomes. I'm concerned about, or also interested in what,

Starting point is 00:42:21 well, do we get to super intelligence or something like that? And you know, one question you can ask is, how would you build something that is as good as Henry Kissinger at politics? The relevant, there's no relevant training environment for that you can run into a data center. And so you do need something that can learn that on the fly. And maybe just by doing enough RLBR, you build something that can just pick up

Starting point is 00:42:45 whatever Kissinger picked up through his life that through interacting with the world. Maybe not. You know, the headline coming out of this talk is going to be Dwarka says Henry Kissinger is good at politics. So I'm just preparing you for that. LBJ or whatever. The example doesn't matter. You know what I'm saying. Interesting. You have a very old soul. All your references are to mid-20th century. You live in San Francisco with Sholto Douglas, a researcher in Anthropic, and Dylan Patel of Semi Analysis, a very influential semiconductor newsletter. You guys are...

Starting point is 00:43:21 Have you seen the rent, man? I've got to split it. Well, that's my question. Semi-analysis is reportedly making something like $100 million a year in revenue. Anthropic is obviously very valuable. At what point are you guys rich enough to not need roommates? The problem is everybody else in SF is also getting so rich. And so the housing is increasing at the same rate that our net worth is increasing.

Starting point is 00:43:45 We're never escaping this. One knock that I sometimes hear on the sort of San Francisco AI scene is that it's all very clubby and insular, that there aren't a lot of people who are doing the work of holding people to account or being appropriately skeptical. You know, one detail in the New York Times profile of you was that you sometimes invest in companies, who's CEOs or leaders you interview. do you think that journalists and other sort of more conventional media people have the wrong sort of framework for thinking about conflicts of interest, or do you just think you're doing

Starting point is 00:44:17 something different? I totally see the rationale for journalistic policies that say you're not allowed to have any sort of financial entanglement with the company that you're covering or whatever. I think at the end of the day, I hope the product speaks for itself and that if you watch an interview I do with a CEO or an executive, you hopefully feel like I ask the relevant questions that at least, I'm not, look, I also don't try to steal man some objection that I don't have. But when I do think that they're not making sense, I try to say so. And I hope that that in and of itself speaks for the interview. Who's your white whale? Who's the guest that you wish you could book that is not agreed to come on? Robert Caro. Can you make this happen? Robert Carroll? Okay,

Starting point is 00:45:00 Robert, if you're out there, go on to our cash. I will say, I will say that Robert was also famously Conan O'Brien's white whale, and Conan O'Brien never got him on the show. No, he got him on. Did he? Yeah, Conan O'Brien needs a friend. All right. He just fact-checked my ass. Yeah. Well, Dorcas, the podcast and the show is amazing. I learned so much from it. I listen to every episode, and I understand about 80% of it now, which is up from a, you know, 20, 25, about 20%. So I'm learning along with your audience, and we thank you for all the work you do. It's a great show. Thank you, guys. Great to see you guys. Thank you. Thank you. All right. Okay, well, friends, we are almost there at the finish line,

Starting point is 00:45:43 but before we go, we wanted to take some questions. If any of you have questions for us, we will spend a few minutes answering them. We have mic runners, upstairs and downstairs. So raise your hand. Someone will approach you with a mic. Anything, we're an open book. You can ask us about it all. It's like a YouTube comment section, but in real life. right here. Hi, my name, Son, can you hear me? Okay. Hi, my name is Dallan. I'm here with my brother from Utah.

Starting point is 00:46:11 What happened to the Fedaverse? Great question. The Forkiverse, I should say. The Forkiverse was, of course, our effort to build a social network in a federated way, sort of show people what it would be like to be part of a social network that wasn't owned by a giant corporation. And I think it just ran into the challenge that any social product does, which is that if you're not constantly bringing a. in new users, it's like default state is to just kind of shrink.

Starting point is 00:46:36 And so, you know, we've been in discussions recently about like what is the future of it. I think it was a fun experiment. But, you know, we didn't really have that strong of an idea of what was going to happen after we started it. And so we're now sort of living with the consequences of that. Balcony. Do we have anyone in the balcony? Yes.

Starting point is 00:46:57 Hi, Kevin and Casey. I was wondering why we're not hearing more from executives like Zatia. and other tech leaders who are restructuring their companies around the premise of AI, they just don't seem to want to engage with that premise when you ask them. What do you think that's about? I mean, I think there's a lot of conflicting incentives here, right? There are some companies that really want you to know how much they are using AI and how much more productive they are getting and how many workers they are laying off.

Starting point is 00:47:26 And sometimes that's real and sometimes it might just be covering for some overhiring they did a couple years ago, I think that's going to flip at some point where companies will not want to advertise the fact that they are restructuring around AI. Right now, there is still sort of this weird market premium for that. And so I think that will continue for as long as the market premium lasts. And then it'll be like, we're just going to sort of sweep it under the rug and hide it. And if we're going to lay people off to replace them with AI, we're going to call it something else because we don't want to deal with the backlash. But I think that really hasn't happened yet, which has been a surprise to me. What about you?

Starting point is 00:48:03 No, I agree with that. And in the interest of answering as many questions as possible, I think we should move on to the next one. We're right here. Hi, my name is Ena. I work at Quizlet. If you've gone to school in the last 20 years, you've heard of Quizlet. If you haven't, what? Anyways, education is being obviously, like, radically changed, but, like, what people need to learn and kind of the fact that you need to learn doesn't really change. So I'm curious if Quizlet were to just, like, start everything from the ground up tomorrow. What do you think we should build?

Starting point is 00:48:30 I mean, that's, that is really challenging. I mean, you know, Kevin and I get a chance to go speak in schools, um, from time to time. And I think what we find is people who are like doing their absolute best to introduce like fairly incremental change and kind of see what happens. There's just tremendous uncertainty right now, you know, school is typically trying to educate you for like a fixed target. You know, like when I went to journalism school, it was like, well, if I get these skills, then, you know, I can have this kind of job. I think, you know, like, we're not able to ask any guests on this stage. And I think, you know, like, we're not able to ask any guests on this stage. about anything longer than a two-year timeline because none of them have credibly anything to say about that. So, you know, how do you, like, educate a five-year-old so they'll be prepared for the world when they're 18? Like, you know, good luck. What an inspiring message. Thank you.

Starting point is 00:49:23 All right. Let's take a couple more. Yes, up there on the balcony. Okay. Can you hear me? Okay, great. Please introduce yourselves. Oh, hi.

Starting point is 00:49:32 I'm Liz. Hi, Liz. Okay, so two real legitimate questions. Number one, what are we wearing now that all birds is under? Okay? And two, so I work as a regulator. I work for the state of California. I do privacy regulation.

Starting point is 00:49:49 And so my question is on, so if you were to take a stab at what would be in the AI, the new world for privacy, like how are you going to protect your digital cells, either your sons or your friends? Like, what are we going to do when it's all owned and one? one-walled universe. Yeah, I mean, my hope is just that that is not the case. You know, we sort of asked Cindy about that tonight. Like, I think there is a lot of logic in having some kind of privilege-like system

Starting point is 00:50:17 that protects certain kinds of conversations that you would have with a chatbot the same way, you know, that a conversation with a lawyer might be protected. But I also think there's a lot of wisdom about what she said is, you know, what systems can we build that would ensure that that sort of data never makes it into the hands of a big corporation. And I think we should outlaw data brokers. Next question. Oh, yeah.

Starting point is 00:50:35 Outlaw data brokers. That's a good one. What's that? Oh, yeah, and where do you get your shoes, Kev? These are from Quince. Yeah. That was not sponsored content. They just are.

Starting point is 00:50:49 Yours are better, though. I got these from like online unspecified. I honestly don't remember, but I can look into it. I'll figure it out by the reception. How's that? All right. Just a couple more. So I'm a software engineer, so I take this for what it's worth.

Starting point is 00:51:04 There's been some talk about, you know, like lots of people are afraid of jobs going away, and then you hear other people saying, oh, there's tons of hiring going on. That's what I see. I say a lot of hiring going on. But it's all for senior engineers who know how to fact-check the models or how to, like, architect and combine the things that they can do really fast. What's happening with the entry-level folks? It seems like that is a real problem.

Starting point is 00:51:32 Yeah. So I've talked to a couple of labor economists about this. within the past couple weeks, and they have sort of said, like, believe it or not, things were actually just, like, much worse during the great financial crisis, and that, like, the circumstances that we're seeing today, like, don't approach that at all. Now, maybe they will eventually, the one labor economist I talked to, Catherine Ann Edwards was telling me, like, some people sometimes forget that, like, your first job just sucks and has nothing to do with what the thing you actually want to do. And so she's sort of, like, encouraging younger

Starting point is 00:51:59 folks to manage their expectations, which is also not a very inspiring message. I think we can do one more question. So let's have the last question. Yes. Hey there. My name is Kevin. Oh. Great name.

Starting point is 00:52:15 Good. Yes, my name is Kevin. And what is your optimistic view over here in the middle, if you're looking out? What is your optimistic view on AI for about three years out, two to three years out? Just curious to get y'all's take. Yeah. My optimism is around the acceleration of science and medicine. This is really a place I care a lot about.

Starting point is 00:52:43 I don't know if any of you saw the cheering at the conference the other week where they announced that they had created a new breakthrough therapy for pancreatic cancer. I want there to be like many, many more of those very soon. And I want the, yeah, thank you. So that is my case for optimism, is that we sort of muddle through the transition from the old jobs to the new jobs. We deal with the safety risks that are really extreme. And then we just accelerate the hell out of the things that make people's lives healthier and longer and allow us to flourish. Yeah.

Starting point is 00:53:20 I mean, that's my number one. But two more I would throw in there is like AI is amazing for learning and AI is amazing for building. And it's fun to learn and it is fun to build. Like if I were in school right now, like, I frothed at the mouth thinking of what it would have been like to tell my AP exams in a world where I could have, you know, chat, UPT, generate infinite quizzes for me to do. And, you know, like Kevin and I've talked a lot on the show about vibe coding in the past year. I've been like making new projects this week and annoying my fiance and making them come see them, even though they're just pure slop. But it is fun to make things. It is fun to annoy your partner with random AI stuff that you build.

Starting point is 00:53:55 All right. We're going to stop it there so that we can get to the reception. We'll see you all at the reception. Thank you so much for coming. Thank you. We love you. We love you. Hard of Work is produced by Rachel Cohn and Whitney Jones.

Starting point is 00:54:49 We're edited by Viren Pavich. We're fact-checked by Caitlin Love. Today's show was engineered by Alyssa Moxley. Original music by Alicia B. YouTube, Marian Lazzano, Diane Wong, Rowan Nemist Doe, Alyssa Moxley, and Dan Powell. Video production by Soya Roque, Jake Nichol, and Chris Schott. Special thanks to the New York Times Live event team who helped us put on Hard Fork Live this year. Hillary Kuhn, Beth Weinstein, Caitlin Roper, Chantal Renei, Melissa Tripoli, Natalie Green, Kirsten Bermanagh-Safrinya,

Starting point is 00:55:22 Jennifer Feeney, Morgan Singer, Dana Praskowski, Haley Duffy, Yenway Liu, Matt Kaiser, Sarah Cheever, Johnny Marolla, Victoria Kim, and SV Productions. Thanks also to everyone, at the Yerba Buena Center for the Arts and the Blue Shield of California Theater where we held the event. They were so fantastic to work with. And a special thanks to Paula Schumann, Pueing Tam, and Dahlia Haddad.

Starting point is 00:55:46 You can email us, as always, at Hard Fork at NYTimes.com.

Hard Fork - ‘Hard Fork’ Live, Part 3: Differing Visions of an A.I. Future

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.