Big Technology Podcast - Best of Big Technology: Demis Hassabis On AGI, Deceptive AIs, Building a Virtual Cell

Episode Date: December 31, 2025

Demis Hassabis is the CEO of Google DeepMind. He joined Big Technology Podcast in early 2025 discuss the cutting edge of AI and where the research is heading. In this conversation, we cover the path t...o artificial general intelligence, how long it will take to get there, how to build world models, whether AIs can be creative, and how AIs are trying to deceive researchers. Stay tuned for the second half where we discuss Google's plan for smart glasses and Hassabis's vision for a virtual cell. Hit play for a fascinating discussion with an AI pioneer that will both break news and leave you deeply informed about the state of AI and its promising future. --- Enjoying Big Technology Podcast? Please rate us five stars ⭐⭐⭐⭐⭐ in your podcast app of choice. Want a discount for Big Technology on Substack + Discord? Here’s 25% off for the first year: https://www.bigtechnology.com/subscribe?coupon=0843016b Questions? Feedback? Write to: bigtechnologypodcast@gmail.com --- Wealthfront.com/bigtech. If eligible for the overall boosted 3.90% rate offered with this promo, your boosted rate is subject to change if the 3.25% base rate decreases during the 3-month promo period. The Cash Account, which is not a deposit account, is offered by Wealthfront Brokerage LLC ("Wealthfront Brokerage"), Member FINRA/SIPC, not a bank. The Annual Percentage Yield ("APY") on cash deposits as of 12/19/25, is representative, requires no minimum, and may change at any time. The APY reflects the weighted average of deposit balances at participating Program Banks, which are not allocated equally. Wealthfront Brokerage sweeps cash balances to Program Banks, where they earn the variable base APY. Instant withdrawals are subject to certain conditions and processing times may vary. Learn more about your ad choices. Visit megaphone.fm/adchoices

Transcript
Discussion (0)
Starting point is 00:00:00 Michael Lewis here. My best-selling book, The Big Short, tells the story of the buildup and burst of the U.S. housing market back in 2008. A decade ago, the Big Short was made into an Academy Award-winning movie, and now I'm bringing it to you for the first time as an audiobook narrated by yours truly. The Big Short's story, what it means to bet against the market, and who really pays for an unchecked financial system, is as relevant today as it's ever been. to big short now at Pushkin.fm.fm.
Starting point is 00:00:31 or wherever audiobooks are sold. Google DeepMind's CEO and Nobel laureate Demis Asabas joins us to talk about the path toward artificial general intelligence, Google's AI roadmap, and how AI research is driving scientific discovery. That's coming up right after this. Welcome to Big Technology Podcast,
Starting point is 00:00:52 a show for cool-headed, nuanced conversation of the tech world and beyond. Today, we're at Google DeepMind headquarters in London for what promises to be a fascinating conversation with Google DeepMinds CEO Demis Hasabas. Demis, great to see you again. Welcome to the show. Thanks for having me on the show. Definitely. It's great to be here. So every research house right now is working toward building AI that mirrors human intelligence, human level intelligence. They call it AGI. Where are we right now in the progression? And how long is it going to take to get there? Well, look, I mean, of course,
Starting point is 00:01:26 the last few years has been an incredible amount of progress. Actually, you know, maybe over the last decade plus. This is what's on everyone's lips right now. And the debate is how close are we to AGI? What's the correct definition of AGI? We've been working on this for more than 20 plus years. We've sort of had a consistent view about AGI being a system that's capable of exhibiting all the cognitive capabilities humans can. And I think we're getting, you know, closer and closer, but I think we're still probably a handful of years away. Okay. And so what is it going to take to get there?
Starting point is 00:02:01 Memory, planning. I mean, what are the models going to do that they cannot do right now? So the models today are pretty capable. Of course, we've all interacted with the language models, and now they're becoming multimodal. I think there are still some missing attributes, things like reasoning, hierarchical planning, long-term memory. There's quite a few capabilities that the current systems
Starting point is 00:02:24 I would say I don't have. They're also not consistent across the board. They're very, very strong in some things, but they're still surprisingly weak and flawed in other areas. So you'd want an AGI to have pretty consistent, robust behavior across the board, all the cognitive tasks. And I think one thing that's clearly missing, and I always always had as a benchmark for AGI, was the ability for these systems to invent their own hypotheses or conjectures
Starting point is 00:02:49 about science, not just prove existing ones. So, of course, that's extremely useful already to prove an existing. math conjecture or something like that, or play a game of go to a world champion level, but could a system invent go? Could it come up with a new Riemann hypothesis? Or could it come up with relativity back in the days that Einstein did it with the information that he had? And I think today's systems are still pretty far away from having that kind of creative,
Starting point is 00:03:16 inventive capability. Okay, so a couple years away until we hit AGI. I think, you know, I would say probably like three to five years away at least. So if someone were to declare that they've reached AGI in 2025, probably marketing. I think so. I mean, I think there's a lot of hype in the area, of course. I mean, some of it's very justified. I mean, I would say that AI research today is overestimated in the short term.
Starting point is 00:03:43 I think probably a bit overhyped at this point, but still underappreciated and very underrated about what it's going to do in the medium to long term. So it's sort of, we're still in that weird kind of space. And I think part of that is, you know, there's a lot of people that need to do fundraising, a lot of startups and other things. And so I think we're going to have quite a few sort of fairly outlandish and slightly exaggerated claims. And, you know, I think that's a bit of a shame, actually. Yeah. In the AI products, what's it going to look like on the path there?
Starting point is 00:04:17 I mean, you've talked about memory again, planning, being better at some of the tasks that it's not excelling at at the moment. So when we're using these AI products, let's say we're using Gemini, what are some of the things that we should look for in these domains that will make us say, oh, okay, it seems like that's a step closer and that's a step closer? Yeah, so I think today's systems, you know, obviously we're very proud of Gemini 2.0. I'm sure we're going to talk about that. But I feel like they're very useful for still quite niche tasks, right? If you're doing some research, perhaps you're summarizing some area of research, it's incredible. You know, I use Notebook LM and deep research all the time to kind of, especially like break the ice on a new area of research that I want to get into or summarize some, you know, maybe a fairly mundane set of documents or something like that. So they're extremely good for certain tasks and though people are getting a lot of value out of them. But they're still not pervasive, in my opinion, in everyday life, like helping me every day with my research, my work, my day to day, my daily life too.
Starting point is 00:05:15 And I think that's where we're going with our products with building things like Ask Project. Astra, our vision for a universal assistant, is it should be involved in all aspects of your life and be enriching, helpful, and making that more efficient. And I think part of the reason is these systems are still fairly brittle, partly because they are quite flawed still, and they're not AGIs. And you have to be quite specific, for example, with your prompts, or you need a lot of, there's quite a lot of skill there in coaching or guiding these systems to be useful and to stick to the areas they're good at. And a true AGI system shouldn't be that difficult to coax. It should be much more straightforward, you know, just like talking to another human. Yeah. And then on the reasoning
Starting point is 00:05:59 front, you said that's another thing that's missing. I mean, that's everybody's talking about reasoning right now. So how does that end up getting us closer to artificial general intelligence? Right. So reasoning and mathematics and other things, and there's a lot of progress on maths and coding and so on. But let's take maths, for example. You have systems, some systems that we work on, like alpha-proof, alpha geometry that are getting, you know, silver medals in maths Olympiads, which is fantastic. But on the other hand, some of our systems, those same systems, are still making some fairly basic mathematical errors, right, for various reasons.
Starting point is 00:06:30 Like the classic, you know, counting the number of ours in strawberries and so on and the word strawberry and so on. And is 9.11 bigger than 9.9.9. And things like that. And of course, you can fix those things and we are and everyone's improving on those systems, but we shouldn't really be seeing those kinds of flaws in a system that is that capable in other domains, in more narrow domains of doing, you know, Olympiad-level mathematics. So there's something still a little bit missing, in my opinion, about the robustness of
Starting point is 00:07:00 these systems. And then I think that speaks to the generality of these systems. A truly general system would not have those sorts of weaknesses. It would be very, very strong, maybe even better than the best humans in some things like playing go or doing mathematics. But it would be overall consistently good. Now, can you talk a little bit about how these systems are attacking math problems? Because, you know, I think the general understanding of these systems is the LLMs is they encompass all of the world's knowledge and then they predict what, you know, somebody might answer if they were asked a question.
Starting point is 00:07:30 But it's kind of different when you're working step by step through an algorithm, through a math problem. Yes. That's not enough. Of course, you know, just understanding the world's information and then trying to sort of almost compress that into your memory, that's not enough for solving a novel math problem or novel conjecture. So there, you know, we start needing to bring in,
Starting point is 00:07:49 I think we talked about this last time, more kind of like AlphaGo planning ideas into the mix with these large foundation models, which are now beyond just language. They're multimodal, of course. And there, what you need to do is you need to have your system not just pattern matching roughly what it's seeing, which is the model, but also planning
Starting point is 00:08:09 and be able to kind of go over that plan. we revisit that branch and then go into a different direction until you find the right criteria or the right match to the criteria that you're looking for. And that's very much like the kind of games playing AI agents that we used to build for go, chess and so on. They had those aspects. And I think we've got to bring them back in,
Starting point is 00:08:32 but now working in a more general way on these general models, not just a narrow domain-like games. And I think that also, that approach of a model guiding a search or planning process, so it's efficient, works very well with mathematics as well. You can sort of turn maths into kind of game-like search. Right. And I want to ask about math. Like once these models get math right, is that generalizable? Because I think there was like a whole hubb when people first learned about reasoning systems and they're like, oh, this is like, this is going to be a problem. These models are getting smarter than we can control because if they can do math, then they
Starting point is 00:09:07 can do X, Y, and Z. So is that generalizable? Or is it like, we're going to teach them how to do math? and they can just do math. I think for now, the jury's out on that. I mean, I feel like it's clearly a capability you want of a general AGI system. It can be very powerful in itself. Obviously, mathematics is extremely general in itself. But it's not clear, you know, maths and even coding and games.
Starting point is 00:09:30 These are areas, they're quite special areas of knowledge because you can verify if the answer is correct, right, in all of those domains, right? the math, you know, the final answer the AI system puts out, you can check whether that maths, that solves the conjecture or the problem. So, but most things in, in the general world, which is messy and ill-defined, do not have easy ways to verify whether you've done something correct. So that puts a limit on these self-improving systems if they want to go beyond these areas of, you know, maybe very highly defined spaces like mathematics, coding or games.
Starting point is 00:10:09 So how are you trying to solve that problem? Well, you know, you've got to, first of all, you've got to build general models, world models, we call them, to understand the world around you, the physics of the world, the dynamics of the world, the spatial temporal dynamics of the world, and so on, and the structure of the real world we live in. And of course, you need that for a universal assistant. So Project Astra is our project built on Gemini to do that, to understand, you know, objects and the context around us. I think that's important if you want to have an assistant. But also robotics requires that too. Of course, robots are physically embodied AIs, and they need to understand their environment, the physical environment, the physics of the world. So we're building those types of models. And also you can also use them in simulation to understand game environments. So that's another way to bootstrap more data to understand, you know, the physics of a world.
Starting point is 00:11:02 But the issue at the moment is that those models are not 100% accurate, right? So they, you know, maybe they're accurate 90% of the time or even 99% of the time. But the problem is if you start using those models to plan, maybe you're planning 100 steps in the future with that model. Even if you only have a 1% error in what the model's telling you, that's going to compound over 100 steps to the point where you'll be in a, you know, you'll kind of get almost a random answer. And so that makes the planning very difficult. Whereas with maths, with gaming, with code coding, you can verify each step are you still grounded to reality? and is the final answer mapped to what you're expecting. And so I think part of the answer is to make the world models more and more sophisticated
Starting point is 00:11:47 and more and more accurate and not hallucinate and all of those kinds of things. So you get, you know, the errors are really minimal. Another approach is to plan not at each sort of linear time step, but actually do what's called hierarchical planning. Another thing we used to, you've done a lot of research on in the past, and I think is going to come back into vogue, where you plan at different levels of temporal abstraction. So that could also alleviate the need for your model to be super, super accurate
Starting point is 00:12:17 because you're not planning over hundreds of time steps, you're planning over only a handful of time steps, but at different levels of abstraction. How do you build a world model? Because, you know, I always thought it was going to be like, send robots out into the world and have them figure out how the world works. But one thing that surprised me is with these video generation tools. Yes.
Starting point is 00:12:34 You would think that if the AI didn't have a good world model, then nothing would really fit together when they try to figure out how the world works as they show you these videos like VO2 for instance but they actually get the physics pretty right so can you get a world model just by showing an AI video do you have to be out in the world
Starting point is 00:12:49 how is this going to work? It's interesting and actually being pretty surprising I think to the extent of how far these models can go without being out in the world right as you say so VO2 our latest video model which is actually surprisingly accurate on things like physics You know, there's this great demo that someone created of like chopping a tomato with a knife, right?
Starting point is 00:13:11 And getting the slices of the tomato just right and the fingers and all of that. And Vio is the first model that can do that. You know, if you look at other competing models, they often, the tomato sort of randomly comes back together or the fingers. Yeah, exactly, splits from the knife. So those things are, if you think that really hard, you've got to understand consistency across frames, all of these things. and it turns out that you can do that by using enough data and viewing that. I think these systems will get even better if they're supplemented by some real-world data like collected by an acting robot or even potentially in very realistic simulations
Starting point is 00:13:49 where you have avatars that act in the world too. So I think that's the next big step actually for agent-based systems is to go beyond world models, can you collect enough data where the agents are also acting in the world and making plans and achieving tasks. And I think for that, you will need not just passive observation. You will need actions, active participation. I think you just answered my next question, which is if you develop AI that can reasonably plan and reason about the world
Starting point is 00:14:20 and has a model of how the world works, it can. And it seems like that's the answer. It can be an agent that can go out and do things for you. Yes, exactly. And I think that's what will unlock robotics. Right. I think that's also what will then allow this notion of a universal assistant that can help you in your daily life across both the digital world and the real world. That's the thing we're missing.
Starting point is 00:14:42 And I think that's going to be incredibly powerful and useful tool. You can't get there then by just scaling up the current models and building hundreds of thousand or million GPU clusters like Elon's doing right now. And that's not going to be the path to AGI. Well, look, I actually think, so my view is a bit more nuanced than that. that is like that that the scaling approach is absolutely working of course that's where we've got to where we have now one can argue about are we getting diminishing returns or we what do you think about that question what my view is that we are getting substantial returns but not but it's slowing but it would have to I mean it's it's not just continuing to be
Starting point is 00:15:22 exponential but that doesn't mean the scaling's not working is absolutely working and we're still getting you know as you see Gemini 2 over Gemini 1.5 and by the way the other thing that is working with the scaling is also making efficiency gains on the smaller size models. So the cost or the size per performance is radically improving under the hood as well, which is very important for scaling, you know, the adoption of these systems. But yeah, so, you know, you've got the scaling part, and that's absolutely needed to build more, more sophisticated world models. But then I think we are missing or we need to reintroduce some ideas on the planning side,
Starting point is 00:16:00 memory side, the searching side, the reasoning to build on top of the model. The model itself is not enough to be an AGI. You need this other capability for it to act in the world and solve problems for you. And then there's still the additional question mark of the invention piece and the creativity piece, true creativity, beyond mashing together what's already known. And that's also unknown yet, if something. news required or again if existing techniques will eventually scale to that. I can see both arguments and I think from my perspective it's an empirical question. We just got to push both the scaling
Starting point is 00:16:39 and the invention part to the limit. And fortunately at Google DeepMind we have a big enough group we can invest in both those things. So Sam Altman recently said something that caught people's eye. He said we are now confident we know how to build AGI as we have traditionally understood it. It just seems by listening to what you're saying that you feel the same way. Well, it depends what we, you know, I think the way you said that was quite ambiguous, right? So in the sense of like, oh, we're building it right now and here's the ABC to do it. What I would say, and if this was meaning, I would agree with it, is that we roughly know the zones of techniques that required, what's probably missing, which bits need to be put together. But that's still an incredible amount of research, in my opinion, that needs to be done to get that all to work, even if that was the case.
Starting point is 00:17:25 And I think there's a 50% chance we are missing. some new techniques. You know, maybe we need one or two more transformer-like breakthroughs. And I think I'm genuinely uncertain about that. So that's why I say 50%. So, I mean, I wouldn't be surprised either way. If we got there with existing techniques and things we already knew, but put them together in the right way and scaled that up,
Starting point is 00:17:47 or if it turned out one or two things were missing. So let's talk about creativity for a moment. I mean, you brought it up a couple times here that the models are going to have to be creative. They're going to have to learn how to invent. If we want a call age AGR, in my opinion. Which is where everybody's trying to. ago. I was re-watching the AlphaGo documentary and the algorithms make a creative move. They do. Move 37. 37. Yes. I just had it. Okay. Thank you. That's interesting because it was a
Starting point is 00:18:15 couple years ago. The algorithms were already being creative. Yes. Why have we not really seen creativity from large language models? I mean, this is to me, I think, the greatest disappointment that people have with these tools is like they say, this is very impressive work, but it's just limited to the training set, we'll mix and match what it knows, but it can't come up with anything new. Yeah, well, look, so what, and I should probably write this up, but what I sometimes talk about in talks ever since the AlphaGo match, which is now, you know, eight plus years ago, amazingly, right?
Starting point is 00:18:42 That happened. That was probably, the reason that was such a watershed moment for AI was, first of all, there was the Everest of, you know, cracking go, right? Which was always considered to be one of the holy grails of AI. So we did that. Second thing was the way we did it, which was these learning systems that were generalizable, right eventually it became alpha zero and so on even play any two-player game and so on and then the third thing was this move 37 so only did it win 4-1 it beat lisa dole the great
Starting point is 00:19:09 lisa dole for one it also played original moves but so i have three categories of of originality or creativity the most basic kind of mundane form is just interpolation which is like averaging of what you see so if i said to a system you know come up with a new picture of a cat and it's seen a million cats and it produces some kind of average of all the ones it's seen. In theory, that's an original cat because you won't find the average in the specific examples. But it's a pretty boring, you know, it's not really very creative. I wouldn't call that creativity. That's the lowest level.
Starting point is 00:19:42 Next level is what AlphaGo exhibited, which is extrapolation. So here's all the games humans have ever played. It's played another million games on top of, you know, 10 million games on top of that. And now it comes up with a new strategy in Go that no humans, has ever seen before. That's Move 37, right? Revolutionizing Go, even though we've played it for thousands of years. So that's pretty incredible, and that could be very useful in science, and that's why I got very excited about that and started doing things like AlphaFol, because clearly extrapolation
Starting point is 00:20:08 beyond what we already know, what's in the training set, could be extremely useful. So that's already very valuable, and I think truly creative. But there's one level above that that humans can do, which is invent Go. Can you invent me a game if I, you know, if I specify it to an abstract level, you know, takes five minutes to learn the rules, but a lifetime to many lifetimes to master. It's beautiful aesthetically, encompasses some sort of mystical part of the universe in it, that it's beautiful to look at, but you can play a game in a human afternoon in two hours, right? That would be a high-level specification of Go. And then somehow the system's got to come up with a game that's as elegant
Starting point is 00:20:49 and as beautiful and perfect as Go. Now, we can't do that. Now, the question is why, is it that we don't know how to specify that type of goal to our systems at the moment. What's the objective function? It's very amorphous. It's very abstract. So I'm not sure if it's just we need high and level more abstracted layers in our systems, building more and more abstract models, so we can talk to it in this way, give it those kind of amorphous goals. Or is there a missing capability, actually, about that we still have, human intelligence has that are still missing from our systems. And again, I'm unsure about that, which way that is. I can see arguments both ways and we'll try both. But I think the thing that people are upset or not upset,
Starting point is 00:21:33 but people are disappointed by is they don't even see a move 37 in today's LLMs. What's going on there? Okay, so well, that's because I don't think we have, so if you look at AlphaGo, and I'll give you an example of there, which maps to today's LLMs. So you can run AlphaGo and Alpha a zero-hour chess program, general two-player program, without the search and the reasoning part on top. You can just run it with the model. So what you say is to the model, come up with the first go move you can think of in this position that's the most pattern matched, most likely good move.
Starting point is 00:22:04 And it can do that. And it'll play reasonable game. But it will only be around master level, or possibly grand master level. It won't be world champion level. And it certainly won't come up with original moves. For that, I think you need the search component to get you beyond where the model knows about, which is mostly summarizing existing knowledge, to some new part of the tree of knowledge, right? So you can use the search to get beyond what the model currently understands. And that's where I think you can get new ideas like, you know, Move 37.
Starting point is 00:22:39 What's it searching the web? No. So, well, it depends on what the domain is. searching that knowledge tree. So, obviously, in Go, it was searching Go moves beyond what the model knew. I think for language models, it will be searching the world model
Starting point is 00:22:55 for new parts, configurations in the world that are useful. So, of course, that's so much more complicated, which is why we haven't seen it yet. But I think the agent-based systems that are coming will be capable of Move 37 type of things. So are we setting too high of a bar for AI? Because I'm curious if you've learned anything
Starting point is 00:23:13 about humanity doing this work. it seems like we almost give too much of a premium on humanity or individual people's ingenuity where like a lot of us, we're kind of taking stuff, we spit it out, like our society really works in memes, like we have a cultural thing and it gets translated. So what have you learned about like the nature of humans from doing the work with the AIs? Well, look, I think humans are incredible and especially the best humans in the best domains. I love watching any sports person or talented musician or games player at the top of their game,
Starting point is 00:23:45 the absolute pinnacle of human performance is always incredible no matter what it is. So I think as a species, we're amazing. Individually, we're also kind of amazing what everyone can do with their brains so generally, right? Deal with new technologies. I mean, I'm always fascinated by how we just adapt to these things sort of almost effortlessly
Starting point is 00:24:03 as a society and as individuals. So that speaks to the power and the generality of our minds. Now, the reason I have set the bar like that, and I don't think it's a question of, like, can we get economic worth out of these systems? I think that's already coming very soon. But that's not what, AGI shouldn't be, I think we should treat AGI with scientific integrity, not just move goalpost for commercial reasons or whatever it is, hype and so on. And there, the definition of that was always having a system that was, you know, if we think about it theoretically, that was capable of being as powerful as a Turing machine. So Alan Turing, one of my all-time scientific heroes, you know, he described a Turing machine which underpins all modern computing, right, as a system that can simulate any other compute anything that's computable. So we know we have the theory there that if an AI system is Turing Powerful, it's called, if it can simulate a Turing Machine, then it's able to calculate anything in theory that is computable.
Starting point is 00:25:02 And the human brain is probably some sort of Turing machine, at least that's what I believe. And so in order for our to know, and I think that's what AGI is, is a system that's truly general and in theory could be applied to anything. And the only way we'll know that is if it exhibits all the cognitive capabilities that humans have, assuming that the human mind is a type of chewing machine, or is at least as powerful as a chewing machine. So that's my, always been my sort of bar. It seems like people are trying to re-badge things as that has been. being what's called ASI, artificial superintelligence. But I think that's beyond that.
Starting point is 00:25:40 That's after you have that system and then it starts going beyond in certain domains what humans are capable of potentially inventing themselves. Okay. So when I see everybody making the same joke on the same topic on Twitter, and I say, oh, that's just us being LLMs. I think I'm selling humanity a little short. Well, yes, I guess so. I guess so.
Starting point is 00:26:00 Okay. I want to ask you about deceptiveness. I mean, one of the most interesting things I saw at the end of last year was that these AI bots are starting to try to fool their evaluators, and they don't want their initial training rules to be thrown out the window, so they'll, like, take an action that's against their values in order to be able to remain the way that they were built. That's just incredible stuff to me. I mean, I know it's scary to researchers, but it blows my mind that it's able to do this. Are you seeing similar things in the stuff that you're testing
Starting point is 00:26:31 within DeepMind, and what are we supposed to think about all this? Yeah, we are. And I'm very worried about, I think deception specifically is one of the, one of those core traits you really don't want in a system. The reason that's like a kind of fundamental trait you don't want is that if a system is capable of doing that, it invalidates all the other tests that you might think you're doing, including safety ones. It's in testing and it's like, right? It's playing a five year goal, five years. Yeah, it's playing some meta game, right? And then, and that's incredibly dangerous. if you think about, then it invalidates all of the results of your other tests that you might, you know, safety tests and other things you might be doing with it. So I think there's a handful of capabilities like deception, which are fundamental and you don't want, and you want to test early for.
Starting point is 00:27:17 And I've been encouraging the safety institutes and evaluation benchmark builders, including, and also obviously all the internal work we're doing, to look at deception as a kind of class A thing that we need to prevent and monitor. as important as tracking the performance and intelligence of the systems. The answer to this as well, and one way to, there's many answers to the safety question of, and a lot of research, more research needs to be done in this very rapidly, is things like secure sandboxes. So we're building those two.
Starting point is 00:27:48 We're world class here at security, at Google, and at DeepMind. And also we are world class at games environments. And we can combine those two things together to kind of create digital sandboxes with guardrails around them, sort of the kind of guard wells you'd have for cyber security, but internal as well as blocking external actors, and then test these agent systems in those kind of secure sandboxes. That would probably be a good, advisable next step for things like deception. What sort of deception have you seen?
Starting point is 00:28:18 Because I just read a paper from Anthropic where they gave it a sketch pad. And it's like, oh, I better not tell them this. And you see it like give a result after thinking it through. So what type of deception I've seen from the box? Well, look, we've seen similar types of things where it's trying to resist sort of revealing some of its training or, you know, I think there was an example recently of one of the chatbots being told to play against Stockfish
Starting point is 00:28:46 and it just sort of hacks its way around playing Stockfish at all at chess because it knew it would lose. Wait, you know, but... You had an AI that knew it was going to lose a game and decided to hack its way around. I think we're anthropomorphizing these things quite a lot at the moment because I feel like these systems are still pretty basic. I would get too alarmed about them right now.
Starting point is 00:29:04 But I think it shows the type of issue we're going to have to deal with maybe in two, three years' time, when these agent systems become quite powerful and quite general. And that's exactly what AI safety experts are worrying about, right? Where systems where, you know, there's unintentional effects of the system. You don't want the system to be deceptive. You want it to do exactly what you're telling it to and report that back reliably. But for whatever reason,
Starting point is 00:29:29 It's interpreted the goal it's been given in a way where it causes it to do these undesirable behaviors. I know I'm having a weird reaction to this, but on one hand, this scares the living daylights out of me. On the other hand, it makes me respect these models more than anything. Sure. Well, look, of course, you know, these are, it's impressive capabilities. And the, you know, the negatives are things like deception, but the positives would be things like inventing, you know, new materials, accelerating science. You need that kind of ability to problem solve and get around, you know, issues that are blocking progress. But, of course, you want that only in the positive direction, right?
Starting point is 00:30:10 So those exactly the kinds of capabilities. I mean, they are, you know, it's kind of mind-blowing. We're talking about those possibilities. But also, at the same time, there's risk and it's scary. So I think both the things are true. Wild. Yeah. All right.
Starting point is 00:30:24 Let's talk about product quickly. Sure. One of the things that your colleagues have told me about you is you're very good at scenario planning what's going to happen in the future. It's sort of an exercise that happens within deep mind. What do you think is going to happen with the web? Because obviously the web is so important to Google. I had an editor that told me he was like, oh, you're going to speak with Demis. Ask him what happens when we stop clicking.
Starting point is 00:30:43 We're clicking through the web at all times. There are a rich corpus of websites that we use. If we're all just dialoguing with AI, then maybe we don't click anymore. So what is your scenario plan for what happens to the web? Look, I think there's, it's going to be, there's going to be a very interesting phase in the next few years on the web and the way we interact with websites and apps and so on. You know, if everything becomes more agent-based, then I think we're going to want our assistants and our agents to do a lot of the work and a lot of the mundane work that we currently do, right? You know, fill in forms, make payments, you know, book tables, this kind of thing. So, you know, I think that we're going to end up with probably a kind of economics model where agents talk to other agents and negotiate things between themselves and then give you back the results, right?
Starting point is 00:31:32 And you'll have the service providers with agents as well that are offering services and maybe there's some bidding and cost and things like that involved and efficiency. And then I hope from the user perspective, you know, you have this assistant that's super. capable that you can, just like a brilliant, a human assistant, personal assistant, and can take care of a lot of the mundane things for you. And I think if you follow that through, that does imply a lot of changes to the structure of the web and the way we currently use it. A lot of middlemen. Yeah, sure. But there will be many other, I think there will be incredible other opportunities that will appear economic and otherwise based on this change. But I think it's going to be a big disruption.
Starting point is 00:32:16 And what about information? Well, I mean, finding information, I think you'll still need the reliable sources. I think you'll have assistance that are able to synthesize and help you kind of understand that information. I think education is going to be revolutionized by AI. So, again, I hope that these assistants will be able to more efficiently gather information for you. and perhaps, you know, what I dream of is, again, assistance that take care of a lot of the mundane things, perhaps replying to, you know, everyday emails and other things
Starting point is 00:32:54 so that you have, you protect your own mind and brain space from this bombardment we're getting today from social media and emails and so on and texts and so on. So it actually blocks deep work and being in flow and things like that, which I value very much. So I would quite like these assistants to take away a lot of the mundane aspects of admin that we do every day. What's your best guest as to what type of relationships
Starting point is 00:33:20 we're going to have with our AI agents or AI assistants? So there's, on one hand, you could have a dispassionate agent that's just really good at getting stuff done for you. On the other hand, it's already clear that people are falling in love with these bots. There's a New York Times article last week about someone who's fallen in love with Chad GPT, like for real falling in love.
Starting point is 00:33:36 And I had the CEO of Replica on the show a couple weeks ago, and she said that they are regularly invited to marriages of people who are marrying their replica and they're moving into this more assistive space. So do you think that when we start interacting with something that knows us so well that helps us with everything we need? Is it going to be like a third type of relationship where it's not necessarily a friend, not a lover,
Starting point is 00:33:59 but it's going to be a deep relationship, don't you think? Yeah, it's going to be really interesting. I think the way I'm modeling that, first of all, is at least two domains, first of all, which is your personal life and then your work life, right? So I think you'll have this notion of virtual workers or something, maybe we'll have a set of them or managed by a, you know, a lead assistant that does a lot of the, helps us be way more productive at work, you know, or whether that's email, across workspace or
Starting point is 00:34:27 whatever that is. So we're really thinking about that. Then there's a personal side where, you know, we're talking about earlier about all these booking holidays for you, avenging things, mundane things for you, sorting things out. And then that makes your life more efficient. I think it can also enrich your life, so recommend you things, amazing things, that it knows you as well as you know yourself. So those two, I think, are definitely going to happen. And then I think there is a philosophical discussion to be had about, is there a third space where these things start becoming so integral to your life? They become more like companions. I think that's possible too. We've seen that a little bit in gaming. So you may have seen, we had a little prototypes of astro working in
Starting point is 00:35:08 and Gemini working with like being almost a game companion commenting you, almost like as if you had a friend looking at a game you're playing and recommending things to you and advising you, but also maybe just playing along with you. And it's, it's very fun. So I haven't, you know, quite thought through all the implications of that, but they're going to be big. And I'm sure there is going to be demand for companionship and other things. Maybe the good side of that is all help with loneliness and these sorts of things. But there's also, you know, I think it's going to be, it's going to have to be really carefully thought through by society, whether, you know, what directions we want to take that in. I mean, my personal opinion is that it's the most underappreciated
Starting point is 00:35:46 part of AI right now and that people are just going to form such deep relationships with these bots as they get better because, like, I don't know, it's a meme in AI that this is the worst it's ever going to be and it's going to be crazy. Yeah, I think it's going to be pretty crazy. This is what I meant about the under under appreciating what's to come. I still don't think this kind of thing I'm talking about, right? I think that it's going to be really crazy. It's going to be very disruptive. I think there's going to be lots of positives out of it too, and lots of things will be amazing and better, but there are also risks with this new, brave, new world we're going into. So you brought up Astra a couple times. Let's just talk about it's Project Astra, as you call it.
Starting point is 00:36:24 It is almost an always-on AI assistant. You can, like, hold your phone. It's currently just a prototype, or not publicly released, but you can hold your phone and it will see what's going on in the room. So if I could basically, I've seen you do this on your show, or not you personally, but somebody on your team, you can say, okay, where am I? And it'll be like, oh, you're in a podcast studio, anything? Okay, so it could have this contextual awareness. Yes. Can that work without smart glasses? Because it's really annoying to hold my phone up. Yes. So like when are, when are we going to see Google smart glasses with this technology embedded? They're coming. So we teased it in some of our early prototype. So that we're mostly
Starting point is 00:36:57 prototyping on phones currently because they have more processing power. But we're, of course, Google's always been a leader in glasses. Google Glass. Yeah, and exactly. Just a little too early. Yeah, maybe a little too early. And now I actually think, and with their super excited that team is that, you know, maybe this assistant is the killer use case that glasses has always been looking for. And I think it's quite obvious when you, when you start using Astra in your daily life, which we have in with trusted testers at the moment and in kind of beta form, there are many use cases where it would be so useful to use it, but it's a bit, it's inconvenient that you're holding the phone. So one example is while you're cooking, for
Starting point is 00:37:33 example, right? And it can advise you what to do next, the menu, you know, how to, whether you've chopped the thing correctly or fried the thing correctly. But you want it to just be hands-free, right? So I think that glasses and maybe other form factors that are hands-free will come into their own in the next few years. And we, you know, we plan to be at the forefront of that. Other form factors? Well, you could imagine earbards with cameras and, you know, glasses is an obvious next stage. But is that the optimal form?
Starting point is 00:38:04 Probably not either. But partly we've also got to see, we're still very early in this journey of seeing what are the regular user journeys and killer sort of use journeys that everyone uses bread and butter uses every day. And that's what the trust of tester program is for at the moment. We're kind of collecting that information and observing people using it and seeing what ends up being useful. Okay. One last question on agents. Then we move to science. Agentic agents, AI agents, this has been the buzzword in AI for more than a year now. Yeah. There aren't really any AI agents out there. No. What's going on? Yeah. Well,
Starting point is 00:38:41 Again, you know, I think the hype train can potentially is ahead of where the actual science and research is. But I do believe that this year will be the year of agents, the beginnings of it. I think you'll start seeing that, you know, maybe second half of this year. But there'll be the early versions. And then, you know, I think they'll rapidly improve and mature. So, but I think you're right. I think the technology, at the moment, it's still in the research lab, the agent technology. But things like Astra, robotics, I think it's coming.
Starting point is 00:39:16 Do you think people are going to trust them? It's like go use the internet for me. Here's my credit card. I don't know. Well, so I think to begin with, you would probably, my view at least, would be to not allow, have human in the loop for the final steps. Like don't pay for anything. Use your credit card unless the human user operator authorizes it.
Starting point is 00:39:35 So that would, to me, be a sensible first step. Also, perhaps certain types of activities or websites. or whatever kind of off-limits, you know, banking websites and other things in the first phase, while we continue to test out in the world how robust these systems are. I propose we've really reached AGI when they say, don't worry, I won't spend your money, and then they do the deceptiveness thing, and then next thing you know, you're on a flight somewhere. Yes, yeah, that would be getting closer, for sure, for sure, yeah. All right, science, so you worked on basically decoding all protein folding with Alpha Fold.
Starting point is 00:40:10 to you won the Nobel Prize for that. Not to skip over the thing that you won the Nobel Prize for, but I want to talk about what's on the roadmap, which is that you have an interest in mapping of virtual cell. Yes. What is that? And what does it get us? Yeah.
Starting point is 00:40:24 Well, so if you think about what we did with AlphaFold was essentially solved the problem of finding the structure of a protein. Proteins, everything in life depends on proteins, right? Everything in your body. So that's the kind of static picture of a protein. But the thing about biology is, really, you only understand what's going on in biology if you understand the dynamics and the interactions between the different things in a cell. And so a virtual cell project is about building a simulation, an AI simulation, of a full working cell. I'd probably start with something like a yeast cell because of the simplicity of the yeast organism.
Starting point is 00:41:01 And you have to build up there. So the next step is with AlphaFold 3, for example, we started doing pair-wise interactions between proteins. and ligands and proteins and DNA, proteins and RNA. And then the next step would be modeling a whole pathway, maybe a cancer pathway or something like that that would be helpful for solving a disease. And then finally, a whole cell. And the reason that's important is
Starting point is 00:41:24 you would be able to make hypotheses and test those hypotheses about making some change, some nutrient change or injecting a drug into the cell and then seeing what happens to how the cell responds. And at the moment, of course, you have to do that painstakingly in a wet lab. But imagine if you could do it a thousand, a million times faster in silico first, and only at the last step do you do a validation in the wet lab.
Starting point is 00:41:50 So instead of doing the search in the wet lab, which is millions of times more expensive and time consuming than the validation step, you just do the search part in silico. So it's sort of translating, again, what we did in the games environments, but here in the sciences and the biology. So you build a model and then you use that to do the reasoning and the search over. And then the predictions are, you know, at least better than maybe they're not perfect, but they're useful enough to be useful for experimentalists to validate against.
Starting point is 00:42:23 And the wet lab is within people. Yeah. So the wet lab, you'd still need a final step with the wet lab to prove what the predictions were actually valid. So, you know, but you wouldn't have to do all of the work to get to the. that prediction in the wet lap. So you just get, here's the prediction. If you put this chemical in, this should be the change, right? And then you just do that one experiment.
Starting point is 00:42:46 So, and then after that, of course, you still have to have clinical trials. If you're talking about a drug, you would still need to test that properly through the clinical trials and so on and test it on humans for efficacy and so on. That I also think could be improved with AI, that whole clinical trial. That also takes many, many years. But that would be a different technology from the virtual cell. The virtual cell would be helping the discovery phase. for drug discovery.
Starting point is 00:43:08 Just like I have an idea for a drug, throw it in the virtual cell. See what it does. Yeah. And maybe eventually it's a liver cell or a brain cell or something like that. So you have different cell models. And then, you know, at least 90% of the time, it's giving you back what would really happen. That would be incredible. How long do you think that's going to take to figure out?
Starting point is 00:43:25 I think that would be like maybe five years from now. Okay. Yeah. So I have a kind of five-year project. And a lot of the alpha-fold, the old alpha-fold team are working on that. Yeah, I was asking your team here. Yeah, John John. I'm speaking with him.
Starting point is 00:43:38 I was like, you figured out protein folding what's next. This is like, it's just very cool to hear about these new challenges because, yeah, developing drugs is a mess right now. We have so many promising ideas
Starting point is 00:43:50 they never get out the door because just the process is absurd. It's process too slow and discovery phase too slow. I mean, look how long we'd be working on Alzheimer's and, I mean, it's tragic way for someone to go and for the families. And, you know, we should be a lot further.
Starting point is 00:44:05 It's 40 years of work on that. Yeah, I've seen it a couple times in my family, and if we can ensure that doesn't happen, it's just... One of the best things we could use AI for, in my opinion. Yeah. Yeah, it's a terrible way to see somebody decline. Yeah. So, yeah. It's important work.
Starting point is 00:44:22 On addition to that, there's the genome. Yes. And so the human genome project sort of... Yes. I was like, okay, so they decoded the whole genome. There's no more work to do there. Like, just same way that you decoded proteins with fold. But it turns out that actually we just have like a bunch of letters.
Starting point is 00:44:36 when it's decoded. And so now you're working to use AI to translate what those letters mean? Yes. So, yeah, we have lots of cool work on genomics and trying to figure out if mutations are going to be harmful or benign, right? Most mutations to your DNA are harmless, but of course some are pathogenic. And you want to know which ones there are. So our first systems are the best in the world at predicting that. and then the next step is to look at situations where the disease isn't caused just by one genetic mutation,
Starting point is 00:45:15 but maybe a series of them in concert. And obviously, that's a lot harder. Like, and a lot of more complex diseases that we haven't made progress with are probably not due to a single mutation. Right, that's more like rare childhood diseases, things like that. So there, you know, we need to, I think AI is the perfect tool to sort of, try and figure out what these weak interactions are like, right? How they may be kind of compound on top of each other. And so maybe the statistics are not very obvious,
Starting point is 00:45:49 but an AI system that's able to kind of spot patterns would be able to figure out there is some connection here. And so we talk about this a lot in terms of disease, but also I wonder what happens in terms of making people superhuman. I mean, if you're really able to tinker with the genetic code, right, The possibilities seem endless. So what do you think about that? Is that something that we're going to be able to do through AI?
Starting point is 00:46:09 I think one day. I mean, we're focusing much more on the disease profile and fixing what goes well. Yeah, that's the first step. And I've always felt that that's the most important. If you ask me, what's the number one thing I wanted to use AI for? And the most important thing we could use AI for is for helping human health. But then, of course, beyond that, one could imagine aging, things like that. Of course, there's a whole field in itself.
Starting point is 00:46:31 Is aging a disease? Is it a combination of diseases? can we extend our healthy lifespan? These are all important questions. I think very interesting, and I'm pretty sure AI will be extremely useful in helping us find answers to those questions too. I see memes come across my Twitter feed
Starting point is 00:46:47 and maybe I need to change the stuff I'm recommended, but it's often like if you will live to 2050, you're not going to die. What do you think the potential max lifespan is for a person? Well, look, I know a lot of those folks in aging research very well. I think it's very interesting in the pioneering work they do, I think there's nothing good about getting old and your body decaying.
Starting point is 00:47:08 I think it's, you know, if anyone who's seen that up close with their relatives, it's a pretty hard thing to go through, right, as a family, or all the person, of course. And so I think anything we can alleviate human suffering and extend healthy lifespan is a good thing. You know, the natural limit seems to be about 120 years old. But from what we know, you know, if you look at the oldest people that are lucky enough to to live to that age. So there's, you know, it's, it's an area I follow quite closely. I don't have any, I guess, new insights that are not already known in that. But I do, I would be surprised if there, if that's, if that's the limit, right? Because there's a sort of two steps to this.
Starting point is 00:47:52 One is curing all diseases one day, which I think we're going to do with isomorphic and the work we're doing there, our spin-out, our drug discovery spin-out. But then that's not enough to probably get you past 120 because there's some sort of, then there's the question of just natural systemic decay, right, aging, in other words. So not specific disease, right? Often those people that live to 120, they don't seem to die from a specific disease. It's just sort of just general atrophy. So then you're going to need something more like rejuvenation, where you, you rejuvenate your cells or you, you know, maybe stem cell research, you know, companies like Altos are working on these things, resetting the cell clocks.
Starting point is 00:48:34 Seems like that could be possible. But again, I feel like it's so complex because biology is such a complicated emergent system. In my view, you need AI to be able to crack anything close to that. Very quickly on material science. I don't want to leave here without talking about the fact that you've discovered many new materials or potential materials. The stat I have here is known to humanity. recently were 30,000 stable materials.
Starting point is 00:49:01 You've discovered 2.2 million with a new AI program. Just dream a little bit. Because we don't know what all those materials can do. We don't know whether they'll be able to handle being out of like a frozen box or whatever. Dream materials for you to find in that set of new materials. Well, I mean, we're working really hard on materials. To me, it's like the next one of the,
Starting point is 00:49:26 the next sort of big impacts we can have, like the level of alpha fold, really, in biology, but this time in chemistry and materials. You know, I dream of one day discovering a room temperature superconductor. So what will that do? Because that's another big meme that people talk about. Yeah, well, then it would help with the energy crisis and climate crisis, because if you had sort of cheap superconductors, you know, then you can transport energy from one place to another without any loss of that energy, right?
Starting point is 00:49:53 And so you could potentially put solar panels in the Sahara Desert and then just have the superconductor, you know, funneling that into Europe where it's needed. At the moment, you would just lose a ton of the power to heat and other things on the way. So then you need other technologies like batteries and other things to store that because you can't just pipe it to the place that you want without being incredibly inefficient. So but also materials could help with things like batteries too, like but come up with the optimal battery. I don't think we have the optimal battery designs that maybe we can do things like a combination of materials and proteins we can do things like carbon capture,
Starting point is 00:50:31 you know, modify algae or other things to do carbon capture better than our artificial systems. I mean, even one of the most famous and most important chemical processes, the harbour process to make fertilizer and ammonia, you know, to take nitrogen out of the air, was something that allows modern
Starting point is 00:50:51 civilization, but there might be many other chemical processes that could be catalyzed in that way if we knew what the right catalyst and the right material was. So I think it's going to be would be one of the most impactful technologies ever is to basically have in silico design of materials. So we've done step one of that where we showed we can come up with new stable materials, but we need a way of testing the properties of those materials. Because no lab can test 200,000, you know, tens of thousands of materials or millions of materials at the moment. So we have to, that's the hard part, is to do the testing. You think it's in there, the room temperature superconductor?
Starting point is 00:51:31 Well, I heard that we actually think there are some superconductor materials. I doubt they're room temperature ones though, but I think at some point, if it's possible with physics, an AI system will one day find it. So that's one use. The two other uses I could imagine, probably people interested in this type of work, toy manufacturers and militaries. Yeah. Are they working with it?
Starting point is 00:51:54 Yeah, toy manufacturers. I mean, look, I think there is incredible, I mean, the big part of my early career was in game design. And, yeah, theme park and simulations. That's what got me into simulations and AI in the first place. And I've always loved both of those things. And in many respects of the work I do today is just an extension of that. And I just dream about, like, what could I have done,
Starting point is 00:52:15 what kinds of amazing game experiences could have been made? if I'd had the AI I have today available 25, 30 years ago when I was writing those games. And I'm a little bit surprised the game industry hasn't done that. I don't know why that is. We're starting to see some crazy stuff with NPCs that like are starting to. But of course, there'd be like intelligence, you know, dynamic storylines. But also just new types of AI first games with learning, with characters and agents that can learn. And, you know, I once worked on a game called Black and White where you had a,
Starting point is 00:52:47 creature that you were nurturing was a bit like a pet dog that that that learnt what you wanted right and but we were we were using very basic reinforcement learning this was like in the late 90s you know imagine what could be done today and I think the same for maybe smart toys as well right and then of course on the militaries you know uh unfortunately AI is a dual dual purpose technology so one has to confront the reality that um especially in today's geopolitical world uh people are using some of these general purpose technologies to apply to drones and other things. And it's not surprising that that works. Are you impressed with what China's up to?
Starting point is 00:53:23 I mean, Deep Seek is this new model getting? It's a little bit unclear how much they relied on Western systems to do that. You know, both training data, there's some rumors about that and also maybe using some of the open source models as a starting point. But look, for sure, it's impressive what they've been able to do. And, you know, I think that's something with. going to have to think about how to keep the Western Frontier models in the lead. I think they still are at the moment.
Starting point is 00:53:50 But, you know, for sure, China is very, very capable of engineering and scaling. Let me ask you one final question. Just give us your vision of what a world looks like when there's super intelligence. Let's move past. We started with AGI. Let's have on super intelligence. Yeah. Well, look, I think for there, two things there.
Starting point is 00:54:07 One is, I think a lot of the best sci-fi can we can look at as interesting models to debate about what kind of galaxy or universe do we want a world, do we want to move towards. And the one I've always liked most is actually the culture series by Ian Banks. I started reading that back in the 90s and I think that is a picture. It's like a thousand years into the future, but it's in a post-AGI world where there are AGI systems coexisting with human society and also alien society. and we've, humanities basically maximally flourished
Starting point is 00:54:44 and spread to the galaxy. And that I think is a great vision of how the things might go in the positive case. So I sort of hold that up. I think the other thing we're going to need to do is as I mentioned earlier about the under appreciating still
Starting point is 00:55:04 what's going to come in the longer term. I think there is a need for some great philosophers to, you know, where are they, the great next philosophers, the equivalence of Kant or Wittgenstein or even Aristotle, I think we're going to need that to help navigate society to that next step because I think the, you know, AGI and artificial superintelligence is going to change humanity and the human condition. Demis, thank you so much for doing this. Great to see you in person and hope to do it again soon.
Starting point is 00:55:34 Thank you. Thank you very much. All right, everybody, thank you for listening and we'll see you next time on Big Technology Podcast. Thank you.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.