Theories of Everything with Curt Jaimungal - Why the Godfather of AI Now Fears His Creation (ft. Geoffrey Hinton)

Episode Date: January 18, 2025

As a listener of TOE you can get a special 20% off discount to The Economist and all it has to offer! Visit https://www.economist.com/toe Professor Geoffrey Hinton, a prominent figure in AI and 2024 ...Nobel Prize recipient, discusses the urgent risks posed by rapid AI advancements in today's episode of Theories of Everything with Curt Jaimungal. Join My New Substack (Personal Writings): https://curtjaimungal.substack.com Listen on Spotify: https://tinyurl.com/SpotifyTOE Timestamps: 00:00 The Existential Threat of AI 01:25 The Speed of AI Development 7:11 The Nature of Subjective Experience 14:18 Consciousness vs Self-Consciousness 23:36 The Misunderstanding of Mental States 29:19 The Chinese Room Argument 30:47 The Rise of AI in China 37:18 The Future of AI Development 40:00 The Societal Impact of AI 47:02 Understanding and Intelligence 1:00:47 Predictions on Subjective Experience 1:05:45 The Future Landscape of AI 1:10:14 Reflections on Recognition and Impact Geoffrey Hinton Links: •⁠ ⁠Geoffrey Hinton’s publications: https://www.cs.toronto.edu/~hinton/papers.html#1983-1976 •⁠ ⁠The Economist’s several mentions of Geoffrey Hinton: https://www.economist.com/science-and-technology/2024/10/08/ai-researchers-receive-the-nobel-prize-for-physics •⁠ ⁠https://www.economist.com/finance-and-economics/2025/01/02/would-an-artificial-intelligence-bubble-be-so-bad •⁠ ⁠https://www.economist.com/science-and-technology/2024/10/10/ai-wins-big-at-the-nobels •⁠ ⁠https://www.economist.com/science-and-technology/2024/08/14/ai-scientists-are-producing-new-theories-of-how-the-brain-learns •⁠ ⁠Scott Aaronson on TOE: https://www.youtube.com/watch?v=1ZpGCQoL2Rk&ab_channel=CurtJaimungal •⁠ ⁠Roger Penrose on TOE: https://www.youtube.com/watch?v=sGm505TFMbU&list=PLZ7ikzmc6zlN6E8KrxcYCWQIHg2tfkqvR&index=19 •⁠ ⁠The Emperor’s New Mind (book): https://www.amazon.com/Emperors-New-Mind-Concerning-Computers/dp/0192861980 •⁠ ⁠Daniel Dennett on TOE: https://www.youtube.com/watch?v=bH553zzjQlI&list=PLZ7ikzmc6zlN6E8KrxcYCWQIHg2tfkqvR&index=78 •⁠ ⁠Noam Chomsky on TOE: https://www.youtube.com/watch?v=DQuiso493ro&t=1353s&ab_channel=CurtJaimungal •⁠ ⁠Ray Kurzweil’s books: https://www.thekurzweillibrary.com/ Become a YouTube Member (Early Access Videos): https://www.youtube.com/channel/UCdWIQh9DGG6uhJk8eyIFl1w/join Support TOE on Patreon: https://patreon.com/curtjaimungal Twitter: https://twitter.com/TOEwithCurt Discord Invite: https://discord.com/invite/kBcnfNVwqs #science #ai #artificialintelligence #physics #consciousness #computerscience Learn more about your ad choices. Visit megaphone.fm/adchoices

Transcript
Discussion (0)
Starting point is 00:00:00 As a creator, I understand the importance of having the right tools to support your business growth. Prior to using Shopify, it was far more complicated and convoluted. There were different platforms, different systems, none of them meshed well together. However, once we made that switch to Shopify, everything changed. What I like best about Shopify is how seamless the entire process is from managing products to tracking sales. It's so much easier now and it's streamlined our operations considerably. If you're serious about upgrading your business, get the same checkout we use with Shopify.
Starting point is 00:00:38 Sign up for your $1 per month trial period at Shopify.com slash theories all lowercase go to Shopify.com slash theories to upgrade your selling today that's Shopify.com slash theories bet MGM authorized gaming partner of the NBA has your back all season long from tip-off to the final buzz, you're always taken care of with a sportsbook born in Vegas. That's a feeling you can only get with BetMGM. And no matter your team, your favorite player or your style, there's something every NBA fan will love about BetMGM. Download the app today and discover why BetMGM is your basketball home for the season. Raise your game to the next level this year with BetMGM, a sportsbook the
Starting point is 00:01:27 .com for terms and conditions must be 19 years of age or older to wager Ontario only please play responsibly if you have any questions or concerns about your gambling or someone close to you please contact Connix Ontario at 1-866-531-2600
Starting point is 00:01:42 to speak to an advisor free of charge. or concerns about your gambling or someone close to you, please contact Connex Ontario at 1-866-531-2600 to speak to an advisor free of charge. VETMGM operates pursuant to an operating agreement with iGaming Ontario. There's some evidence now that AIs can be deliberately deceptive. Once they realize getting more control is good,
Starting point is 00:02:01 and once they're smarter than us, we'll be more or less irrelevant. We're not special, and we're not safe. What happens when one of the world's most brilliant minds comes to believe his creation poses an existential threat to humanity? Professor Jeffrey Hinton, winner of the 2024 Nobel Prize in Physics and former Vice President and Engineering fellow at Google, spent decades developing the foundational algorithms that power today's AI systems. Indeed, in 1981, he even published a paper that foreshadowed the seminal attention mechanism. However, Hinton is now sounding an alarm that he says few researchers want to hear. Our assumption that consciousness makes humans special and safe
Starting point is 00:02:40 from AI domination is patently false. My name's Kurt J. Mungle, and this interview is near and dear to me, in part because my degree in mathematical physics is from the University of Toronto, where Hinton's a professor, and several of his former students, like Ilya Sutskover and Andrei Karpathy, were my classmates. Being invited into Hinton's home for this gripping conversation was an honor. Here Hinton challenges our deepest assumptions about what makes humans unique. Is he a modern Oppenheimer or is this radiant mind seeing something that the rest of us are missing? What was the moment that you realized AI development is moving faster than our means to contain it?
Starting point is 00:03:27 I guess in early 2023, it was a conjunction of two things. One was chat GBT, which was very impressive. And the other was work I've been doing at Google on thinking about ways of doing analog computation to save on power and realizing that digital computation was just better and it was just better because you could make multiple copies of the same model. Each copy could have different experiences and they could share what they learned by averaging their weights or averaging their weight gradients. And that's something you can't do in an analog system. Is there anything about our brain that has an advantage because it's analog? The power. It's much lower power. We run like 30 watts. And the ability to pack in a lot of
Starting point is 00:04:13 connections. We've got about a hundred trillion connections. The biggest models have about a trillion. So we're still almost a hundred times bigger than the biggest models. And we run at 30 watts. Is there something about scaling that is a disadvantage? So you said it's better, but just as quickly as something nourishing or positive can spread. So can something that's a virus or something deleterious can be replicated quickly?
Starting point is 00:04:39 So we say that that's better because you can make copies of it quicker. If you have multiple copies of it, they can all share their experiences very efficiently. So the reason GPT-4 can know so much is you have multiple copies running on different pieces of hardware and by averaging the weight gradients they could share what each copy learned. You didn't have to have one copy experience the whole internet. Hmm. That could be carved up among many copies. We can't do that because we can't share efficiently.
Starting point is 00:05:11 Scott Aronson actually has a question about this. Dr. Hinton, I'd be very curious to hear you expand on your ideas of building AIs that run on unclonable analog hardware so that they can't copy themselves all over the internet. Well, that's what we're like. If I want to get knowledge from my head to your head, I produce a string of words and you change the connection strings in your head so that you might have set the same string of words. And that's a very inefficient way of sharing knowledge.
Starting point is 00:05:39 A sentence only has about 100 bits. So we can only share about 100 bits per sentence, whereas these big models can share trillions of bits. So we can only share about 100 bits per sentence, whereas these big models can share trillions of bits. So the problem with this kind of analog hardware is it can't share. But an advantage, I guess, if you're worried about safety, is it can't copy itself easily. You've expressed concerns about an AI takeover or AI dominating humanity. What exactly does that look like?
Starting point is 00:06:11 We don't know exactly what it looks like, but to have AI agents, you have to give them the ability to create sub-goals. And one path that's slightly scary is they will quickly realize that a good sub-goal is to get more control. Because if you get more control, you can achieve your other goals. So even if they're just trying to do what we ask them to do, they'll realize getting more control is the best way to do that. Once they realize getting more control is good, and once they're smarter than us, we'll be more or less irrelevant. Even if they're benevolent, will become somewhat irrelevant. Will be like the sort of very dumb CEO of a big company that's
Starting point is 00:06:51 actually run by other people. I want to quote you. You said that it's tempting to think, because many people will say, can we just turn off these machines like currently we can. So it's tempting to think that we can just turn it off. Imagine these things are a lot smarter than us. And remember that they'll read everything, everything a Machiavelli off these machines like currently we can. So it's tempting to think that we can just turn it off. Imagine these things are a lot smarter than us and remember that they'll read everything Machiavelli has ever wrote.
Starting point is 00:07:11 They'll have read every example in the literature of human deception. They'll be real experts at doing human deceptions because they'll learn that from us and they'll be much better than us. As soon as you can manipulate people with your words, then you can get whatever you like done. Do you think that this is already happening? That the AIs are already manipulating us? There's some evidence now, there's recent papers that show that AIs can be deliberately deceptive.
Starting point is 00:07:39 And they can do things like behave differently on training data from a test data, so that they deceive you while they're being trained. So there is now evidence they actually do that. Yeah. And do you think there's something intentional about that or that's just some pattern that they pick up? I think it's intentional.
Starting point is 00:07:59 But there's still some debate about that. And of course, intentional could just be some pattern you pick up. So is it your contention that there's a subjective experience associated with these AIs? Okay, so most people, almost everybody in fact, thinks one reason we're fairly safe is we have something that they don't have and will never have.
Starting point is 00:08:26 Most people in our culture still believe that. We have consciousness or sentience or subjective experience. Now, many people are very confident they don't have sentience, but if you ask them, what do you mean by sentience? They say, I don't know, but they don't have it. That seems a rather inconsistent position to be confident they don't have it without knowing what it is. So I prefer to focus on subjective experience.
Starting point is 00:08:56 I think if that is like the thin end of the wedge, if you could show they have subjective experience, then people will be less confident about consciousness and sentience. So let's talk about subjective experience. When I say, suppose I get drunk and I tell you, I have the subjective experience of little pink elephants floating in front of me. Most people interpret that they have a model of what that means, and I think it's a completely incorrect model. And their model is, there's an inner theatre, and in this inner theatre
Starting point is 00:09:31 there's little pink elements floating around, and only I can see them. That's the sort of standard model of what the mind is, at least as far as perception is concerned. And I think that model is completely wrong. It's as wrong as a religious fundamentalist model of the material world. Maybe the religious fundamentalist believes it was all made 6,000 years ago. That's just nonsense. It's wrong. It's not that it's a truth you can choose to believe, it's just wrong. So I think people's model of what the mind is, is just wrong. So let's take again, I have the subjective experience of a little pink elephant
Starting point is 00:10:14 that's floating in front of me. And I'll now say exactly the same thing without using the word subjective experience. Okay, here goes. My perceptual system is telling me something I don't believe. That's why I use the word subjective. But if there were little pink elephants floating in front of me, my perceptual system would be telling me the truth. That's it. I just said the same thing without using the word subjective or experience. So what's happening is when my perceptual system goes wrong, I
Starting point is 00:10:49 indicate that to you by saying subjective. And then in order to try and explain to you what my perceptual system is trying to tell me, I tell you about a hypothetical state of affairs in the world, such that if the world were like that, my perceptual system would be telling me the truth. Okay. Now let's do the same with the chatbot. So suppose we have a multimodal chatbot. It has a robot arm that can point and it has a camera and it can talk obviously.
Starting point is 00:11:21 And we train it up and then we put an object in front of it, and we say, point to the object. No problem, it points at the object. Then when it's not looking, we put a prism in front of the camera lens. And then we put an object in front of it, and say, point to the object, and it points over there. And we say, no, that's not where the object is. The object's actually straight in front of you,
Starting point is 00:11:43 but I put a prism in front of your lens. And the chat bot says, oh, I see. The prism bent the light rays. So the object's actually there, but I had the subjective experience it was there. Now, if it says that, it's using the word subjective experience exactly like we use them.
Starting point is 00:12:01 And therefore I say, multimodal chat bots can already have subjective experiences. If you mess up their perceptual system, they'll think the world's one way and it'll actually be another way. And in order to tell you how they think the world is, they'll say, well, they had this objective experience that the world was like this.
Starting point is 00:12:21 Okay, so they already have subjective experience. Now you become a lot less confident about the other things. Consciousness is obviously more complicated because people vary a lot on what they think it means, but it's got an element of self-reflexive element to it, a self-awareness element, which makes it more complicated. But once you've established that they have subjective experience, I think you can give up on the idea that there's something about them, something about us that they will never have. And that makes me feel a lot less safe. So do you think there's a difference between consciousness and self-consciousness?
Starting point is 00:13:00 You said consciousness has a self-reflexiveness to it, but some consciousness does. Yes. So philosophers have talked a lot about this. at present I don't want to get into that. I just want to get the thin end of the wedge in there and say they have subjective experience. So for something to have subjective experience, does that not imply that it's conscious? Like who is the subjective experience happening to? Where is the subjective experience being felt? Okay, exactly. So you say, where's the subjective experience being felt? That involves having a particular model
Starting point is 00:13:34 of subjective experience that somehow, if you ask philosophers, when I say, I've got this objective experience of little pink elephants floating in front of me, they'll say, and you say, where are those little pink elephants? They say they're in your mind. And you say, well, what are they made of? Philosophers are told they're made of qualia. They're made of pink qualia and elephant qualia and floating qualia and not that big qualia and right way up qualia, all stuck there with qualia gling. That's what many philosophers think.
Starting point is 00:14:07 And that's because they made a linguistic mistake. They think the words experience of work like the words photograph of. If I say I've got a photograph of little pink elephants, you can very reasonably ask, well, where is the photograph? And what's the photograph made of? And people think that if I say I have an experience of little pink elephants, you can ask, well, where is the experience? Well, it's in my mind.
Starting point is 00:14:35 And what's it made of? It's made of qualia. But that's just nonsense. That's because you thought the words experience of work the same way as photograph of, and they don't. Experience of, the way that works, or subjective experience of, is the subjective says I don't believe it, and the experience of
Starting point is 00:14:57 is really an indicator that I'm going to tell you about my perceptual system by telling you about a hypothetical state of the world. That's how that language works. It's not referring to something in an inner theater. When I hear the word perception, it sounds like an inner theater as well. Like if you say, I see something in my perceptual system, it sounds like there's this you that's seeing something on a perceptual system that's being fed to you. So that's the wrong model.
Starting point is 00:15:26 Yes. You don't see your percepts. You have your percepts. So photons come in. Your brain does a whole bunch of processing. You presumably get some internal representation of what's out there in the world. But you don't see the internal representation. Let's call that internal representation a perfect.
Starting point is 00:15:47 You don't see that, you have that. Having that is seeing. People are forever trying to think that you have the external world, something comes into the inner theater and then you look at what's in the inner theater. It doesn't work like that. There was a psychologist or neurologist who thought that the pawns had to do with consciousness. And then recently self-consciousness has to do with default mode network.
Starting point is 00:16:14 Okay, is there something, is there a part of an AI system that has to do with self-consciousness? And also help me understand even my own terminology when I'm saying the AI system. Are we saying when it's running on the GPU? Are we saying it's the algorithm like what is the AI system that is conscious or that has subjective experience? So where is it? I guess that there's going to be some hardware that's running it And it's going to be that system that's going to be conscious. That's if something's going to be conscious
Starting point is 00:16:47 Software by itself has to run on it, it has to be running on something, I would have thought to be conscious. The Economist has actually spoken to and covered Jeffrey Hinton several times before. Links are in the description. As you know, on Theories of Everything, we delve into some of the most reality-spiraling concepts from theoretical physics and consciousness to AI and emerging technologies. To stay informed, in an ever-evolving landscape, I see The Economist as a wellspring of insightful analysis and in-depth reporting on the various topics we explore here and beyond. The Economist's commitment to rigorous journalism means you get a clear picture of the world's most significant developments. Whether it's in scientific innovation or the shifting tectonic
Starting point is 00:17:33 plates of global politics, The Economist provides comprehensive coverage that goes beyond the headlines. What sets The Economist apart is their ability to make complex issues accessible and engaging, much like we strive to do in this podcast. If you're passionate about expanding your knowledge and gaining a deeper understanding of the forces that shape our world, then I highly recommend subscribing to The Economist. It's an investment into intellectual growth. One that you won't regret.
Starting point is 00:18:00 As a listener of Toe, you get a special 20% off discount. Now you can enjoy The Economist and all it has to offer for less. Head over to their website, www.economist.com slash Toe, T-O-E, to get started. Thanks for tuning in. And now back to our explorations of the mysteries of the universe. Software by itself has to run on, it has to be running on something, I would have thought, to be conscious. What I'm asking is, just like prior there was the pods that were stopping... I think a good way to think about it is to think about what AI systems are going to be like when they're embodied.
Starting point is 00:18:40 So, and we're going to get there quite soon because people are busy trying to build battle robots Which aren't going to be very nice things But if a battle robot Has figured out where you're going to be late at night that you're going to be in some dark alley by yourself late at night And it's decided to creep up behind you when you're least expecting it and shoot you in the back of the head behind you when you're least expecting it and shoot you in the back of the head. It's perfectly reasonable to talk about what the battle robot believes and you talk about what the battle robot believes in the same way as you talk about what a person believes. The battle robot might think that if it makes a noise you'll turn around and see it and it might really think that in just the way people think it.
Starting point is 00:19:30 It might have intentions, it might be intending to creep up behind you and shoot you. So I think what's going to happen is our reluctance to use words like believe and intend and think is going to disappear once these things are embodied. And already it's disappeared to quite a large extent. So if I'm having a conversation with the chatbot and it starts recommending to me things that don't make any sense, and then after a while I figure the chatbot must think I'm a teenage girl. Uh-huh. And that's why it gives me all these things about makeup and clothes and certain pop groups, boy bands, whatever.
Starting point is 00:20:12 And so I asked Jabba, what demographic do you think I am? And it says, I think you're a teenage girl. When it says, I think you're a teenage girl, we really don't have any doubt that that's what it thinks, right? In normal language, you say, okay, it thought I was a teenage girl. And you wouldn't say, you don't really believe that,
Starting point is 00:20:34 okay, it's a bunch of software or neural nets and it acts as if it thinks I'm a teenage girl. You don't say that. It thinks you're a teenage girl. We already use thinks when we're dealing with these systems, even if they don't have hardware associated with them, or obvious hardware associated with them. We already use words like thinks and believes. So we're already attributing mental states to them. It's just we have a funny model of a mental state. So we can attribute mental states to them,
Starting point is 00:21:06 but have a completely incorrect model of what it is to have a mental state. We think of this inner theater that's the mind and so on. That's not how your mental state is. How much of your concern about AI and its direction would go away if they were not conscious or did not have subjective experience. Is that relevant to it? Does that just accelerate the catastrophe? I think the importance of that is that it makes most people feel relatively safe. It makes most people think we've got something they haven't got or never will have. And that
Starting point is 00:21:41 makes us feel much safer, much more special. We're not special, and we're not safe. We're certainly not safe because we have subjective experience and they don't. But I think the real problem here is not so much a scientific problem as a philosophical problem, that people misunderstand what is meant by having a subjective experience. I want to give you an example to show that you can use words. You've got a science background, so you probably think you know what the words horizontal and vertical mean.
Starting point is 00:22:11 I mean, that's not a problem, right? It's obvious what they mean. Sure. And if I show you something, that one's vertical and that one's horizontal, right? Sure. Not difficult. So I'll now convince you,
Starting point is 00:22:23 you actually had a wrong model of how they work. Not totally wrong, but there were significant problems, significant incorrectnesses in your model of the terms horizontal and vertical. Okay, here we go. Suppose in my hands I have a whole bunch of little aluminium rods, a large number, and I throw them up in the air, and they tumble and turn and bump into each other. Then suddenly I freeze time. And I ask you, are there more that are within one degree of vertical or more within one degree of horizontal, or is it about the same? So it's approximately the same. Right. That's what most people say.
Starting point is 00:22:55 Approximately the same. And they're surprised when I tell you there's about 114 times as many that are within one degree of horizontal. That's kind of surprising, right? How did that happen? Well, that's vertical, and this is vertical too. One degree of rotational freedom. That's horizontal, and this is horizontal too,
Starting point is 00:23:16 but so is this. So horizontal has two degrees of freedom. Vertical only has one degree of freedom. So here's something you didn't know about horizontal and vertical. Vertical is very special and horizontal is too appenny. That's a bit of a surprise to you. Obviously it's not like that in 2D, but in 3D they're very different.
Starting point is 00:23:35 One's very special and the other isn't. So why didn't you know that? Well, I'm going to give you another problem. Suppose in my hands I have a whole bunch of little aluminium discs and I throw them all up in the air and they tumble and turn and bump into each other and suddenly I freeze time. Are the more that are within one degree of vertical or more than within one degree of horizontal or is it about the same? No, there's about 114 times as many that are within one degree of vertical.
Starting point is 00:24:05 Interesting. So that's vertical, and this is vertical, and this is vertical. This is horizontal, and this is horizontal, but it's only got one degree of freedom. So for planes, horizontal is very special and vertical is true a penny. And for lines, vertical is very special and horizontal is true a penny. So that's just a little example of you have a sort of meta-theory of how the words work. And that meta-theory can be wrong even though you use the words correctly. And that's what I'm saying about all these mental state terms, in terms like subjective experience of.
Starting point is 00:24:45 You can use them correctly, and you can understand what other people mean when they use them, but you have a meta theory of how they work, which is this inner theater with things made of quarrelry, that's just complete junk. So what is it then about a theory of percepts or subjective experience that makes it then correct
Starting point is 00:25:03 in order for you to say, well, I'm more on the correct track than most people think. That you think of them as thing that these subjective experiences, you think they have to be someone that has to be made of something. That neither of those things is true. When I say subjective experience, that's an indicator that I'm now about to talk about a hypothetical state of the world that isn't true. So it isn't anywhere,
Starting point is 00:25:33 it's a hypothetical state of the world. But notice the big difference between saying I'm going to talk about this something that's just hypothetical, it isn't actually anywhere. But if it was somewhere, be out there in the world, versus I'm talking about something that's in an inner theater made of funny stuff. Those are two completely different models. And the model that is in an inner theater made of funny stuff, I think is just completely wrong, even though it's a model we almost all have.
Starting point is 00:26:00 What about someone like your fellow Nobel Prize winner, Roger Penrose, who we were talking about? Let me tell you a story about Roger Penrose. A long time ago, he was invited to come to the University of Toronto and give a talk about his new book, The Emperor Has No Clothes. And I got invited to introduce him. The Dean called me up and said, would you introduce Roger Penrose? And I said, sure. And she said, oh, thank you very much. And I said, ah, but before you agree,
Starting point is 00:26:31 you should know what I'll say. And she said, what will you say? And I said, I will say, Roger Penrose is a brilliant mathematical physicist who's made huge contributions to mathematical physics. And what he's gonna talk about today is complete junk. So that's my view of Roger Penrose's view of consciousness. And in particular, he makes a crazy mistake, which is, now I have to think how to say this carefully because obviously people will be criticising
Starting point is 00:27:07 it. The issue is can mathematicians intuit things are true that can't be proved to be true? And that would be very worrying if mathematicians intuition was always right. If they could do that correctly every time, that'd be really worrying and would sort of mean something funny was going on. But they can't. Mathematicians have intuitions and they're sometimes right and sometimes wrong. So it doesn't really prove anything.
Starting point is 00:27:43 It doesn't prove that you need quantum mechanics to explain how mathematicians work. And I don't see any reason for needing quantum mechanics to explain things like consciousness. AI is doing a pretty good job so far. We've produced these chatbots. These chatbots, as I just argued, if you give them a camera, can have subjective experiences. There's nothing about people that requires quantum mechanics to explain it. Is there something about the Penrose argument that relies on mathematicians 100% of the time intuiting correctly? Yes.
Starting point is 00:28:25 It's only if they could intuit correctly. If they're guessing, that's fine. If they have a way of always getting it right, the answer to these questions that can't be derived within the system, they can't be answered within the system, then that would be a problem. But they don't. They make mistakes. Why don't you they make mistakes. Why don't you outline what his argument is?
Starting point is 00:28:48 Panorazis. I don't want to, I mean, the argument, as I understood it, the argument is, there's two things going on. One is he says classical computation isn't going to explain consciousness. I think that's a big mistake. And I think that's based on a funny notion of what consciousness is. That's not right. A misunderstanding of what consciousness is. A second is that mathematicians can intuit the truth of things that can't be proved and that shows there's something funny going on. That doesn't show there's something funny going on unless they intuit it correctly every time.
Starting point is 00:29:23 Hmm. So I'm sure you've heard of the Chinese Room Experiment. I have. What are your thoughts on that? Feel free to briefly outline it for the audience. Okay. So back in about 1990, I got invited to be on a TV program with John Searle. I called up my friend Dan Dennett and said, should I do this?
Starting point is 00:29:48 And he said, well, you know, he will try and make you look stupid. But if you do it, don't talk about the Chinese room argument. So I agreed to be on the program with So, and the very first thing he said was an hour long interview. The very first thing he said was, so Jeffrey Hinton is a connectionist, so of course he has no problems with the Chinese room argument.
Starting point is 00:30:18 He's a connectionist. A connectionist. And So he then says, so he has no problems with the Chinese room argument, which was, we'd agreed not to talk about it. And he was saying something that was completely false. I've got a lot of problems with the Chinese room argument. I think it's nonsense. And I think it's a deliberately deceptive argument.
Starting point is 00:30:35 I think it's a dishonest argument. What you're doing is you're saying, does this room full of Chinese people, I think. Well, there's this room where he wants you to identify. Yeah, we could, okay, we can make a system made of Chinese people who are sending messages to each other in Chinese. And as a result of all these messages that are sent around in Chinese, you can send in an English sentence, they'll send messages to you other than Chinese. This is just my memory of the argument. And they'll be able to answer this English sentence, even though none of the people sending these messages around understood a word of English because they're just running a program, but they do it by sending messages in Chinese to each other.
Starting point is 00:31:28 Okay. What's dishonest about the argument is he wants you to think that to get confused between the whole system and the individual Chinese people sending messages. So the whole system understands English. The individual Chinese people sending messages don't. He wants you to think that that whole system can't possibly understand English because the people inside don't understand English. But that's nonsense. The system understands English.
Starting point is 00:32:00 That's what I think is wrong with the argument. Now speaking about China, something that many AI researchers didn't predict was that China would catch up with the West in terms of AI development. So how do you feel about that and what are the consequences? I don't think they're quite caught up yet. They're very close though. America's going to slow them down a bit by trying to prevent them having the latest Nvidia chips. Nvidia maybe can find workarounds and what
Starting point is 00:32:35 that's going to do if the embargo is effective it's just going to cause the Chinese to develop their own technology and there'll be a few years behind, but they'll catch up. They've got better STEM education than the U S so they've got more people who are better educated. Um, I think they're going to catch up. Do you know who Mark Andreessen is? He, he thinks, oh yeah, I disagree with him about more or less everything, I think. OK, how about let's pick one.
Starting point is 00:33:10 So he had a comment that said, I don't understand how you're going to lock this down. He was speaking to someone from the government about how the government was saying, well, if AI development gets out of hand, we can lock it down, quote unquote. Right. He was saying, how can you do that?
Starting point is 00:33:20 Because the math for AI is out there, it's being taught everywhere. To which the officials responded, well, during the Cold War, we classified entire areas of physics and took them out of the research community. Entire branches of physics basically went dark and didn't proceed. If we decide that we need to, we're going to do the same to the math underneath AI. Forget it. I agree with Marc Andreessen on that.
Starting point is 00:33:45 There's no way you're going to be able to... Now, it could have been, for example, that Google in 2017 could have decided not to publish Transformers. And it might have been several years before anybody else came up with the same idea. So they could slow it down by a few years maybe. But I don't think there's much hope in, I mean just think what it would take to prevent the information getting out there. Be very hard. The all-new FanDuel Sportsbook in Casino is bringing you more action than ever. Want more ways to follow your faves? Check out our new player prop tracking with real-time notifications.
Starting point is 00:34:26 Or have out more ways to customize your casino page with our new favorite and recently played games tabs. And to top it all off, quick and secure withdrawals. Get more everything with Bandula Sportsbook and Casino. Gambling problem? Call 1-866-531-2600. Visit connexontario.ca. There's a hostage situation in the Olympic Village.
Starting point is 00:34:45 You're sports, you're in way over your head. People can't stop talking about September 5. It's a madhouse down here. It's one of the best movies of the year. What's happening? Oh God. September 5, now playing in select theaters. So you don't think the government can classify some,
Starting point is 00:35:01 what would it be, linear algebra? No. I mean, they could make it harder to share certain kinds of information, which would slow things down a little bit. But I just think it's implausible that they could take AI ideas that really work well and by not sharing them, prevent anybody else creating them. What happens with new ideas is that there's a zeitgeist and within that zeitgeist, it's possible to have new ideas. It often happens that one person has a new idea and at more or less the same time and
Starting point is 00:35:41 quite independently, except they're sharing the same zeitgeist, someone else has a slightly different version of the same idea. This is going on all the time. Unless you can get rid of the whole zeitgeist, you're not gonna be able to have new ideas and keep them secret. Because a few years later, somebody else is gonna come up with the same idea.
Starting point is 00:36:03 What about decentralizing AI? So that's a huge topic. Some people would say, well, that's giving the atomic bomb to any person who wants access to an atomic bomb. Yes, I say that. And then there are other people who say, well, that's what is required in order to
Starting point is 00:36:17 create the guardrails against the Skynet scenario, is where we have multiple different decentralized agents or AI. Sorry, there's two notions of decentralized. So let's talk about sharing weights. So if you ask why doesn't Alabama have a bomb? It's because you need fissile material. And it's hard to get fissile material.
Starting point is 00:36:42 It takes a lot of time and energy to produce the fissile material. Once you have the fissile material, it's much easier to make a bomb. And so the government clearly doesn't want fissile material to be out there. You can't go on eBay and buy some fissile material. That's why we don't have lots of little atomic bombs belonging to tiny states.
Starting point is 00:37:03 So if you ask what's the equivalent for these big chatbots? The equivalent is a foundation model that's been trained, maybe using a hundred million dollars, maybe a billion dollars. It's been trained on lots of data. It's got a huge amount of competence. If you release the weights of that model, you can now fine tune it to all sorts of bad things. So I think it's crazy to reduce, to release the weights of these model, you can now fine tune it to all sorts of bad things. So I think it's crazy to release the weights of these big models because they are our main constraint on bad actors. And META has now done it and other people have followed suit. So it's too late now, the cat's out of the bag, but it was a crazy move. Well, speaking about foundation models,
Starting point is 00:37:47 much of our latest AI boom is because of Transformer, the Transformer architecture. Do you see some other large breakthrough, either some paradigm or some other architecture on the horizon? Okay, I think there will be other large breakthroughs of comparable magnitude, because that's just how science works.
Starting point is 00:38:07 I don't know what they are. If I knew what they were, I'd be doing them. Would you though? Well, I'm too old now. I have students doing them. What I mean is, how do you reconcile your past contributions to this field and you have your current woes? So would you be contributing to it? So here's the issue. AI is very good for lots of things that will benefit humanity a whole lot, like better healthcare, fighting climate change, better materials, things like room temperature superconductors
Starting point is 00:38:38 where AI might well be involved in actually discovering them. I assume there are some out there. So there's so many things, and actually discovering them. I assume there are some out there. So there's so many things, good uses of AI, that I don't think the development is going to be stopped. So I don't think it's sensible to say we should be slowing down AI, slowing down the development. It's not going to happen anyway because there's so much competition and it's just not feasible. It might be the best thing for humanity,
Starting point is 00:39:06 but it's not gonna happen. What we should be doing is, as it's being developed, trying to figure out how to keep it safe. So it's another thing to say that this is a boulder that no one can stop. It's another thing to also be responsible for pushing the boulder as well. So do you actually feel like if there was a breakthrough on the horizon that you see
Starting point is 00:39:27 and you could, you're like Rakers while you have this great predictive quality, that you would actually put your coins into it and work on it? As long as that was combined with working on how to keep it safe, yes. I feel I didn't realize soon enough how dangerous it was going to be. I wish I realized sooner. There's this quote from Einstein about the atomic bomb. He said, I would burn my hands had I known what I was developing would lead to the atomic bomb.
Starting point is 00:39:52 Do you feel similar? I don't actually know. Maybe I should. I don't regret what I've done. I regret the fact it may lead to bad things. But I don't think back and think, oh, I wish I'd never done that. I think AI is going to be developed. I don't think we have much choice about that just because of the competition between countries
Starting point is 00:40:16 and between companies. So I think we should focus our efforts on trying to develop it safely. And that's very different from trying to slow down the development. In addition to alignment, what does safe development of AI mean? Okay, figuring out how to deal with the short-term risks. There's many of those and they all have different solutions. So things like lethal autonomous weapons. And to do with that, you need things like Geneva Conventions.
Starting point is 00:40:51 And we're not going to get those until nasty things have happened. You've got fake videos and images corrupting elections, particularly if they're targeted to particular people. To deal with that, I think you need a much better system for establishing the provenance of a video or an image. Initially, I thought you should mark them as fake. You should insist they're marked as fake. I don't think there's much future in that anymore. I think you're better off insisting that there's a provenance associated with things and your
Starting point is 00:41:25 browser can check the provenance. Just as already with email, it says, don't trust this one, I can't establish it. It should be like that. and bias where you can freeze the weights of a system and measure its bias and then somewhat correct it. You'll never correct it perfectly but somewhat correct it. So you can make the system less biased than the data was trained on and so you can replace people by a less biased system. It'll never be unbiased. But if you just keep replacing systems by less biased systems, that's called gradient descent, things will get less biased.
Starting point is 00:42:14 So I'm not so worried about that one, possibly because I'm an old white man. There's jobs. We don't really know what to do about that. So you don't get many people digging ditches anymore Because a backhoe is just much better at digging ditches than a person It's going to be the same for almost all mundane intellectual labor An AI system is going to make a much better paralegal than a person than a person. That's kind of really scary because of what it's going to do to society. It's going to cause the rich to get richer because we're going to get big increases in productivity. Where's that wealth going to go to? It's going to go to rich people and poor people
Starting point is 00:42:58 get poorer. I don't know what to do about that. Universal basic union helps, stops them starving, but it doesn't really solve the problem because people's dignity is gone if they don't have a job. So earlier we were talking about perception, and then perception was associated with subjective qualities. Maybe there's a wrong model there. But anyhow, whenever we're speaking about percepts, are we speaking about perception and thus we're speaking about a subjective experience associated with it? No, when you use the word subjective experience, you're indicating that you're about to talk about a hypothetical state of the real world.
Starting point is 00:43:39 Okay? Not some funny internal thing, but a hypothetical state of the real world. These funny internal things don't exist. There's nothing, there are no qualia, there's nothing made of qualia. There's just hypothetical states of the world as a way of explaining how your perceptual system is lying to you. And that's what we mean when we say subjective experience, is these hypothetical states of the world. That's how we actually use it.
Starting point is 00:44:08 So prediction or no? Oh, getting the issue of prediction into it is sort of red herring, it's a different direction altogether. The thing you have to get in your head is that there isn't a funny kind of thing called a subjective experience that's made of some funny mental stuff. There's just a technique of talking about how your perceptual
Starting point is 00:44:35 system goes wrong, which is to say what the world would have had to have been like for it to be telling the truth. And that's what we're indicating. When we use the phrase subjective experience, we indicate that that's the game we're playing. We're playing the game of telling you about hypothetical states of the world in order to explain how my perceptual system is going wrong. A subjective experience is not a thing. And can anything have a perceptual system? Can a book have a perceptual system? What defines a perceptual system? Okay, to have a perceptual system, you'd have thought you needed something that can have some internal representation of something going on in some external world. That's what I would have thought.
Starting point is 00:45:26 something going on in some external world. That's what I would have thought. So like a toad gets light in its eyes and it snaps up flies and it's really got a perceptual system, right? Because I see where the flies are. I don't think a book has a perceptual system because it doesn't, it's not sensing the world and having an internal representation. system because it doesn't it's not sensing the world and having an internal representation. Hi everyone hope you're enjoying today's episode. If you're hungry for deeper dives into physics, AI, consciousness, philosophy, along with my personal reflections, you'll find it all on my sub stack. Subscribers get first access to new episodes, new posts as well, behind-the-scenes insights, and the chance to be a part of a thriving community of like-minded pilgrimers. By joining, you'll directly be supporting my work and helping keep these conversations
Starting point is 00:46:11 at the cutting edge. So click the link on screen here, hit subscribe and let's keep pushing the boundaries of knowledge together. Thank you and enjoy the show. Just so you know, if you're listening, it's C-U-R-T J-A-I-M-U-N-G-A-L dot org KurtJayMungle dot org Because it doesn't it's not sensing the world and having an internal representation What would be the difference between intelligence and rationality? Okay
Starting point is 00:46:40 so There's various kinds of intelligence So you wouldn't accuse a cat of being rational, but a cat could be pretty intelligent. In particular, when you talk about rationality, you typically mean logical reasoning. And that's very different from the way we do most things, which is intuitive reasoning. And that's very different from the way we do most things, which is intuitive reasoning. So a nice analogy would be if you take something like AlphaZero that plays chess,
Starting point is 00:47:15 I'll use chess because I understand it better than Go, it'll have something that can evaluate a board position and say, how good is that for me? It'll have something that can look at a board position and say what's a plausible move for me and then it'll have what's called Monte Carlo rollout where it's you know if I go here and he goes there and I go here and oh dear that's bad. The Monte Carlo rollout is like reasoning. The neural nets that just say that would be a good move or this is a bad position for me, they're like intuitive reasoning.
Starting point is 00:47:51 And we do most things by intuitive reasoning. Originally in AI, they wanted to do everything by using reasoning, logical reasoning, and that was a huge mistake and they couldn't get things done. They didn't have a way of dealing with things like analogy. What neural nets are good at is intuitive reasoning. So what's happened in the last 20 years is we've used neural nets to model human intuition rather than human reasoning, and we've got much further that way. Is it the case that the more intelligent you are,
Starting point is 00:48:29 the more moral you are? I read something about that recently, this is yesterday it was. But of course, I don't know the provenance of that, so I don't know whether to believe it. I'm not convinced that's true. Um, I'm, here's some evidence. Elon Musk is clearly very intelligent.
Starting point is 00:48:53 I wouldn't accuse him of being very moral. And you can be extremely moral and not terribly intelligent. I think so. Yes. That's my guess. Well, you said that you weren't entirely sure. So what's the evidence to the contrary? What's the evidence that as you increase in intelligence, your morality increases? Proportionally, somehow.
Starting point is 00:49:17 Well, I mean, I just have no idea whether there's a correlation at all. I see. I think there's highly intelligent people who are very bad, and there's highly intelligent people who are very good. What does it mean to understand? Okay. That's a question I'm happy to answer. So again, I think most people have a wrong model of what understanding is.
Starting point is 00:49:54 If you look at these large language models, there's many people, particularly people who from the Chomsky School of Linguistics, who said they don't really understand what they're saying. They just are using statistical correlations to predict the next word. If you look at the first models like that, I think I probably made the very first language model that used backpropagation to train the weights to predict the next word. So you back propagate the error in predicting the next word. And the point of the model was to show how you could learn meanings for words, or to put it another way,
Starting point is 00:50:34 to show how you could take a string of words and learn to convert the words into feature vectors and interactions from feature vectors. And that's what understanding is. Understanding, understanding a string of words is converting the words into feature vectors so that you can use interactions between features to do things like predict an X word,
Starting point is 00:51:00 but also to do other things. So you have a sentence which is a string of symbols. Let's not talk about word fragments. I know these transformers use word fragments, but let's suppose they used whole words. It's easier to talk about. Right. And it would just make them work a bit worse. That's all.
Starting point is 00:51:17 That's still worth. Sure. So I give you a string of words, some text. The meaning isn't in the text. What you do is you convert those words into feature vectors and you've learned how feature vectors in context, how the features should interact with each other to do things like disambiguate the meanings of ambiguous words. And once you've associated features with those words, that is understanding. That's what understanding is.
Starting point is 00:51:52 And that's what understanding is both in a large language model and in a person. In that sense, we understand in the same basic way they understand. It's not that when we understand there's some magical internal stuff called understanding. I'm always trying to get rid of magical internal stuff in order to explain how things work. We're able, using our big neural networks, to associate features with these symbols in such a way that the features all fit together nicely.
Starting point is 00:52:24 So here's an analogy I quite like. If you want to model 3D shapes and you're not too worried about getting the surface just right, you can use Lego blocks. So it's a big shape like a car. You can make something the same shape as a Porsche with Lego blocks. The surface won't be right, but it'll have the same space occupancy. Sure. So Lego blocks are kind of a universal way of modeling 3D structures. And you don't need many different kinds of Lego block. Now, think of words as like Lego blocks,
Starting point is 00:53:05 except that there's a whole bunch of different Lego blocks with different names. What's more, each Lego block has some flexibility to it. It's not a rigid shape like a piece of Lego. it can change in various directions. It's not completely free. The name tells you something about how it can change. But there's some flexibility to it. Sometimes there'll be a name and it's two completely different shapes it can have. But it can't have any old shape.
Starting point is 00:53:42 So what we've invented is a system for modeling much more complicated things than the 3D distribution of matter, which uses high dimensional Lego blocks. So the Lego blocks with say a thousand dimensions, and if you're a mathematician you know thousand dimensional spaces are very weird things, and they have some flexibility. And I give you the names of some of these Lego blocks, and each of which is this thousand-dimensional underlying, and they all deform to fit together nicely. And that's understanding. So that explains how you can learn the meaning of a word from one sentence without any definitions. So if, for example, I say, she scrummed him with the frying pan.
Starting point is 00:54:30 You have a sense of what scrummed means. It's partly phonetic, but because the ED on the end tells you it's a verb. But you think it probably means she hit him over the head with it or something like that. It could mean something different. She could have impressed him with it. You know, she cooked such good omelets that that really impressed him. It could mean she impressed him. But probably it means she hit him over the head or something like that, something aggressive like that. And you get that from just one sentence. And nobody's telling you
Starting point is 00:54:58 this is a definition of crass, scrod. It's just that all the other Lego blocks for the other words, she and him, and all those other words, adopt shapes that fit together nicely, leaving a hole. And that hole is the shape you need for Scromed. So now that's given you the shape that Scromed should be. So that's how I think of language. It's a modeling system we've invented
Starting point is 00:55:22 where there's some flexibility in each of these blocks. I give you a bunch of blocks and you have to figure out how to fit them together. But because they all have names, I can tell other people about what my model is. I can give them the names. If they share enough knowledge with me, they can then figure out how they all fit together. So are you suggesting, help the audience understand what understanding- me, they can then figure out how they all fit together.
Starting point is 00:55:45 So are you suggesting, help the audience understand what understanding- I think what's going on in our heads, and that's what's going on in these large language models. So they work the same as us. And that means they really do understand. One of Chomsky's counter arguments to that the language models work the same is that we have sparse input for our understanding. We don't have to feed the internet
Starting point is 00:56:05 to ourselves. So what do you say to that? It's true that the language models are trained on much more data. They are less statistically efficient than us. However, when children learn language, they don't just learn it by listening to the radio. They learn it by being in the real world and interacting with things in the world. And you need far less input if you train a multimodal model, it doesn't need as much language. And the more, if you give it a robot arm and a camera, and it's interacting with the world, it needs a lot less language.
Starting point is 00:56:35 So that's one argument. It still probably needs more than a person. The other argument goes like this. The back propagation training algorithm is really good at packing a lot of knowledge into a few weights, or a few is a trillion, if you give it a lot of experience. So it's good at taking this huge amount of experience,
Starting point is 00:57:02 sucking the knowledge out and packing it into a relatively small number of weights like a trillion. That's not the problem we have. We have the opposite problem. We've got a huge number of weights like a hundred trillion, but we only live for two billion seconds. And so we don't have much experience. So we need to be optimized for making the best use you can of the very limited amount of experience you get. Which says we're probably not using backpropiation, we're probably using some other learning algorithm. And in that sense Chomsky may be right that we learn based on less knowledge. But what we learn is how to associate features with words and how these features should interact.
Starting point is 00:57:49 With TD Direct Investing, new and existing clients could get 1% cash back. Great! That's 1% closer to being part of the 1%... Maybe, but definitely 100% closer to getting 1% cash back with TD Direct Investing. Conditions apply. Offer ends January 31st, 2025. Visit td.com slash DI Offer to learn more. We want to continue to talk about learning and research. Jay McClellan said that in your meetings with your graduate students and other researchers, you tend to not write equations on the board unlike in other machine learning research meetings.
Starting point is 00:58:30 Instead, you draw pictures and you gesticulate. So what's the significance of this and what are the pros and cons of this approach? Okay, so I think intuitively and do the math afterwards. Some people think with equations and derive things and then get the intuitions afterwards. There's some people who are very good at both. Like David McKay, who is very good intuitively and also very good at math. So they're just different ways of thinking,
Starting point is 00:59:04 but I've always been much better at thinking in terms of spatial things rather than in terms of equations. Can you tell us about your undergraduate experience, how you changed programs and why, or what led you to do so? So, um, it's a long story, but I started off at Cambridge doing physics and chemistry and crystalline state, which was x-ray crystallography essentially. And after a month, I got fed up. It was the first time I'd lived away from home and the work was too hard.
Starting point is 00:59:43 So I quit and reapplied to do architecture. And I got back in and after a day of that, I decided I'd never be any good at architecture. Interesting. So I went back to science. But then I did physics and chemistry and physiology and I really liked the physiology. And after a year of that, I decided I wanted to know more about the mind. And I thought philosophy would teach me that. So I quit science and did philosophy for a year. And I learned some stuff about Wittgenstein and Wittgenstein's opinions. Um, but on the whole, the main thing that happened was I developed antibodies to philosophy. Mainly because it's
Starting point is 01:00:27 all talk. They don't have an independent way of judging whether a theory is good. They don't have like an experiment. It's good if it sounds good. And that was unsatisfactory for me. So then I did psychology to find out more about the mind. And I found that very annoying because what psychologists would do is have a really stupid, simple theory and have very well designed experiments to see whether this theory was true or false. And you could tell before you started, the theory was hopeless. So what's the point of the experiments? Um, that's what most of psychology was.
Starting point is 01:01:03 That's what most of psychology was. And so then I went into AI and there we did computer simulations and I was much happier doing that. When you became a professor and to this day, how is it that you select research problems? Okay. There's no reason why I should really know how I do it. That's one of the most sophisticated things people do. I can pontificate about how I think I might do it, but you shouldn't necessarily believe me. One thing I think-
Starting point is 01:01:39 Feel free to confabulate like LLMs. One thing I think I do is this. Look for a place where you think everybody's doing it wrong. You just have an intuition everybody's doing it wrong. And see if you can figure out how to do it better. And normally what you'll discover is eventually you discover why people are doing it the way they're doing it and that your method that you thought was going to be better isn't better. But just occasionally, like if you
Starting point is 01:02:11 think everybody's trying to use logic to understand intelligence and we should be using neural networks and the core problem of understanding intelligence is how the connection strengths in a neural network adapt, just occasionally you'll turn out to be right. And until you can see why your intuition is wrong and the standard way of doing it is right, stick with your intuition. That's the way you'll do radically new things. And I have an argument I like, which is,
Starting point is 01:02:40 if you have good intuitions, you should clearly stick with your intuitions. If you have bad intuitions, you should clearly stick with your intuitions. If you have bad intuitions, it doesn't really matter what you do, so you might as well stick with your intuitions. Now, what is it about the intuitions of Ray Kurzweil that ended up making a variety of correct predictions when even I was following him in the early 2000s and thinking there's no way half of these would be correct. And time and time again he's correct.
Starting point is 01:03:07 But if you read his books, that's what you conclude. I suspect there's a number of things he said that he doesn't mention so much which weren't correct. But the main thing he said, as far as I can tell, his main point is that computers are getting faster, they'll continue to get faster, and as computers get faster, we'll be able to do more things. And using that argument, he's being roughly right about the point at which computers will get as smart as people. Do you have any similar predictions that your colleagues disagree with, but your intuition
Starting point is 01:03:44 says you're on the right track. Now we've talked about AI and alignment and so on, but perhaps not that because that's covered ground. I guess the main one is to do with what is subjective experience, what's conscious is this one, where I think most people just have a totally wrong model of what mental states are. That's more philosophical now. In terms of technical things, I still believe that fast weights are going to be very important.
Starting point is 01:04:16 So synapses in the brain adapt to many different timescales. We don't use that in most of the AIR models. And the reason we don't use it is because you want to have many different training cases that use exactly the same weights. And that's so you can do matrix, matrix, multiplies, which are efficient. If you have weights that adapt rapidly, then for each training case you'll have different weights because they'll have rapidly adapted. So what I believe is a kind of overlay of fast weights and slow weights. And for each training case, you'll have different weights because they'll have rapidly adapted.
Starting point is 01:04:45 So what I believe it is a kind of overlay of fast weights and slow weights. The slow weights are adapting as per usual, but on top of that, the fast weights which are adapting rapidly. As soon as you do that, you get all sorts of nice extra properties, but it becomes less efficient on our current computers.
Starting point is 01:05:00 It would be fine if we were running things on analog computers. But yeah. It would be fine if we were running things on analog computers. So I think eventually we're going to have to use fast rates because they lead to all sorts of nice properties. But that's currently a big difference between brains and the hardware we have. You also talked about how publicly, how you're slightly manic-depressive in that you have large periods of being extremely self-critical and then large periods of having extreme self-confidence and then this has helped you with your creativity.
Starting point is 01:05:36 Shorter periods of self-confidence. Okay, let's hear about that please. So when I get a new idea, I get very excited about it. And I can actually weigh my ideas. So sometimes I have one pound ideas, but sometimes I have like five pound ideas. And so what happens is I get this new idea, I get very excited, and I don't have time to eat. So my weight goes down.
Starting point is 01:06:00 Oh, I see. And so I can measure sort of how exciting I found this idea by how much my weight went down. Oh, I see. And so I can measure sort of how exciting I found this idea by how much my weight went down. And yes, really good ideas. I lose about five pounds. Do you have a sense of carrying the torch of your great great grandfather? No, not really. I mean, my father talked about this kind of inheritance and it's a fun thing to talk about. I have a sense of very high expectations that came from my father. They didn't come from George Ball, they came from my father. High expectations for yourself?
Starting point is 01:06:38 My academic success, yes. Do you have a successor that in your mind you're passing the torch to? Not exactly. I don't think, I don't want to impose that on anybody else. Why did you say not exactly instead of no? Why did you say not exactly instead of no? I have a couple of nephews who are very good at quantitative stuff. I see. But you don't want to put that pressure on them?
Starting point is 01:07:16 No. Speaking of pressure, when you left Google, you made some public statements about your concern regarding AI safety. What was the most difficult part about making that break and voicing your anxieties to the world? I don't think it was difficult. I wouldn't say it was difficult. It was just, I was 75, right?
Starting point is 01:07:51 So it's not like I wanted to stay at Google and carry on working, but I felt I couldn't because of AI safety. It was, I was really a retire anyway. I wasn't so good at doing research anymore. I kept forgetting what the variables stood for. Yes. And so it was time to retire. And I thought I could just, as I went out the door, I could just mention that AI,
Starting point is 01:08:08 there were these AI safety issues. I wasn't quite expecting what happened next. Now you also did mention this in another interview about how as you're now 75, 76, keeps changing. It keeps changing every year. 77. Yeah, okay. You mentioned publicly that, yes,
Starting point is 01:08:28 you keep forgetting the variable names as you're programming. And so you think you're going to move to philosophy as you get older. Which is what we've been talking about quite a lot. Yes. But it's basically philosophy I did when I was doing philosophy as when I was about 20. I'm going back to the insights I had when I was doing philosophy and exploring those further. Got it. So what's on the horizon?
Starting point is 01:08:55 Um, old age. I think the world's going to change a whole lot fairly quickly because of AI, and some of it's going to be very good and some of it's going to be very bad. And we need to do what we can to mitigate the bad consequences. And I think what I can still do usefully is encourage young researchers to work on the safety issues. So that's what I've been doing quite a lot of. Safety and within that there's something called alignment. Now we as people don't have alignment.
Starting point is 01:09:34 So do you see that we could solve the alignment problem? I kind of agree with that statement. Alignment is like asking you to find a line that's parallel to two lines at right angles. Yeah, there's a lot of people talk very naively about alignment like this, this sort of human good. Well, what some people think is good, other people think is bad. You see that a lot in the Middle East. So alignment is a very tricky issue. Alignment with whom? Now, you just were speaking to young AI researchers.
Starting point is 01:10:15 Now you're speaking to young math researchers, young philosophers, young students coming into whatever new STEM field, even though philosophy is not a STEM field. What is your advice? Well, I mean, one piece of advice is a lot of the excitement in scientific research is now around neural networks, which are now called AI. In fact, the physicists sort of now want to say that's physics. Someone got a Nobel Prize in physics for their work in neural nets. You can't remember?
Starting point is 01:10:57 I don't remember, but anyhow, continue. You serious? No, I'm joking. Right, I thought you were joking. I'm a great actor, huh? Right. So, yeah, clearly the Nobel committees recognize that a lot of the excitement in science is now in AI.
Starting point is 01:11:16 And so for both physics and chemistry, the Nobel prizes were awarded to people doing AI or using AI. So I guess my advice to young researchers would be that's where a lot of the excitement is. But I think there's also other areas where there's going to be very important progress like if we could get room temperature superconductors that will make it easy to have solar power a long way away. Things like that. So that's not the only area that's exciting. Nanomaterials are very exciting, but they will use AI. So I think probably most exciting areas of science will at least use AI tools.
Starting point is 01:11:59 Now, we just alluded to this. Now, let's make it explicit reference. You won the Nobel Prize last year in physics for your work in AI and neural nets. So how do you feel about that? What was it like hearing the news and in physics? Do you consider yourself a physicist? What does this mean?
Starting point is 01:12:18 No, I'm not a physicist. I was quite good at physics when I did my first year at university. I got a first in physics based on being able to do things intuitively, but I was never very good at the math. And I gave up physics because I wasn't good enough at math. I think if I'd been better at math, I'd have stayed in physics and I wouldn't have got a Nobel Prize.
Starting point is 01:12:40 So probably it was lucky I wasn't very good at math. How do I feel about it? I still feel somewhat confused about it. The main problem is that the work I did on neural nets that related closely to physics was a learning algorithm called Boltzmann machines that I developed with Terry Sienofsky. And it used statistical physics in a nice way. So I can see why physicists would claim that. But it wasn't really on the path to the current successful AI
Starting point is 01:13:12 systems. It was a different algorithm I also worked on called backpropagation that gave rise to this huge new AI industry. So I still feel sort of awkward about the fact that we got rewarded for bolster machines, but it wasn't bolster machines. They were helpful, but they weren't the thing that was really successful.
Starting point is 01:13:33 Professor, it's been a pleasure. Okay. Thank you for inviting me into your home. I'm getting to meet your cats. Okay. Thank you. New update. Started. Thank you. New update! Started a sub stack. Writings on there are currently about language and ill-defined concepts, as well as some other mathematical details.
Starting point is 01:13:54 Much more being written there. This is content that isn't anywhere else. It's not on Theories of Everything, it's not on Patreon. Also, full transcripts will be placed there, at some point in the future. Several people ask me, hey Kurt, you've spoken to so many people in the fields of theoretical physics, philosophy, and consciousness. What are your thoughts? While I remain impartial in interviews, this substack is a way to peer into my present
Starting point is 01:14:18 deliberations on these topics. Also, thank you to our partner, The Economist. Firstly, thank you for watching, thank you for listening. If you haven't subscribed or clicked that like button, now is the time to do so. Why? Because each subscribe, each like helps YouTube push this content to more people like yourself, plus it helps out Kurt directly, aka me. I also found out last year that external links count plenty toward the algorithm, which means
Starting point is 01:14:50 that whenever you share on Twitter, say on Facebook or even on Reddit, etc., it shows YouTube, hey, people are talking about this content outside of YouTube, which in turn greatly aids the distribution on YouTube. Thirdly, there's a remarkably active Discord and subreddit for Theories of Everything where people explicate Toes, they disagree respectfully about theories, and build as a community our own Toe. Links to both are in the description. Fourthly, you should know this podcast is on iTunes, it's on Spotify, it's on all of
Starting point is 01:15:21 the audio platforms. All you have to do is type in The theories of everything and you'll find it. Personally I gained from rewatching lectures and podcasts. I also read in the comments that hey toll listeners also gained from replaying So how about instead you re-listen on those platforms like iTunes, Spotify, Google podcasts, whichever podcast catcher you use. And finally if you'd like to support more conversations like this, more content like this, then do consider visiting patreon.com slash KurtJayMungle and donating with whatever you like. There's also PayPal, there's also crypto, there's also just joining on YouTube. Again, keep in mind, it's support from the sponsors and you that allow me to work on toe full time.
Starting point is 01:16:01 You also get early access to ad- free episodes, whether it's audio or video, it's audio in the case of Patreon, video in the case of YouTube. For instance, this episode that you're listening to right now was released a few days earlier. Every dollar helps far more than you think. Either way, your viewership is generosity enough. Thank you so much. you

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.