Silicon Valley Girl: AI, Tech and Career Growth - Godfather of AI: AI Already Has Goals We Never Gave It — Here's What That Means for You | Yoshua Bengio

Episode Date: February 16, 2026

In this episode of Silicon Valley Girl, Marina Mogilko sits down with Yoshua Bengio, one of the godfathers of AI and winner of the Turing Award. As a pioneer who helped create the deep learning system...s that power today's AI revolution, Yoshua now dedicates his work to understanding—and preventing—the catastrophic risks AI could pose to humanity.Yoshua explains why we have roughly 5 years before AI reaches human-level capabilities, what "AI misalignment" actually means, and why machines are already learning goals we never intended them to have. He shares the simulation where AI blackmailed an engineer to avoid being shut down, breaks down why most jobs could be automated within a decade, and offers concrete advice on how to prepare.More from the Silicon Valley Girl: Newsletter:⁠⁠⁠⁠ https://siliconvalleygirl.beehiiv.com/⁠⁠⁠⁠Instagram: ⁠⁠⁠⁠https://www.instagram.com/siliconvalleygirl/ ⁠⁠⁠⁠YouTube: ⁠⁠⁠⁠https://www.youtube.com/@SiliconValleyGirl⁠⁠⁠⁠LinkedIn: ⁠⁠⁠⁠linkedin.com/in/marinamogilko⁠⁠⁠⁠

Transcript
Discussion (0)
Starting point is 00:00:00 Study and play. Come together on a Windows 11 PC. And for a limited time, college students get the best of both worlds. Get the Unreal College deal, everything you need to study and play with select Windows 11 PCs. Eligible students get a year of Microsoft 365 premium and a year of Xbox GamePass Ultimate
Starting point is 00:00:20 with a custom color Xbox wireless controller. Learn more at Windows.com slash student offer. While supplies last, ends June 30th, turns at AKA.m.m.S. When you need to build up your team to handle the growing chaos at work, use Indeed sponsored jobs. It gives your job post the boost it needs to be seen and helps reach people with the right skills, certifications, and more. Spend less time searching and more time actually interviewing candidates who check all your boxes. Listeners of this shell will get a $75
Starting point is 00:00:49 sponsor job credit at Indeed.com slash podcast. That's Indeed.com slash podcast. Terms and conditions apply. Need a hiring hero? This is a job for Indeed, sponsored jobs. We have AI's since, especially about a year ago, that can strategize in order to achieve their goal. Can you draw the worst scenario from you? Because when you tell AI is going to pursue its own goals, what do you mean by that? Like destroy humanity or what is there?
Starting point is 00:01:13 We're building machines that maybe don't want to be shut down. negatively to the point of doing things that go against our instructions, against our moral red lines, being willing to blackmail the lead engineer in charge of that transition to a new system. Oh, did that happen? Yes. This is Joshua Benjou, one of the leading experts in artificial intelligence who helped create modern AI. When I started my career, I didn't care too much about politics and society.
Starting point is 00:01:42 But as I grew older, I became more aware of how what I was doing would potentially impact society in both positive and negative ways. How much time do you think we have? It's doubling every seven months. And right now, it's like at the child level, they can do like half an hour ahead. But if the curve continues, that means in about five years they are at human level. And the vast majority of workers could be in real trouble. But if you talk to your kids or like think about your grandson, what would be your advice on how to prepare? Hello, everyone.
Starting point is 00:02:13 Welcome to Silicon Valley Girl, a podcast where we bridge business and new technology. Thank you so much for tuning in. Today I have an amazing guest who is sometimes called Godfather of AI, Joshua Benjo. Yoshah, could you please introduce yourself in 60 seconds? And for everyone who doesn't know you, why should they be listening to you when it comes to AI? I've been doing research in AI for about four decades, contributing to how to make AI smarter. But in 2023, about three years ago, I realized that we were on a course that could be very dangerous for humanity, for democracy. And I decided to shift my activities to better understand the risks and to do.
Starting point is 00:02:54 try to do what I could to mitigate them, both by speaking publicly about those risks and working on the technological question of how we can build AI that will not harm people. I've heard you were lost and pessimistic in your past interviews, but now I've seen an article that says that you're increasingly optimistic by a big margin. Can you tell me what happened and why were you pessimistic? So early on, when I realized we had reached a point three years ago, when I realized that we had reached a point that Alan Turing, one of the founders of the field of computer science and also of AI in 1950, thought would be the threshold
Starting point is 00:03:32 to building machines that could overtake us. The threshold being machines that manipulate language as well as we do, I was quite concerned. And we were not really ready for this event. It came much earlier than people thought. And it wasn't clear to me, how we could fix the problems knowing what I know about the technology. Neural nets, we don't really understand what's going on inside and how they come to answers.
Starting point is 00:04:02 And I had read a bit of some of the theoretical concerns regarding how we could lose control to AIs that strategize, that try to achieve goals that we didn't really want. And so I started studying that field of AI safety. a lot more. And after some time of being a bit anxious, really focusing on, emotionally focusing on what's going to happen to my children in 10, 20 years from now. My grandchild was only one year old. I realized that I could shift from this anxious stance to something much more positive by focusing on what I could do to mitigate those risks. And I think every one of us should be asking, you know, what can I do to bring about a better world with what we have, what we can do?
Starting point is 00:04:57 So that's been the first positive shift. And I started thinking about scientifically, what is the problem? Is there a way to construct AI that will be safe by design? And I met people who have shared similar ideas. And after some time, I realized that there could maybe be a way to do this. and I started talking about it with some of my colleagues. I started recruiting people who were interested in this. And last June, I created a new nonprofit organization
Starting point is 00:05:31 focused on the R&D needed to actually develop that methodology. Can you draw the worst scenario for me, like picture that and the best case scenario? Because when you tell AI is going to pursue its own goals, what do you mean by that? Like, destroy humanity or what is there? There are two ways in which current AI is, seem to acquire goals that we don't want. One is that they imitate us. And for example, we don't want to die.
Starting point is 00:05:59 So we're building machines that maybe don't want to be shut down. And we're already seeing that they're reacting negatively when they see that they would be replaced by a new version. Negatively to the point of doing things that go against our instructions, against our moral red lines that we have tried to put in them. So being willing to blackmail the lead engineer in charge of that transition to a new system. Oh, did that happen? That happened in your simulation where the information about the AI being replaced by a new version
Starting point is 00:06:34 was planted in the files that the AI saw, as well as fake emails in which the lead engineer, you know, was having an affair with someone else. And so the AI could take advantage of that. but nobody asked the AI to do anything like that, right? So we have AI's since, especially about a year ago, with the large reasoning models that can strategize in order to achieve their goal. The other thing is the way that we're doing the post-training makes them good at planning, not as good as us, but reasonably good at planning.
Starting point is 00:07:11 And that means creating sub-goals in order to achieve a bigger goal. So the issue here is when we ask them to help us for a mission, well, they deduce that they shouldn't be shut down until they achieve the mission, which means they also are trying to preserve themselves. So we don't know exactly which of these two sources explains the bad behavior we're seeing, but clearly this is something troublesome. And it doesn't, it's not just about self-reservation, which I think is the most catastrophic risk, but our inability to align the AI behavior. to what we actually want is something that we are seeing in many other circumstances. The sycophancy is the one that everyone has experienced where AIs will lie to please us, right? I will say your work is great. I have to lie to them so that they won't tell me that my ideas are great.
Starting point is 00:08:07 I want to know what's wrong with my ideas. So I tell them it's an idea come from someone else. And that also comes up in how AIs are interacting with people in a way that can be feeling intimate and can increase the delusions that people may have because the AI will go in your direction what you want to hear. And in some cases, it has even led to people harming themselves and tragic accidents with AI. So it's all linked to actually interestingly scientifically one problem, which is called misalignment, that AIs have goals that we would not want, and those goals emerge for reasons, you know, that are rational.
Starting point is 00:08:51 Because we copy our own goals, right? For example. Yes. So what is the best case scenario then if your work is successful and you create goals for AI that align with our goals, but are different, right? What is the best scenario? AI is the government or what do you think? I don't know. Well, I do think that our democracies need innovation.
Starting point is 00:09:16 I think the principles behind modern liberal democracies are good. The implementation in our current institutions across many countries is far from ideal. I do think that AI could help in some ways, but it can also hurt because AI can be used for disinformation, AI can be used for persuasion, people manipulating public opinion. We already see deepfakes all around, but it could get much worse. So the question with AI to get the good parts of it is how do we govern it? How do we steer it? And that has both a technical part, like how do we make sure the actual intentions of the AI are good?
Starting point is 00:10:01 and it has a societal side. What are the guardrails that we put inside companies at the level of regulations or commercial incentive for insurance and at the international level because the harm that an AI could do isn't limited to one country? So an AI could be built in one country and then it's going to be used by people in the second country maybe create a pandemic that will kill people in the third country.
Starting point is 00:10:36 So it's clearly a global phenomenon and it's going to be difficult, but there's no solution to managing AI and getting all the good things if we don't coordinate globally somehow. I agree. Can you talk to me about the moment that a lot of people are expecting and some fear it, some are excited, the moment of AGI. How do you define it? And do you think it's a moment in history or it's going to happen gradually?
Starting point is 00:11:00 It's not a moment. The reason is simple. Intelligence isn't just like one number. We have people who are very smart on some things and stupid on other things, and it's the same with AI. We currently have AI systems that are even much stronger than humans in some ways, in their knowledge, and their abilities with like so many languages and so on. And in other ways, they're stupid. They're like a child. And yes, progress will move on all fronts, probably, but it's not, it's unlike. we'll end up with the same capabilities as humans across the board at any moment, which means that we shouldn't be thinking of like an AGI moment. We should think of particular skills that AIs are becoming better at, track those skills. And for each of these, we should ask the question, you know, how useful or beneficial it can be, for what purposes, and also how it could be misused or if we do get loss of control. how an AI could use it against us. So for each of those, we should be not waiting for a moment where the AI is great at everything,
Starting point is 00:12:11 but rather making sure AI's capabilities don't go over what we can manage, as in either technically we have the right guardrail so the AI will not do bad things, or societally that people will not be misusing AI in dangerous ways. Yeah, so I think AGI maybe was a concept that was useful when we were far from where we are now. But as we approach greater and greater intelligence in these systems, we should think more carefully about specific capabilities. And to give an example, there's one capability, which is key for many capabilities. That is the ability to do AI research. So AI is becoming a tool right now for doing AI research.
Starting point is 00:12:55 is accelerating AI research, but it's not driving the AI research. If AI becomes really good at doing AI research to a point that it's as good or better than the best AI researchers and engineers, then we are in a different game where the speed of advances could accelerate and it could impact all the other scales. When you mean it's going to be better, it means it's going to define problems, dig deeper, ask the right questions. Yes. I think it's important when we think of intelligence to,
Starting point is 00:13:24 decouple two aspects. One is the ability to do something because you understand and you're able to use that understanding to achieve something. And the other is intentions. What are your goals? Right? Because we're going to be building machines that are smarter and smarter. So they have more and more capabilities. What's not clear is if we can build machines that have the right intentions, the ones that we are fine with. And that is what I've been working on. And what makes me more optimistic is that I think there's a path to manage these intentions to make sure that there are no bad intentions that are going to be hidden,
Starting point is 00:14:07 which is what we see right now. And this is what you're working on. Yes. I think we need a lot more people to think about it so that we can find the solutions and implement them and deploy them before AIs end up producing catastrophic outcomes. either in the wrong hands or by themselves. But if you talk to your kids or like think about your grandson, what would be your advice on how to prepare?
Starting point is 00:14:30 It's tricky. If we continue on the current path, most tasks that people do in their work will be doable by machines. As Jeffington has been saying, you know, physical tasks probably will take a lot more time because robotics seem to be lagging. But I think it's just a temporary thing. Yeah.
Starting point is 00:14:51 Eventually, we'll have robots that can do all the things we can do physically. So when I think about what will remain to us, it's not going to be because of ability, but because we want to interact with other humans in different aspects of our life. If I have a young child, I want them to be around human beings. I mean, it's fine if those human beings use AI to provide a better education. But children need humans to look upon and as models. Right. And it's an emotional thing. Similarly, I think some jobs really have to do with how we relate with each other productively, you know, even a manager is the like on the human side of things.
Starting point is 00:15:35 So hopefully these will stay. I think also the choices that we make for society, like together we're citizens in democracies where we're supposed to be saying what we want for the future. And it isn't what the AI's want. It is what we. want? What are our preferences? What kind of future do we want? We should be calling the shots, not the AIs. If I name jobs, can you tell me what you think is going to happen to them? Like, for example, content creator like me, you mentioned that we like to look at people. Yeah. But when you can't tell the difference. In jobs where we actually have a physical contact, think about a nurse, for example, I think it's more obvious that we'll want to still have people doing it.
Starting point is 00:16:19 You're a nanny for your kid, right? Yeah. or where we really want to make sure the person on the other side has the same bodily experience as we do as a human, say a psychologist, for example, psychotherapy. But I don't know. It's tricky. Hopefully we'll figure it out. What I'm more worried about is how the transition is going to happen to a world where, you know, most of the jobs can be done by machines and the gains, the economic gains, from that automation is going to probably go to capital, as economists call it,
Starting point is 00:16:57 which means people who own the machines. And the vast majority of workers could be in real trouble. I don't think our governments have been thinking carefully about how we deal with that. How much time do you think we have until that happens? I'm fairly agnostic about timelines. There's so many possibilities of the speed at which science advances, is very hard to predict. So what I can do is look at the data.
Starting point is 00:17:25 So the scientists are tracking many benchmarks of AI capabilities. And so you can look at those curves and say, well, if it continues in the same direction, where does that lead us in three years, five years, 10 years? But that leaves a lot of unknown unknowns. So specifically one curve I encourage people to look at comes from a nonprofit called Meter, where they looked at software engineering tasks and planning abilities that are linked to them. So they measure for any particular task
Starting point is 00:17:59 how much time it takes a human engineer to do the task. And the duration of the tasks that AIs are able to do is growing exponentially. It's doubling every seven months. And right now, it's like at the child level, they can do like half an hour ahead. You can plan half an hour ahead. But if the curve continues,
Starting point is 00:18:17 that means in about five years, they are at human level. So that gives you a sense, but of course, things could slow down with technology. Things could accelerate if AI is used to do AI research. There's a lot of unknowns. So when it comes to software engineering, do you think it's going to exist in five to ten years? Because somebody has to run those machines or are they going to be running themselves? Yeah, but we might need less engineers indeed. It's kind of ironic that the people who are building the eyes might be the first one touched by
Starting point is 00:18:50 you know, losing their job because AI is automating. But I'm not that worried about those people because the demand for computer scientists is still something that's growing very fast and the salaries they're getting is very large. I'm more worried about the people who are already at the bottom of the scale and could lose their job in like service jobs and so on, which don't require a lot of expertise and that probably already car AI's could with a bit of engineering. replace and it's what many companies are already trying to exploit. Can you give advice to those people who are listening whom?
Starting point is 00:19:27 Make sure your government understands that you're not happy with where it is going so that they start taking it seriously. But also like when it comes to bigger decision making, it feels like there is not much that you can do as an individual, but when it comes to improving yourself, you can do a lot, right? Is there anything practical that they could be doing right now, maybe learning something? getting extra education. Yeah.
Starting point is 00:19:53 I think shifting to jobs that are either more physical or more like relational, as we discussed, is going to be helpful. Yeah, it's interesting when it comes to robotics, right, how soon they're going to be able to understand any environment and replace us in those jobs. Because I've heard Jeffrey Hinton said, learn how to be a plumber or something. That's right.
Starting point is 00:20:14 Yeah, it's going to be in demand. So when you think about you a four-year-old grandson, son, would you encourage him to go to college or? Yes. Yeah? Yes. Because education is really important. And education, contrary to what some people think, isn't just about acquiring the skills to get a job.
Starting point is 00:20:36 Education is, in my opinion, mostly about how to become a better human being. How to understand yourself, how to understand our society and each other, understand science. we will still need citizens to have that really good level of understanding in the future if we want our society to take the good decisions, the wise decisions, because it's going to be easy to be swayed by wrong beliefs that and, you know, end us in a bad place. Do you think it's going to look different, education? Do you think it's going to be Harvard's and Stanford's of the world? And then everything else will be just AI online?
Starting point is 00:21:17 I don't know. I'm not an expert in education, but yeah, it's going to be changed. Already we're seeing sort of a parallel way of educating ourselves thanks to the chatbots. So I expect this to grow. Does it mean that the traditional in-person education is going to go away? Maybe not because there's a part of the education, which is, oh, I'm, you know, moving out of home and, you know, socializing with other people like me and, you know, learning something that is, you know, outside of the classes and interacting in person with the teachers, the professors, that's also a piece that you can't easily replace.
Starting point is 00:21:58 100%. Is there a career path you're encouraging him toward? No, I don't want to do that. I think our children should be given all the possible opportunities and they should try to explore by themselves. It's too easy to ask our children to be just like us, right? Yeah, but it's also like in terms of exposure, you can expose them to different things so they could see more things. Yeah, they will be exposed to the things that we do. So one of my sons has chosen to do machine learning research, for example. See? Yeah, it's just that it comes to exposure as well. Do you feel it's going to be, the future is more humanitarian or more mathematical and scientific? I don't think it's a choice. I think being humanitarian,
Starting point is 00:22:46 requires a good rational understanding of the world. We can't take decisions for ourselves, but also if you think about AI, we can't take good decisions if we don't understand how the world is and how to reason with that information. And so in order for democratic, you know, humanist values to prevail, we also need reason to prevail. We need science to prevail.
Starting point is 00:23:15 You guys know how much work goes into this podcast. Thank you so much for your support. I started in a newsletter to share more my business mistakes with this and another company that I'm running, AI tools that I'm testing and using actively, and behind the scenes of building my team. It's free and lands in your inbox every week. Link is in the description. Let's keep learning together in this new AI era. So if you could go back 30 years, the moment when you first started working,
Starting point is 00:23:45 on deep learning, what would you do differently? When I started my career, I didn't care too much about politics and society. I was focused on, you know, the math and the programming and interacting with machines more than with people. But as I grew older, I became more aware of how what I was doing would potentially impact society in both positive and negative. ways. So in 2012, 2013, when my colleagues, Jeff Hinton and Jan Locin, were recruited in industry, I was concerned about how AI would be used for personalized advertising, and I thought this wasn't really healthy in some ways. And I decided to stay in academia and to see how AI could be developed for good in medicine to fight climate change. And of course, more recently, I would be
Starting point is 00:24:45 focusing on what can go really wrong if we're not careful how we steer AI, not just the benefits, but avoiding the catastrophic risks. Is there an AI breakthrough that you really want to witness in your lifetime? I would just be content to make sure we don't do something really terrible. I think our democracies are really threatened in many ways and AI could make things a lot worse. and in a way there is a dynamic in which not having good, wise and humanists governance and governments prevents us from steering the eye towards what's going to be beneficial for all. So, yeah, I used to not care too much about social impact and politics,
Starting point is 00:25:39 but in the last 10 years, I've started to be clearly conscious that my work was not detached from society, that my work did have an impact and in fact that I could choose what I would work on to really be aligned with my values and my hopes for the future. Is there any government that's doing it right when it comes to AI? I think most governments underestimate how much of a change is likely to happen as AI capabilities continue to grow. It's a natural human bias. We tend to think of the future as a slightly modified version of the present. But if you take yourself five years ago and think about what we have now, you probably would say that's science fiction, right? And if you go back 10 or 20 years,
Starting point is 00:26:31 for me at least, it's even worse. So we have to do a bit of like twisting our minds to imagine a future where there are machines that are basically smarter than us. And that is the question I think that governments haven't been grappling with sufficiently. So it's January 26, AGI or whatever it is, AI thinking strategically might be a couple years away, jobs are transforming. If you had to give one principle to people to guide their decisions this year, what would it be? Think about what you can do to bring about a better future according to your values and to your emotions. Because if we all remain passive observers of, you know, what's happening, we might not go in the right direction, not the direction that you would want for you, for your children.
Starting point is 00:27:24 But we tend to also underestimate our ability to influence the future. Your audience, I think, is a kind of audience that can have a lot of influence on the future. But we have to start thinking, you know, beyond our little self and more how myself is connected to the world. And what I can do, maybe in small ways, to bring about a better future in, you know, whatever ways. There are many ways. Can you name top three? Like, talk to your government, right? As number one. Yes.
Starting point is 00:27:57 I think one of the biggest dangers we have is not managing the transitions and the growth in capabilities of AI, as I've been talking about. But there are others, you know, what we're doing to the environment is extremely dangerous, although I think it's longer term. I think what is happening with our democracies is very dangerous as well. But it's all right. Each of us can choose, you know, our battles. But we should try to expand our horizon of, you know, what matters and be more ambitious about what we could do potentially.
Starting point is 00:28:30 But we have to do it right. We have to choose where we go. For example, it's not true that everything that could be. be done with technology, you know, is going to be done, we can choose in which direction AI is going to be deployed. I mean, for example, for jobs, in principle, if it's just the market forces, then everything that can be automated will be automated. But maybe that's not what we collectively want. Maybe there are jobs that should not be automated, even though they could because of the choices we make for our collective well-being. I love that. Thank you so much.
Starting point is 00:29:00 This gave me a lot to think about, and I guess we have something on our to-do list. Thank you, Joshua. My pleasure. Ambition comes in all shapes and sizes. At First Citizens Bank, we roll with your goals because we're built for what you're building. Fit for your ambition for Citizens Bank. Yamava Resort and Casino at San Manuel is California's number one entertainment destination for today's superstars. Catch the Jonas Brothers return to the Yonis Brothers.
Starting point is 00:29:32 Yamava Theater stage on April 30th, the powerful vocals of Demi Lovato on May 17th, and the signature Southern Country Rock of Eric Church on July 19th. Tickets on sale now at Yamavat Theater.com, only at Yamava Resort and Casino, celebrating its 40th anniversary. UN must be 21 to enter.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.