Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas - 30 | Derek Leben on Ethics for Robots and Artificial Intelligences

Starting point is 00:00:00 Ready or not, summer is coming, and Wayfair's Memorial Day clearance is on now. Right now through May 25th, get up to 70% off everything home at Wayfair. Plus, score amazing doorbuster deals all sale long and surprise flash deals on Memorial Day. We're talking thousands of products at every style and budget. Now is the time to save big on must-habs for your patio, backyard, and beyond. These savings won't last. So don't wait. Shot Wayfair's Memorial Day clearance now through May 25th. Wayfair, every style, every home.

Starting point is 00:00:30 You're confused about your credit score. One site has one number and another site, something completely... What? That can't be right. It's okay. Forget everything except MyFICO. These free scores from other apps can differ by as much as 100 points from your FICO score that 90% of top lenders actually use when you apply for a credit card, personal loan, car loan, or mortgage.

Starting point is 00:00:52 For the moments that matter, get the score that matters, your FICO score. Visit MyFICO.com and get started for free today. Hello everyone and welcome to the Mindscape podcast. I'm your host, Sean Carroll. And today's episode, we're going to see where the rubber hits the road in moral philosophy. And I mean that quite literally. You've all heard about self-driving cars, and you may have heard about the idea that self-driving cars are going to have to solve the trolley problem.

Starting point is 00:01:20 This famous thought experiment in philosophy where you can either continue to do something and several people will die, or you can take an action to prevent your current course of action and do something different and fewer people will die. Is it okay to intentionally kill a smaller number of people to save a larger number of people? You might not think that this is something you are going to need to deal with, but it's simply an illustration of the kinds of problems that all sorts of robots and artificial intelligences

Starting point is 00:01:50 are going to have to deal with. They're going to need to make choices, and in the extreme examples, they're going to need make hard choices about how to cause the least harm. As one example, should a self-driving car, if there are two bicyclists in the way, and it judges that it's going to have to hit one of them, should the self-driving car target a bicyclist with a helmet rather than one without, on the theory that wearing a helmet makes that bicyclist more safe, and therefore, in some sense, that person should get punished for wearing the helmet. That doesn't seem right. These are moral intuitions

Starting point is 00:02:25 that lead to really hard problems and we have to face up to them. Today's guest, Derek Lieben, is a philosopher who has written a new book called Ethics for Robots, where he tackles exactly these questions, not just self-driving cars, but the general idea of what kind of moral decision processes

Starting point is 00:02:43 should we program into our artificial intelligences? I think it's just a fascinating topic to think about because on the one hand, Derek's book involves big ideas from moral philosophy, utilitarianism versus deontology, John Rawls' theory of justice, things like that. On the other hand, very down-to-earth questions about game theory, the Prisoner's Dilemma, Nash Equilibrium, Pareto Optimality,

Starting point is 00:03:08 other sort of economic and rationality-oriented ideas need to come into play here. So to me it's a great example of how the abstract theorizing of philosophy suddenly becomes frighteningly relevant to real-world decisions. Personally, I do not own or have plans to buy a self-driving car in the near future, but I do think they're coming. Moreover, artificial intelligences of all sorts are all around us and have an increasing effect on our lives. Therefore, we should be thinking about these issues, and now's a good time as any. Let's go. Do you ever feel like you're drinking from a firehouse?

Starting point is 00:03:43 Paycor's intelligent HR solution empowers leaders to turn down the pressure. Their unified platform includes payroll, talent, management, compliance software, and a lot more. Connecting you to the people, data, and expertise you need to drive long-term business results. Visit paycourt.com slash leaders and go from work flood to workflow. That's paycourt.com slash leaders. When people turn to telehealth or weight loss, they're looking for real support. That's why more people are choosing orderly meds.com.

Starting point is 00:04:15 Orderly meds connects you with real doctors and access to proven GLP1 medications like semaglutide and terseptitide. No guessing, just a more supportive experience, and all ship directly to your door in discrete packaging. Do your research, ask questions, then visit orderlymeds.com slash podcast for an exclusive offer. That's orderlymeds.com slash podcast. Individual results may vary not. Medical advice, eligibility required seaside for details.

Starting point is 00:04:58 Derek Lieben, welcome to the Mindscape podcast. Thanks so much for having me. So this is a topic, you know, ethics and morality for robots and artificial intelligence. Everyone's thinking about this. We all know self-driving cars are coming, and they're going to be apparently running into people on the streets right and left, just deciding how many people to hit. So could you just give us your short version of why it's necessary even to talk about ethics or morality for robots? I mean, do they even have ethics? Should they just do what we program them to do?

Starting point is 00:05:31 Yeah, so that's a great place to start. As we are starting to develop more and more autonomous technologies in the fields of transatlantic. transportation, medicine, warfare, they are starting to make these decisions that are going to have impacts on human health and safety and opportunity. And for that reason, we need to start thinking about which actions are permissible for them to do and which actions are impermissible. And I take this to be just an inescapable fact of designing machines that are making complicated decisions about human well-being that are going to be made. making these decisions without very much human supervision, right? We're just going to have to inevitably decide what kinds of rules we want these machines to be following. Now, I'm not sure if we can use certain words to talk about these machines like responsible or not, or blameworthy or not,

Starting point is 00:06:31 but that's sort of beside the point for me. What I'm most interested in is what are the rules that we're actually going to be using to program into these machines because that's definitely something that we need to do. Yeah, you have used the word decisions as if the robots have the ability to make decisions. Is that an important word or is it just a way of talking about the fact that the robots are going to be doing something and we have to decide what we want it is for them to do? That's interesting. For me, it doesn't really matter whether you call this a decision or an algorithm. I know that maybe some people might think of decision as something performed by an agent with free will who could have done otherwise or something like that. But for me, that's not too important.

Starting point is 00:07:20 I'm not going to get hung up in those kinds of issues. I mean, certainly some moral theories would, like a Kantian about ethics is going to say that only an agent with. free will who understands what he or she is doing is capable of being a moral agent who makes decisions at all. But that may already reveal something about my normative assumptions that I'm making. And that's something that I think we'll see as we move forward, that every kind of choice you make in programming a machine is revealing something about your normative assumptions. It's sort of impossible to stay free of ethics, to just say, well, I'm going to avoid ethics entirely.

Starting point is 00:08:04 Right. Well, I agree with that very much. And also, I'm happy that it doesn't matter whether we attribute free will to the robots because I hate talking about free will and yet I end up doing it all the time. So I'm glad that we can avoid doing it for this one. Okay, so let's get one thing out of the way, which I'm sure you've been hit with before. What about Isaac Asimov? Didn't he explain to us how to make moral rules for robots?

Starting point is 00:08:29 doesn't he have three laws and should we just implement them? Yeah, so Asimov did propose these three rules, which are really just one rule and two sort of subservient rules that say obey people and protect yourself. But the first rule is really the one doing all the work and it says don't cause or allow harm to other humans. Now, on the face of it, that's actually pretty good. And I think the reason why there's a sort of variety of moral theories and a variety of different kinds of rule systems that we've constructed is that most of them do pretty well in normal circumstances. It's a bit like, I'm going to make the first of many analogies to physics here because I love to make analogies to physics and now's my chance. It's a bit like using classical mechanics in most normal circumstances where if you're not going to make analogies to physics, I love to make analogies to physics and now's my chance. circumstances where if you're not going very fast and you're dealing with moderately sized objects,

Starting point is 00:09:26 this works really well. However, when you get to these very sort of extreme situations, you start to see differences between the moral theories. And the problem with Asimov's laws is that it's vague about certain situations like the violation of property rights, the violation of dignity, insulting people. Does that count as harming them? Does trespassing on their property count as harming them? Does blowing smoke in someone's direction count as harming them? And also it fails in these situations where every action either does or allows harm to others. And these are called moral dilemmas. So in some scenarios, you can't avoid either causing or allowing harm to others. And Asimov law simply breaks down. Yeah, to be perfectly honest,

Starting point is 00:10:18 The question that I asked you was because I feel I have to ask it, but I think you're being far too generous to Asimov's laws. I think they're just silly. You know, the idea that a robot cannot through inaction allow a human being to come to harm is entirely impractical. Like, human beings come to harm over the world all the time. Every robot would instantly spring into action trying to prevent every human from stubbing its toe, right? Yeah, exactly. And this is a point that philosophers like Peter Singer have made for decades is that almost everything we do in the world has some kind of effect on people that we might not even be aware of. I mean, Singer has gone to great length to show that the way that we eat, the way that we travel, the way that we spend our money is actually having effects on other people.

Starting point is 00:11:07 We could be doing other things with that money, with that food, perhaps driving around in our cars. were not thinking about the damage that we're doing to the environment and to future generations. Yeah, and I also like the idea that you need to be a little bit more specific. I think a lot of people do have this idea that, you know, the example I used in my book, The Big Picture was in Bill and Ted's Excellent Adventure, we heard the moral rule that you should just be excellent to each other. And I try to make the point, that's fine, but it's not quite good enough. You haven't told us what excellence is.

Starting point is 00:11:42 And actually recently, I got into a Twitter conversation with Ed Solomon, who was the screenwriter for Bill and Ted's Excellent Adventure. And he was very pleased that even if I was saying it's not a good rule, that it made it into this kind of consideration. And certainly when robots are on the scene, we have to be a little bit more specific, a little bit more clear, a little bit more quantitative maybe, about what constitutes a morally correct action. Is that right? That's right. And I agree that what you're describing is a theory that you are familiar with, virtue ethics. And it basically says, do whatever a noble person would do. Now, this kind of works if you have a good exemplar of a noble person handy, but then there are all sorts of problems.

Starting point is 00:12:25 Like, how do you know that this person is actually a noble person whose behavior we should be trying to imitate or not? And if you are a noble person in one culture, that might be a very, very terrible person in another culture. And so that leads to this problem that you were just talking about, which is where do we look for these more precise, more quantitative, more formal approaches to designing moral algorithms? And there's a lot of different places. One place we might look is in human judgments and try to model machine behavior after human behavior. Another place we might look is to moral theories and try to actually take a historically important theory. and implement it into a machine. And so, yeah, in your book, Ethics for Robots,

Starting point is 00:13:16 there's just a wonderful little book. Everyone can read it, and it really delves into a lot of fun things. Am I accurate in saying that you contrasts three major approaches here, utilitarianism, libertarianism, and contractarianism? That's right, yeah. So I talk about some historically influential moral theories, utilitarianism, Kantian ethics, contractarianism,

Starting point is 00:13:40 And you could also include some other ones. You could talk about virtue ethics if you're interested in that or not. But like I said, I don't think that virtue ethics is going to be specific enough to make it into this club. Be a virtuous self-driving car is hard to actually implement in real life. It is. Unless you're doing something like let's train the machine in a sort of bottom-up approach, as they say, to be like human beings around us. There's actually a few people like Wendell Wallach and Colin Allen who have proposed that this is a good approach to designing moral machines.

Starting point is 00:14:18 However, my objection to that is that you're probably going to incorporate all of the terrible biases and limitations of the humans that you're using to model your machine's behavior on. So these machines are probably going to wind up being terribly racist and sexist and not. thinking very clearly about consequences versus rewards and so on. And also, isn't it just a little bit circular, you know, to what it means to be moral is to be like a moral person? I mean, I think I'm on your side here, if I'm right, that you would say we need to be much more explicit, both for robots and for human beings, about what it objectively means to be a moral person.

Starting point is 00:15:03 Yes, I totally agree. So it sounds like we are both on board with virtue ethics, not necessarily, being a good approach here. So maybe we can move on to what you were talking about, which is these historically important moral theories. And in your book, The Big Picture, which I also am a fan of, and I recommend, you talk a lot about how these theories are constructed around our moral intuitions. And there might be different, consistent, internally consistent set of rules that you could get from different kinds of moral intuitions. And I actually, actually think that is completely correct. I think that that is historically where these theories

Starting point is 00:15:44 have emerged from. And the question is not so much, well, which intuitions should we rely on? But what is, I think, the evolutionary function of the intuitions that we are using in the first place? Right. So one answer to this question is there's a lot of these different internally consistent sets of rules that are all more. or less intuitive in some kinds of circumstances, but we have no way of evaluating one versus the other. If somebody wants to be a utilitarian and say it's wrong to, for instance, buy a cappuccino when you should be giving money to famine relief. And another person wants to be a contian and say, well, your intentions when buying the cappuccino were just to have this delicious coffee beverage. and therefore it's only a side effect that these people were harmed, right?

Starting point is 00:16:42 How do we resolve the disagreement here? Now, you could just say, well, these are just equally important and equally coherent sets of rules, but I think a better approach is let's look at the evolutionary function of the intuitions that they are drawing on and say, is there a sort of unified form? framework that matches that evolutionary function, that original goal of the system. Do you ever feel like you're drinking from a firehouse? Paycor's intelligent HR solution empowers leaders to turn down the pressure. Their unified platform includes payroll, talent management, compliance software, and a lot more, connecting you to the people, data, and

Starting point is 00:17:29 expertise you need to drive long-term business results. Visit paycourt.com slash leaders and go from work flood to workflow. That's paycord.com slash leaders. When people turn to telehealth for weight loss, they're looking for real support. That's why more people are choosing orderly meds.com. Orderly meds connects you with real doctors and access to proven GLU-1 medications like semaglutide and terseptatide. No guessing, just a more supportive experience, and all shift directly to your door in discrete packaging. Do your research. Ask questions. Then visit orderlymeds.com slash podcast for an exclusive offer. That's orderlymeds.com

Starting point is 00:18:05 slash podcast. Individual results may vary not medical advice, eligibility required, C-Sight for details. Right. So, I mean, I think we agree on a lot of things, but maybe we disagree on the meta-ethical question of whether moral rules are objectively real or not. Is that right? That's right. And a lot of this hinges on what we mean by real. So I think I mean... It usually does, yes. Yeah, exactly. So when I say real, I mean real in the sense that smoking is bad for your health is a fact. It's a real fact about human beings and it's dependent on certain goals that we all share. Now, this is something that I think you're inclined to agree with too, that if we talk about morality as a set of, as the philosopher Philip Afoot once said, hypothetical imperatives as a set of sort of if then statements. If you want to be X, then you should do Y.

Starting point is 00:19:00 If you want to be healthy, then you should exercise and eat right and so on. then we can have objective answers to this question within the domain of us all sharing these goals. However, I am in agreement with you. And I completely agree with that, right? Yes. And so outside of these goals that we all share, there's no way of talking about one state as better or worse than another one. And that's where I think you and Sam Harris might disagree here. And I'm on, I'm on your side of that debate.

Starting point is 00:19:28 He has this thought experiment that he calls the worst possible suffering for everyone, where he says, imagine everyone is in total misery all the time. Clearly, that's objectively bad. But I think it only makes sense to say that that's objectively bad if we already are within the realm of caring about suffering and avoiding suffering. Yeah, I mean, to me, it's much like saying, well, aesthetic judgments are objective because if we imagine the world's ugliest painting, everyone would agree that it's ugly, therefore it must be objective. But I think that, you know, I think it's okay to admit that we're contingent human beings made of atoms, obeying the laws of physics, and this thing called our moral goals are very reliant on historical contingent things, right? Like different people could have different goals. But happily, we agree on a lot, and we can build from what we agree sensible moral systems.

Starting point is 00:20:28 Yes, I totally agree. Now, the question is, is which of these internally consistent and all apparently intuitive different sets of rules should we select from? And we're going to have to make choices like that because, as you said, in constructing a self-driving car, we're going to have to decide which paths that result in collisions are better and worse than others. Right. And it would be easy. it would be really wonderful if we could just avoid all collisions. That would be make my job completely unnecessary, and I am fine with that. If it turns out that we can avoid any kind of harm to anybody,

Starting point is 00:21:13 then we don't need ethics at all. But every time the machine is going to be evaluating one path and comparing it to another, it's going to have to decide which collisions are better and worse. Now, there's a lot of ways of doing it. It could, for instance, consider the driver and the passengers of its vehicle as more valuable than the other passengers. It could consider swerving to hit someone as better or worse than continuing straight. It could view hitting and injuring more people as better or worse than fewer people.

Starting point is 00:21:49 Now, some of those might seem more obvious to you or less obvious to you, but the problem is, how do we actually resolve this? And I think we need a moral theory. And for that, like I was saying, we need some framework of comparing these theories. And I think the only way of doing that is by saying which moral theory actually fulfills this metaethical assumption that moral theories evolve for the purpose of promoting cooperative behavior amongst self-interested organisms. If you have a moral theory that actually is better at creating cooperative behavior, then other, kinds of theories, then that's the one we should use, I propose, in designing these vehicles. Yeah, maybe it's good to clear up this issue before moving on to the specifics of the different moral theories. There is this objection out there to this kind of discussion that says, you know,

Starting point is 00:22:41 who cares? Cars are not really going to be solving trolley problems. If they see something bad, they're just going to hit the brakes and stop. And so this is all kind of irrelevant. And I suspect that's just an impoverished view of how moral reasoning works. And, you know, people get annoyed with philosophers for inventing trolley problems because they say, well, I just don't want to play this game. But it's a way of sharpening our intuition, right? And actually, it would be useful here if you laid out what a trolley problem is. It's conceivable that some of the audience doesn't know.

Starting point is 00:23:13 And you also, you know, you mentioned in the book that Harry Truman's decision to drop an atomic bomb on Japan was very much like a trolley problem. Yes, that's right. So these do happen. And unfortunately, they happen quite often in certain professions, like in medicine and warfare and business, where you have to make choices about causing harm to one person or allowing harm to many other people. Now, this sounds very strange to most people like me because I think about myself as, well, I'm just going through my day. I'm buying cappuccino. I'm grading papers. I'm watching Netflix, what am I doing that is causing harm to others? But in fact, if you're buying that cappuccino, you are weighing your own pleasure against the happiness and suffering of other people who you could be donating that money to. If you're watching Netflix, you could be doing more with your time. If you're eating meat, you're making judgments about whether animals are valuable or not. And the trolley problem is one of the scenarios that was constructed in order to demonstrate the differences between, well, initially between doing and allowing and killing many people versus one, or I should say sacrificing one person to save many. So in the scenario, you have a runaway train going down a track, it's going to hit five people.

Starting point is 00:24:39 and the only way to stop it is to either divert it onto a side track where a single person is or in another condition to push a large man in front of the train and stop the train. Now it turns out that most people say it's permissible in surveys to switch the track onto the single person but not permissible to push the large man to his death, even though that seems strangely inconsistent. If you are only considering the consequences, then it is the same exact effect. It's killing one to save five. But if you think that actually performing some kind of physical intrusion

Starting point is 00:25:20 into somebody else's personal space is important, or if you think about this in terms of a causal chain or your intentions or something like that, then maybe there is a difference here. So the trolley problem is one way of comparing what different moral theories might say. Utilitarians will say you always kill one to save five, a lot of deontologists, what they have in common, aside from just focusing on rights and duties,

Starting point is 00:25:46 is that they often say that there's nothing wrong with just standing back and allowing things to happen. So this is called the difference between a positive and a negative obligation. The philosopher Robert Nozik made a big deal about this and said, look, it's wrong of me to push you in front of a train. It's not wrong for me to allow you to be hit by a train. And so, according to most deontologists, it's wrong for me to push you in front of a train, and it might also be wrong for me to pull that switch causing the train to go on to you. But I think that it's a very helpful illuminating thought experiment precisely because how we

Starting point is 00:26:25 viscerally think about what's morally right and wrong might not match with what we say we're thinking, right? I mean, that's what the pushing the guy off of the footbridge really brings home, that, you know, if you would ask someone abstractly, is if given two choices, would you let five people die or one person die? They'll say, yeah, I would let the one person die. But then when you make it concrete, you have to actually kill this one person to save the other five. They're unwilling to do it. And, you know, that's kind of okay. And I don't think it's irrational. It's just revealing that our moral intuitions are not always very coherent or matching up with our moral cognition. Yeah, and as someone who's taught ethics classes to undergrads for a long time, I'm very familiar with how sort of inconsistent people's everyday moral intuitions are. You know, if you press just a little bit on some of these trolley problems, like, well, it's okay to pull the switch. And in fact, you should pull the switch to kill one, to say five, but what if it was your mother or your father, your best friend on the side track? Well, then people change their minds. And then you ask them, well, why do you think it's okay for you to value your parents over other people's parents?

Starting point is 00:27:38 Surely these people are equally valuable. You're not saying that your family is actually better than other people's families. But it turns out that people's moral judgments actually are sensitive to this. There was a fantastic experiment done by some researchers led by April Bleske-Retchick, and they found that alternating by condition the genetic, relatedness of the person on the sidetrack will alternate whether people are willing to pull the switch almost exactly as you would predict by the proportion of their genetic relatedness. So brother, cousin, and so on.

Starting point is 00:28:13 Right. But most more... I would sacrifice, you know, my brother or sister to save two cousins or something like that. Yeah, exactly. I think that was a quote from, was it Herbert Spencer or something. Yeah, that's right. And so that, but most moral theories agree that genetic. relatedness should not play a role. And in fact, if you press people and ask, well, are you saying

Starting point is 00:28:35 that people who are genetically related to you are better than the people who are not genetically related to you, they'll say, well, of course not. That's silly. I would never say that. But by acting in that way, they are revealing that judgment is actually playing a role in their behavior. Yeah. And I think that, again, part of it is that a lot of people have a presumption, whether it's explicit or implicit, that the right kind of... kind of ultimate moral theory will be something utilitarian. But maybe it's not. Like, maybe it's perfectly okay. Or at least, let's say, not maybe, but let's say I can imagine a perfectly coherent moral theory that very explicitly gives more credit to people who are closer to me when

Starting point is 00:29:18 I come to saving lives. Yeah, exactly. So, I mean, the utilitarian has to say some really, really, really weird things. But the other moral theories also sometimes have to say some really, really weird things. And this is what you have to get used to is that no consistent set of rules is going to give you everything that you want all the time. And my theory that I'm advocating also tells me some things that I really don't like and I think is really weird. Why don't we give you a chance to explain what your theory is, which is not completely yours. You know, you're building on quite a tradition here. That's right. So I am advocating a moral theory that is drawn from a tradition called contractarianism. And the most recent version of this that I'm using was from a philosopher, an

Starting point is 00:30:06 American philosopher named John Rawls. And in his book, A Theory of Justice from 1971, he proposed that the best way of designing a fair society is to imagine that we were in this original position where I don't know who I'm going to be. I could be anyone. And in this kind of idealized bargaining position, he thought we would all come to agree on certain basic distributions of what he called primary goods. And the distinction between primary goods and secondary goods are that when you don't know who you're going to be, you don't know if you're going to be male or female, tall or short, handicapped or perfectly abled, and so on. And if that's the case, you don't know what kinds of particular things you're going to

Starting point is 00:31:04 value. Are you going to like television or you're going to like coffee? Maybe, maybe not. These are all secondary goods. Primary goods are the kinds of things that all human beings value no matter what, that all human beings have to value in order to pursue any kind of goal at all. And this list includes things like your life, your health, your opportunity, and essential resources for survival. So no matter what you want to do with your life, if you want to be a juggler, if you want to be a lawyer, if you want to be a physicist, you need to have essential resources, opportunities, and health.

Starting point is 00:31:39 Right. And so going back to our previous discussion, these are the kinds of things of the kinds of of things that we have as you could call them common ground, that all human beings, as a matter of fact, just by virtue of being human beings, care about. And if someone says, well, I don't value these things, I could say, yes, you do, you're a human being and you pursue goals. And so you care about your health and safety and opportunity. Looking to start 2026 with an unforgettable getaway. Fiji Airways sale is on now. Fly nonstop round trip to Fiji from just 748. Or enjoy a sunny Fiji stopover on your way to Australia with round-trip fairs starting from 839.

Starting point is 00:32:17 As a One World Alliance member and American Airlines Advantage program partner, your miles go further across the South Pacific. Book now at Fiji Airways.com or contact your local travel agent. Conditions apply. Do you ever feel like you're drinking from a firehouse? Paycor's intelligent HR solution empowers leaders to turn down the pressure. Their unified platform includes payroll, talent management, compliance software. and a lot more, connecting you to the people, data, and expertise you need to drive long-term

Starting point is 00:32:54 business results. Visit paycourt.com slash leaders and go from work flood to workflow. That's paycourt.com slash leaders. You know, I actually took a class with John Rawls in graduate school. Really? Yes. It was very funny because I, well, I audited the class, but I went to the sections and everything. And I remember one day walking with a friend of mine across kids.

Starting point is 00:33:18 campus and the people who had actually taken the class for a grade were coming out of the final exam. And I just ran into them. And so I said, hey, you know, how did the exam go? And they were like, it's very fair. Which, of course, you know, my friend who was like, that must not be your physics friends. Those must be your philosophy friends, because no physics people have ever come out of an exam saying that was very fair. But Rolls' whole thing was justice as fairness and trying to make things as fair to everyone as possible. That's right. And Rolls, is primarily known as a political philosopher because the kinds of things he was talking about designing from this original position were mainly policies and the structure of our government

Starting point is 00:33:59 and social institutions. However, towards the end of the book, he talks a little bit about using this as a framework for individual decision-making, and that's what I want to be doing. I want to say that we can also use this as a way of thinking about what kinds of actions are wrong and what kinds of actions are permissible. From the original position, Rolls said we would all agree on a certain distribution principle that he called the Maximin principle. And the Maximin principle has a history from game theory. And in this context, it just means we would agree on a distribution which makes the poorest person as best off as possible. Yeah, actually, I do want to get into this, but I realize now while you're saying this, there's a prior thing I want to just touch on very quickly.

Starting point is 00:34:45 you mentioned the fact that I think that Rolls himself would have cast his theory as a political one, right, a way to organize our society. In fact, only a well-ordered society, right? I mean, he admits that there would be cases where things were in extreme distress where you'd have to violate his principles. But the idea, as I understood it, was that we could disagree on basic moral conceptions, but if we agreed to live together in a liberal democratic polity, then he had these rules for how to reconcile our different moral conceptions. And so you're going a little bit farther because you want to actually use this as a theory of morality as well, right? That's right. Well, I want to say that there are many kinds of values that are equally comparable to each other.

Starting point is 00:35:34 However, those all exist within a kind of space that is constrained by essentially this moral decision procedure. So there are lots of equally good distributions according to the maximin principle. And within that space of equally good distributions of primary goods, then we could go ahead and impose many different kinds of values and have interesting disagreements about which sets of values are the ones that are best. But importantly, that's all occurring within the space of a sort of maximin constraint. But it is a little bit, it's asking you a bit more, right? Because the original position is something where we forget some things about ourselves, like you said,

Starting point is 00:36:23 and we remember other things ourselves, the difference between primary versus secondary goals. And this seems potentially more problematic if we want to get out of it moral rules rather than just a political system. I mean, if someone is in real life very religious and has some religious convictions that strongly flavor their ideas of right and wrong. Are those convictions, things they will have to forget in the original position? Yes. Wouldn't that lead them to something that's a moral theory coming out of the original position that is very different than the one they actually have? Probably, yes. But I don't think we need John Rawls to convince us that religion is irrelevant to ethics. I think just some basic assumptions about what we mean by making moral choices can do that. And this goes back to

Starting point is 00:37:12 the dialogue, Uthafro by Plato, where he plausibly demonstrated that, look, even if God were to say that slavery and child abuse and rape are morally good, that doesn't make them good. And so usually when I'm talking to somebody who thinks that morality is based on a certain set of religious beliefs, it takes about 90 seconds of talking to them to get them to finally admit, well, yeah, I guess you're right, that it doesn't really matter. what the religious text says, what matters is something else. I think that I'm on board with the, I can never pronounce it, Euthyphro. Oh, Euthyphro, yeah. Uthifro dilemma and why there should be some criteria for morality other than what God says. But nevertheless, I think that I could imagine that the actual moral beliefs that a religious person has are different, not because God gave them the moral beliefs, but because their

Starting point is 00:38:10 religious beliefs affect how they think about the world, right? How they think about the ontology of reality. If you believe that human life begins at conception, you might have different views on abortion than if you believe it's just a bunch of cells obeying the laws of biology. Yeah, that's interesting. I think it's, the abortion case is difficult. And in fact, people often, when they're talking about ethics, jump right to abortion, which is one of the most complicated moral topics. And I usually like to point out, well, you know, you're you're basically starting off at the introduction of the book and just skipping right to the most complicated problem at the very end. Right.

Starting point is 00:38:49 There's a lot of stuff that goes on in between. And most problems that we face are actually, I think, ones that are plausibly ones that have good moral answers to them. Like, as I mentioned, slavery, child abuse, rape. And then, of course, we get to more difficult cases like, charity and eating animals and driving a fossil fuel vehicle, which I think are, in fact, things that most people think are maybe morally permissible or wrong and are probably mistaken about. And then we get to very, very difficult cases like abortion, which even if there is not an answer to that, and I think there is probably an answer to it, but even if there is not,

Starting point is 00:39:40 that doesn't invalidate everything that sort of went up to that point. Okay, but, you know, I'm just trying to get on the table the idea that when we get back to the self-driving cars killing people, solving their little trolley problems as they're going down the street, I could at least imagine that people have deep-seated moral convictions that wouldn't qualify in Rolls' conception as primary goods, and they might object to having those conceptions stripped away from them as we put them in the original position. I think that's correct, but I also think it's correct that any group of people who are interacting with each other are going to have certain beliefs that are not respected in the process of sort of interacting. And so if we are having a civil society together, just to take the political case,

Starting point is 00:40:32 then inevitably some of your beliefs are probably going to come into conflict. with other people's beliefs and with the institutions. So if I have a religious objection to say, oh, I don't know, respecting other people of different races, then you're going to say, as a government, no, you actually have to treat everybody equally, and it doesn't matter what your religious beliefs are. Right.

Starting point is 00:40:57 Yeah, no, I think that the reason that I harp on this is I've become very, very interested in this potential conflict between fundamental moral positions that individuals might have and the goal that we presumably share of living in a liberal democratic society. I think that we tend to paper over them these differences a little bit, but I think we should respect that they could be true conflicts and have to eventually say something like what you just said, which is that, yeah, you know, suck it up. Some people are going to have to make compromises if we're going to live together like this. Yeah, I totally agree. And I think instead of phrasing it,

Starting point is 00:41:34 I mean, I would also say suck it up, but that's sort of the Pittsburgh in me. I think that a more congenial way of phrasing that is that you might have very different religious values from me, but there are a set of values that we all share. And what we need to base a moral theory on is the sort of universal grounds that we all have in common, the kinds of things that enable all human beings to pursue their goals. You're right. That's a much more public relations friendly way of putting the points. I think that that's a wise way of doing it. Okay. So I know you want to get to the Maximian principle. So do I. But maybe even before we do that, I was, I really liked in your book the casting things in terms of game theory and prisoner's dilemmas and Nash Equilibria and Pareto Optimality and all these other buzzwords. And I think that, you know, this is why we have an hour-long podcast so we can actually explain a little bit how you're, you know, how you're you're thinking because it's a very helpful conceptual tool. So why is game theory something that is useful tool to have in mind when we think about these issues? Yeah, I am really excited to talk about this because I think this is the way forward in resolving these kinds of tensions between different consistent moral theories. This kind of tool was only available to philosophers

Starting point is 00:42:57 for the last 50, 60 years or so. A lot of times when I tell people, that I work on designing moral frameworks for machines. They'll say, well, haven't philosophers been doing this for thousands of years and they haven't gone anywhere? Now, my first response is, well, just because a problem has been around for a long time doesn't mean it can't be solved. And the second response is actually that I think the tools

Starting point is 00:43:23 for solving these kinds of problems have really only emerged in the last 50 or 60 years or so. And when we talk about evaluating things, based on which one promotes cooperative behavior. There's been a lot of talk in the history of philosophy about promoting cooperation. The British philosopher Thomas Hobbes talks a lot about if we didn't have any kinds of rules, we would need to invent ones in order to cooperate. You and I have been talking about living in a civil society and cooperating together. But exactly what does this mean when we talk about cooperation? Well, there is a very, very technical way of

Starting point is 00:44:01 describing this, and you can describe it as a kind of improvement from simple self-interested behavior. Self-interest is actually a very powerful tool, and in the 1950s, John Nash described a certain method of showing how self-interested agents would come to certain equilibria in interactions, in games with each other. And of course, games doesn't just mean poker and blackjack, but it can just mean any situation where two or more people are interacting, and there are gains and losses for those people. But poker was a big influence, right? That was a big inspiration.

Starting point is 00:44:40 That's right. Well, it also includes games, too. Yeah. So what we're talking about here are cases say where the prisoner's dilemma is usually phrased in terms of sort of cops and robbers drama, where let's say that I arrest you for drug dealing and I know you're dealing drugs, but I don't have enough evidence to convict you. Now, I make you a deal, you and your partner, a deal in separate rooms. I say, look, if you will confess to the crime, I'll let you off free, but I'm going to put your partner away for good. And you know,

Starting point is 00:45:19 if you both stay quiet, that you actually get, let's say, a very low sentence. If both of you confess, you both get a medium sentence. And it turns out, in this kind of scenario, a lot of people might think it's intuitive that you should stay quiet, but according to Nash, you should both confess. You should both squeal on each other. Right. Now, what's weird about this is you've got a conflict where it turns out there is an improvement. If both of you confess to the crime, then you both get a low sentence, but, or I'm sorry, a medium sentence. But if both of you were to stay quiet, you would have both gotten a low sentence.

Starting point is 00:46:04 Now, that's what's called a Pareto improvement because it's an improvement for everyone. It doesn't make anyone worse off. And these are the kinds of improvements that economists think are the bare minimum for rationality. Like if I finish my lunch and you're hungry and you're sitting next to me, I've still got some leftovers, it seems obvious that I should just give some to you. It doesn't make me any worse off. and it makes you better off. It's a Pareto improvement.

Starting point is 00:46:29 It's from the Italian economist Vilfredo Pareto. And so I'm defining cooperation problems as ones where self-interest here and self-interest I'm measuring as a Nash equilibrium. In fact, there are lots of different situations that have different kinds of Nash equilibrium. You could have multiple Nash equilibrium. But there exist Pareto improvements from Nash equilibrium. And the prisoner's dilemma.

Starting point is 00:46:56 If I understand it correctly, the Nash equilibrium is one where one person cannot unilaterally change to get a better outcome without hurting somebody else. But a Pareto improvement would be where if we all change at once, we will all be better off. Is that right? That's exactly right. And that's the challenge is that Pareto improvements from these Nash Equilibria are not things that self-interested rational agents can do. They're not capable of it. So there's a cooperation problem. Exactly.

Starting point is 00:47:27 There's a problem. And this is the challenge that Thomas Hobbes in the 1600s was describing, is that people in their own self-interest are going to be led to these outcomes that are actually not optimal for everyone. And so we need this sort of third party, as he called it, a Leviathan, or maybe a set of rules or a government to come in and force us to act in a way that's for everybody's mutual benefit. Right. And being, as you just alluded to, being Pareto Optimal is kind of something that nobody could disagree with, right? It's not necessarily the final answer to our right thing to do, but if there is something where if everyone acted in a certain way, literally everybody would be better off, or at least the same, then how could anyone object to that, right? Exactly. And it's the kind of thing that you would expect in the evolutionary history of our species and other species. as well, would motivate certain adaptive traits to emerge, would actually lead certain traits to emerge to force us to cooperate in places where we wouldn't have cooperated before. And you go on to propose the repugnant prisoner's dilemma as an illustration of how

Starting point is 00:48:45 straightforward utilitarianism can lead us wrong. And it's kind of a version of the utility monster thought experiment that, I guess, was it Nozic who put? propose that? Yeah. And where, you know, if one person can become way better off and everyone else suffers just a little bit, straightforward utilitarianism would say, yeah, sure, you know, like let everyone suffer just a little bit because this one person would be so much better off. And you want to argue that that's probably not the moral strategy we want to pursue. That's right. So the great thing about defining cooperation in this very formal sense is that we could actually go on and test which of our moral theories produce more and less cooperative solutions.

Starting point is 00:49:28 So I think that in most cases, like the prisoner's dilemma, the regular prisoner's dilemma, it turns out that utilitarianism, contractarianism, natural rights theories, contient ethics, they all produce the correct result. They all produce mutual cooperation, which is great. And I think our moral intuitions, this sort of mixed bag of cognitive mechanisms that have over time evolved to make these kinds of choices, that they also, in most situations, do a great job of promoting cooperative behavior. Except maybe libertarianism does not get the same answer for the iterated prisoner's dilemma? Maybe, yeah. It depends on how you define causing the harm. So if you, this is something that Nozick talks a lot about in Anarchy State and Utopia,

Starting point is 00:50:21 which was his rebuttal in the 70s to Rawls. He said that if you're talking about causing harm as sort of doing something where if you had done otherwise, she would have been worse off, then in this case, maybe the prisoner's dilemma is not an instance of causing harm to the other person. if you confess or if you stay quiet. But it's difficult to say in that scenario, really, what counts as causing harm. But nevertheless, I think I take your point that for the conventional prisoner's dilemma, we have a Nash equilibrium where everyone defects,

Starting point is 00:51:01 but most sensible people would say that both players in the game are better off if they cooperate. And so we can have that as a starting point of agreement and work from there. Yeah, and it's even more than most sensible people. it's over time, if you are a utilitarian or a contractarian playing prisoner's dilemma, over and over and over again, you will get better outcomes. If you're measuring this in money, you'll make more money over time. If you're measuring this in children, you'll have more children over time. Right. Okay, so now I'm going to finally let you tell us what a good contractarian believes.

Starting point is 00:51:36 I mean, you've mentioned the maximin principle, but how is it different? How would a good contractarian approach something like a prisoner's dilemma or other sorts of, of games differently than a straightforward utilitarian would. Right. So the Maximin principle says that we should prefer the distribution that makes the worst off person as best off as possible. And usually what that means is you have a set of outcomes. You attach values to each of those outcomes, number values.

Starting point is 00:52:05 And in each of the outcomes, you pick the worst off, then you put those into a group and select the highest of the lows. And that's the one you pick. Now, in terms of distribution of money, that's fairly straightforward, how you count and quantify those distributions. But in terms of other kinds of goods, it might be a little more complicated.

Starting point is 00:52:29 However, utilitarians have been spending years, decades, centuries, to try to convince us that pleasures and pains can be quantified and counted. And I think... And add it up. And add it up, that's right. And so the utilitarian wants to run essentially a summation function over all of this and just pick the highest of the sums.

Starting point is 00:52:53 Now, those usually produce very similar answers. So in the prisoner's dilemma, it turns out that adding up all the outcomes and running Maximin both say that we should cooperate with each other. Right. But in other scenarios, you could arrange it, and I mentioned, or you mentioned the repugnant prisoner's dilemma that I set up, you could arrange scenarios where the sum of all the payoffs for people is actually not either what Maximin would say, nor is it what Pareto-Optomality would predict over just self-interested behavior. And for that reason, I think that the Maximin principle is actually the better principle

Starting point is 00:53:35 than the utilitarian one. When people turn to telehealth for weight loss, they're looking for real support. That's why more people are choosing orderly meds.com. Orderly meds connects you with real doctors and access to proven GLP1 medications like semaglutide and terseptatide. No guessing, just a more supportive experience,

Starting point is 00:53:51 and all shift directly to your door in discrete packaging. Do your research, ask questions, then visit orderlymeds.com slash podcast for an exclusive offer. That's orderlymeds.com slash podcast. Individual results may vary, not medical advice, eligibility required, see site for detail.

Starting point is 00:54:05 Do you ever feel like you're drinking from a firehouse? Paycor's intelligent HR solution empowers leaders to turn down the pressure. Their unified platform includes payroll, talent management, compliance software, and a lot more, connecting you to the people, data, and expertise you need to drive long-term business results. Visit paycourt.com slash leaders and go from work flood to workflow. That's paycourt.com slash leaders. And it's precisely because this possibility that there could be great gains for one person, but other people have to suffer because of it.

Starting point is 00:54:45 Exactly. And that's almost a prediction of the theory, not necessarily a motivation for it. The real motivation for it is that it produces cooperative behavior in all scenarios. The prediction is that it will make the worst off as well off as possible. Usually this makes a lot of sense. intuitively. So in a, in a, like you said, a liberal democracy, very often progressives are wanting to benefit the poor before we benefit the rich, that they should have priority. So this is often called a prioritarian principle. However, there are some other situations where it wouldn't be

Starting point is 00:55:27 prioritarian. I think the real, where the rubber meets the road here is when you start actually attaching values to outcomes. And you have to do that by saying, here are the primary goods. What I was saying earlier are the kinds of goods that all human beings from the original position would care about, our health, our safety, our opportunity, and you try to quantify them and then calculate the effects of your action on those goods. So if I say, look, if I'm going to punch you in the face. I might get some amount of pleasure from that if I'm some sadistic weirdo. I don't want to do that. But if I did, but it would be terrible in terms of your health and opportunity. And that loss to you in primary goods is not equivalent or not made up for the gain to me in the secondary goods that I get,

Starting point is 00:56:21 namely my pleasure or something like that. And so when we talk about applying this to self-driving cars, what we need to do is we need to have a way of quantifying the effects of every collision on the health and safety of the passengers, of pedestrians, of people in other cars. And then what a contractarian would do is run a maxi-men function over all of that and say, here are three different collisions. What are the worst health outcomes in each of these collisions? And I'm going to pick the best of the worst case scenarios. So that sounds like something sensible, but just before we dig into that, I think it's safe to say that in the political sphere, where Rawls was originally talking about, his reasoning does seem to lead us to quite a redistributive way of running society, right?

Starting point is 00:57:15 The very worse off people have to be improved by any inequality that we allow. So it's a very different world than where we actually live, where in modern capitalism in the United States, there's plenty of people suffering a lot with the idea that there's other people who are doing really well off because of economic growth, and that's what a tradeoff we're willing to make. That's right. Now, if you're utilitarian, you care very much about the suffering of these other people. So you use the word suffering. But if I'm a contractarian, I don't care about their suffering. I care about the distribution of primary goods, namely their health, their safety, their essential resources.

Starting point is 00:57:56 And so what I care about is making sure that the worst off people in the population are brought up to a minimal level of, let's just call it, normal functioning, right? Now, there's a lot of discussion about this in bioethics, about what is normal functioning, how do we quantify normal functioning, but that's essentially what a contractor is trying to do, to bring everybody up to a minimal threshold of opportunity and safety, but not happiness. In fact, contractarians don't care about happiness. Happiness is not the good that we are calculating. Well, but for roles, certainly wealth that an individual has would be among the goods that we do calculate, right?

Starting point is 00:58:39 When we talk about the difference principle saying that inequalities should only be allowed to the extent that everyone is better off, Wealth is among the things that makes us better off. Sure, but only to the extent that wealth is able to get you the essential resources you need to pursue goals. If you are a masochist and you enjoy suffering, that's fine. As long as you have enough essential resources to continue being a masochist, then that's all a contractarian cares about. Sure, that's right. I'm just trying to, you know, they're, I'm just trying to, because when we get to the self-driving cars, there'll be competing conceptions of what the cars should be doing. So I just want people to know there are analogous competing conceptions in the political arena. You know, a Ralsian, at least at face value, would be much more democratic socialist, whereas a libertarian would be much more capitalist in terms of how the economy should run itself, right?

Starting point is 00:59:34 And these are both plausible theories that we can argue about. That's right. Yes. Good. So then when we come to the cars, you're going to try to implement some kind of. kind of Maximin algorithm in the mind of a self-driving car. That's right. So I think there needs to be a database of collisions and the effects of these collisions on

Starting point is 00:59:58 most people of comparable, let's say, size and position, right? Now, this is something that you might think is really, really complicated and even maybe a little bit silly, but I think the alternative is even sillier. So right now a lot of the major car companies have the official position of just saying, well, we think all collisions are bad and we want to avoid them all equally. But I think that's an incredibly ridiculous position to take because not all collisions are equal. Obviously, getting hit by a vehicle moving at two miles an hour is better than getting hit by a vehicle moving at 20 miles an hour. And I want vehicles that are evaluating different paths to say that one collision is,

Starting point is 01:00:43 better than another. Yeah, no, I think that I don't even quite understand the resistance to this way of thinking. I mean, if someone says, what is the best economic system? And someone else said, well, is this system where everybody is wealthy? That would not be very convincing to anyone. You're like, well, that's not the world, that we have to make some hard choices here. And the cars are, we should at least anticipate the reality that cars are going to be making some hard choices. I think that's the reality that is slowly coming, but I think it's sort of a public

Starting point is 01:01:13 relations nightmare for an industry that is already working hard to just convince people that these things are safe at all, much less to convince them that they should be evaluating which kinds of collisions are better and worse. And there is, I think, a point, I think that you made this point in the book that hadn't quite sunk into my brain before reading it, which is that neither you, the human driver, nor the car, the artificial intelligence, can say with perfect certain. what the outcome of a decision is going to be. Therefore, even if it's rare that someone actually gets run over, a car will constantly be talking, will be making decisions between higher risk

Starting point is 01:01:55 and lower risk actions. And that is really quite down to earth and it's going to be common, right? That's right. I mean, I've talked to a few people who are designing autonomous systems in the, mostly in academics, not in the industry. Industry doesn't want to talk at all about this kind of stuff. And I can understand why. But a lot of the people in academics who are working on this technology, I talk to, for instance, Benjamin Coypers, who is building, along with his former postdoc, Sean Jin Park, they built this wonderful autonomous robot that moves around the halls of the University of Michigan and detects obstacles and slows down and tries to avoid them. And Park used this system called model predictive control, where it essentially casts out a net

Starting point is 01:02:39 of many, many, many, many possible paths, many per second, and then it prunes those paths based on the likelihood that each of them is going to result in a collision. Now, likelihood is a really great method to evaluate paths, right? I want to take the paths that are least likely to result in collisions. But once again, I think we need more than just likelihood. I think we also need to say a likely collision with a pedestrian is worse than a likely collision with a tree. Right. Yeah, exactly. So, I mean, is there, how simple and straightforward does this suggested algorithm become when it comes to things like trolley problems or things like babies versus grownups or anything like that? I mean, there still seems to be a lot of wishy-washiness there. Yeah, so it will tell us what kinds of information is relevant in making this database in the first place, which I think is really important. So there was a recent experiment conducted by the MIT Media Lab that you're probably familiar with.

Starting point is 01:03:42 It was just published in nature a couple weeks ago, and it was called the Moral Machine Experiment. Yes. And so what they did is they asked people to make choices about self-driving car trolley problems, where they alternated things like the genders, the age, the social status of all the people involved. So would you rather run over two doctors and a homeless person to save one obese man and a dog or something like that? Now, the contractarian, as well as most moral theories, are going to say all of that information is irrelevant, or most of it is irrelevant. So whether a person is a doctor or a lawyer, whether a person is a Muslim or a Christian or an atheist, all of that is irrelevant. But what is relevant are things like your physical position, your physical

Starting point is 01:04:33 size, and maybe your age, because that information actually tells us about the effects of this collision with you. And so this is important in figuring out what kinds of databases are going to be discriminatory against people and what kinds are not. Well, you brought up this very interesting question, do you discriminate against people on motorcycles who are wearing helmets versus those who are not because presumably a collision with someone wearing a helmet will hurt them less than someone not wearing a helmet? So we actually punish them in some sense. Yeah, and that's one of the more counterintuitive predictions of my theory. My theory says a lot of things that I find pretty intuitive, but a few things that I find counterintuitive. And unfortunately, if my theory is correct,

Starting point is 01:05:22 just have to say, well, so much the worse for my intuitions here. Part of the problem might be the use of the word punish. So that's a little bit misleading. If the car is going to evaluate a collision with a bicyclist without a helmet as worse than a collision with a bicyclist with a helmet, that doesn't mean that it hates the one with a helmet or that it thinks that the one without a helmet deserves to die more than the one without a helmet. It's only saying this path is less dangerous than that path. And the reason why I agree with you that seems really weird is that you think, well, the person with the helmet was being safe. She's the one who left the house that day taking precautions. Why should she have the car target her or punish her more than the other one? And the

Starting point is 01:06:19 answer is, I think we need to stop using words like target or punish and just say that the path that leads to you was evaluated as better than the path that led to her. Okay, good. And so I think, yeah, there's two big looming questions that I'm not quite clear on here yet, but I think we can clear them up. One is you seem to be saying that the contractarian just treats every human being equally, roughly speaking. Maybe there's some health differences, like maybe a strong person.

Starting point is 01:06:49 wouldn't be as bad to get an accident with as a weak person because they're more likely to survive it. But this is contrary to how many people's intuition goes. One of the aspects of the MIT study, if I remember correctly, was that different people from different parts of the world gave different answers for injuring women versus men, young people versus old people, etc. But you're saying that you're advocating being ignorant over all of that. that's right so if people are preferring to collide with men over women my response would be that sexism and that's not something we want to incorporate into our machine right good and but so some people are not going to agree with this right you're going to have to try to convince them but that's okay

Starting point is 01:07:34 well yeah so i have to keep stepping back until we find some grounds that we could agree on so i'll i'll step back one step and say okay well what moral theory are you using and in just about any moral theory, you're not going to value men more than women or vice versa, not utilitarianism, not Kantian ethics, nothing. And if they say they still do, well, I'll take a further step back and say, okay, well, how should we even make decisions in the world, right? Should we just base off of the things that we all have in common? And if we're agreeing on that, then I'm going to say contractarianism is the best way of cooperating based on the values that we all share. But okay, what about babies versus grownups?

Starting point is 01:08:20 Babies versus grownups is difficult because a grownup, and when we're just talking about collisions, is more likely to survive a crash than a baby. And so in that case, the baby should be preferred, but not because we love babies more or they're more adorable, but because they are more vulnerable. Okay, that makes sense. Good. And then the other looming issue was, you know, let's be explicit about how we come down on the various trolley problem kind of scenarios here.

Starting point is 01:08:49 It sounds like contractarianism doesn't really care if one person versus five gets injured because of an active choice versus a passive one, right? It's just, it is a consequentialist point of view at the end of the day. Yeah, that's right. That's right. So I do think we need to evaluate outcomes based on the distributions they produce, and that is a kind of consequentialism. And so in that way, I think utilitarianism and contractarianism are sort of cousins in this respect. But I think the biggest difference is their way of quantifying the goods.

Starting point is 01:09:25 Do they quantify happiness and suffering or primary goods? And do they run a summation function or a maximon function? Right. And so in, say, the trolley problem, in most cases, they're going to agree. But in some cases, they're going to disagree. And in some cases, I find it really weird. So here's one of those cases. So according to my theory, this is something that a friend of mine, the philosopher Susan Anderson, pointed out.

Starting point is 01:09:51 She pointed out that according to Maximen, it would be better for the car to swerve into a crowd of 50 people and give them all a broken leg rather than to swerve into a brick wall and give the passenger in the vehicle two broken legs. Why is that? Because 50 broken legs is, or rather 50 single broken legs, is better than one instance of two broken legs. And I find that so strange. I find that crazy. But once again, I just have to say, just like Jeremy Bentham did in the 1700s, well, my theory says it, so I have to accept it. Jeremy Bentham was talking about homosexuality. And he said, according to my theory, I guess it's all right.

Starting point is 01:10:36 right, even though, according to him, it was really weird and gross, he had to accept it. Well, I mean, I agree with what the consequences are. Sometimes when our moral theories give us highly counterintuitive or weird sounding suggestions, we need to say, well, maybe I have the wrong moral theory, right? Well, that's actually something that Rawls thought. So Rawls agreed that we need to go through this process called reflective equilibrium, where we sort of tune our own intuitions to the theories that we are developing. However, that's where I think I would diverge from Rawls. I would say that, look, if there's a matter of fact about which actions create more cooperative behavior than others, then just like if the doctor tells me to stop smoking

Starting point is 01:11:27 and I really, really want to, I have to say, well, look, it's a matter of fact which actions are right and wrong, or which actions are healthy and unhealthy, but I could still say, I don't want to do that, or even I'm not going to do that. However, there's still a fact of the matter about what the right thing to do is. Right, but, you know, we do need to make some choice about whether or not we discovered that fact of the matter through our moral theorizing or whether or not we should be a little bit less confident that, you know, you've chosen a function, A utilitarian chooses a function over all utility, which is to say, add it all up and maximize it. And you've in some sense chosen a function over utility, which is to say, look at the utility of the worst off person and maximize that.

Starting point is 01:12:15 Right. And maybe there's some happy medium so that the 50 people don't get their legs broken. I don't see how that would work, although I'm open to thinking about it. So there are a lot of people who want to sort of have hybrid versions between these two. But once again, the problem is if you want to mix the two theories together, I think you need to have a third theory that tells you when do you take the utilitarian choice, when do you take the contractarian choice? And I just don't know what that third theory would look like. Well, you say that now when the 50 people are suing you because they all have their legs broken,

Starting point is 01:12:49 you might feel differently. Oh, no, don't say that. Yeah, actually, someone at a conference recently joked to me that if I'm wildly successful beyond my dreams and this actually was used, in self-driving cars, I could be responsible for millions of injuries and deaths. And I laughed because I know that that's not going to happen. Right. But there was a part of me that sort of was a little bit afraid.

Starting point is 01:13:16 I mean, I know that the car companies need to work out some kind of solution to this. And the problem is they're not talking about what they're doing. Yeah, I mean, you could equally well say that if you succeed beyond your wildest dreams, you'll be responsible for saving enormous amounts of death and suffering in the world, right? Sure, sure. So let me ask you, if you don't mind, are you, I assume you're sort of taking the more utilitarian approach here. I got from what you were saying that generally you take a sort of utilitarian, although

Starting point is 01:13:47 utilitarian constructivist from your big picture book, but still more or less good, old-fashioned utilitarian approach to most kinds of decisions like this. Actually, no, I'm just trying to give you a hard time. Because I'm just trying to figure out what the right thing to do is. I don't really have strong substantive moral theory myself. Like I don't believe in utilitarianism because I totally buy the utility monster kind of responses or the repugnant conclusion. I mean, Derek Parfit had this very similar argument that it would always be better just to have more children, just have like more and more killed kids because there can be more and more people having happiness.

Starting point is 01:14:28 I'm extraordinarily skeptical of the idea that we can, number one, calculate individual utility for people. Maybe that's possible, but then number two, add them up on some commensurable scale. Seems like the wrong thing to do to me. So I'm almost to the point where I'm willing to accept some kind of deontology rather than some kind of consequentialist way of thinking, but I'm not quite sure what that would be. That's interesting. So you mentioned two objections to utilitarianism.

Starting point is 01:14:55 One of them is that it doesn't match your intuitions on some way. weird cases. And the other one is that it's just very, very hard to implement. It's hard to to calculate pleasures and pains. Now, if I'm a utilitarian, I might say, as to the first one, well, so much the worst for your intuitions. And in addition, I might point out, now I'm being a utilitarian for some reason, by the way, but I might also point out that any moral theory is going to say really, really counterintuitive things. So I'm not sure why we should care if there's a crazy sounding scenario with an alien where it doesn't seem to match our intuitions. Do you expect that there's going to be a moral theory that matches all of your

Starting point is 01:15:35 intuitions at some point? No, but I actually do by Rawls's point on reflective equilibrium, because as a moral anti-realist, I think that we're getting our starting point for morality are our moral intuitions. As a cognitive realist, I understand that those intuitions might be incoherent, and therefore there's work for moral philosophy to do in from our moral intuitions, building them into the best fit, sensible, logical, coherent system. So I think it's evidence when the system that I've tried to build is wildly in conflict with the intuitions I started with, that might be either because I got to get rid of that intuition or because I did a bad job building a system. I'm open to both possibilities.

Starting point is 01:16:21 I think that's fair. I think that's fair. I think my main concern about using intuitions and evaluating the theory is that I'm so aware of the history of strong intuitions that have been false, that I just give them virtually no evidential weight whatsoever. I mean, in addition to that, you know, there's a lot of intuitions I have right now that I suspect almost any moral theory is going to tell me is wrong. Like I said, I love the taste of meat. Yeah. But any plausible moral theory is going to tell me that if I can lead a happy, healthy life without eating meat, that I really should. I love driving fossil fuel vehicles. I love it. But most moral theories tell me that if that has terrible effects on the environment,

Starting point is 01:17:09 and I really don't need to be doing that, I live in a city. Yeah. I have public transport. So, again, most moral theories are going to tell me things I really don't want to hear. And I'm very sensitive. We have to be open to throwing out this or that moral intuition, or at least dramatically changing it. And I think this is what makes human beings pretty cool, is that we don't only have our moral intuitions. I mean, they're where we start, but we also have our rational cognitive capabilities.

Starting point is 01:17:39 And we can, there's feedback, right? We can go from rationality to alter our moral feelings. And, you know, it could happen. Like, I'm a meat eater and I drive a fossil fuel car. I want to get rid of the fossil fuel car, but I'm not going to get rid of eating the meat. But I'd be happier if we could make artificial meat and wouldn't have to kill any animals to do it. Sure, I could see that. I mean, and the problem in appealing to pure or rational sort of corrections here is that if there's nothing outside of our intuitions that we're appealing to to correct them,

Starting point is 01:18:13 then I'm not sure how we escape the inevitability of an internally coherent system that's just, completely mistaken. Yeah, I don't necessarily believe that the word mistaken has any reference there in the world. I think that... Yeah, I can understand that. I think then we're just coming to blows about whether we think the function of moral theories is to produce cooperative behavior among self-interested organisms or whether it's to produce sort of satisfying solutions according to our contingent intuitions that we all happen

Starting point is 01:18:48 to share. or some of us happen to share. Yeah, I think that in both senses, we're trying to be, you know, coherent and rational, either individually or collectively. So I think it's, you know, it's interesting to me is that people have strong disagreements about moral realism versus anti-realism,

Starting point is 01:19:08 and those disagreements are almost entirely uncorrelated with their ideas about what actually is and is not moral. That is really fascinating to me, too. I find that in talking to most sort of well-educated people in my friend circles, that they are explicitly moral relativists, but implicitly utilitarians. Interesting. And yeah, well, usually, like you said, sort of good utilitarians, where they want, they're willing to sacrifice one person to save many,

Starting point is 01:19:40 but then they also don't want to sacrifice cappuccinos and fossil fuel and eating meat and so on. Right. And so if you push them far enough, maybe they'll admit that explicitly, but then they might fall back on relativism and say, well, it's all relative or something like that. So they're relative when it's convenient. Yeah, for me, utilitarianism is an example of something that I reasoned to myself out of as far as I'm concerned. I think that it sounds superficially like the right thing to do, but I think the objections to it are good enough that I'm looking for something better. Yeah. Just in case you're curious, I was going to bring this in. I have a survey that was conducted by, the website, Philpapers.org, run by David Chalmers and his group, and they asked professional philosophers, do you accept the category of normative ethics described as consequentialism, deontology, or virtue ethics?

Starting point is 01:20:33 Right. And it's roughly split as 25% accept or lean deontology, 23% and some change, accept or lean towards consequentialism, 18% towards virtue ethics, and 32.3% other. And this actually reminds me of a poll that you took and you described in one of your blog posts about your survey of interpretations of quantum mechanics. Yep, not a lot of consensus. Yeah, not a lot of consensus. And you called this a huge embarrassment for the field of physics. And I kind of feel that way about my own field in some ways. I must admit, I feel like it is a little embarrassing that these are not just theories that are sort of fun to think about, but they actually make a difference in how we live and how

Starting point is 01:21:26 we design artificial intelligence. And it turns out that there's not a lot of consensus in the field where there should be. Do you have the numbers there for moral realism versus anti-realism? I actually might, yeah, hold on a second. Because that was also a Philpapers, uh, question. I remember that. Yes, I do. I think that most philosophers are realists, right? That's right. 56.4 accept or lean towards realism, 27.7 anti, and then 15% other. All right. I mean, it's, it is interesting. I think it's more embarrassing that we don't

Starting point is 01:21:59 have a consensus on quantum mechanics, because quantum mechanics should be easier than ethics or morality, but it's more important that we don't have a consensus on ethics or morality. Right. That's where the analogy must be. might end is that most of the versions of quantum mechanics, if I understand it, make essentially similar predictions. However, the moral theories, although one could say in 99% of cases, most of the moral theories probably make the same predictions. It's just these rare scenarios of, especially involving, say, opportunity cost, what you could be doing instead of what you're doing right now, that there's the biggest and most important disagreements. Sure. And if you're that crowd of

Starting point is 01:22:42 50 people who's going to get their legs broken, it's extremely relevant to you that your car is programmed one way or the other. So just to sort of wrap it up, put a bow on it, I guess we glossed over a little bit about the implementability of this plan. I mean, you sketched out a sort of a database idea where we would have all these different possibilities. How real world is this prospect of making contractarianism the way that our self-driving cars go about making moral decisions? Yeah, that's a terrific question, and my answer is, I don't know. But if you are working on this kind of technology out there, I would love to hear from you. I want to know how plausible is it to be able to design autonomous vehicles and other autonomous systems that can quantify the effects of these actions on primary goods and then run maxim and functions over them. I've talked to people in the field who say, well, it seems like this might be plausible.

Starting point is 01:23:42 I see my job as saying, if we are going to design autonomous systems, here's what they need to be capable of doing. And if they are not capable of doing this, then we should slow down or maybe even halt the development of this technology. Right, right. And I think that's especially relevant in the domain of autonomous weapons systems. Well, good. I want to, so here are my final two questions, which could be short answers or longer. but one of them is you brought up an issue in the book that, again, I was sort of hadn't, I was surprised because I hadn't even thought of it.

Starting point is 01:24:17 Is it a problem if an artificially intelligent system does things that seem to be ethical to us, but it can't articulate why it's doing them? This is an issue for deep learning systems, right, where it can recognize a picture, but it can't tell us why it recognized a picture. A human being would be able to articulate an answer. that might not be the correct reason why they did something, but at least it can try. Should we expect the same from AI? That's a great question.

Starting point is 01:24:47 And the answer is, I'm not sure. There's a Kantian position here that says that it's not a real decision, getting back to, you know, the very first thing we talked about. It's not a decision you're responsible for unless you can actually articulate the reasons for it. You can tell me why you did it. Otherwise, you're just sort of an animal or a child acting on instinct. Now, to me, it doesn't so much matter if you can articulate it. What matters is, are you following the Maximin principle?

Starting point is 01:25:19 And I think the best way of doing this is actually constructing the Maximum and principle in these autonomous systems in what's called a top-down approach. However, I'm also open to the possibility of what you might call approximating a Maximum in principle through these more bottom-up methods, if the machine learning system produced outputs that always matched the Maximin principle in the kinds of cases we observe. And we had good reason for thinking that it would continue to run this program that approximated Maximin in future cases. Then I would say that would be, let's say, close to good enough or sufficient in that case.

Starting point is 01:26:02 I think so. I mean, I think it's a little bit too much to demand of our AI systems that they be articulate moral philosophers as long as they seem to be doing mostly the right things. Right. Well, as long as it says something like the reason why I chose this path instead of this path is that the worst collision in this path is better than the worst collision in that path. It doesn't need to say something like, I traveled into the original position and I realized from there that MaxiMen was actually up. Yeah, that'd be too much to ask. And the other final question was something you already alluded to, a potential difference between the everyday life circumstance of a car driving around and trying to avoid accidents and the everyday, but not everyone's life case of people at war or machines that were intentionally built in order to inflict harm in a certain way. How do the moral considerations change?

Starting point is 01:26:57 And I realize this is a huge topic, but maybe a simple introduction to the different. differences between that and everyday life and wartime? Well, the smallest cases that this might be applied in right now are what you could call security robots. And in fact, these are currently being used in some airports in China and other places in East Asia, where they have, in some cases, taser technology equipped with them. And so there are good things about this kind of technology. and in fact if someone is harming another person,

Starting point is 01:27:33 it is good if a robot could step in and actually neutralize the threat, but the problem is in doing so, it needs to be capable of identifying when there is a threat, what kind of threat this is, and what the proportional amount of force is to neutralize that threat. And so contractarianism does make predictions about this.

Starting point is 01:27:56 If you could quantify the kind of harm being, done by that threatening agent, that enemy agent. And you could say usually that neutralizing the threat is better than just say killing the agent, because that would be certainly making the agent now worst off. Yeah. But neutralizing the threat would be the best of all possible outcomes. And so you could imagine security robots and in the extreme military robots being designed with their goal of neutralizing enemies and neutralizing threats, because I think that would be the Maximin approach to it. Right, but Maximin seems to, maybe I'm just not conceptualizing it correctly,

Starting point is 01:28:40 but it seems to fail us a little bit when literally our goal is to kill people. That's right. And I think that you could imagine cases, and I do imagine cases where the ideal autonomous robot in what, would be commanded to kill an enemy soldier and the robot would say, no, thank you, but I am going to apprehend him and take him into prison. Right. In fact, that's the goal. I mean, going back to our good friend Emmanuel Kant, he famously and shockingly said, if God commands you to kill your own children, the correct response is, no, I'm not going to do that. And we tell people in

Starting point is 01:29:22 military ethics and the ethics of war that if your commanding officer tells you to kill innocent people in war, the correct answer is no, but I am happy to do other things that are not war crimes. But do I remember correctly that in the book you suggested or at least wondered out loud whether or not it might be okay to ultimately have autonomous self-driving cars or drones and so forth, but not in the theater of war, that autonomy should be always in the hands of human agents who actually can take responsibility. That's right. There was a letter that was recently signed by a number of people who work on ethics and political philosophy, who, and these group of people were arguing that autonomous systems should not be used in war at all. Now, I wouldn't go that far, but I would agree that the kinds of capabilities they would require in order to be, I don't want to use the word responsible, but in order to make the right choices in war are unlikely to happen anytime soon.

Starting point is 01:30:33 And so all of my claims are a big hypothetical, which is if we are going to design these kinds of machines, these are the kinds of capabilities that they would require. And I'm willing to do that for military robots as well with more skepticism that they are actually going to achieve this level of sophistication than in the case of medical technology or transportation technology. It's very interesting to me because I see a philosophical version of what happens in physics, in particular in science more generally, where concepts that we could have ignored at earlier. times are forced to the forefront of our attention by the progress of technology, right? And so this, it's a wonderful thing. I think it's a wonderful thing for philosophy that our discussions about morality are being sharpened a little bit by the fact that we can't be wishy-washy. We can't be fuzzy about them. We can't just say be excellent to each other. We need to tell, you know, machines that will listen to us quite literally how to behave in a wide variety of

Starting point is 01:31:42 circumstances. Yeah. And, you know, I have to admit, a friend of mine pushed me on this position that I take. He said it was kind of bullshit what I'm doing, because if I was taking a strong moral stance against autonomous weapon systems, can I say bullshit on your program? Absolutely. Okay. So that it's kind of bullshit in that I am being hypocritical or I am, I am not really caring about the use of this technology, that in fact, I'm just saying, if you're going to build it, here's the right way of doing it. And I'm sensitive to that objection. I'm very sensitive to it.

Starting point is 01:32:18 I'm not convinced that autonomous weapon systems or autonomous vehicles or autonomous medical care bots are actually a good idea in the long run. And I'm sensitive to the fact that maybe this position I'm taking is a little bit, to corporate. But that being said, if any corporations would like to pay me large amounts of money, I am more than available. Very good. Well, I do hope they take you up on that. I think that I certainly on your side in thinking that this is something where we should face up to the problems rather than ignore them. Derek Leibin, thanks so much for being on the podcast. Thank you, Sean.

Starting point is 01:33:33 What if you could have even more and more and more help to pursue your goals? At LPL Financial, we offer more ways for advisors and their clients to thrive. So what if you could? Paid advertisement. Investing involves risk, including potential loss of principal, LPL Financial LLC member FINRA SIPC.

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas - 30 | Derek Leben on Ethics for Robots and Artificial Intelligences

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.