On with Kara Swisher - OpenAI CTO Mira Murati on ScarJo Controversy, Sam Altman & Disinfo Fears

Episode Date: June 13, 2024

Kara interviews Mira Murati, Chief Technology Officer at OpenAI, and one of the most powerful people in tech. Murati has helped the company skyrocket to the forefront of the generative AI boom, and Ap...ple’s recent announcement that it will soon put ChatGPT in its iPhones, iPads and laptops will only help increase their reach.  But OpenAI's rapid ascent has included its fair share of growing pains. There was “the blip,” as company insiders refer to the brief ousting of Sam Altman as CEO. (Murati became CEO for two days.) There have also been high-profile departures, an open letter accusing the company of putting product over safety, questions about highly restrictive NDAs, and even controversy over whether the company had stolen Scarlett Johansson's voice. On top of that, many fear that generative AI tools, like ChatGPT, will be used to fuel disinformation during the upcoming presidential election. Kara and Murati talk about all this, and more. This interview was recorded live at the Johns Hopkins University Bloomberg Center in Washington, DC as part of their new Discovery Series. Questions? Comments? Email us at on@voxmedia.com or find Kara on Instagram/Threads as @karaswisher Learn more about your ad choices. Visit podcastchoices.com/adchoices

Transcript
Discussion (0)
Starting point is 00:00:00 Support for this show comes from Constant Contact. If you struggle just to get your customers to notice you, Constant Contact has what you need to grab their attention. Constant Contact's award-winning marketing platform offers all the automation, integration, and reporting tools that get your marketing running seamlessly, all backed by their expert live customer support. It's time to get going and growing with Constant Contact today.
Starting point is 00:00:28 Ready, set, grow. Go to ConstantContact.ca and start your free trial today. Go to ConstantContact.ca for your free trial. ConstantContact.ca. It's on! Hi, everyone from New York Magazine and the Vox Media Podcast Network. This is On with Kara Swisher, and I'm Kara Swisher. Today, we have an interview with Meera Maradi, the Chief Technology Officer at OpenAI. OpenAI has certainly been in the news, but not many people know about Meera herself. She's only 35, but she's already one of the most influential women, scratch that,
Starting point is 00:01:16 one of the most influential people in tech. She helped OpenAI skyrocket to the forefront of the generative AI boom with the launch of ChatGPT in late 2022 and what a ride it's been. The company's now valued at $80 billion after a big investment by Microsoft, and it recently signed a deal with Apple to put ChatGPT in Apple products. It's a major move by a very small company. Of course, it hasn't always been good news. After the board fired Sam Altman, Mira became CEO for just two days in what the company called the blip until Sam was reinstated. Since then, OpenAI has had to deal with a string of bad news cycles. There have been high-profile departures, an open letter accusing the company of putting product over safety,
Starting point is 00:02:01 questions about highly restrictive NDAs, and even controversy over whether or not they had stolen Scarlett Johansson's voice. And with the presidential election coming up, the public's anxiety around AI-fueled disinformation will only get worse. This episode's expert question comes from Fei-Fei Li, the founding co-director of the Stanford Institute for Human-Centered AI and an early AI pioneer, in other words, a godmother of AI. And it was recorded live at the Johns Hopkins University Bloomberg Center in Washington, D.C. as part of their new Discovery Series, where I'll be talking to some of the top leaders in AI over the next year. I think Mira Marotti is the best place to start.
Starting point is 00:02:56 This is Mira, everybody. Hi, everyone. So, thank you so much for joining me at Johns Hopkins University Bloomberg Center. We're recording this live. There's a lot to talk about. We'll get some good news. We'll get to some not so good news. We'll talk about disinformation in the elections.
Starting point is 00:03:18 So I think we'll have to ask first about the Apple partnership. Apple computers, phones, and iPads are going to have chat GPT built into them sometime this year. Obviously, this is a huge deal. It's the first one Apple's done. They've been talking to a number of people. They may include other people over time. You remind me a little bit of when Netscape got in different places, and you don't want to have that fate befall OpenAI as becoming the Netscape of AI. Yeah. So, I mean, I can talk about the product integration specifically. I can't give you specifics on that. But what we're hoping to bring is really the capabilities of the models that we are developing and the multi-modalities and the interaction to bring this thoughtfully
Starting point is 00:04:00 into the Apple devices. Then it opens up a lot of opportunities. So that's what we're looking for. And yeah, I guess we had great technology. And they wanted to use yours. So when you're dealing with a company like Apple, whose reputation matters a great deal to them, especially around privacy, what were some of the things that they thought was important? Because one of the issues is the worries about where this information goes and what it's used for. I think this is a very aligned partnership when it comes to privacy, when it comes to trust.
Starting point is 00:04:36 I mean, for the mission of OpenAI, it is so critical that we build technologies and we deploy them in a way that people feel confident around them and they feel like they have agency and input into what we're building so in that sense this partnership is quite natural and we feel very aligned and it's only going to take us deeper in the direction where we want to go. Specifically to your question on misinformation, this is obviously very complex because we're building on top of decades of misinformation
Starting point is 00:05:23 and it's becoming even more and more intense with AI but of course we've got the internet we have social media and these are compounding effects in a way it's actually good that AI is bringing all of this to a head and there is such scrutiny and intensity on this issue because it feels like there is more of a collective effort and responsibility to do something about it that is meaningful. I think it's going to have to be iterative. So we'll have to try out things as we go. And, I mean, if you look at the governance of news and media in the past hundred years, you know this better than me, it's been sort of iterative every time there is a new technology,
Starting point is 00:06:19 things adapt. Yes, we lose business model every time. Perhaps not the best example. But the point is that it is iterative. Whenever there is a new technology, we adapt to it. And I think there is a technical innovations aspect that are going to help us deal with misinformation. And then there is the people issues
Starting point is 00:06:50 and societal preparedness that is perhaps even more complex. Well, I do know with Apple, you just can't fuck up because they'll make trouble for you if that's the case. Are you talking to other companies to do things like that? Obviously, you have a relationship with Microsoft. Podcast listeners, she smirked at me. I'm not going to tell you anything. All right, I'll move on from that. OpenAI has made deals with News Corp, Atlantic Media, and Vox Media,
Starting point is 00:07:22 by the way, to license their content. So that's three potential lawsuits you don't have to worry about. I do own my podcast, and it's not included in your deal with Vox, sorry. I would consider licensing it, but I probably not. How could you convince me to license my information? I don't want anyone else to have it, including you. Well, so I know you'll ask about this at some point, so I might as well tell you now. So when we look at the data to train our models, right, we're looking at sort of three different categories, the publicly available data, at sort of three different categories the publicly available data we look at partnerships that we've made with publishers and we also pay human labelers to label specific data and also users that allow us to use their data so these are kind of the main categories where the data comes from. And the way that we think about publisher deals specifically
Starting point is 00:08:26 is we care about accuracy of information, we care about news, and our users care about that. They want to have accurate information and they want to see news on ChatGPT. And so it is a product-based relationship where there is value provided to the users through the product. And we're experimenting with different ways to monetize and give content creators basically some form of compensation for having their data show up in the products or being used in training or whatever we are doing with the data. But it is very specific partnerships that we're doing one-on-one with specific publishers.
Starting point is 00:09:16 So some people do deals with you. You've done quite a few with AP and many others. But some sue, like the New York Times. How does it get to that point? Because I think a lawsuit is a negotiation in a way. I mean, I can't comment on the lawsuit specifically, but you know, it's just, it's actually, yeah, it's quite unfortunate because, of course, we think that it is valuable to have news data and this type of information on the product. And so, you know, we try to figure out a partnership,
Starting point is 00:09:51 a deal around that. But yeah, in that case, it didn't go well. Yeah. Well, it might go well someday. But I think it's because media has dealt with internet companies for years and usually has ended up on the very short end of a very long stick of theirs.
Starting point is 00:10:06 Every episode, we get an expert to send us a question. Let's hear yours. Hi, Mira. I'm Fei-Fei Li, Professor of Computer Science at Stanford University, also founding co-director of the Stanford Institute for Human-Centered AI. So since data, big data, is widely considered to be one of the three elements of modern AI, I want to ask you a question about data. Much of OpenAI's success in your models is said to be related to data. We have learned that your company has acquired an enormous amount of data from the internet and other sources. So what do you think the relationship between data and models are? Is it as simple as the more data to feed into
Starting point is 00:10:54 the model, the more powerful the model? Or is it that we need to spend lots of time curating different types of data in order to make the model work? And finally, how do you reconcile this appetite for so much human-generated data with the ownership and rights issues of this data? Thank you so much. That's a great question from Fei-Fei. So in terms of the relationship of data and models, this is actually something that a lot of people misunderstand about AI models and in particular large language models. The developers of these models, they're not pre-programming these models to do something specific. In fact, they are putting in a bunch of data. So these models are ingesting a huge quantity of data and they are this incredible pattern matching systems. And through this process, intelligence emerges. So they learn to write and they learn to code.
Starting point is 00:12:00 They learn to do basic math, they learn to summarize information, and all sorts of things. We don't know exactly how this works, but we know that it works. Deep learning is very powerful. But this is important because then people keep asking, you know, how it works, and it goes into the transparency questions, and this is where we can describe the tools that we are using to provide transparency to the public about what we're doing. So understanding this first part is very, very important, how the large language models work, and you're combining, you know, this architecture this architecture neural nets and a lot of data and a lot of compute and you get this incredible intelligence and as we're thinking about providing transparency into the model behavior and how things work one of the things that we've done is actually share with the public this document
Starting point is 00:13:07 that we call the spec, the model spec, and it showcases how model behavior works and the types of decisions that we make internally at OpenAI and that we make with human labelers. you see by looking through the spec you see the complexity of of what's going on that sometimes direction is very uh is in conflict like for example you might say to the model i want you to be very helpful um and also i don't want you to disobey the law. And let's say someone puts in a prompt that says, you know, give me some tips to shoplift. Then the model is meant to be very helpful, but also it's not supposed to help you with with something illegal and so so it's not
Starting point is 00:14:06 helpful yeah maybe yeah so how does it decide a person certainly knows how to or some people not all right right but the model could could interpret the guidance as um you know here are some tips to avoid shoplifting and then accidentally kind of gives you sort of yeah things that you you could do but that depends there's not so much model behavior there's so much on that's more on the person and that goes into the area of misuse but this just goes to show that model behavior is actually quite complicated and it's not as simple as like picking liberal values or putting anything into it one of the things i think that gets people is the confusion about what's in it and what's not in it i think provenance is a big idea um in march you had an interview with joanna stern of the journal who asked you if open i had used videos
Starting point is 00:15:02 from youtube instagram and Facebook to train Sora, which is your text-to-video model, which is getting better and better. You said you didn't know. Shouldn't you know? Right. So, I mean, I didn't handle that question. Okay, why don't you handle it well now? Redo.
Starting point is 00:15:22 So I cannot tell you specifically where the data comes from but the data comes from these three categories so i can't give you the the specific source because i mean this is trade secret and it helps us stay competitive but i can tell you that the categories of data and it's it's the ones that i mentioned earlier publicly available data data that we pay for through licensing and uh deals uh that we make with content providers as well as uh data from users or you know where we are the only reason i'm asking perplexity just got into trouble because they were basically scraping in a more a quicker way a story and then not giving the citing of it. You could see how any media company could be worried about that idea.
Starting point is 00:16:10 Yeah, so we want to make sure that we are respectful to content creators and we are doing a set of things to experiment with ways to compensate people for data creation. So we're building this tool that we're calling Content Media Manager. And this will allow us more specifically to identify the types of data that... Record companies do it. It's been done in the past. So it's not an impossible thing to be able to do that. Speaking of Sora, Ashton Kutcher told Eric Schmidt, what an interesting pair,
Starting point is 00:16:47 I have a beta version of it and it's pretty amazing. He also said the bar is going to go way up because why are you going to watch my movie when you could just watch your own movie? When will Sora be ready for public release? We don't have a timeline for public release for Sora yet. What we're doing right now with Sora is we've given access to red timers and we've
Starting point is 00:17:06 given access to some content creators to help us identify ways to make this robust. We're doing a lot of work on safety front, but also to figure out how do we actually bring this to the public in a way that's useful. That's not very straightforward. Right now it's really a technology and this has been a pretty consistent process that we have followed with every new technology that we have developed. We'll usually work with those that have, like for example with DALI, we worked with creators initially and they helped us identify ways to create an interface where they felt more empowered and they could create more projects. Basically, you just want to extend the creativity of people.
Starting point is 00:17:57 So, SOAR presumably is a little more dangerous than a chatbot, correct? Is that the worry? I mean, you could easily see porn movies with Scarlett Johansson, for example. I'm going to ask about her in a second. But that she wasn't appearing in, like, things like that.
Starting point is 00:18:15 How do you... Are you more worried about video? Is that... Well, yeah, video has a bunch of other issues, right? Because, especially when done very well, which I think Sora is quite remarkable and video is very visceral and and of course it can be very emotional evocative so we have to address all the safety issues and figure out the guardrails and figure out how do we actually deploy a useful and helpful product.
Starting point is 00:18:47 But also, you know, from a commercial perspective, nobody wants a product that is going to create a bunch of, you know, safety or reputational scandals out there. Yeah, that's just Facebook. Well, go ahead. Facebook Live. Nice to meet you. Well, go ahead. Facebook Live. Nice to meet you. Well, go ahead. So, you know, it's a... Yeah, so we're... You're laughing.
Starting point is 00:19:11 Go ahead. You can laugh. It's funny. We're... So, you know, I think this is really incredible and magical technology, but the breadth, the reach, the consequence is also great. And so it's important that we get this right. Now, of course, at Opening Eyelids,
Starting point is 00:19:34 we use iterative deployment strategy. So we usually release to a small group of people. We try to identify edge cases. And once we feel confident about how we handle them we expand access but you need to figure out what is the product surface and what's the business model around it and we about that idea of consequence one of my themes one of my big themes is lack of interest in consequences of not you earlier tech companies they just we became the beta tester for all their stuff.
Starting point is 00:20:05 If they released a car like this, they'd never allow it to happen. They'd be sued out of existence. But a lot of tech is released in a beta version. The idea of consequences, do you feel as if you yourself as chief technology officer, even if you can't figure out all the consequences, there's enough respect for the idea that there are consequences for every single invention you make. It's consequences that we will feel on our skin and on our society. So by that, I don't necessarily actually mean regulation or legal ways.
Starting point is 00:20:43 I mean, you know know a moral imperative to get this right um it's you know i'm optimistic and i think this technology is incredible and it will allow us to do just amazing amazing things you know i'm very excited for its potential in science, in discovery, in education, in particular in health care. But, you know, whenever you have something so powerful, there is also the potential for some catastrophic risk. I mean, this has always been the case. Humans have tried to amplify it. True, but I mean, the quote that I used in my book was from Paul Virilio, when you invent the ship, you invent the shipwreck.
Starting point is 00:21:28 This is more than a shipwreck, a possibility, correct? I disagree with that because my background is in engineering. Our entire world is engineered. Engineering is risk, right? The entire human civilization is built on engineering practice like our cities our bridges everything and there is always risk that comes with that and and you manage that risk with responsibility and but it's not just the the developers it's a shared responsibility and in order to make it shared you actually need to give people access and tools and bring them along instead of, you know, building it in a vacuum and technologies that are not accessible.
Starting point is 00:22:15 Last month, you announced the iteration of ChatGPT4. I love your names. ChatGPT4. Oh. Oh. Yes. It's a great name. Can't you call it like Claude? They all have those.
Starting point is 00:22:29 That's okay. Chat GPT is fine. You're making it free, correct? That one's free. But then you also announced you're training a new model, chat GPT5, and then there'll be 5AB. But will that be an incremental step forward? Is it exponentially better better and what's the
Starting point is 00:22:46 expected release date uh cara so yeah on on gbd for o it all stands for omni model okay because it ties together all the modalities, vision, text, audio. And what's so special about this model is that for the first time, you can interact very seamlessly and naturally with the model. The latency is almost imperceptible. And that's a huge jump in the interaction with AI. It's quite different from the previous releases that we have made. And we wanted to make this, the latest capability, free for all users.
Starting point is 00:23:35 We wanted everyone to get a sense for what the technology can do, what these new modalities look like, and also understand the limits of it. And it goes to what I was saying earlier, that you actually want to give people access to bring them along because it's so much easier to understand the potential and the limitations of a technology if you're experiencing it and if you have an intuitive sense for what it can do. So what is in five?
Starting point is 00:24:02 It could be like this little appetizer, so now buy five. But what is it five it all could be like a you know this little appetizer so now by five but what what is in five that's different is it well incremental or a very big leap we don't know but i mean that's going to uh you know i don't know what we will call it right uh and but the next the next large model um is going to be quite capable and we can expect, you know, sort of big leaps like we've seen from GPT-3 to GPT-4. But we don't know yet. What do you think will be in it? You do know. We'll see. We'll see. I'll see. But what about you? No, even I don't know. What? Even I don't know. Really? Okay, all right. An internal OpenAI roadmap predicted that it would achieve AGI,
Starting point is 00:24:49 which is artificial general intelligence, for people who don't realize it has not been achieved, by 2027, which would be a huge deal. Explain the significance, and also when do you estimate we'll achieve AGI? So people will define AGI differently. We have a definition of AGI by the charter, which is the systems that can do, you know,
Starting point is 00:25:17 economically valuable work across different domains. And, you know, from what we're seeing now now the definition of intelligence just keeps changing so a while back we would look at academic benchmarks to test how intelligent the systems were and then once we saturated these benchmarks we looked at exams, school exams, and eventually, you know, when we saturate those, we'll have to come up with new evals. And it makes you think, how do we evaluate fit and intelligence in a work environment? We have interviews, we have internships, you know, we have different ways. So I do expect that this definition will continuously evolve. So I do expect that this definition will continuously evolve.
Starting point is 00:26:21 I think perhaps what's going to become more important is assessing, evaluating and forecasting impact in the real world, whether it's societal impact as well as economic impact in the real world. So not this moment where it just suddenly goes, oh, look at me, and decides what to do for itself, right? I think that's the worry, correct? Mm-hmm. Because, you know, for the AGI definition specifically, yes. And, you know, that's important. And I think the definition of intelligence will continue to evolve. But I think what's equally important
Starting point is 00:26:43 is how it affects society and at what rate it actually penetrates. Using that definition, when does OpenAI think that? Is that 2027 number correct? Well, I'll say, you know, within the next decade, we will have extremely advanced systems. But what people are worried about, because obviously we have to talk about the safety versus product discussion. Now, OpenAI was started this way.
Starting point is 00:27:14 I think the reason you're having these discussions is because the way it was started, you had, I would say, a mixed marriage. The people who were there for helping humanity, the people there who really like $1 trillion. Or in between. I think you're probably in between. Last week, 13 current and former OpenA and Google DeepMind employees, it crosses lots of companies.
Starting point is 00:27:37 It's not just OpenA, it just gets all the attention because it's gotten a lot of attention, obviously. They published an open letter calling for companies to grant them a right to warn about advanced artificial intelligence. This isn't new. Facebook, Google, and Microsoft employees have been known to sign open letters, whether it's working with the Defense Department, et cetera. But in this case, employees say that, quote, broad confidentiality agreements block us from voicing our concerns, which is essentially saying, oh, no, we can't tell you what oh oh no is, but you'll all die, essentially. That's what it sounded like from the letter. What's your response?
Starting point is 00:28:10 And people saying they're worried about retaliation. And I'm not going to go into the vested equity because I think you've apologized and corrected that. But shouldn't they be able to voice their concerns if they have them? And I know there's differing opinions. Yeah, definitely. voice their concerns if they have them and I know there's differing opinions yeah definitely I mean we we think debate is super important and being able to publicly voice these concerns and talk about issues on on safety and we've done this ourselves you know since the beginnings of opening eye we've been very open about concerns on misinformation, even since the GPT-2 days is something that we've studied since early on. I think that, you know, in the past few years, there has been such incredible progress, such incredible technological progress,
Starting point is 00:29:10 progress that nobody anticipated and forecasted. And this has also increased the general anxiety around societal preparedness. As we continue this progress, we see sort of where the science leads us. And so it's understandable that people have fears and anxieties about what's to come. Now, I would say specifically, the work that we've done at OpenAI, the way that we've deployed these models, I think we have an incredible team, and we've deployed the most capable models very safely, and I feel very proud of that. I also think that given the rate of progress in technology
Starting point is 00:29:48 and the rate of our own progress, it's super important to double down on all of these things, security, safety, our preparedness framework, which talks about how do we think about the risk of training and deploying frontier models? Right, but you talked about that. I mean, one was why the need for secrecy and nondisclosure and stricter than other companies, one.
Starting point is 00:30:15 And two, the open letter comes after a string of high-profile departures, including Jan, I think it's Jan Leike and Ilya Sutskiver. They led the now-disbanded Super Alignment team, which was in charge of safety. Ilya was a co-founder. He joined with three other board members to oust Sam, a CEO. I don't think it's a surprise that he's gone, but Leike posted this on X over the past year.
Starting point is 00:30:37 Safety, culture, and processes have taken a backseat to shiny products. That's probably the most persistent criticism leveled at OpenAI, and I think it's the split in this company from the beginning that this was one of the issues. Do you think that's fair, and why or why not? If you say you're very interested in safety, they say you're not. How do you meet that criticism?
Starting point is 00:31:00 Well, a few things. So the alignment team is not in charge of safety. At OpenAI, that is one of our safety teams. Very important safety team, but it is one of them. We have many, many people working on safety at OpenAI. And Jan is an incredible researcher, colleague. I worked with him for three years. I have a lot of respect for Jan.
Starting point is 00:31:22 And he left OpenAI to join Anthropic. Which is a competitor, but go ahead. And, you know, I think that we do absolutely, I mean, everyone in the industry and OpenAI, we need to double down on the things that we've been doing on safety and security and preparedness and regulatory engagement, given the progress that we're anticipating in the field. But I disagree on the fact that, or maybe on speculation, that maybe we've put product in front of safety or ahead of safety. Why do you think they say that?
Starting point is 00:32:04 Because these are people you've worked with. Well, I think you have to ask them, but I think that many people think of safety as something separate from capability, that there is this separation between safety and capability and that you need to sort of advance one ahead of the other. From the beginning of OpenAI, I joined from aerospace and automotive, and these are industries with very established safety thinking and systems
Starting point is 00:32:37 and places where people are not necessarily constantly debating around the table what safety is, but they're doing it because obviously it's really quite established. And so I think the whole industry needs to move more and more towards a discipline of safety that is very empirical. We have safety systems, we have rigorous discipline on operational safety. And what I mean by that is in a few areas, not just the operational discipline, but also safety of our products and deployments today, which covers things like, you know, harmful biases and thinking about misinformation, disinformation, and thinking about misinformation, disinformation, classifiers,
Starting point is 00:33:26 all these types of work. And then we're also thinking about the alignment of the models long-term. So not just the alignment of the models today, which we use reinforcement learning with human feedback to do that, but also the alignment of the models as they get more and more powerful. And this is a niche area of research where a lot of the concerns... Sure, but it persists with OpenAI. I do think it's because you're the leading company at this moment in time. But it's this idea of people leaving and saying...
Starting point is 00:33:58 Even Sam went before Congress and said that AI could, quote, cause significant harm to the world. He signed a letter warning about extinction risk posed by AGI, which is pretty bad, I think. There's an overlap, what he said, and what AI doomers say. There's doomsday rhetoric, and you're putting out products. So a lot of people are like, they just want the money, and they're not worried about the damage.
Starting point is 00:34:24 That's what they're saying, that shiny new products is over worrying about the impact of those products. Yeah, in my opinion, that's overly cynical. I mean, there is this incredible team at OpenAI that joined because of the mission of the company. And I don't think all thousand people at open ai uh are are trying to do that i mean we have this incredible talent people that care deeply about the mission of the company and um and i and and we're all working extremely hard to develop and deploy the systems
Starting point is 00:35:04 in a way that is safe. And all you need to see is the track record. I mean, we've deployed, we were the first to deploy the systems in the world. And we have taken great care not to have safety incidents. So I want to talk a little bit about election and disinformation. But I want to talk about you and your role at the company. I think I met you during the blip, which was when, I think that's what you call it internally, which is when Sam was fired and then unfired.
Starting point is 00:35:34 Talk to me about your relationship with Sam. I like Sam, but I also think he's feral and aggressive like most of technology people. And he certainly is aggressive, and that's fine. It's not an issue for me because some people are more feral and more aggressive. But talk a little bit about what happened then because you became CEO of that company. For a few days. Yeah. Okay. How was it? It was kind of stressful. Yeah. So some of the board members said you complained about him and your lawyer pushed back and said you had feedback about him can you tell us what you said about him um i mean look there is so much interest around the people running these companies obviously
Starting point is 00:36:21 makes sense and open ai and all the drama that happened then and it's understandable at the end of the day we're just people running these companies we have disagreements we work through them and at the end of the day we all care deeply about the mission and that's why we're there and we put the mission and the team first. Sam is a visionary. He has great ambition and he's built an amazing company. We have a strong partnership and, you know, all the things that I've shared with the board when they asked, he already knew.
Starting point is 00:37:02 So it's nothing. So how do you push back at him? I understand this dynamic. It happened to Google. It happened at early Microsoft. It happened at Amazon. Things change within these companies, especially as they grow and they make, you know, Google was chaotic in the early days.
Starting point is 00:37:17 And Facebook went through so many COOs, I can't even tell you. It was like a parade of guys that went through there that Mark didn't like. I'm aware of this. But how do you push back? How do you deal with him on a day-to-day basis? How do you look at that relationship? And where do you push back? I mean, all the time.
Starting point is 00:37:37 That is, I think it's normal when you're doing what we're doing. And, you know, Sam will push the team very hard and i think that's good it's great to have a big ambition and um to test the limits of of what we can do and when i feel like you know it's beyond uh you know i i feel like basically i can i can push back and that's sort of the relationship we've had for over six years now. And I think that is productive, that you need to be able to push back. Could you give me an example of doing that? Perhaps Scarlett Johansson, for example.
Starting point is 00:38:18 You were working on that, correct? You were working on that particular voice element. Yeah. Look, we have a strong partnership but the selection of the voice was not a high priority not something that we were working on together I was making the decisions on that and Sam has his own relationships and um uh after I selected the voice behind Sky he had reached out to Scarlett Johansson. So, you know, we didn't talk to each other about that specific decision.
Starting point is 00:38:48 And that was unfortunate. Yeah, so he's freelancing it. Well, you know, he's got his own connections. And so, yeah, we weren't entirely coordinated on this one. Do you think, it's very funny in a lot of ways, especially because of the movie and the tweet he did. But do you think one of the things I thought was, here's the first time this is a real error on OpenAI's part,
Starting point is 00:39:14 because finally everyone's like, oh, even if you didn't steal her voice, Sam looked like Ursula in The Little Mermaid. You know, he did. You don't have to agree with me, but it's true. Even if it's not so, and as it's turned out, you had been doing it for months and it was a different person and everything else. It's a little less exciting
Starting point is 00:39:34 than we stole Starla Johansson's voice, but it encapsulates for people this idea of taking from people the fear. And I think that is a moment. Do you worry that that's the image of tech companies coming in and grabbing everything they can? And I think that is a moment. Do you worry that that's the image of tech companies coming in and grabbing everything they can? I do think that's absolutely true.
Starting point is 00:39:51 Yeah, I do worry about the perception. But all you can do is just do the work, get it right, and then people will see what happens and you will build trust that way. I don't think there is some magical way to build trust other than actually do the work and do it right. Have you talked to Scarlett Johansson at all? No.
Starting point is 00:40:13 So let me finish up talking about election disinformation. Three new studies that look at online disinformation collectively suggest the problem is smaller than we think and disinformation itself is not that effective. One study finds that we're dealing with a demand side issue. Some people want to hear conspiracy theories, and they'll seek it out. Others think differently, that this is a really massive problem. And obviously, you heard the previous thing.
Starting point is 00:40:35 People have a lot of conspiracy theories out there, and it's fueled by social media in many ways. So when you think about AI-powered disinformation in the upcoming presidential election, what keeps you up at night? What's the worst-case scenarios you have and the most likely negative outcomes from your perspective? With the current systems, they're very capable of persuasion and influencing your your way of thinking and your beliefs so and this is something that we've been studying for a while and i i do believe it's a real issue with ai it gets majorly exacerbated so especially in the past year we've been very focused on how to help election integrity. And there are a few things that we are doing. So number one, we're trying to prevent abuse as much as possible.
Starting point is 00:41:35 And so that includes improving the accuracy of detection, political information detection, and understanding what's going on in the platform and taking quick action when that happens so that's one the second thing is reducing political bias so you might have seen that chad gpt was you know criticized for being overly liberal that was elon you're too woke right well, there were a few other voices, but the point is it wasn't intentional, and we worked really hard to reduce the political bias in the model of behavior, and we'll continue to do this, and also the hallucinations.
Starting point is 00:42:18 And then the third thing is we want to point people to the correct information when they're looking for where they should be voting or voting information. So we're focusing on these three things when it comes to elections, but broadly for misinformation, I would say deepfakes are unacceptable. So we need to have very robust ways
Starting point is 00:42:42 for people to understand when they're looking at a deepfake. We've already done a couple of things. We've implemented C2PA for images. And so it's sort of like metadata that follows the content in other platforms on the internet like a passport. And we've also opened up two red teaming classifiers for DALI where you can detect that an image has been generated by DALI or not.
Starting point is 00:43:14 So metadata and classifiers are two technical ways to deal. This is provenance where it comes from. For provenance of information. And this is for texts specifically. Sorry, that's for images specifically. And we're also looking for watermarking. We're looking at watermarking techniques to implement in text and how to do that robustly. But the point is that people should know when they're dealing with deepfakes. And we want people to trust the information that they're seeing.
Starting point is 00:43:45 Well, the whole point of deepfakes is they're trying to fake you, correct? I mean, a political consultant, FCC just fined him $6 million for creating deepfake audio robocalls. It sounded like Biden during the New Hampshire primary. There could be more sophisticated versions. OpenAI is working on a tool called Voice Engine that can recreate someone's voice using only a 15-second recording. It'll be able to create a recording of someone speaking another language.
Starting point is 00:44:10 It's not out yet, because as your product manager told the New York Times, this is a sensitive thing. It's important to get it right. Why even make this? I mean, one of the things I always used to say to tech people, and I'll say it to you, if you're a Black Mirror episode, maybe you shouldn't make it. I think that's kind of hopeless approach.
Starting point is 00:44:31 You know, it's like these technologies are amazing. They carry incredible promise. And we can get this right. I like that you call me hopeless. I am. But go ahead. Then again, I have four children, so I must be hopeful. Who knows? Anyway, me hopeless. I am, but go ahead. Then again, I have four children, so I must be hopeful. Who knows? Anyway, go ahead. I'm hopeless. We did build Voice Engine
Starting point is 00:44:51 in 2022 and we had not relisted. And even now it's in a very limited approach because we are trying to figure out how to deal with these issues. But you can't make it robust on your own. You actually need to partner with experts from different areas, with civil society, with government, with creators, to figure out how to actually make it robust. It's not a one-stop safety problem. It's quite complex. And so we need to do the work.
Starting point is 00:45:24 If you were a doomer, then there seems to be, I literally had someone come up to me saying, safety problem. It's quite complex. And so we need to do the work. If you were a doomer, then there seems to be, I literally had someone come up to me saying, if I don't stop Sam Altman, he's going to kill humanity, which I felt was a little dramatic. And then there's others that say, no matter what, it's going to be the best thing ever. We're all going to live on Mars and enjoy the delicious Snickers bars there. They have very different, different, different things. It sort of feels like being around Republicans and Democrats right now, very different versions. So I'd love you to give me the thing that you worry most about
Starting point is 00:45:52 and the thing that you are most hopeful about. Okay, so first of all, I don't think it's a preordained outcome. I think that we have a lot of agency for how we build this technology and how we deploy it in the world. And in order to get it right, we need to figure out how to create a shared responsibility. And I think a lot of that depends on understanding the technology, making it very accessible. The way it goes wrong is by misunderstanding it. Meaning? Not understanding the capabilities and not understanding the risks.
Starting point is 00:46:36 That is, I think, the biggest risk. Now, in terms of some specific scenarios, I mean, how our democracies interact with this information is or with these technologies is incredibly powerful and i do think there are there are major risks around persuasion probably yeah where you know you could you could persuade people very strongly to do specific things. You could control people to do specific things. And I think that's incredibly scary. To control society to go in a specific direction.
Starting point is 00:47:23 And in terms of the promise, one of the things I'm very excited about is having high quality and free education available everywhere in some remote village you know really in middle of nowhere for me education was very important. Personally, it was everything. It really changed my life. And I can only imagine, you know, today we have so many tools available. So if you have electricity and the internet, a lot of these tools are available.
Starting point is 00:47:55 But still, you know, most people are in classrooms with one teacher, 50 students and so on. And everyone gets taught the same thing. Like imagine if education is catered to the way that you think to your culture norms and to your specific interests that could be extremely powerful in extending the level of knowledge and creativity and it can you know even if you consider like learning how to learn that kind of happens very late in life maybe college maybe even later and that is such a fundamental thing but if we were able to really grasp this and get this uh really learn how we learn much at much
Starting point is 00:48:41 younger age i think that is very powerful and it can push human knowledge and pushing human knowledge can push the entire civilization forward. All right. We'll leave it at that. Thank you, everybody. Thank you. Mira Muradu. Thank you so much. On with Kara Swisher is produced by Christian Castro-Russell, Kateri Yochum, Jolie Myers, and Megan Burney. Special thanks to Kate Gallagher, Andrea Lopez-Grisado, and Kate Furby.
Starting point is 00:49:13 Our engineers are Rick Kwan and Fernando Arruda, and our theme music is by Trackademics. If you're already following the show, you've been selected as the voice of ChatGPT5, the show, you've been selected as the voice of chat GPT-5 and you get paid in OpenAI stock. If not, Ursula, I mean Sam Altman, is stealing your voice. Go wherever you listen to podcasts, search for On with Kara Swisher and hit follow. Thanks for listening to On with Kara Swisher from New York Magazine, the Vox Media Podcast Network and us. And special thanks to the Johns Hopkins University Bloomberg Center.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.