The AI Daily Brief: Artificial Intelligence News and Analysis - OpenAI Forms Catastrophic Risk Preparedness Team

Starting point is 00:00:00 Today on the AI breakdown, we're looking at OpenAI's new catastrophic risk preparedness team. Before that on the brief, an incredibly cool model that can watch videos and actually understand them. The AI breakdown is a daily podcast and video about the most important news and discussions in AI. Go to Breakdown.network for more information about our YouTube channel, our Discord, and our newsletter. Welcome back to the AI breakdown brief. All the AI headline news you need in around five minutes. And man, we have some really cool stuff today. We kick off with a new foundation model. It is a video language model from the company 12 Labs. Alongside the announcement of this model, they are also announcing an investment from invidia, Intel, and Samsung,

Starting point is 00:00:46 which I think probably tells you all you need to know about just how exciting this is, given those huge companies participating in this investment. So like I said right at the beginning, this is a model where you can have it watch a video and then ask it to summarize, to give highlights, to generally interact with the video in an intelligent way. Now, you may have seen other tools that do this, but 12 Labs says that their solution is somewhat different. They write, Contrary to existing solutions that either utilizes speech-to-text conversions or rely solely on visual frame data, Pegasus 1 integrates visual, audio, and speech information to generate more holistic text from videos. Basically, what they're saying is that we are not taking this,

Starting point is 00:01:26 translating it into a script, from which we can interact with it in the same way that a normal LLM would, but instead that they have built a model from the ground up that integrates all of this different type of information. Pegasus 1, they write, has approximately 80 billion parameters, with three model components jointly trained together, their video encoder, their video language alignment model, and their language decoder. Now, yes, this looks cool, but does it perform? According to their research, the answer is yes. They write, Pegasus 1 exhibits massive performance improvement over previous state-of-the-art video language models and other approaches to video summarization. Now, the company has released a technical report on this that you can go read,

Starting point is 00:02:04 where you can get more information around how they compare to other types of models, both in terms of their strategy and in terms of their performance, and you can also learn more about the dataset underlying it as well. What you can't do, unfortunately, is actually use it yet, but you can join a wait list. This is a product category that has tons of opportunity, and I think that we'll really break down the barriers between the different categories of content, that we all consume. We're moving quickly to a world where regardless of what format the creator creates content in, you can consume it in a format of your choice, be it audio, video, or otherwise, with the help, of course, of AI. Next up, in the world of AI image generation, a slightly different approach from

Starting point is 00:02:43 shutterstock. Obviously, in this space, one of the big questions is where regulation, policy, and the courts are going to come down on questions of copyright, and we've seen a number of different strategies to try to get out ahead of that. Companies like Adobe have created models in their Firefly 2 that are trained entirely on images that they have the rights to, trying to sidestep those issues. And of course, they've also offered to pay for legal fees should that not be sufficient. Shutterstock is walking even farther down that path, basically by allowing users to transform photos that are already in the Shutterstock library and paying artists if those images are licensed after they're edited with AI. So really, in some ways, this is more of an AI.

Starting point is 00:03:24 extension of existing stock photo libraries, but one that solves some of the questions of artist compensation. Some of the ways that users will be able to modify those images, include expanding the image or background, resizing the image to match the required dimensions, removing the background, and doing something akin to Adobe's generative fill, where you can select an area of the photo and describe a natural language what you want to add, replace, or erase. Now, obviously, ultimately, this is still more creativity constrained than the pureplay image generators that many people are using now, but in a way that does answer a lot of questions that are still dogging that space. Moving back to the world of business, we had another earnings call

Starting point is 00:04:02 that was totally focused on AI, and that comes from Amazon. Earlier in the week, we had earnings calls from Microsoft and Google. Microsoft was the big winner out of that, seeing its Azure business grow even more than analyst's expectations. Google's cloud business had a little bit of a tougher time, but in both cases, AI was clearly the feature. So two was it with Amazon's earnings call on Thursday. Yahoo Finance kicks off their article, Amazon entered the AI hype zone. Now, there were two things going on that analysts liked here. One was that Amazon beat earnings estimates, which is the most important thing. But two, CEO Andy Jassy said that their AI business was an opportunity that was worth, quote, tens of billions of dollars to their Amazon Web Services business. Earlier this year, of course,

Starting point is 00:04:42 Amazon launched Bedrock, which is a way for enterprise clients to help customize develop and implement AI models, and Jassie said, quote, our genitive AI business is growing very, very quickly. In fact, he said it was growing so quickly they were even surprised at the growth of that business. Now, last weekend, we talked about how Amazon fits into the larger big tech competition around AI and how they're trying to operate at the level of applications and enterprises and actually getting this technology into companies that need it, which is, of course, their bedrock approach, but also on the level of actual foundation models, specifically with their recent tie-up with anthropic, and then down even a little farther, into their approach to developing AI chips.

Starting point is 00:05:21 Seeing that it is actually already having an impact on the bottom line, means that markets are likely to be very eager that they continue down this path. Meanwhile, over in the land of Google, things were a little quiet this week, but the company did announce a new bug bounty program specific to generative AI. Cyber threats, they write, evolve quickly, and some of the biggest vulnerabilities aren't discovered by companies or product managers, but by outside security researchers. Today they write, we're expanding our vulnerability rewards program to reward for a tax scenario specific to generative AI.

Starting point is 00:05:50 Now, this just makes total sense to me and I think is a thing that you're going to see from lots of companies, especially because the very existence of AI increases the variety of threats that companies are going to face. Now, lastly today, I want to talk about a really interesting survey from the information. The information is a Silicon Valley publication. You hear about it a lot here because they have a really good beat on, particularly the big tech companies. They have been behind a lot of the scoops when it comes to new forthcoming features in the AI space, and their readership also sort of reflects, I think, their sources, right? In other words, they are a very tech-native publication through and through from their reporters to what they cover to the audience that reads them.

Starting point is 00:06:30 They recently did a survey with their readers around how they're using AI and whether or not they're paying for it. The way that I would treat this is as a look at perhaps how an early adopter cadre is actually using these technologies. And in that, I think there's some pretty interesting data. When they asked the question, do you ever use any AI services for personal or professional use? The overwhelming majority did. Only 12% in fact said no. 11% said yes for professional use, 19% said yes for personal use, and 58% said yes for both. But what about what they spend on AI? Well, apparently one in five respondents who use AI, which again represents about 90% of the people surveyed, said that they

Starting point is 00:07:07 spend at least $100 monthly for AI services. That's specifically for AI services at work. A similar portion of the group said they spend at least $30 monthly on AI services for personal use as well. What's more? It seems like people are pretty happy with these tools. When asked if they plan on continuing to pay for these same AI services next year, more than half, 54% said yes, and another 30% said probably. In other words, the vast vast majority are not churning and are finding enough value to continue paying. Now, what about which services they use? Totally unexpectedly, chat GPT is right at the top of the heap, with only 14% of people not using chat GPT, and a full 51% using a paid version of it. Google Bard was next with 45% using a free

Starting point is 00:07:48 version. After that, were Dolly and Mid Journey. Dali had 15% using a paid version with 22% using a free version, not surprising given that it's now integrated into chat GPT, and Mid Journey had 15% using a paid version. GitHub co-pilot had around 13% using a paid version, and another 6% using a free version. Anthropics Clod had 15% using a free version, but only 6% using a paid version. One of the takeaways for the information authors was that people were more willing to pay for AI services that have very specific applications like generating code or generating images. Now, when it comes to why people are using these tools professionally, some are because it saves them money, and some are because it saves them time. One respondent runs a company called Element

Starting point is 00:08:31 Human that uses machine learning to analyze body language and says that he spends around 500 pounds a month on AI services, but that that saves the company 2,000 pounds per month and likely more in the future. Another respondent uses AI to prepare for interviews on his podcast. He said, normally it'd take a couple hours to put together eight questions for an interview, and using AI, I can cut that down to under 10 minutes. Now, overall sentiment was the best that they've had since they began their monthly surveys back in February. Fifty-eight percent of respondents said they think conditions will improve, and only 14 percent said they think conditions will get worse. Like I said, this is a very early adopter kind of group.

Starting point is 00:09:05 A full 56% of the people who participated in this survey actually work in tech, but it still gives, I think, a great little insight into what early adopters are doing and how much these tools are changing people's workflows right now. However, that will do it for today's AI breakdown brief. Next up, the main AI breakdown. Welcome back to the breakdown. There is a lot, and I mean a lot of discourse right now around questions of AI safety and AI risk in general. but specifically catastrophic risk in AI, real human extinction level risk. The reason for that is a couple parts. One, it's just the culmination of a narrative which has been coming up all year.

Starting point is 00:09:50 You've seen a steady beat of scientists and academics and some people from industry, saying that these are issues that we really need to take seriously, that we really need to think about. Another part of that is that it's coinciding with a growing policy conversation. governments around the world are figuring out how they are going to handle the full spectrum of risks from AI, as well as take advantage of it and leave their populations in the best possible place, and most sees its advantages to help their populations. Now, along those efforts, next week we are getting the UK's much-ballyhooed and much-discussed AI Safety Summit. This has been in the news for a variety of reasons.

Starting point is 00:10:28 One, because the UK Prime Minister's office has been really putting emphasis on this and making it a big political issue. Two, because there is intrigue and controversy around it. One of the decisions that Rishi Sunox's government made was to invite and include China in the proceedings. That apparently angered allies, especially the United States, who has taken a very, it's us against them approach when it comes to China and AI. That has obviously been embodied in the extension of an increase in export restrictions around AI chips, including cutting off loopholes that China was using to get around the export restrictions that were put into place last year at this time.

Starting point is 00:11:04 So, this event is coming up next week. It's happening right at the beginning of November, and around that it seems like both governments and labs are positioning themselves vis-a-vis this type of issue. We talked yesterday about how the White House is planning to release their executive order on AI on Monday, and OpenAI has also just announced a new initiative around this, which is directly focused on these catastrophic misuses of AI. So, in a blog post called Frontier Risk and Preparedness, OpenAI announced their preparedness team and the Preparedness challenge. The company writes, as part of our mission of building safe AGI, we take seriously the full spectrum of safety risks related to AI, from the systems we have today to the furthest

Starting point is 00:11:44 reaches of superintelligence. In July, we joined other leading AI labs and making a set of voluntary commitments to promote safety, security, and trust in AI. These commitments encompassed a range of risk areas, centrally including the frontier risks that are the focus of the UK AI Safety Summit. As part of our contributions to the summit, we have detailed our progress on frontier safety, including work within the scope of our voluntary commitments. Now, we will come back to exactly what they said and what they reported in that in just a moment. But Open AI continues that when it comes to catastrophic risks from Frontier AI,

Starting point is 00:12:14 there are a set of questions that society needs to answer. One, how dangerous are Frontier AI systems when put to misuse? Two, how can we build a robust framework for monitoring, evaluation, prediction, and protection against these problems? Three, if our Frontier AI model weights were stolen, how might malicious actors choose to leverage them? Because these questions need answers, OpenAI is now convening this preparedness team. They write that the team will connect capability assessment, evaluations, and internal red teaming

Starting point is 00:12:41 for frontier models, from the models we develop in the near future to those with AGI-level capabilities. As a little aside, they are clearly saying that we are not developing AGI-level AI just yet. They go on, the team will help track, evaluate, forecast, and protect against catastrophic risks spanning multiple categories, including individualized persuasion, cybersecurity, chemical, biological radiology and nuclear threats, autonomous replication, and adaptation. Now, that first category, individualized persuasion is interesting. Given a tweet the other day from OpenAI CEO Sam Altman who wrote, I expect AI to be capable of superhuman persuasion well before it is superhuman at general

Starting point is 00:13:17 intelligence, which may lead to some very strange outcomes. Okay, so coming back to this preparedness team, they're helping track, evaluate, forecast, and protect against these risks. But I think it's clearly that last one that we're most interested in, right? how do you actually help protect against it? One of the specific things is that they're developing what they're calling a risk-informed development policy. This, they say, will, quote,

Starting point is 00:13:38 detail our approach to developing rigorous frontier model capability evaluations and monitoring, creating a spectrum of protective actions, and establishing a governance structure for accountability and oversight across the developmental process. Basically, to me, this reads like the mandate of the preparedness team vis-a-vis this risk-informed development policy is to actually integrate how the company deals with these risks to the actual development and deployment process. In other words, rather than having it be totally disconnected,

Starting point is 00:14:07 or something that you just look at once the model is ready to go, it seems at least they're giving the indications that this preparedness team will actually integrate those questions into the process of how open-eye works in general. If that's the case, and certainly I could be optimistically reading into it, that I think would be a huge and meaningful development. One of the big concerns that many have pointed out, including a piece that we will read for long reads this weekend, is that there is an incredible disparity between the amount of money and just the general resources being spent to add capabilities to AI as opposed to align and minimize risks of AI. If that becomes a more integrated process within Open AI, and especially if that

Starting point is 00:14:46 sets a template and a model that others can follow, or even that government's mandate others follow, that could be a meaningful change to how the industry is evolving. So as part of this launch, they also announced their AI preparedness challenge. They say to identify less obvious areas of concern and build the team, we're offering 25,000 in API credits to up to 10 submissions on catastrophic misuse prevention. This is now available on the website. There's a preparedness challenge. It's just opena.com slash form slash preparedness dash challenge. And if you're interested, there's not a lot of words. You give them your name and your LinkedIn. And then basically you have around 600 words total, along with the three-page max PDF to explain a particular misuse your

Starting point is 00:15:24 concerned about, why it might lead to catastrophic harm, and a plan for how to actually deal with it. I'll include a link to this in the notes so that if any of you have these ideas or want to get involved with this team, you have easy access to it. Now, as I mentioned, Open AI said that as part of this announcement, they were also sharing an update around the UK AI Safety Summit on their approach to frontier risk. This is sort of building off those voluntary commitments that came earlier in the summer. So what have they done around those commitments? Well, they say when they launched Dolly 3, which is their first major public release of a new, frontier model within the scope of those voluntary commitments. They, quote, did critical safety

Starting point is 00:15:59 work including pre-deployment safety evaluation and red teaming. In addition, they say, we are working towards new methods to empower people to track the provenance of AI-generated media, end quote, have continued to invest in responsible practices through a rollout of voice and image analysis capabilities in chatGBT, although they don't say what those responsible practices are. They also said that they've met their voluntary commitment to, quote, establish or join a forum or mechanism through which we can develop, advance, and adopt shared standards and best practices for Frontier AI Safety. That is the Frontier Model Forum, which we discussed the other day, as it just got itself its first executive director. Still, it's quite clear that when it comes to what they

Starting point is 00:16:33 think is significant here, it really is this risk-informed development policy. The vast majority of this blog post is in fact spent on that policy. They basically expand upon what they said before, that the RDP will detail their approach to evaluations and monitoring, as well as establishing a governance structure, and that it will provide for a spectrum of actions to protect against catastrophic outcomes. Now, they do note that this is meant to extend their existing work, including the work of their safety systems team to conduct research, their superalignment team, which has this moonshot mandate of aligning super intelligent AI systems with human intent in the next three and a half years now. And they also discuss a joint deployment safety board

Starting point is 00:17:09 with Microsoft, which they say, quote, approves decisions by either party to deploy models above a certain capability threshold. The DSB focuses specifically on deployment decisions rather than on earlier steps, such as deciding whether or not to train models of a certain scalar capability level. Now, another part of this note talks about just how much of a scientific challenge alignment really is.

Starting point is 00:17:29 They put it really bluntly. Our current techniques for aligning AI, such as reinforcement learning from human feedback, rely on human ability to supervise AI. But these techniques will not work for superintelligence because humans will be unable to reliably supervise AI systems much smarter than us. Now, I will also share a link to this because there is some really interesting information

Starting point is 00:17:46 about how their model evaluations and red teaming work. There's also an acronym that they're using a lot, which is CBRN, which refers to chemical, biological, radiological, and nuclear uses and risks of use of AI. And I think if you take anything away from this write-up and from the preparedness team announcement, it's that while there isn't a substitute for real actions, the presence of this larger conversation around AI risks is having an impact in how people are thinking about their development. There is quite clearly serious effort being put into this. Now, we can argue about whether there is enough and about whether that effort needs to be matched by resources in terms of

Starting point is 00:18:20 both money and compute. This certainly looks different than the conversation did even just six months ago. Now, two other sort of related announcements heading into the AI Safety Summit next week. The first is that the United Nations has created an advisory body addressing AI governance. This was announced yesterday by the Secretary General, and it is a 39-member advisory body that can include tech company executives, government officials, and academics. Said Secretary General Antonio Gutera's, the transformative potential of AI for good is difficult even to grasp. And without entering into a host of doomsday scenarios, it is already clear that the malicious use of AI could undermine trust and institutions, weaken social cohesion and threaten democracy itself. So basically, this body is not one

Starting point is 00:19:02 that is actually going to have any sort of real governance oversight, but is one which is meant to help make initial recommendations for intergovernmental cooperation around these issues. The UN said that the immediate tasks including building a global scientific consensus on those risks and challenges and strengthening international cooperation on AI governance. First meeting of this body is taking place actually today. Lastly, yesterday, of course, the British Prime Minister Rishi Sunak had a speech about AI heading into next week's summit, where frankly a lot of the conversation was around the decision to invite China, but he also announced, as reported by Reuters, that they are setting up the world's first AI safety institute.

Starting point is 00:19:39 Sunuk said that the Institute will, quote, advance the world's knowledge of AI safety and will carefully examine, evaluate, and test new types of AI so that we understand what each new model is capable of, exploring all the risks from social harms like bias and misinformation through to the most extreme risks of all. So like I said, if you are trying to step back and get a sense of what's happening right now,

Starting point is 00:19:58 it's that the world is very clearly taking these threats seriously. Fascinatingly, you have people on either side of this question who are upset about this for one reason or another. The accelerationists are of course concerned that anything that stands in the way of AI is a failure to realize how beneficial it can be to humanity. And so of course they don't want to see Open AI slowed down because they're worried about these risks. And then on the flip side, you have plenty of people who say either A, it's not enough, and that even more serious commitments are needed, that more government power is needed to exert control on these big AI labs. Or another dimension of that is that we're focusing on the wrong issues, that there's too much time spent on these big future theoretical scenarios and not enough time on the existing problems like biases that show up in LLM. right now. My general take is that if a lot of people on lots of sides of an argument are upset about

Starting point is 00:20:43 something that's happening in the middle for different reasons, then it might, might, just be the start of a good compromise approach. But of course, this is an extremely dynamic and fast-moving space. We will hear a lot more, I'm sure, about all of this next week with the Safety Summit actually happening. However, for us today, we will wrap it there. I appreciate you guys listening or watching as always. Until next time, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - OpenAI Forms Catastrophic Risk Preparedness Team

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.