The AI Daily Brief: Artificial Intelligence News and Analysis - OpenAI Sets Moonshot Goal of AI Superalignment in 4 Years

Starting point is 00:00:00 Today on the AI breakdown, we're talking about OpenAI's massive moonshot goal to reach super alignment in four years. Before that, on the brief, Google's new data policies and why AI is killing the open web. The AI breakdown is a daily podcast and video about the most important news and discussions in AI. Go to breakdown.network for more information. Is it the end of the open internet? After a number of recent moves by tech giants, it certainly seems that way. Welcome back to the AI breakdown brief, all the AI headline news you need in five-ish minutes or less. We kick off today with something that is in many ways a continuation of the story of Reddit and

Starting point is 00:00:40 Twitter's changes in their API considerations from last week. Over the weekend, Google updated its privacy policy, and it basically says that the company reserves the right to scrape pretty much anything that anyone posts online. Now, specifically, it says that it's allowed to use this data to train its AI models. So for all intents and purposes, if you create words on the internet that Google can see, Google says it has the right to train AI on them. Now, in terms of how this changes things, part of it is that it's expanding the types of AI that Google says that it could use this data for.

Starting point is 00:01:12 As Gizmodo writes, previously Google said the data would be used for language models rather than AI models, and whereas the older policy only mentioned Google Translate, BARD and Cloud AI are now listed as well. Gizmodo also sums up the stakes pretty well. They write, the practice raises new and interesting privacy questions. People generally understand that public posts are public, but today you need a new mental model of what it means to write something online. It's no longer a question of who can see the information, but how it could be used. Now, of course, these sort of policies are exactly what's behind recent changes to Twitter and Reddit's API.

Starting point is 00:01:44 In April, Reddit announced that it would now be charging companies who scraped its data, and Elon on Twitter has been on an anti-scraping tirade recently. The biggest manifestation of that so far is Twitter's move to cap the number of tweets that any account can read. with the specific numbers based on whether you're verified or not, and to many, the implications of this are quite a bit larger than just which internet services you use. Bob McElrath writes, there is no more open internet.

Starting point is 00:02:08 It's all walled gardens under the lie of free speech. Rob Sheridan writes, the thing we warned about way back in the early days of the net neutrality fight was that media and telecom companies wanted to turn the open internet into an entertainment platform of walled gardens, paid tiers and proprietary content. Now, that call is coming from inside the net.

Starting point is 00:02:26 The way the verge sums it up is this. Add it all up, and the social web is changing in three crucial ways. It's going from public to private. It's shifting from growth and engagement, which broadly involves building good products that people like to increasing revenue no matter the trade-off, and it's turning into an entertainment business. It turns out there's no money in connecting people to each other, but there's a fortune in putting ads between vertically scrolling videos that lots of people watch. So the social media era is giving way to the media with comments era, and everything is an entertainment platform now. Now, of course, part of the downside of these walled gardens is in just that they create balkanized experiences, but that they give platforms much more control than they previously had. Professor Ethan Mollock writes, everything the early open internet hackers were worried about turned out to be true. If you can't have your own local copy of something, it can be disappeared. Any corporate online platform can become a walled garden overnight and your content taken away. Axios sums it up public web unravels an AI-driven storm. The old web is coming apart at the seams faster than efforts to shape a new one can fill the gap. The Axios piece continues. Any site

Starting point is 00:03:27 that depends on contributions from the public, text messages, product reviews, photo or video uploads is preparing to be swamped with AI-generated input that will make finding signal in the noise even harder for human users. At the same time, these sites are trying to shut their technical gates so others can't gobble up troves of data for AI models to study. The tech world has built 30 years of growth on the idea of the open web. But the emergence of chat GPT and other AI tools trained on this stockpile of human expression, along with a financial downturn that's made firm scramble for revenue, has imperiled the old ideal of the web. as a public resource.

Starting point is 00:03:59 Now, of course, one of the other battles that this sets up is the one that has been anticipated in the Web 3 space for some time. As Axios puts it, many of the users and volunteer moderators who contributed all that valuable data to these sites are chiming in, too, to say, hold on, we created all that value for your company. We should have a say in how it gets used and maybe a share of the profit. I think we are just at the beginning of this battle, but it is remarkable to note how fast the rise of generative AI has fundamentally shifted the way that people think about the open

Starting point is 00:04:28 web, or perhaps to put it more accurately, the way that platforms think about their opportunities and challenges in the context of the open web. I do think data-hungry AI is in this case combining with the fallout of the shift away from zero interest rate policies and the incredible financial pressure that internet companies now find themselves under. Sadly, from where I'm sitting right now, I don't totally disagree with Axios' conclusion that, whatever happens, the online world is going to end up with less commons and more silos. Next up today, another area where the challenges of that move away from ZERP policies has been very clear. Venture capital funding is down 49% year over year in Q2 and 51% for the first half of 2023 as compared to 2022. As CrunchBase puts it, AI was

Starting point is 00:05:13 quarter two's big hope to reverse the global venture funding slowdown. It wasn't enough. Now, the picture that this research paints is just a significant decline in venture capital funding. And by and large, this has been expected for some time. The venture capital space is just finally now starting to deal with the fundamental shift in interest rate policy and how that impacts where investors are looking for yield. And the byproduct is simply less money for startups. Now, many think that this is going to create an extinction level event over the next 18 months as companies that raised at high valuations over the last few years

Starting point is 00:05:43 are forced either to get to profitability or have to raise money at what will in many cases be down rounds where the valuation is lower than it was in previous fundraising. Now, when it comes to AI, companies that are categorized as AI in Crunchbase raised $25 million in the first half of this year, which was down from $29 million invested in the first half of last year, but up as a portion of total funding. AI companies represented 18%, so almost a fifth of total global funding over the first half of this year. Now, that does include $10 billion in funding to Open AI that was led by Microsoft, so the statistic might be a little bit warped by that. A couple more quick ones before we get out of here, the trend of entertainment unions adapting to or

Starting point is 00:06:21 trying to adapt to AI continues, as IATSE, which is a union that represents around 160,000 professionals in the entertainment space, has released their core principles for applications of artificial intelligence and machine learning. If I had to sum up these core principles, it would be that one, artificial intelligence is here and it's not going away. Two, as they write, it threatens to fundamentally alter employers' business models and disrupt IATSE members' livelihoods. And so three, this union is going to work hard to prepare its members for this new future. IATSE tweets the stakes involved are high. Therefore, our approach as a union must be comprehensive, focusing on research, collaboration, education, political and legislative advocacy, organizing, and collective bargaining.

Starting point is 00:07:02 One of the things here that they discuss extensively is upskilling or re-skilling. They write, We assert that our members have the right to receive adequate training and upskilling opportunities to navigate any changes brought about by AI in their work environment. We will continue to work to equip our members with the skills to navigate this technological advancement and build a culture of continuous education. Now, speaking of upskilling, another little Passover news item from Reuters, India's Tata Consultancy, which is one of the biggest IT consultancies in the world, has announced a new generative AI offering, as well as a plan to upskill 25,000 of their engineers on Microsoft Azure's OpenAI. One of the interesting dynamics of this new era of AI is the way in which it levels the playing field

Starting point is 00:07:40 between highly skilled workers in places like America and their engineering counterparts in other countries. In other words, this could be a big moment for global competitiveness. Finally, a story from Bloomberg, which I definitely think is going to be reflective of a larger trend. The piece is called the U.S. military is taking generative AI out for a spin. It focuses on how the Defense Department is testing five different large language models to see how they do when trained on proprietary secret military data and can be used to quickly help military organizations ask questions, access information, make predictions, test scenarios, and more. Bloomberg writes, The military exercise, which runs until July 26th, will also serve as a test of,

Starting point is 00:08:18 whether military officials can use LLMs to generate entirely new options they've never considered. For now, the U.S. military team will experiment by asking LLMs for help planning the military's response to an escalating global crisis that starts small and then shifts to the Indo-Pacific region. In other words, they're testing to see what would happen if we got in a conflict with China around Taiwan. Speaking of that, in reaction to the U.S.'s growing controls around AI chips sent to China, China has announced plans to curb exports of AI chip-making materials. On July 3, the Chinese Ministry of Commerce issued a statement saying that beginning on August 1st, export of a couple products that are used in chip manufacturing, will now require a government-issued license.

Starting point is 00:08:57 Now, I don't think, given the U.S.'s aggressive anti-China moves as it relates to AI, that this is all that surprising, but it does help demonstrate just how tense and geopolitical the AI issue really is. Anyways, guys, that is going to do it for today's AI breakdown brief. If you're enjoying it, go check out the AI breakdown newsletter. You can find that at the AI breakdown.bighive. That's B-E-E-H-I-I-V.com. Thanks for watching and listening, and I'll be back soon with the main AI breakdown. Welcome back to the AI breakdown. Today, we are talking about a new initiative from OpenAI that is meant to help make it so that AI doesn't take over the world. Over the past few months concerned around AI safety, and in particular, the extinction risk or existential risk that comes from a super-intelligent, AI or an advanced general intelligence have really rocketed into mainstream consciousness.

Starting point is 00:09:48 One of the big drivers of this has been Jeffrey Hinton. At the beginning of May, the New York Times published this piece called The Godfather of AI leaves Google and warns of danger ahead. Now, the idea that someone could be ludicrously well compensated as someone at Google is, and still leave and effectively turn one's back on their entire lifetime of research and work, has really captured people's attention and put the spotlight on these questions of AI risk. Now, when it comes to AI risk, part of the way that people in the space try to address or think about it is what they call alignment. In other words, aligning AI with human values.

Starting point is 00:10:24 The presumption is that an advanced general intelligence wouldn't necessarily share human values a priori. And so in order to avoid the various scenarios that end up with AI exterminating the human race in one way or another, we need to figure out ways to align advanced AIs with our own societal norms and values. The problem historically has been that there has far less time, energy, compute, and other resources dedicated to alignment than there is to advancing AI capabilities. This makes sense in the context of our market system, which is going to reward people more for their advances in capabilities than it is for their advances in alignment, but that gap has been hugely problematic, and part of the reason why AI Dumers are so, so scared. It was notable then yesterday

Starting point is 00:11:03 when OpenAI, the company behind ChatGPT and pretty much the most influential company in the space, published a blog post called Introducing Superalignment. The lead of the post, reads, we need scientific and technical breakthroughs to steer and control AI systems much smarter than us. To solve this problem within four years, we're starting a new team co-led by Ilya Sutskever and Jan Lakey, and dedicating 20% of the compute we've secured to date to this effort. The post begins, Superintelligence will be the most impactful technology humanity has ever invented and could help us solve many of the world's most important problems. But the vast power of superintelligence could also be very dangerous and could lead to the disempowerment of humanity or even human

Starting point is 00:11:41 extinction. While superintelligence seems far off now, we believe it could arrive this decade. Managing these risks will require, among other things, new institutions for governance, and solving the problem of superintelligence alignment. In other words, how do we ensure AI systems much smarter than humans follow human intent? Currently, we don't have a solution for steering or controlling a potentially super intelligent AI and preventing it from going rogue. Our current technologies for aligning AI, such as reinforcement learning from human feedback, rely on humans' ability to supervise AI, but humans won't be able to reliably supervise AI systems much smarter than us, and so our current alignment techniques will not scale to superintelligence. We need new scientific

Starting point is 00:12:20 and technical breakthroughs. So there is a lot here just in this first section. One, once again, Open AI notes that the extreme possibility of human extinction is real. I've said it before and I'll say it again that it is quite significant that the leading company in this space speaks so openly about that risk. It's exactly the type of thing that in most historical context a company would try to tamp down or even malign and cast a gate is crazy. A second interesting thing about this opening is what Open AI's sense of timeline is. They believe they say that super intelligence could arrive this decade, given that historically people's estimates of how fast AI was going to develop have been off but in the other direction, as in AI has developed much

Starting point is 00:13:00 faster than they thought. This prediction that superintelligence could be here before 2030 is super, is super, super notable. Finally, this acknowledgement that today's techniques are not sufficient is put really plainly and really clearly. There is an acknowledgement from OpenAI that in the context of a super intelligent AI that is smarter than us, we simply won't be able to monitor it, like a zealous parent would. So to me, right from the beginning, it's clear that there is a serious level of intent behind this new initiative. So what is Open AI's approach going to be? They write, our goal is to build a roughly human level automated alignment researcher. We can use vast amounts of compute to scale our efforts and iteratively align superintelligence.

Starting point is 00:13:39 To align the first automated alignment researcher, we will need to, one, develop a scalable training method, two, validate the resulting model, and three, stress test our entire alignment pipeline. They also say that they're expecting their research priorities will evolve substantially as we learn about the problem, and so this solution is not necessarily the one that they'll end up with, but it is where they are starting. From there they get into the resources that they're going to dedicate to this. First of all, it's a new team, and it's being led by their chief.

Starting point is 00:14:04 scientists than a co-founder of the company. More than that, however, is the announcement that they're dedicating a fifth of the compute that they've secured to solving this particular problem. Now, you may say a fifth isn't all that much, but to put it in context, OpenAI has had to delay features this year because they simply don't have enough compute to push them forward. They had intended, for example, to release a multimodal version of chat GPT, and it appears that that's just not going to be possible because of lack of access to compute. So to take 20% of that and move it away from non-commercial resources is not an insignificant move. Still, maybe the most notable thing about this is the moonshot nature of it.

Starting point is 00:14:40 OpenAI writes, our goal is to solve the core technical challenges of superintelligence alignment in four years. Going from something we have no idea how to do to actually solving it in four years is an incredibly, incredibly ambitious goal. And this is absolutely something that people in the community have taken note of. And I think that's a good segue to start talking about what people in the community actually think about this. Let's start with the AI safety community, who are the folks who are most likely to be skeptical of it. Zvi Maushowitz, who's one of the best writers on AI safety right now, sent out a Twitter poll saying, if your top priority is AI not killing everyone, joining the Open AI Super Intelligence Alignment team is the options he listed

Starting point is 00:15:20 were a great play, a good play, about neutral or an actively bad idea. In reverse order, of the 500-plus voters, 12.7% said an actively bad idea, 19.1% percent. said about neutral, 38.2% said a good play, and 30% said a great play. So almost a third of people think that this would be a great approach for someone who wanted to focus on eliminating AIX risk. And overall, about two-thirds of people thought that it was a great or good play. Eliezer Yudkowski initially had concerns that there was a disparity between how the people who were focusing on the alignment side of things were being compensated as compared to those who were focused on the capability side, tweeting, based on previous reports that a typical ML person

Starting point is 00:15:59 at OpenAI earns 930,000 per year. It looks like you're planning to pay alignment people one-fourth to one-third. What capabilities people earn at OpenAI? Am I in error? To that, a researcher at OpenAI said you are. The alignment research scientist's role page says the annual salary range for this role is $245,000 to $450,000. The generic research scientist role page says the annual salary range for this role is $200,000

Starting point is 00:16:20 to $300,000 to $370,000. To that, Liezer said, okay, I'm temporarily relieved. And even tweeted out, I'm in error. Yay. Daniel F writes, glad they're not dancing around this issue or using euphemisms here. As an aside, superalignment sounds like trying to align superheroes, which makes me think of the TV show The Boys, it's a goofy term, but one that seems hard to co-opt or watered down. And maybe the most

Starting point is 00:16:40 telling response to me comes from Loran Shapira, who wrote, hot take, that's awesome, more than I was expecting and more than any competitor is doing. Now, most importantly, Leron points out, they didn't have to make themselves vulnerable to accusations of failing to meet the standard they set for themselves, but they did. And what he's talking about is what it means to set this four-year goal. In the absence of a four-year goal, in the absence of identifying a percentage of compute that's being used for this, it would be easy for this to be a sort of corporate philanthropy-type effort that is ultimately greenwashing or just some PR-focused thing. By announcing that they're using 20% of their compute for this goal, and by announcing their

Starting point is 00:17:15 goal of four years, those are benchmarks that people can measure against to see how OpenAI is doing at its own initiative. Now, of course, this isn't to say that there is some skepticism. Autism Capital writes OpenAI owning alignment research is like the cigarette companies owning cancer research. TK. Rangrenagin writes OpenAI has no incentive to avoid even known downsides of the AI tech they create. The superalignment is simply a checkbox faint to avoid regulations. Maybe more middle of the road, Roheat asks, practically, does OpenAI's announcement about super alignment meaningfully change anything from what they were already doing before? 19.3% said yes. 60.6% said no. 20.1% said a secret third thing.

Starting point is 00:17:54 For what it's worth, I am definitely in the yes column on this question. I think that putting a co-founder and chief scientist on a team that is going to have access to a fifth of the compute resources of a company, where compute resources are inevitably and inexorably scarce, and where there is a four-year moonshot goal for one of the hardest problems in the world, is significantly different than just having a bunch of people who focus on alignment kicking around the office. Now, from the less cynical side, a lot of people recognize that moonshot nature of this and appreciated the ambition behind it. Riley Goodside, who was featured on that cognitive revolution,

Starting point is 00:18:24 episode that we had on our feed a few weeks back wrote, A Statement of Purpose in Giant Font, toward an ambition like no other. Meanwhile, others like Dr. Jim Fan from Nvidia are jumping in with their own ideas already. He writes, Open AI's alignment strategy says that humans won't be able to reliably supervise AI systems much smarter than us.

Starting point is 00:18:41 But I think we can move humans up the supervision chain, i.e. feedback to feedback. He then gives an example of writing malware and how a human system could be redesigned to have humans be able to supervise much better than they could today. Now, what about the wisdom of the crowds? On manifold markets, there are a number of prediction markets around this particular issue now. One of them asks, will OpenAI superalignment project produce a significant breakthrough in alignment research before 2027,

Starting point is 00:19:06 which currently stands at a 62% chance of yes? Now, when the question was asked a little bit more concretely, i.e. not just a generally important breakthrough, but actually achieving the goal of superalignment, the confidence interval was a bit lower. Victor Lee created a prediction market. Will the OpenAI Super Alignment team believe that their goal has been achieved after four years? And only 26% say yes. Now, of course, the big question that overhangs is what happens if the alignment team gets to the end of four years and hasn't made the breakthroughs that they want?

Starting point is 00:19:35 What happens if instead they've convinced themselves that there aren't good ways to align superintelligence? Nathan Young on Twitter asks, Will OpenAI stop developing AGI if their alignment team is pessimistic in four years? I think that's a really good question to ask, and one that's worth. having discussions about now. You have to think that OpenAI is having some of those conversations, but ultimately I'm sure they're going to reserve judgment until they're actually able to make some progress on this challenge. Now, I definitely find myself in the optimistic camp around this. Not optimistic in the sense that I'm convinced that in four years we can solve alignment to

Starting point is 00:20:07 superintelligence, but optimistic in the sense that this is a serious effort at it. It's an attempt to align their company at least around a moonshot goal, and I wouldn't be surprised if this is influential in how other companies have to follow suit as relates to some of these issues. Anyways, I am interested to know what you guys think. Do you have that cynical take on this, that this is just for PR or an ability to avoid regulations? Are you less cynical about the intention, but skeptical of the capacity to actually make progress? Do you think that an even broader effort is needed? And if so, what do we do to encourage it? I think if nothing else, this is a good moment to ask those questions. So hit me up in the comments, come join me on Twitter, and let's have the super alignment conversation.

Starting point is 00:20:46 do it for today's AI breakdown. If you enjoyed it, go share this episode with one person who you think would be interested in these issues. I'd love to have them join the community as well. Until next time, guys, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - OpenAI Sets Moonshot Goal of AI Superalignment in 4 Years

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.