The AI Daily Brief: Artificial Intelligence News and Analysis - A Critique of Marc Andreessen's "Why AI Will Save The World"

Starting point is 00:00:00 Today on the AI breakdown, we're reading a pretty definitive response to Mark Andreessen's seminal essay that we shared last week, why AI will save the world. The AI breakdown is a daily podcast and video about the most important news and discussions in AI. Like, subscribe and share and go to Breakdown.network for more information. Hello, friends. Welcome back to another AI long read, in this case, an AI long read Saturday. This week, I have not lost my voice and so, as exciting as the AI experiment was, you've actually got to be a real read. the real me this time. That said, we are staying on the themes from last week. You can probably tell at this point that it's my goal to give you lots of different perspectives when it comes to questions of AI safety and risk. Given where he sits in the technology space and society more broadly,

Starting point is 00:00:48 anything that Mark penned about this particular issue, and especially something that he is so clearly trying to make a strong and definitive statement, was going to get picked up, it was going to get tons and tons of attention. There have been a few different responses, but the one of the one I've seen recommended most to me as a good follow-up to read comes from Dwarkesh Patel, the host of the Lunar Society. The piece is called Contra Mark Andresen on AI and was published on Dworkesh's blog. Dwarkeh writes, Mark Andresen published a new essay about why AI will save the world. I had Mark on my podcast a few months ago and he was, as he is usually, very thoughtful and interesting. But in the case of AI, he fails to engage with the worries about AI misalignment. Instead, he

Starting point is 00:01:31 substitutes aphorisms for arguments. He calls safety warriors cultists, questions their motives, and conflates their concerns with those of the woke trust and safety people. I agree with his essay on a lot. People grossly overstate the risks AI poses via misinformation and inequality. Regulation is often counterproductive and naively regulating AI is more likely to cause harm than good. It would be really bad if China outpaces America in AI. And technological progress throughout history has dramatically improved our quality of life. If we solve alignment, we can look forward to material and cultural abundance. But Mark dismisses the concern that we may fail to control models,

Starting point is 00:02:08 especially as they reach human level and beyond. And that's where I disagree. Section, it's just code. Mark writes, quote, My view is that the idea that AI will decide to literally kill humanity is a profound category error. AI is not a living being that has been primed by billions of years of evolution to participate in the battle for the survival of

Starting point is 00:02:28 the fittest, as animals are, and as we are. It is math, code, computers, built by people, owned by people, used by people, controlled by people. The idea that it will at some point develop a mind of its own and decide that it has motivations that lead it to try to kill us is a superstitious handwave. End quote. Dwar Keshe goes on. The claim that you will completely control any system you build is obviously false, and a hacker like Marx should know that. The Russian nuclear scientists who built the Chernobyl nuclear power plant did not want it to melt down. The biologists at the Wuhan Institute of Virology didn't want to release a deadly pandemic, and Robert Morris didn't want to take down the entire internet.

Starting point is 00:03:06 The difference this time is that the system under question is an intelligence capable, according to Mark's own blog post, of advising CEOs and government officials, helping military commanders make better strategic and tactical decisions, and solving technical and scientific problems beyond our current grasp. What could go wrong? I just want to take a step back and ask Mark, or those who agree with him, What do you think happens as artificial neural networks get smarter and smarter? In the blog post, Mark says that these models will soon become loving tutors and coaches,

Starting point is 00:03:33 frontier scientists and creative artists, that they will, quote, take on new challenges that have been impossible to tackle without AI, from curing all diseases to achieving interstellar travel? How does it do all this without developing something like a mind? Why do you think something so smart that it can solve problems beyond the grasp of human civilization will somehow totally be in your control? Why do you think creating a general intelligence just goes, well by default. Saying that AI can't be dangerous because it's just math and code is like saying

Starting point is 00:04:00 tigers can't hurt you because they're just a clump of biochemical reactions. Of course it's just math. What else could it be but math? Mark complains later that AI warriors don't have a falsifiable hypothesis. And I'll address that directly in a second. But what is his falsifiable hypothesis? What would convince him AI can be dangerous? Would some wizard have to create a shape-shifting intelligent goo in a cauldron? Because short of magic, any intelligence we build will be made. of math and code. As a side note, I wonder if Mark has tried to think through why the pioneers of deep learning, including multiple Turing Award winners, the CEOs of all the big AI labs, and the researchers working most closely with advanced models think AI is an existential risk. Surely he doesn't

Starting point is 00:04:40 think he could just hand this essay to them and they'd say, oh, it's just code. Why didn't I think of that? What was I even worried about? AI doesn't even need to have bad goals in order to be disastrously dangerous. It can simply pursue a goal we give it in a way we never intended. Even within human societies optimizing on a seemingly banal goal has led to awful consequences. Maximize equality leads to communism causing uniform suffering and death, expanding your nation's fear of control, leads to leaders starting a war that kills millions of people. With advanced AI, the situation is much worse. We're talking about something that is potentially more powerful than any human. It doesn't start off with the hundreds of nuanced and inarticulable

Starting point is 00:05:17 values embedded in human psychology as common sense. We laugh at toy examples like when tasked with solving depression, the AI develops an advanced opioid and hooks everyone up to an IV. The AI is not doing this because it's stupid, but because it doesn't automatically have all of our contradictory and often counterbalancing values. We live in a world where the only intelligent optimizers are other humans that basically think like us and want similar kinds of things. So it becomes hard for us to imagine what it would be like to have a different type of mind come online, and how strange and destructive its methods for executing our commands might be. Section. The Testable Hypothesis

Starting point is 00:05:51 from Marx essay, quote, My response is that their position is non-scientific. What is the testable hypothesis? What would falsify the hypothesis? How do we know when we are getting into a danger zone? These questions go mainly unanswered apart from you can't prove it won't happen. End quote.

Starting point is 00:06:07 And back to Dorcas. We're already in a situation where Sidney Bing threatened to blackmail, bribe and kill people, albeit in acute and endearing way. And GPT4 told the user how to manufacture biological weapons, and we're still far from human-level intelligence. Obviously, Sidney Bing can't hurt you given its current capabilities, but do you feel confident that whatever caused it to go rogue won't happen on a much larger and more destructive scale, with a model that is much smarter?

Starting point is 00:06:33 If so, why? A technologist and venture capitalists like Mark should be the most open to the possibility that there will be big, unprecedented changes whose possibility we can't immediately verify. When Mark created the first web browser in 1993, what was his immediately falsifiable thesis about the internet? Of course, you could just build GPT-8 and see if it does exactly what we want it to do in exactly the way we want it. We can then hope that gradient descent on predicting the next token somehow, by default, creates the drive to act in the best interest of humanity. Though evolution optimizes our ancestors over billions of years to leave behind the most possible copies of our genes,

Starting point is 00:07:07 and here we are using condoms and pulling out. We're hastily putting together a new airplane. Its smaller prototypes sometimes fall out of the sky in weird ways that we don't really understand. Would you like to be the test pilot? I wonder if Mark actually has asked a bunch of people working on technical alignment, whether they have a falsifiable hypothesis, or if he's just assuming that the answer must be no. I posted Mark's question in a group chat, and just off the cuff, Tristan Hume, who works on interpretability alignment research at Anthropic, supplied the following list, edited for clarity.

Starting point is 00:07:38 One, I'd feel much better if we solved hallucinations and made models follow arbitrary rules in a way that nobody succeeded in red-teaming, in a way that wasn't just confusing the model into not understanding what it was doing. Two, I'd feel pretty good if we then further came up with and implemented a really good supervision setup that could also identify and disincentivized model misbehavior to the extent where me playing as the AI couldn't get anything past the supervision. Plus evaluations that were really good at eliciting capabilities and showed smooth progress and only mildly superhuman abilities. And our data centers were secure enough I didn't believe that I could personally hack any of the major AI companies if I tried. Three, I'd feel great if we

Starting point is 00:08:12 solve interpretability to the extent where we can be confident there's no deception happening, or develop really good and clever deception evals or come up with a strong theory of the training process and how it prevents deceptive solutions. Section. The only thing that can stop a bad guy with an LLM. From Mark's essay, quote, In short, AI doesn't want, it doesn't have goals,

Starting point is 00:08:33 it doesn't want to kill you because it's not alive. An AI is a machine. It's not going to come alive anymore than your toaster will. End quote. Back to Dorcasch. This argument that AI is just like a toaster, contradicts the entire rest of the essay, which argues that AI will be so powerful that it will save the world. The very reason why AI will be capable of creating stupendous benefits for humanity, if we get it right,

Starting point is 00:08:57 is that it is much more powerful than any other technology we have built to date. If it was just like a toaster, it couldn't, quote, make everything we care about better. Even if it does exactly what a user wants it to do, how do you ensure that GPT7 doesn't teach a terrorist how to synthesize and transport a few dozen new deadly infectious diseases? Are you saying we could prevent such output? If so, what technique for reliably controlling GPT-7's behavior have you discovered which OpenAI will surely pay you a fortune to reveal? And even if you had such a technique, I'm sure they exist in principle,

Starting point is 00:09:29 otherwise alignment would be hopeless, how do you make sure everyone who builds advanced models implements them? After all, Mark advocated against mandated evaluations, standards, and other regulations at the end of the essay. Later, he writes, quote, I said we should focus first on preventing AI-assisted crimes before they happen. Wouldn't such prevention mean banning AI? Well, there's another way to prevent such actions, and that's by using AI as a defensive tool. The same capabilities that make AI dangerous in the

Starting point is 00:09:54 hands of bad guys with bad goals make it powerful in the hands of good guys with good goals, specifically the good guys whose job it is to prevent bad things from happening. End quote. Back to Dwar Keshe. The tech tree of synthetic biology or cyber warfare isn't necessarily organized according to catchy aphorisms. How exactly does a good guy with a bioweapon stop a bad guy, the bioweapon. While we might hope that new technology must always help defense more than offense, there's no metaphysical principle that says it must be so. In fact, even in theory, offense would be more advantaged. Rockets are much easier to build than missile defense systems. Synthesizing and transporting a single new virus into the U.S. is much easier than implementing

Starting point is 00:10:33 metagenomic sequencing across every single airport, harbor and border crossing, official and unofficial. The reason a terrorist hasn't done something 100x worse than 9-11 is not because the tech has in principle ruled out such attacks, in that our defense has progressed faster than their offense, rather there just aren't that many very smart people who want to cause mass destruction. I don't know what happens when only a modestly smart person can ask a chatbot for the recipe to cause one million deaths, but I see no reason to be confident that the outcome will be positive. Section. Regulation. Quoting Mark. This causes some people to propose, well, in that case, let's not take the risk. Let's ban AI now before this can happen. Unfortunately, AI is not some

Starting point is 00:11:12 esoteric physical material that is hard to come by, like plutonium. It's the opposite. It's the easiest material in the world to come by, math and code. The AI cat is obviously already out of the bag. You can learn how to build AI from thousands of free online courses, books, papers, and videos. And there are outstanding open source implementations proliferating by the day. AI is like air. It will be everywhere. The level of totalitarian oppression that would be required to arrest that would be so draconian, world government monitoring and controlling all computers, jackbooted thugs and black helicopters seizing rogue GPUs, that we would not have a society left to protect. End quote. Back to Dwar Keshe. Regulation may not be the answer and may indeed

Starting point is 00:11:55 make things worse, as in my view it has done in many other areas. I also worry that regulations will ban naughty thoughts in the name of AI safety in a way that does nothing about the larger alignment problem. But dismissing the legitimate worries about AI risk makes this more like, not less. After all, if people think that making GPD 4 woke and making sure GPT7 doesn't release a bio weapon are the same thing, and we want to make sure GPD 7 doesn't release a bio weapon, then I guess we got to make GPT4 woke. The best LLMs today require hundreds of millions of dollars to train, and something close to human level will probably take billions of dollars using the current approach. The idea that we'll need to seize the 3070s in every teenager's gaming rig

Starting point is 00:12:31 in order to evaluate and monitor the most advanced models is obviously ridiculous. Identifying and monitoring training runs that require tens of thousands of the newest server GPUs will require no more totalitarian oppression than the International Atomic Energy Agency requires to keep track of all the world's nukes. You cannot argue, as Mark has repeatedly done, that regulations are so destructive that they have paralyzed entire sectors of the economy, like energy, medicine, education, law, and finance, and at the same time argue that powerful technologies are so inevitable that trying to constrain them would be hopeless anyways. Regulations either have effects good or bad, or they don't.

Starting point is 00:13:07 Section. Motives. From Mark Andreessen's essay. Quote, In fact, these Baptists' position is so non-scientific and so extreme, a conspiracy theory about math and code, and is already calling for physical violence, that I will do something I would not normally do

Starting point is 00:13:24 and question their motives as well. Specifically, I think three things are going on. First, recall that John von Newman responded to Robert Oppenheimer's famous hand-wringing about his role creating nuclear weapons, which helped end World War II and prevent World War III with, some people confess guilt to claim credit for the sin. What is the most dramatic way one can claim credit

Starting point is 00:13:43 for the importance of one's work without sounding overly boastful? Dot, dot, dot. Second, some of the Baptists are actually bootleggers. There is a whole profession of AI safety expert, AI ethicist, AI risk researcher. They are paid to be doomers, and their statements should be processed appropriately. Third, California is justifiably famous for our many thousands of cults.

Starting point is 00:14:02 And the reality, which is obvious to everyone in the Bay Area, but probably not outside it, is that AI Risk has developed into a cult, which has suddenly emerged into the daylight of global press attention and the public conversation. This cult has pulled in not just fringe characters, but also some actual industry experts and not a small number of wealthy donors, including, until recently, Sam Bankman-Fried. And it's developed a full panoply of cult behaviors and beliefs. It turns out that this type of cult isn't new.

Starting point is 00:14:27 There's a long-standing Western tradition of millinarianism, which generates apocalypse cults. The AI Risk cult has all the hallmarks of a millenarian, Apocalypse cult. End quote. Back to Dwar Keshe. Mark is right here. The original AI safety thinkers are weirdos.

Starting point is 00:14:42 The AI ethics people are mostly grifters. And hyping up AI risk may plausibly be a way to give your AI work a larger significance in the grand scheme of things. So what? This is the obvious ad homonym. The idiosyncrasies of the people who develop a set of ideas are irrelevant to the debate around those ideas themselves. Every group of people that has done something important in history has been weird.

Starting point is 00:15:00 Normal people usually don't do cool shit. Are you throwing away your math textbooks because many of history greatest mathematicians were mentally ill? Will you smash your iPhone because the people who started the IT revolution were eccentric, autistic, and culty? It would be intellectually dishonest of me to write a section about how Mark is arguing against AI warriors because he wants to discourage regulations and industry standards that would harm his investments. I'm confident Mark genuinely believes what he says, but even if I'm wrong, who cares? The debate must always be about the ideas, not the people. Oppenheimer was correct that he was involved in a project that changed the course of human history.

Starting point is 00:15:33 Maybe Mark is right in that Oppenheimer was trying to aggrandize himself. Again, so what? If we're going to play a game of guilt by affiliation with Sam Bangman-Fried, might I point out that one of the world's largest crypto investors probably has fewer degrees of separation between themselves and Sam than a Turing award winner? To be clear, I think it would be ridiculous to say that SBF is a stand on Mark. But it's even more bizarre to name-drop SBF to dismiss AI Risk Warriors. Section, China from Mark's essay. China has a vastly different vision for AI than we do. They view it as a mechanism for authoritarian population control full stop. They're not even being secretive about this. They are very clear about it. And they are already pursuing their agenda. And they do not intend to limit their AI strategy to China. They intend to proliferate it all across the world.

Starting point is 00:16:17 Everywhere they are powering 5G networks, everywhere they are loaning built-in-road money, everywhere they are providing friendly consumer apps like TikTok that serve as front-ends to their centralized command and control AI. The single greatest risk of AI is that China wins global AI dominance, and we, the United States and the West, do not. not. I propose a simple strategy for what to do about this. In fact, the same strategy President Ronald Reagan used to win the first Cold War with the Soviet Union. We win, they lose. End quote. Back to Dwar Keshe. A world in which the CCP develops AGI first is a very bad world. But a lot of

Starting point is 00:16:50 the harm from China developing AI first comes from the fact they probably will give no consideration to alignment issues. So if we developed AI in the same cavalier attitude in an effort to beat them, and humanity's future is thwarted, what have we gained? Thankfully, we have a lead on China right now. We should maintain that lead and use it to dramatically increase investment in technical alignment. I don't understand how we get from develop AI tech faster to China is prevented from abusing AI. Do we use AI to fight a war against the CCP that takes them out of power and prevents an internal Chinese panopticon? If not, how exactly does the pace of AI progress in America change about what the CCP will do in China? At the end of the essay, Mark

Starting point is 00:17:29 advocates for free proliferation of open source AI. If the real threat from AI is the real threat from AI is that China catches up to American companies, then surely this is a bad idea. When we were competing with the Soviet Union, we wouldn't have allowed our advanced weapons technology to be leaked to the enemy. If AI is the most powerful tool in a conflict with China, then shouldn't we be similarly worried about sharing AI secrets with China through open source? Note that in terms of concrete ideas, there's a lot of overlap between China Hawks and AI safety warriors. They both want to prevent China from hacking into the AI labs and stealing weights. And I'm sure AI safety warriors were quite pleased by Biden's restrictions on China's

Starting point is 00:18:02 access to advanced semiconductors and the equipment used to make them. Export controls can harm China's ability to compete with us on AI without increasing the risk from more advanced models. Final section. AI isn't social media. People like Mark are still pretty hung up on the social media culture war. He had a front row seat to this battle for over a decade. Mark is on the board of meta and his firm A16Z was both an early investor in the original Twitter and now in Elon's takeover. Especially since 2016, there's been a huge moral panaceous. around misinformation, hate speech, and algorithmic bias. Social media companies did not stand their ground

Starting point is 00:18:37 and explain why these concerns were exaggerated or misplaced, at least until recently. Instead, they repeatedly conceded to the demands of activists, governments, and the media. It seems Mark wants to draw a line in the sand this time. We will not give in to your hysterical demands to control our technology. But a website where you post life updates, memes, and news articles, is just not the same thing, as a steadily improving general intelligence capable of doing science and tech.

Starting point is 00:19:00 And you're missing the story, you insist on thinking about the development of this current infant alien intelligence using your decade old culture war lens. Imagine the first hunter gatherers to plant some seeds across a patch of land. In the grand scale of human history, the creation of AI is as significant as the agricultural revolution. Forcing your perspective on AI through the current trust and safety wars is like the proto-agriculturalists killing each other over whether they dedicated their first harvest to the tree spirit or the fire deity. All right, back to NLW here. For just a quick round, up. Now, I'm not going to go point by point on whether I think Mark is more correct or Dwar

Starting point is 00:19:37 Keshe is more correct. Again, this is part of a longer exploration of the questions of AI safety and AI risk, and my goal is to be a forum that really gives lots of perspectives as long as they are engaged with in good faith and thoughtful consideration. What I will say, for a little bit of contextualization, because it's something I've been thinking about a lot this week, is I think that it may be important in this particular set of discourses to understand or to at least ask the question of what the goal of Mark's piece might have been. The more that I've spent time with it, the less convinced I am that it was meant to be his definitive statement about AI safety concerns. And instead, I get more convinced that it was meant as a very, very specific response to the way in which

Starting point is 00:20:25 AI risk has found its way into the media discourse specifically over the last month or so. We've talked a lot about how the folks who care about AI safety have been effectively screaming into a void for years, only to now find themselves at the center of an extremely mainstream topic. Every day there is some shocking new headline about how 42% of CEOs think that AI could end the world within five to 10 years or take your pick, right? There is always something like that in the news now. And I think it's reasonable to ask, even if you are an AI safety advocate, to what extent those headlines are driven by the media establishment thinking that this is an important discourse, which in many cases it will have been, versus them just being absolutely

Starting point is 00:21:11 sensational attention-grabbing, click-baiting, engaging headlines. And I think that that question does matter. On the one hand, you could argue that who cares? As long as the conversation is happening, it doesn't matter why the media has decided to take it up as a topic. And to some extent, that may be true. However, the risk, and I do think it is a profound risk, is that people become naturally skeptical or wary of doom-laden discourse. There have been a lot of things over the last few years that have been presented to people as the likely or potential end of humanity.

Starting point is 00:21:45 COVID-and-pandemic dialogue falls into this, as does even more notably climate change. There has been a never-ending increase and ratcheting up of the rhetoric around climate change for again what might be completely legitimate reasons of frustration that people aren't getting with the picture. The problem is that the net effect of that is that people naturally get tired. They don't want to think about their doom all the time. And so it becomes easier to inoculate oneself against those things as sensationalism, the diatribs of people who just want something from you, then as a real concern. It's a classic

Starting point is 00:22:20 boy who cried wolf, except the problem is it's not even the AIC. safety people crying wolf, it's that they have to deal with the fact that mainstream audiences feel like others have cried wolf for a long time. So there is this real question of the media's intentions and what happens if this just becomes fodder for headlines and will they turn off it because people stop engaging. But in the short term, there's no doubt that the media narrative has shifted dramatically. And to me, Entresen's YAI will save the world reads at least in a meaningful part, like a response not to Eliezer Yudkowski and Jeffrey Hinton and Max Tagmark, but to the New York Times, to the Washington Post, to CNN.

Starting point is 00:23:03 Now, I do think that Dworkesh addresses this at least a little bit at the end of the show, where he discusses that he worries that this might be fighting the social media battle one more time. There is no doubt that Silicon Valley and particularly the New York Times, but really the media establishment, have been in an influence battle for coming on five years now, that was especially ratcheted up during COVID. It is always the case that the beginning of new battles are fought like the end of old ones, and so to some extent this is inevitable. But I do think that the AI safety conversation deserves its own discourse.

Starting point is 00:23:34 That is, as much as we can do it, reflective of its own reality, rather than just echoes of technology issues passed. Anyways, guys, I hope that this was a nice rounding out perspective on that piece that we read last week. Thanks again to Dwar Keshe for writing it. There are a lot of people in this space who maybe had disagreements with it but weren't willing to or weren't able to put them together in such a coherent, clear way. And of course, thanks to you guys, as always, for listening. I hope you are having a great weekend. And we'll be doing a version of the weekly recap tomorrow, although this week, it'll be about the most interesting research that we didn't have a chance to talk about yet.

Starting point is 00:24:11 Until next time, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - A Critique of Marc Andreessen's "Why AI Will Save The World"

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.