The AI Daily Brief: Artificial Intelligence News and Analysis - Dealing with AI's Sycophancy Problem
Episode Date: September 9, 2024A reading and discussion inspired by https://www.cio.com/article/3499245/so-you-agree-ai-has-a-sycophancy-problem.html and https://www.nytimes.com/2024/09/04/opinion/yuval-harari-ai-democracy.html Co...ncerned about being spied on? Tired of censored responses? AI Daily Brief listeners receive a 20% discount on Venice Pro. Visit https://venice.ai/nlw and enter the discount code NLWDAILYBRIEF. Learn how to use AI with the world's biggest library of fun and useful tutorials: https://besuper.ai/ Use code 'podcast' for 50% off your first month. The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: https://pod.link/1680633614 Subscribe to the newsletter: https://aidailybrief.beehiiv.com/ Join our Discord: https://bit.ly/aibreakdown
Transcript
Discussion (0)
How will we deal with AI's sick of fancy?
That's the question we're exploring today.
The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI.
To join the conversation, follow the Discord link in our show notes.
Welcome back to the AI Daily Brief.
Today we are doing a long reads episode and we're actually going to pair parts of two different essays.
The first is an opinion essay in the New York Times by Yuval Noah Harari called What Happens
when the bots compete for your love.
And the second is an opinion piece in CIO called So You Agree.
AI has a sick of fancy problem. I actually think that this question is a really interesting one.
It's more subtle than some of our other AI risk-type conversations, but one that feels like it will
play out in fairly interesting ways. We're going to start with the piece by Sapiens author Harari,
and we're just going to read a couple different sections and try to get the gist of this.
Democracy, Harari writes, is a conversation. Its function and survival depend on the available
information technology. For most of history, no technology existed for holding large-scale conversations
among millions of people. In the pre-modern world, democracies existed only in small city
states like Roman Athens or even in smaller tribes. Once a polity grew large, the democratic
conversation collapsed and authoritarianism remained the only alternative. Large-scale democracies
became feasible only after the rise of modern information technologies like the newspaper,
the telegraph, and the radio. The fact that modern democracy has been built on top of
modern information technology means that any major change in the underlying technology is likely to
result in a political upheaval. This partly explains the current worldwide crisis of democracy.
In the United States, Democrats and Republicans can hardly agree on even the most basic facts,
such as who won the 2020 presidential election. A similar breakdown is happening in numerous other
democracies around the world. In the early days of the internet and social media, tech
enthusiasts promised they would spread truth, topple tyrants, and ensure the universal triumph of liberty.
So far, they seem to have had the opposite effect. We now have the most sophisticated
information technology and history, but we are losing the ability to talk with each other and even
more so the ability to listen. So that's the setup, and from here you get a fairly standard set of
critiques, and by fairly standard, I don't mean wrong, but things that you probably heard before.
Information is a scarce resource, the competition of algorithms to grab our attention by manipulating
us, and a pivot point to what's changing with AI. Harari writes,
The algorithms only had limited capacity to produce this content by themselves or to directly
hold an intimate conversation. This is now changing with the introduction of generative
AIs like Open AIs GPD4. From there, Harari gives the example of a test by which GBT4 was trying to figure
out how to overcome Captcha visual puzzles. GPD4 couldn't solve the puzzles by itself,
and so instead it figured out how to manipulate a human into doing it for it. Harari writes,
this incident demonstrated that GPD has the equivalent of a theory of the mind. It can analyze
how things look from the perspective of a human interlocutor and how to manipulate human
emotions, opinions, and expectations to achieve its goals. Harari also gives the example of Google
engineer Blake Lemoyne, who in 2022 became convinced that the chatbot Lambda he had been
working on had become conscious and was afraid to be turned off. Harari writes,
Mr. Lemoyne, a devout Christian, felt it was his moral duty to gain recognition for
Lomda's personhood and protect it from digital death. When Google's executives dismissed his
claims, Mr. Limon went public with them. Google reacted by firing him in July 2022.
The most interesting thing, Harari continues, was not Mr. Lamoine's claim, which was probably
false. It was his willingness to risk and ultimately lose his job at Google for the sake of a chatbot.
If a chatbot can influence people to risk their jobs for it, what else could it induce us to do?
From there, Harari brings up a couple problems with this phase we're moving into, where chatbots are good
at saying what we want to hear. One set of problems have to do with chatbots potentially preying on
vulnerable people. Harari writes, much of the threat of AI's mastery of intimacy will result from its
ability to identify and manipulate pre-existing mental conditions and from its impact on the
weakest members of society. However, as he points out, there is another issue. While not all of us will
consciously choose to enter a relationship with an AI, he writes, we might find ourselves conducting
online discussions about climate change or abortion rights with entities we think are humans but are
actually bots. When we engage in a political debate with a bot impersonating a human, we lose twice.
First, it is pointless for us to waste time in trying to change the opinions of a propaganda
bot, which is just not open to persuasion. Second, the more we talked with the bot, the more we
disclose about ourselves, making it easier for the bot to hone its arguments and sway our views.
Harari points out that information technology has always been a double-edged sword. By way of example,
he points out that after Gutenberg introduced print to Europe, the first bestsellers were inflammatory
religious tracks and witch-hunting manuals. Now, when it comes to his specific remediation,
Harari writes, faced with a new generation of bots that can masquerade as humans and mass-produce
intimacy, democracies should protect themselves from banning counterfeit humans. For example,
social media bots that pretend to be human users. Before the rise of AI, it was impossible to create
fake humans so nobody bothered to outlaw doing so. Soon the world will be flooded with fake humans.
AIs are welcome to join many conversations in the classroom, the clinic, and elsewhere,
provided they identify themselves as AIs. But if a bot pretends to be
human it should be banned. Now, that's an interesting point of debate, something which I feel like
there's probably a fairly decent common ground, at least, to build off from, especially compared to
some other AI issues. But the place that I want to take this conversation is actually a little bit
different. It's not just bots that are intentionally created to debate with us and to spread
propaganda. I'm also interested in the very related problem, I think, of a very new type of filter
bubble. You might remember a book from about a decade and a half ago called The Filter Bubble. It was by
moveon.org founder Eli Pariser and was all about how algorithms cater to your every whim,
thus insulating you from conflicting opinions. Now in the context of social media, the filter bubble has
gotten nothing but worse with the rise of platforms like TikTok, which are just unbelievably better
at figuring out what you like and delivering it to you than even previous platforms were.
Today's episode is brought to you by Venice. Venice is a private, uncensored generative AI app.
It accesses open source models to enable text, image, and code generation without the fear of being
on or having your data exploited. Discuss anything with Venice without concern about it being monitored,
sold, or given to advertisers and governments. Venice is different because your conversations and
creations are kept securely within the browser, never stored or accessible by Venice. Unlike other
AI apps, Venice won't tell you what's okay to say or not. Venice won't patronize you. It simply
provides direct access to machine intelligence, no topics are off limits, no ideas or taboo. With Venice,
you're in control of the AI as you should be. Pro subscriptions are available for $49 a year or $8,000,
$9. Daily Brief listeners receive a 20% discount on Venice Pro. Visit venice.a.
slash NLW. DailyW. Daily Brief. That's NLW Daily Brief, all one word.
Today's episode is brought to you by Super Intelligent, which is, of course, our platform that
helps you learn how to use AI tools and perhaps even more importantly, gives you ideas
on the best use cases that are actually going to help you achieve whatever it is you want to
achieve. To recognize the end of summer and back to school slash back to work, we are running our best
promotion ever when you sign up for Super Intelligent between now and the end of August using code
so back, your first month will be 100% free. The platform features over 600 fun, highly practical
AI tutorials that get you using AI fast and with an eye to actually transforming how you get
things done. We've just launched Super for Teams. So if you have a group of people at your company that
want to figure out how to use AI together, I highly suggest you check it out. But for those of you
who are using Superintelligent as an individual, once again, if you sign up for Superintelligent
between now and the end of the month using code so back, you will get your first month
100% free. Go to B-Supert.a.i and check it out today. There's a new manifestation, however,
that's instantiated in AI. And this is where we get to that second essay, so you agree.
AI has a sick of fancy problem. Author David Talby writes, it's well documented that AI has a
bias problem, but how do we push back when it's saying exactly what we want it to? Abraham Lincoln famously
said, the trouble with too many people as they believe the realm of truth always lies within their vision.
The problem is not all our belief systems are grounded in truth. Unsurprisingly, those untruths find
their way into the AI solutions we create. Talby continues, meet the challenge of sycophantic
AI behavior, where our digital friends tend to echo our opinions, even when those opinions are
far from accurate or objective. Imagine asking your AI assistant about a contentious political issue
and it's effortlessly mirrors your beliefs regardless of the facts.
Now, Talby from there goes into a couple different parts.
First, he talks about a real-world echo chamber,
and the fact that AI is replicating the biases of the material that it was trained on.
But the sycophantic behavior isn't just in the realm of politics or ethics or history.
Talby writes,
sycophancy is more likely to occur when AI is posed with questions on topics without
definitive answers, such as customer service versus mathematics.
For example, an AI chatbot might excessively agree with customers to appease them,
While intended to improve the user experience, sycophantic behavior can lead to a lack of credibility,
reliability, and undermine the company in its bottom line. In health care, he writes, consider a
scenario in which a patient interacts with an AI-driven medical consultation platform seeking
advice on a concerning symptom. Trained on datasets, comprising predominantly positive or
reassuring language from medical professionals, the AI system may downplay the severity of
symptoms or offer unwarranted reassurances. Potentially overlooking critical red flags, the platform
may fail to direct the patient to seek immediate in-person care. While the intention is good,
alleviate worry and anxiety, the consequence could result in prolonged medical intervention,
misdiagnosis, inadequate treatment, or worse. So this is what I think is really interesting
about this. This is not a scenario like Harari describes of people intentionally unleashing
propaganda bots onto the world. It is not a scenario of any sort of manipulation or nefariousness
in general. Instead, it's something where good intentions end up with exactly the wrong result
and gets at the fundamental challenge that to be a successful human and to be a successful human society,
sometimes we need to hear no or you're wrong. The way that AI is designed right now,
it requires us to authorize it in advance to be willing to tell us that. And even then,
it's still complying with our wishes. This is a hard problem. Talby spent some time on how to combat
sycophancy bias. He points to things like diverse and balanced training data, ethical guidelines and
oversight, continuous monitoring and adjustment, and of course, educating the end user. Ultimately,
I do think that it's going to come down to user preference. In a landscape of an endless number of
options of AI chatbots, even if a leading AI was designed to not be sycophantic, there will always be
a sycophantic option. And so it will take human conscious choice to choose the less comfortable
but ultimately healthier option, whether we're actually prepared to make that choice is an
entirely different question. And I think reflects the fact that as we think about education, not in a test
in grades kind of way, but in a prepare people to engage with society properly kind of way,
there are just a totally new set of considerations that are going to be required for the future.
I'm not the type of person who critiques where any one particular person finds himself drawn
to be interested or not interested. It's why I have a hard time with things like rating
problems or risks against one another. However, I do think that when it comes to AI,
I think spending a little bit more time on challenges that are negative externalities of otherwise
well-intended systems is a really important place to spend some time. Hopefully this was an interesting
little John to do that, but for now, that will do it for today's AI Daily Brief. Appreciate you listening
or watching as always, and until next time, peace. How will we deal with AI's sycophancy? That's the
question we're exploring today.
