The AI Daily Brief: Artificial Intelligence News and Analysis - The gpt2 Chatbot is Back as OpenAI Talks Data
Episode Date: May 9, 2024Today on The AI Daily Brief, NLW looks at expanded restrictions the White House is considering to deny China access to advanced models. Plus, an absolute slew of news around OpenAI. ** Check out the ...hit podcast from HBS Managing the Future of Work https://www.hbs.edu/managing-the-future-of-work/podcast/Pages/default.aspx Join Superintelligent at https://besuper.ai/ -- Practical, useful, hands on AI education through tutorials and step-by-step how-tos. ** ABOUT THE AI BREAKDOWN The AI Breakdown helps you understand the most important news and discussions in AI. Subscribe to The AI Breakdown newsletter: https://theaibreakdown.beehiiv.com/subscribe Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown Join the community: bit.ly/aibreakdown Learn more: http://breakdown.network/
Transcript
Discussion (0)
Today on the AI Daily Brief, more confirmation that search is coming or even already here for OpenAI, but maybe a delay in the announcement.
Before that in our headlines, the White House might be moving into a next phase of trying to deny China access to advanced AI models.
The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI.
Check out the Discord link in our show notes to join the conversation.
Hello, AI friends, one quick note before we get into today's episode.
You may have seen that we just announced that the breakings.
the Crypto Podcast, as well as the Bitcoin Builders show that I started last year,
and a bunch of related media properties are being acquired and moving over to Blockworks.
Blockworks is an excellent crypto media company.
I'm really excited for the breakdown and the other crypto content to be over there.
But the AI show is staying put right here.
As part of the transition, we are shifting the name to the AI Daily Brief instead of the AI
breakdown, just to remove any brand confusion.
The format will remain the same.
The cadence will remain the same.
Basically, if you like the AI breakdown, the AI Daily Brief is pretty much exactly the same thing.
Anyways, just wanted to share why you had been hearing a new name, but there is a lot to discuss
today, so let's get into it.
Welcome back to the AI headlines on the AI Daily Brief.
It's all the AI headline news you need in around five minutes.
We kick off today with yet further efforts from the U.S. government to limit China's access to
advanced AI.
For the last couple years, AI has been a centerpiece of the geopolitical tension between China's
and the USA. Since the end of 2022, the Biden administration has increasingly tightened export restrictions
on advanced AI chips, while at the same time trying to create incentives to bring chip manufacturing
back to the U.S. Now it appears that there is going to be a new front in that effort, with Reuters
reporting that the White House has, quote, preliminary plans to place guardrails around the most advanced
AI models like Chat Chhabit. Continuing Reuters writes, the Commerce Department is considering a new
regulatory push to restrict the export of proprietary or closed source AI models. Any action they write would
complement a series of measures put in place over the last two years to block the export of
sophisticated AI chips to China in an effort to slow Beijing's development of the cutting-edge
technology for military purposes. According to Reuters' sources, any new export controls would
likely target Russia, China, North Korea, and Iran. Back in February, Microsoft said it in a report
that it had tracked hacking groups affiliated with the governments of China and North Korea as well
as Russian military intelligence in trying to use their software to improve their hacking.
In terms of determining which models would be advanced enough to be under these controls,
it seems like it might be based on the computing power it took to train a model.
Reuters again writes,
The sources said the U.S. may turn to a threshold contained in an AI executive order issued last October
that is based on the amount of computing power it takes to train a model.
When that level is reached, a developer must report its AI model development plans
and provide test results to the Commerce Department.
That computing power threshold could become the basis for determining what AI models would be subject
to export restrictions.
If used, it would likely only restrict the export of models that have yet to be released since
none are thought to have reached that threshold yet.
According to the reporting, the agency is, quote, far from finalizing a rule proposal,
but just the fact that they're considering it, Reuters argues, shows how seriously the
U.S. government is taking AI as a geopolitical concern.
Now, of course, one of the challenges, if anything like this were to be imposed, is how much
open source just complicates things, especially given how close to state-of-the-art open source is
getting, it could be a real challenge.
To reiterate, this is currently just reporting from unnamed sources.
When Reuters reached out, the Commerce Department declined to comment, while the Russian embassy
in Washington did not respond.
The Chinese embassy, though, you know they were going to get a comment in, describing it
as a, quote, typical act of economic coercion and unilateral bullying which China firmly opposes.
Meanwhile, in terms of things that have actually happened on this area, the U.S. government
has revoked licenses that allowed Qualcomm and Intel to supply chips to Huawei.
The information writes,
U.S. lawmakers and officials have been alarmed by Huawei's ability to make advanced chips for smartphones
despite years of Western sanctions. Republican lawmakers concerned about national security risks
had been urging the U.S. government to cancel licenses that allow U.S. companies, including Qualcomm and Intel,
to continue to sell chips to the Chinese company. In many ways, this is just a continuation of policies
that were already started, but shows that in general, this is not just idle talk. Another part of the
world that is impacted by the China-U.S. battle around AI is, of course, the Middle East. For some time,
the Middle East has been positioning itself as a literally middle player, maintaining relationships
with both China and the U.S. However, the U.S. has been increasingly uncomfortable about the Middle
East relationship with China, seeing it as a way for China to getting around policies like the chip
export controls. Recently, UAE company G42 explicitly started to move away from its relationships
with China and took on a $1.5 billion minority investment from Microsoft, which was in part
set up by the U.S. Commerce Department. Now, Time magazine is reporting that the head of Saudi Arabia's
new investment fund for AI, said that if the U.S. asked them to divest from China, they would.
Amit Midha said, so far the requests have been to keep manufacturing and supply chains completely
separate, but if the partnerships with China would become a problem for the U.S., we will divest.
In another interview, Midha said, we are seeking trusted secure partnerships in the U.S.
The U.S. is the number one partner for us and the number one market for AI, chips,
and semiconductor industry. What's more, the U.S. government is not just prohibiting China
from using AI or trying to, but also figuring out how to use AI itself. For example, the
of Homeland Security is now piloting using AI to train officers who review applicants for refugee
status. Basically, the central idea of this pilot is that it can be really difficult for immigration
officers to do interviews with refugees who have experienced significant trauma. Said Secretary Alejandro
Mayorkas, refugee applicants given the trauma that they have endured, are reticent to be
forthcoming in describing that trauma. So then in the pilot, DHS is training the AI to act like
refugees so that their officers can practice interviewing them. Meanwhile, while over the last few weeks,
Microsoft has made a number of big investment announcements in Southeast Asia, they're now bringing it back
home with an over $3 billion investment to build AI in Wisconsin. Today, President Biden will speak with
Microsoft President Brad Smith in Mount Pleasant, Wisconsin to announce a $3.3 billion investment in a new
data center there. In addition to the data center, Microsoft said that they're also investing in a new
AI lab at the University of Wisconsin-Milwaukee to train employees to use AI. Said Brad Smith in an interview,
we have a huge responsibility to help ensure this technology serves people. Part of that is ensuring
that it works safely and remains under human control. But another part of it is really supporting
and aiding the transition of the economy. So lots going on in the world of AI and geopolitics,
but for now, that is going to do it for the headline section of the AI Daily Brief. Next up,
the main episode where we talk all about what's going on with Open AI, or really what isn't
going on with Open AI. As a listener of this show, I have a strong feeling you like to stay
up to date on all things artificial intelligence, including its impact on the workforce, which is why
I highly recommend checking out managing the future of work, the chart-topping business podcast from
Harvard Business School.
HBS professors Bill Kerr and Joe Fuller talked to business leaders, technologists, and
policymakers grappling with the forces like AI, globalization, and demographic shifts that are
reshaping the nature of work.
Recent guests include IBM's CHRO, Nicol-Lamuro, on how Big Blue is adopting AI,
Morningstar CEO, Kunal Kapoor, on how AI can raise the investment IQ.
Microsoft Corporate Vice President Jared Sparro on how the tech giant is experimenting its way
from AI assistance to autonomous agents and many other prominent movers in business and the
workforce ecosystem. So don't miss out. Follow managing the future of work on Apple Podcasts,
Spotify, or wherever you're listening now. Hello, AI friends today. I want to tell you about
our platform super intelligent. In short, it's a platform for useful, practical, immediately
applicable AI learning. We have nearly 400 video tutorials, each of which comes with step-by-step
how-toes, and the idea is to get you actually using these AI tools we talk about every day
in a matter of minutes to actually solve problems, create new opportunities, and just do really
cool things. To learn more and subscribe, go to be super.a.I. And if you do decide to subscribe,
use code podcast for 50% off your first month. Again, that's besupor.aI.
Welcome back to the AI Daily Brief. Boy, is there a lot of open AI news today.
None of it's huge or transformational. It's just really really.
diverse. As you'll see, there is enough that it will easily fill an episode. First up, we got some
more confirmation that OpenAI does appear to be getting ready to launch a search product that will
rival Google and other AI native companies like perplexity. According to Bloomberg, quote,
the feature would allow users to ask chat GPT a question and receive answers that use details from
the web with citations to sources such as Wikipedia entries and blog posts. One version of the product
also uses images alongside written responses to questions when they're relevant. If a user asks chat
GPD to change a doorknob, for instance, the results might include a diagram to illustrate the task.
Bloomberg also gave a shout out to the Twitter sleuth who found search.chatschapet.com last week,
but we're still a little bit in the realm of unknown sources.
The other open AI related thing from last week was, of course, that mysterious GPT2 chatbot
that appeared on an LLM ranking site, and which seemed to many to be more performant and
advanced than anything else out there.
After it got all of that attention last week, it was taken off of LIMSIS, but then a couple days
ago it came back. Once again, people are really impressed. This time it was called I'm a good
GPT2 chatbot, and much of the response I saw was people like Pietro Sherano who wrote,
I'm a good GPT2 chatbot is so good that it created a code interpreter that uses Claude Opus
for me. Excuse me as I faint in ontological shock. Lior at Alpha Signal AI tweeted a network error
due to high traffic that seemed to point to OpenAI as confirmation that it was behind the chatbot.
There was also Sam Altman, who on May 5th tweeted, I'm a good GPT2 chatbot.
Siki Chen from Runway suggested that this chatbot also revealed that chat GPT had already
stealth launched the search.
Siki writes, did chat GBTGPT already stealth launched search?
ChatGPT used to reject prompts asking for current events.
But now I get this without even a little browsing with Bing message.
He shared an image where he had asked, what's the latest news on this I'm a good GBT2 chatbot
model?
ChatGPt responds, the I'm a good GPT2 chatbot model that has recently surfaced in the AI
community is wrapped in a bit of mystery.
This model appearing on the LIMSys chatbot arena generated considerable interest due to its sudden
appearance and impressive performance, sparking speculation about its origins and capabilities.
There were initial reports that it performed better than GBT4, but concrete details about its
design or purpose were not disclosed and it was subsequently taken offline.
What was interesting about this is that it also cited its sources, pointed to Daily AI and some
other sources as where it had drawn this information.
And again, this was not technically supposed to be the Browse with Bing version.
All this is to say, there is even more evidence now.
that search with chat GPT is coming, but it seems also like we might have to wait just a little
bit longer to find out more about it. The information reported earlier this week that OpenAI
was considering postponing an event that had been planned for this Thursday, where they had
been intending to show off a set of new products, including presumably this search product.
The information says the spokesperson from OpenAI declined to elaborate on the reasons for
the change. If this alone were the OpenAI news slate, it would be a lot, but we are not even
close to done. Yesterday, OpenAI announced that they were working on new tools to detect
AI-created images. The blog post was called Understanding the Source of What We See and Here
Online. There were a couple things that were announced as part of this post. One was that OpenAI
was joining the steering committee of something called C2PA, the Coalition for Content Providence
and Authenticity. Earlier this year, they wrote, we began adding C2PA metadata to all images
created and edited by Dolly 3, our latest image model, in ChatchipT and the OpenAI API API.
We will be integrating C2PA metadata for SORA, our video generation model, when the model is
launch broadly as well. They also announced that they were working on new technology in this area,
including what they describe as temper-resistant watermarking, i.e. marking digital content like audio
with an invisible signal that aims to be hard to remove, as well as detection classifiers
or tools that use artificial intelligence to assess the likelihood that content originated from generative
models. As part of the announcement then, they shared that they were opening applications
for access to OpenAI's image detection classifier to a first group of testers. They write that in
there were internal tests, the classifier correctly identified around 98% of Dolly 3 images, and
less than 0.5% of non-AI generated images were incorrectly tagged as being from Dolly 3.
They write the classifier handles common modifications like compression, cropping, and saturation
changes with minimal impact on its performance, but other types of augmentations, such as
adjusting the hue or adding moderate amounts of Gaussian noise, can make a significant
difference. Lastly, on this front, Microsoft and OpenAI also announced that they were launching
a $2 million societal resilience fund, which is basically all about
combating deepfakes. Then there was another blog post from yesterday, this time called Our Approach to Data
and AI. The big TLDR of this comes in the section called We Respect the Choices of Creators and Content
Owners on AI. They write, decades ago, the Robots.TXT standard was introduced and voluntarily
adopted by the internet ecosystem for web publishers to indicate what portions of websites
web crawlers could access. Last summer, OpenAI pioneered the use of web crawler permissions
for AI, enabling web publishers to express their preferences about the use of their content in
AI. We take these signals into account each time we train a new model. That said, we understand that
these are incomplete solutions, as many creators do not control websites where their content may appear,
and content is often quoted, reviewed, remixed, reposted, and used its inspiration across multiple
domains. They conclude, we need an efficient, scalable solution for content owners to express their
preferences about the use of their content and AI systems. OpenAI's answer is something they're
calling media manager. It's a tool that will, quote, enable creators and content owners to tell us what
they own and specify how they want their works to be included or excluded from machine learning research
and training. They didn't explain exactly how, but that's the idea. There was a lot of commentary
here. Some people were pretty skeptical, but Brian Merchant noted that, quote, the mere fact that
OpenAI feels it needs to release a statement about training data principles and to announce a program
to let creators opt out of having their works included in training data shows how powerful the
movement against it has become. And then finally, we get to today where OpenAI announced
something that they called their model spec. Sam Altman writes,
we are introducing the model specifies how our model should behave. We will listen, debate,
and adapt this over time, but I think it will be very useful to be clear when something is a bug
versus a decision. We want to give users lots of control of AI with some hard boundaries that
society eventually agrees on. This is another step. While this is definitely very different
than Anthropics' constitutional approach to AI, in the sense that this is a public-facing document,
not a document that specifically is meant to inform AI behavior, it still has some of the same
ideas of articulating what they're trying to have their AI models be. So the approach includes,
one, objectives, broad general principles that provide a directional sense of the desired
behavior, such as assisting the developer and end user, benefiting humanity, and reflecting well
on open AI. There are tools, instructions that address complexity and help ensure safety and
legality. These include things like follow the chain of command, comply with applicable laws,
don't provide information hazards, respect creators and their rights, protect people's privacy,
and don't respond with not safe for work content. Lastly, there are default behaviors. Or as
They describe guidelines that are consistent with objectives and rules, providing a template for
handling conflicts and demonstrating how to prioritize and balance objectives.
These include things like assuming best intentions from the user or developer, asking clarifying
questions when necessary, being as helpful as possible without overstepping, assuming an objective
point of view, encouraging fairness and kindness and discouraging hate, and more.
Joanne Zhang, who worked on this product writes, I'm personally excited about this concept
of a model spec for three reasons.
One, there will be more clarity on whether something is a policy or an RLHF bug.
Two, principles are easier to debate and get feedback on versus hyper-specific screenshots or
abstract feel-good statements.
The way that she describes this is, it's easy for most people to agree on model should be
something, but the more important questions lie deeper in thorny scenarios.
For example, how should the model engage with someone who claims the earth is flat?
Third, she writes, model spec feedback will help us steer our efforts and steerability.
Unexplicit non-goal she writes for the model spec is to reach consensus on a one-size-fits-all
model.
That will never happen.
We want to give users and developers as much control as possible while staying within
hard boundaries that people understand. Hearing feedback on where and how everyone wants to steer the
model is helpful in A, designing a more rigorous survey process, and B, informing the research and product
roadmap. So just tons and tons of stuff happening, but when push comes to shove, what everyone's
really waiting for is the next big products. OpenAI CEO Brad Lightcap told a conference this week
that today's AI systems will seem laughably bad his words in 12 months after they ship GPT5. However,
Gurggerliaros writes, OpenAI was amazing in 2022, 2023 because they shipped a product that
spoke for itself. Jaws dropped by those using it and seeing it for themselves. To see the company
hype up future unreleased products feels like a major shift. If it's that good, why not ship it?
Like before. It seems like we will get answers to that question sooner rather than later-ish,
but for now, still plenty of Open AI news to consume a cycle. That is going to do it for today's
AI Daily Brief. Until next time, peace.
