The AI Daily Brief: Artificial Intelligence News and Analysis - The gpt2 Chatbot is Back as OpenAI Talks Data

Episode Date: May 9, 2024

Today on The AI Daily Brief, NLW looks at expanded restrictions the White House is considering to deny China access to advanced models. Plus, an absolute slew of news around OpenAI. ** Check out the ...hit podcast from HBS Managing the Future of Work https://www.hbs.edu/managing-the-future-of-work/podcast/Pages/default.aspx Join Superintelligent at https://besuper.ai/ -- Practical, useful, hands on AI education through tutorials and step-by-step how-tos. ** ABOUT THE AI BREAKDOWN The AI Breakdown helps you understand the most important news and discussions in AI.  Subscribe to The AI Breakdown newsletter: https://theaibreakdown.beehiiv.com/subscribe Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown Join the community: bit.ly/aibreakdown Learn more: http://breakdown.network/

Transcript
Discussion (0)
Starting point is 00:00:00 Today on the AI Daily Brief, more confirmation that search is coming or even already here for OpenAI, but maybe a delay in the announcement. Before that in our headlines, the White House might be moving into a next phase of trying to deny China access to advanced AI models. The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI. Check out the Discord link in our show notes to join the conversation. Hello, AI friends, one quick note before we get into today's episode. You may have seen that we just announced that the breakings. the Crypto Podcast, as well as the Bitcoin Builders show that I started last year, and a bunch of related media properties are being acquired and moving over to Blockworks.
Starting point is 00:00:42 Blockworks is an excellent crypto media company. I'm really excited for the breakdown and the other crypto content to be over there. But the AI show is staying put right here. As part of the transition, we are shifting the name to the AI Daily Brief instead of the AI breakdown, just to remove any brand confusion. The format will remain the same. The cadence will remain the same. Basically, if you like the AI breakdown, the AI Daily Brief is pretty much exactly the same thing.
Starting point is 00:01:06 Anyways, just wanted to share why you had been hearing a new name, but there is a lot to discuss today, so let's get into it. Welcome back to the AI headlines on the AI Daily Brief. It's all the AI headline news you need in around five minutes. We kick off today with yet further efforts from the U.S. government to limit China's access to advanced AI. For the last couple years, AI has been a centerpiece of the geopolitical tension between China's and the USA. Since the end of 2022, the Biden administration has increasingly tightened export restrictions
Starting point is 00:01:36 on advanced AI chips, while at the same time trying to create incentives to bring chip manufacturing back to the U.S. Now it appears that there is going to be a new front in that effort, with Reuters reporting that the White House has, quote, preliminary plans to place guardrails around the most advanced AI models like Chat Chhabit. Continuing Reuters writes, the Commerce Department is considering a new regulatory push to restrict the export of proprietary or closed source AI models. Any action they write would complement a series of measures put in place over the last two years to block the export of sophisticated AI chips to China in an effort to slow Beijing's development of the cutting-edge technology for military purposes. According to Reuters' sources, any new export controls would
Starting point is 00:02:13 likely target Russia, China, North Korea, and Iran. Back in February, Microsoft said it in a report that it had tracked hacking groups affiliated with the governments of China and North Korea as well as Russian military intelligence in trying to use their software to improve their hacking. In terms of determining which models would be advanced enough to be under these controls, it seems like it might be based on the computing power it took to train a model. Reuters again writes, The sources said the U.S. may turn to a threshold contained in an AI executive order issued last October that is based on the amount of computing power it takes to train a model.
Starting point is 00:02:43 When that level is reached, a developer must report its AI model development plans and provide test results to the Commerce Department. That computing power threshold could become the basis for determining what AI models would be subject to export restrictions. If used, it would likely only restrict the export of models that have yet to be released since none are thought to have reached that threshold yet. According to the reporting, the agency is, quote, far from finalizing a rule proposal, but just the fact that they're considering it, Reuters argues, shows how seriously the
Starting point is 00:03:07 U.S. government is taking AI as a geopolitical concern. Now, of course, one of the challenges, if anything like this were to be imposed, is how much open source just complicates things, especially given how close to state-of-the-art open source is getting, it could be a real challenge. To reiterate, this is currently just reporting from unnamed sources. When Reuters reached out, the Commerce Department declined to comment, while the Russian embassy in Washington did not respond. The Chinese embassy, though, you know they were going to get a comment in, describing it
Starting point is 00:03:33 as a, quote, typical act of economic coercion and unilateral bullying which China firmly opposes. Meanwhile, in terms of things that have actually happened on this area, the U.S. government has revoked licenses that allowed Qualcomm and Intel to supply chips to Huawei. The information writes, U.S. lawmakers and officials have been alarmed by Huawei's ability to make advanced chips for smartphones despite years of Western sanctions. Republican lawmakers concerned about national security risks had been urging the U.S. government to cancel licenses that allow U.S. companies, including Qualcomm and Intel, to continue to sell chips to the Chinese company. In many ways, this is just a continuation of policies
Starting point is 00:04:05 that were already started, but shows that in general, this is not just idle talk. Another part of the world that is impacted by the China-U.S. battle around AI is, of course, the Middle East. For some time, the Middle East has been positioning itself as a literally middle player, maintaining relationships with both China and the U.S. However, the U.S. has been increasingly uncomfortable about the Middle East relationship with China, seeing it as a way for China to getting around policies like the chip export controls. Recently, UAE company G42 explicitly started to move away from its relationships with China and took on a $1.5 billion minority investment from Microsoft, which was in part set up by the U.S. Commerce Department. Now, Time magazine is reporting that the head of Saudi Arabia's
Starting point is 00:04:44 new investment fund for AI, said that if the U.S. asked them to divest from China, they would. Amit Midha said, so far the requests have been to keep manufacturing and supply chains completely separate, but if the partnerships with China would become a problem for the U.S., we will divest. In another interview, Midha said, we are seeking trusted secure partnerships in the U.S. The U.S. is the number one partner for us and the number one market for AI, chips, and semiconductor industry. What's more, the U.S. government is not just prohibiting China from using AI or trying to, but also figuring out how to use AI itself. For example, the of Homeland Security is now piloting using AI to train officers who review applicants for refugee
Starting point is 00:05:19 status. Basically, the central idea of this pilot is that it can be really difficult for immigration officers to do interviews with refugees who have experienced significant trauma. Said Secretary Alejandro Mayorkas, refugee applicants given the trauma that they have endured, are reticent to be forthcoming in describing that trauma. So then in the pilot, DHS is training the AI to act like refugees so that their officers can practice interviewing them. Meanwhile, while over the last few weeks, Microsoft has made a number of big investment announcements in Southeast Asia, they're now bringing it back home with an over $3 billion investment to build AI in Wisconsin. Today, President Biden will speak with Microsoft President Brad Smith in Mount Pleasant, Wisconsin to announce a $3.3 billion investment in a new
Starting point is 00:05:58 data center there. In addition to the data center, Microsoft said that they're also investing in a new AI lab at the University of Wisconsin-Milwaukee to train employees to use AI. Said Brad Smith in an interview, we have a huge responsibility to help ensure this technology serves people. Part of that is ensuring that it works safely and remains under human control. But another part of it is really supporting and aiding the transition of the economy. So lots going on in the world of AI and geopolitics, but for now, that is going to do it for the headline section of the AI Daily Brief. Next up, the main episode where we talk all about what's going on with Open AI, or really what isn't going on with Open AI. As a listener of this show, I have a strong feeling you like to stay
Starting point is 00:06:37 up to date on all things artificial intelligence, including its impact on the workforce, which is why I highly recommend checking out managing the future of work, the chart-topping business podcast from Harvard Business School. HBS professors Bill Kerr and Joe Fuller talked to business leaders, technologists, and policymakers grappling with the forces like AI, globalization, and demographic shifts that are reshaping the nature of work. Recent guests include IBM's CHRO, Nicol-Lamuro, on how Big Blue is adopting AI, Morningstar CEO, Kunal Kapoor, on how AI can raise the investment IQ.
Starting point is 00:07:07 Microsoft Corporate Vice President Jared Sparro on how the tech giant is experimenting its way from AI assistance to autonomous agents and many other prominent movers in business and the workforce ecosystem. So don't miss out. Follow managing the future of work on Apple Podcasts, Spotify, or wherever you're listening now. Hello, AI friends today. I want to tell you about our platform super intelligent. In short, it's a platform for useful, practical, immediately applicable AI learning. We have nearly 400 video tutorials, each of which comes with step-by-step how-toes, and the idea is to get you actually using these AI tools we talk about every day in a matter of minutes to actually solve problems, create new opportunities, and just do really
Starting point is 00:07:48 cool things. To learn more and subscribe, go to be super.a.I. And if you do decide to subscribe, use code podcast for 50% off your first month. Again, that's besupor.aI. Welcome back to the AI Daily Brief. Boy, is there a lot of open AI news today. None of it's huge or transformational. It's just really really. diverse. As you'll see, there is enough that it will easily fill an episode. First up, we got some more confirmation that OpenAI does appear to be getting ready to launch a search product that will rival Google and other AI native companies like perplexity. According to Bloomberg, quote, the feature would allow users to ask chat GPT a question and receive answers that use details from
Starting point is 00:08:28 the web with citations to sources such as Wikipedia entries and blog posts. One version of the product also uses images alongside written responses to questions when they're relevant. If a user asks chat GPD to change a doorknob, for instance, the results might include a diagram to illustrate the task. Bloomberg also gave a shout out to the Twitter sleuth who found search.chatschapet.com last week, but we're still a little bit in the realm of unknown sources. The other open AI related thing from last week was, of course, that mysterious GPT2 chatbot that appeared on an LLM ranking site, and which seemed to many to be more performant and advanced than anything else out there.
Starting point is 00:09:01 After it got all of that attention last week, it was taken off of LIMSIS, but then a couple days ago it came back. Once again, people are really impressed. This time it was called I'm a good GPT2 chatbot, and much of the response I saw was people like Pietro Sherano who wrote, I'm a good GPT2 chatbot is so good that it created a code interpreter that uses Claude Opus for me. Excuse me as I faint in ontological shock. Lior at Alpha Signal AI tweeted a network error due to high traffic that seemed to point to OpenAI as confirmation that it was behind the chatbot. There was also Sam Altman, who on May 5th tweeted, I'm a good GPT2 chatbot. Siki Chen from Runway suggested that this chatbot also revealed that chat GPT had already
Starting point is 00:09:40 stealth launched the search. Siki writes, did chat GBTGPT already stealth launched search? ChatGPT used to reject prompts asking for current events. But now I get this without even a little browsing with Bing message. He shared an image where he had asked, what's the latest news on this I'm a good GBT2 chatbot model? ChatGPt responds, the I'm a good GPT2 chatbot model that has recently surfaced in the AI community is wrapped in a bit of mystery.
Starting point is 00:10:03 This model appearing on the LIMSys chatbot arena generated considerable interest due to its sudden appearance and impressive performance, sparking speculation about its origins and capabilities. There were initial reports that it performed better than GBT4, but concrete details about its design or purpose were not disclosed and it was subsequently taken offline. What was interesting about this is that it also cited its sources, pointed to Daily AI and some other sources as where it had drawn this information. And again, this was not technically supposed to be the Browse with Bing version. All this is to say, there is even more evidence now.
Starting point is 00:10:33 that search with chat GPT is coming, but it seems also like we might have to wait just a little bit longer to find out more about it. The information reported earlier this week that OpenAI was considering postponing an event that had been planned for this Thursday, where they had been intending to show off a set of new products, including presumably this search product. The information says the spokesperson from OpenAI declined to elaborate on the reasons for the change. If this alone were the OpenAI news slate, it would be a lot, but we are not even close to done. Yesterday, OpenAI announced that they were working on new tools to detect AI-created images. The blog post was called Understanding the Source of What We See and Here
Starting point is 00:11:07 Online. There were a couple things that were announced as part of this post. One was that OpenAI was joining the steering committee of something called C2PA, the Coalition for Content Providence and Authenticity. Earlier this year, they wrote, we began adding C2PA metadata to all images created and edited by Dolly 3, our latest image model, in ChatchipT and the OpenAI API API. We will be integrating C2PA metadata for SORA, our video generation model, when the model is launch broadly as well. They also announced that they were working on new technology in this area, including what they describe as temper-resistant watermarking, i.e. marking digital content like audio with an invisible signal that aims to be hard to remove, as well as detection classifiers
Starting point is 00:11:45 or tools that use artificial intelligence to assess the likelihood that content originated from generative models. As part of the announcement then, they shared that they were opening applications for access to OpenAI's image detection classifier to a first group of testers. They write that in there were internal tests, the classifier correctly identified around 98% of Dolly 3 images, and less than 0.5% of non-AI generated images were incorrectly tagged as being from Dolly 3. They write the classifier handles common modifications like compression, cropping, and saturation changes with minimal impact on its performance, but other types of augmentations, such as adjusting the hue or adding moderate amounts of Gaussian noise, can make a significant
Starting point is 00:12:20 difference. Lastly, on this front, Microsoft and OpenAI also announced that they were launching a $2 million societal resilience fund, which is basically all about combating deepfakes. Then there was another blog post from yesterday, this time called Our Approach to Data and AI. The big TLDR of this comes in the section called We Respect the Choices of Creators and Content Owners on AI. They write, decades ago, the Robots.TXT standard was introduced and voluntarily adopted by the internet ecosystem for web publishers to indicate what portions of websites web crawlers could access. Last summer, OpenAI pioneered the use of web crawler permissions for AI, enabling web publishers to express their preferences about the use of their content in
Starting point is 00:12:56 AI. We take these signals into account each time we train a new model. That said, we understand that these are incomplete solutions, as many creators do not control websites where their content may appear, and content is often quoted, reviewed, remixed, reposted, and used its inspiration across multiple domains. They conclude, we need an efficient, scalable solution for content owners to express their preferences about the use of their content and AI systems. OpenAI's answer is something they're calling media manager. It's a tool that will, quote, enable creators and content owners to tell us what they own and specify how they want their works to be included or excluded from machine learning research and training. They didn't explain exactly how, but that's the idea. There was a lot of commentary
Starting point is 00:13:32 here. Some people were pretty skeptical, but Brian Merchant noted that, quote, the mere fact that OpenAI feels it needs to release a statement about training data principles and to announce a program to let creators opt out of having their works included in training data shows how powerful the movement against it has become. And then finally, we get to today where OpenAI announced something that they called their model spec. Sam Altman writes, we are introducing the model specifies how our model should behave. We will listen, debate, and adapt this over time, but I think it will be very useful to be clear when something is a bug versus a decision. We want to give users lots of control of AI with some hard boundaries that
Starting point is 00:14:06 society eventually agrees on. This is another step. While this is definitely very different than Anthropics' constitutional approach to AI, in the sense that this is a public-facing document, not a document that specifically is meant to inform AI behavior, it still has some of the same ideas of articulating what they're trying to have their AI models be. So the approach includes, one, objectives, broad general principles that provide a directional sense of the desired behavior, such as assisting the developer and end user, benefiting humanity, and reflecting well on open AI. There are tools, instructions that address complexity and help ensure safety and legality. These include things like follow the chain of command, comply with applicable laws,
Starting point is 00:14:42 don't provide information hazards, respect creators and their rights, protect people's privacy, and don't respond with not safe for work content. Lastly, there are default behaviors. Or as They describe guidelines that are consistent with objectives and rules, providing a template for handling conflicts and demonstrating how to prioritize and balance objectives. These include things like assuming best intentions from the user or developer, asking clarifying questions when necessary, being as helpful as possible without overstepping, assuming an objective point of view, encouraging fairness and kindness and discouraging hate, and more. Joanne Zhang, who worked on this product writes, I'm personally excited about this concept
Starting point is 00:15:13 of a model spec for three reasons. One, there will be more clarity on whether something is a policy or an RLHF bug. Two, principles are easier to debate and get feedback on versus hyper-specific screenshots or abstract feel-good statements. The way that she describes this is, it's easy for most people to agree on model should be something, but the more important questions lie deeper in thorny scenarios. For example, how should the model engage with someone who claims the earth is flat? Third, she writes, model spec feedback will help us steer our efforts and steerability.
Starting point is 00:15:40 Unexplicit non-goal she writes for the model spec is to reach consensus on a one-size-fits-all model. That will never happen. We want to give users and developers as much control as possible while staying within hard boundaries that people understand. Hearing feedback on where and how everyone wants to steer the model is helpful in A, designing a more rigorous survey process, and B, informing the research and product roadmap. So just tons and tons of stuff happening, but when push comes to shove, what everyone's really waiting for is the next big products. OpenAI CEO Brad Lightcap told a conference this week
Starting point is 00:16:10 that today's AI systems will seem laughably bad his words in 12 months after they ship GPT5. However, Gurggerliaros writes, OpenAI was amazing in 2022, 2023 because they shipped a product that spoke for itself. Jaws dropped by those using it and seeing it for themselves. To see the company hype up future unreleased products feels like a major shift. If it's that good, why not ship it? Like before. It seems like we will get answers to that question sooner rather than later-ish, but for now, still plenty of Open AI news to consume a cycle. That is going to do it for today's AI Daily Brief. Until next time, peace.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.