The AI Daily Brief: Artificial Intelligence News and Analysis - US Gov't Says AI Labs Need to Notify Them Around LLM Training

Episode Date: January 30, 2024

NLW looks at recent comments from Secretary of Commerce Gina Raimondo about an LLM disclosure regime. Plus, Elon Musk's role in AI. ABOUT THE AI BREAKDOWN The AI Breakdown helps you understand the mo...st important news and discussions in AI.  Subscribe to The AI Breakdown newsletter: https://theaibreakdown.beehiiv.com/subscribe Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown Join the community: bit.ly/aibreakdown Learn more: http://breakdown.network/

Transcript
Discussion (0)
Starting point is 00:00:00 Today on the AI breakdown, we're looking at Elon Musk's role in AI. Before that on the brief, the U.S. government announces that OpenAI and other AI labs have to notify them about large language model training. The AI breakdown is a daily podcast and video about the most important news and discussions in AI. Go to Breakdown.network for more information about our YouTube, our Discord, and our newsletter. The U.S. is using the Defense Production Act to require companies to notify them when they're training large. large language models. Welcome back to the AI breakdown brief, all the AI headline news you need in around five minutes. One of the big themes for 2024 was always going to be the growing importance of the conversation about artificial intelligence in Washington, D.C. Of course,
Starting point is 00:00:49 last year we got President Biden's executive order, and some of the fruits of that are starting to bear now. U.S. Secretary of Commerce, Gina Raimondo, has announced more details about requirements for companies training large language models, with the government saying that the defense Production Act authorizes them to get notification when those companies are training advanced models. Said Raimondo at an event at Stanford University's Hoover Institute last week, we're using the Defense Production Act to do a survey requiring companies to share with us every time they train a new large language model and share with us the results, the safety data so we can review it. Now, Ramondo tried to minimize how burdensome this sort of notification requirement
Starting point is 00:01:25 is going to be, calling it, quote, just a quick heads up we're building powerful LLMs that might be a major national security risk. Relatedly, Raimondo said, quote, we're beginning the process of requiring U.S. cloud companies to tell us every time a non-U.S. entity uses their cloud to train a large language model. Now, this is something that's gotten increasingly popular and I think is going to be an even bigger discussion going forward, which is monitoring cloud usage as a way to keep tabs on who is potentially training advanced models. Now, in some ways, these might seem simple, but they are likely to be controversial,
Starting point is 00:01:56 so I will keep you posted as discussion about these new policies. kicks up. Meanwhile, over in China, it's not just notification but approval that is required by the government before companies can release their AI models to the public. Last week, Chinese regulators granted approval to a group of 14 new LLMs for public use, which represents the fourth batch of approvals that China has granted. Overall, China has approved more than 40 AI models in the first six months since they began this approval process. It appears that so far, the most used Chinese chatbot is Bidu's ErnieBot, which according to that company's CTO, has more than 100 million users. In terms of what's different about this new batch, it appears that it's not just generalist models,
Starting point is 00:02:37 but a number of them are actually industry-specific LLMs as well. Those include LLMs focused on cybersecurity and video solutions. Now, speaking of LLMs, Chatbot Arena is a crowdsource platform for LLM evaluations that has collected over 200,000 human preference votes to rank LLMs with an ELO rating system. A big jump on that arena leaderboard has come from Bard's new Gemini Pro version, which has jumped above everything except GPT4 Turbo. Currently, the rankings are at 1 GPT4 Turbo, at 2 barred with Gemini Pro, at 3GPt40, at 4GPT4-0613, at 5 Mistral Medium, at 6 Claude 1, and at 7, Claude 2.
Starting point is 00:03:20 Now, as many pointed out, Gemini Pro is not Gemini Ultra, and so for many, this has reignited excitement about what Gemini Ultra might actually be able to do. However, some aren't buying it. Anjini Midha from A16Z writes, Bard is the only model in this list with internet access, which the table should make clearer. Open book versus closed book exams should be graded differently. Moving back to a story from last week, you'll remember me discussing a robo call that New Hampshire Democrats got from what turned out to be an AI synthesized President Biden urging them not to vote in that week's election primary, saying that their vote only mattered in November. While a cybersecurity research firm Pindrop Security was able to identify that it had been
Starting point is 00:04:00 11 Labs platform that had been used to make that voice, and once Pindrop Security shared that information with 11 Labs, they were able to figure out who the user was and suspend that account. Expect this process to play out a lot more during this election season. A new interesting AI feature from Apple, Apple Podcasts is now offering AI-generated transcripts. This is for the upcoming iOS 17.4, which is expected to come to users in March. Writes Apple Insider, the feature is new and still in beta, so there isn't any information on implementation or expected accuracy. Users should expect most, if not all shows, deliver through Apple's podcast network to be
Starting point is 00:04:35 transcribed. For what it's worth, as a person who maintains multiple podcasts through Apple, we were all asked to sign new terms of service that reflected these changes, so yes, indeed, transcripts are coming to Apple Podcasts. Now, speaking of podcasts, one very cool company in the AI podcast and media creation space is Wondercraft. Wondercraft is in some of the same space as companies like 11 Labs with some overlapping features, but is focused on being a really wonderful tool for creating audio content and then now also turning that into video content.
Starting point is 00:05:06 The company has just announced a $3 million round, and I'm excited to see what they do with those resources. Now, as we get used to new tools and new interfaces, obviously a lot of the companies, obviously a lot people have switched their research behavior from Google to perplexity, while now next generation browser, ARC, has decided to make Perplexity the default search engine option. In a blog post on the Perplexity site, they write, by combining ARC's minimalist interface and Perplexity's intelligent search, we've created a streamlined browsing experience, no more sifting through a relevant results or dead-end links, just fast, straightforward access to the information you need.
Starting point is 00:05:42 Speaking of new features, Mid-Journey V6 has been out for a little while now, but they have continue to announce new features, and just recently they've released a set including panning, zooming, and varying part of a region, which lots and lots of the AI artists out there are getting very excited about. Now, lastly today, an interesting one about AI infrastructure. Blackstone is a massive private equity firm that manages basically a trillion dollar empire, and in 2021, the company spent $10 billion taking over data center operator QTS. At the time, the company had about a billion worth of property under development, which has risen now in just a few short years to $15 billion, thanks in large part to increase demand for lease data center capacity because of artificial intelligence.
Starting point is 00:06:27 Blackstone is now reporting that QTS could be one of the best investments in their history. So there you have it, folks, lots and lots of interesting things to start your week. That's going to do it for today's AI breakdown brief. Next up, the main AI breakdown. Today, we are talking Elon and AI. Welcome back to the AI breakdown. Today we are talking about Elon Musk and his relationship to AI, and the catalyst for this is a set of comments and some recent actions
Starting point is 00:06:57 that show the diversity of the places that he inhabits in this industry. And I thought it was worth exploring a little bit more about all of those diverse connections and giving you sort of a primer on where he sits relative to this field. So let's talk first about those comments and those actions, starting with a recent discussion around how much Tesla would be spending on Nvidia chips and how much it takes to just compete in AI in general. In a post on Elon responded to another post saying, the governor is correct that this is a dojo supercomputer, but $500 million, while obviously a large sum of money, is only equivalent to a 10KH100 system
Starting point is 00:07:31 from Nvidia. Tesla will spend more than that on Nvidia hardware this year. The table stakes for being competitive in AI are at least several billion dollars per year at this point. Now, those table stakes comments have been reported widely. And it reflects something that we've talked about a lot, here, that the scale of capital needed to compete in this space is just so different than any technology startup industry we've ever seen. It's forcing different dynamics upon the industry because venture capital alone can't cover the need. It's one of the reasons that big tech companies are so intrinsically involved even at this early stage. Now, when someone asked him if they also plan to buy chips from AMD, he said yes, but that's not really the main point here.
Starting point is 00:08:09 Now, the other interesting event that shows Elon in the context of his ex-leadership is around the viral NSFW Taylor Swift AI deepfakes that were released and distributed on that platform last week. Now, we talked about how the Swifties had risen up last week and tried to push down the illicit results by sharing legitimate photos of Taylor with the same hashtag, as well as a new hashtag, something like Protect Taylor Swift, but Elon and X took it a step farther and just started to block searches for Taylor Swift. So if you searched Taylor Swift over the weekend, you would get a message that said something went wrong, try reloading. And although X didn't make an official statement to reporters, its head of business operations Joe Benarok told BBC News that the action was taken, quote,
Starting point is 00:08:51 with an abundance of caution as we prioritize safety on the issue. On X's safety account, they said, posting non-consensual nudity images is strictly prohibited on X, and we have a zero-tolerance policy towards such content. Now, as we discussed last week, this is definitely putting the issue of deep fakes and non-consensual nudes, even higher on the political agenda than it had been. I basically said last week that if Taylor Swift couldn't get people paying attention to this issue, then nothing could. One of the things that this X story shows, though, is that there are going to be multiple dimensions to this. There is not just the image generation side guardrails, which, of course, the major platforms like ChatchipT can do, but is much harder if you just implement
Starting point is 00:09:30 an open source version of stable diffusion or something like that, but there's also guardrails on the distribution side. It kind of makes it clear that distribution and creation are going to have to work together on these policies for them to be actually effective in any meaningful way. There's also almost inevitably going to be a bit of whack-a-mole on the distribution side, and so response time is going to be paramount as well. Now, another interesting story around Elon and AI has been the question of whether X.comaI is raising external capital or not. This has been going on for a month or so, but the most recent version of it is that on January 26th, the Financial Times reported that X.a.I was seeking to raise $6 billion at a $20 billion
Starting point is 00:10:09 valuation. They reported that Elon had been courting wealthy individuals around the world, including family offices in Hong Kong, as well as sovereign wealth funds in the Middle East, to see if there was appetite for that sort of fundraise. But then again, after this Financial Times piece went up, Musk took to Twitter again to say, XAI is not raising capital and I have had no conversations with anyone in this regard. The previous week he had also denied a report that the company had secured $500 million in commitments towards a $1 billion fundraising goal. Now, one thing that Elon did say about GROC in a response to Jordan Peterson on X was that GROC 1.5 would be coming sometime in the next month.
Starting point is 00:10:47 Now, as the FT points out, one of the things that could be interesting, or challenging really for Elon, is if he really is trying to raise money in Hong Kong and even in some parts of the Middle East, that could run a foul of increasing tensions between those regions in the U.S. when it comes to AI policy. That alone could explain why Elon is keen to tamp down any rumors before any deals are done. Going back to our theme from right at the beginning, when Elon talked about the poker analogy and the table stakes is spending several billion dollars just to compete, the FT writes, the scale of the attempted fundraising reflects the enormous cost required to develop generative AI, which requires huge computing power, vast amounts of data, and cutting-edge chips.
Starting point is 00:11:25 Now, one thing that could defray some of those costs are tax breaks. It's not XAI. but X itself, aka Twitter, was just granted a $10 million tax break for hardware in Georgia. Basically, the company wanted to install new AI hardware at its data center environment in a QTS facility, QTS, by the way, being the company that we just talked about in the brief, and this week, the Development Authority of Fulton County voted 62 in favor of granting a 10-year tax break on that $700 million project. writes datacenter dynamics.com, the social networking firm, formerly known as Twitter, said the computer infrastructure would be used to develop and train
Starting point is 00:12:01 artificial intelligence products for the X platform, including large language models and semantic search. Now, Twitter slash X had threatened to move the data center to Portland, Oregon if the tax break didn't come through. Now, of course, the one other big, interesting Elon story of late has been the way that he's been trying to leverage artificial intelligence to get more of Tesla, responding to another post on X back on January 15th, where someone had basically said that Elon shouldn't need more of an incentive to keep building Tesla out. Elon responded, I'm uncomfortable growing Tesla to be a leader in AI and robotics without having around 25% voting control. Enough to be influential, but not so much that I can't be overturned. Unless that is the
Starting point is 00:12:42 case, I would prefer to build products outside of Tesla. You don't seem to understand that Tesla is not one startup, but it doesn't. Simply look at the delta between what Tesla does and GM. As for stock ownership itself being enough motivation, Fidelity and others own similar stakes to me. why don't they show up for work? I should note that the Tesla board is great. The reason for no new compensation plan is that we are still waiting for a decision in my Delaware compensation case. The trial for that was held in 2022, but a verdict has yet to be made. I put compensation plan in quotes because from my standpoint, this is primarily about ensuring the right amount of voting influence at Tesla. If I have 25%, it means I am influential but can be overridden if twice as many shareholders
Starting point is 00:13:18 vote against me versus for me. At 15% or lower, the four against ratio to override me makes a takeover by dubious interest too easy. I would be fine with a dual-class voting structure to achieve this, but I am told it is impossible to achieve post-IPO in Delaware. So basically what Elon is asserting here is that the stakes are so high around AI and robotics that heat fields that he needs to ensure that it is very difficult, although not impossible, to overrule him when it comes to key decisions. This, of course, brings up a question around concentration of power in AI. There is clearly concentration at the top of these big AI labs. Zuckerberg has always had incredible control at Facebook, now meta, and the whole escapade with Sam Altman being forced out and then invited back at
Starting point is 00:14:04 OpenAI shows just how much control he has there, even though it's such a different type of environment. The question, of course, is it just bluffing? What would it look like for Elon to not grow Tesla to be a leader in AI and robotics? There are a lot of people who have put a lot of money into the idea that Tesla is the single most important real-world AI and robotics company that exists. So does Elon really have the option of not growing it into being that leader? Just one of the many fascinating dynamics around this guy that make him such a pivotal, like it or not, figure in the AI space. And of course, we haven't even really got into his AI safety perspective, where he is purportedly
Starting point is 00:14:42 much more on the concern for humanity side of things than the effective accelerationist side of things, for example. If you haven't read it yet, I highly recommend you go read the New York Times piece back from December called Ego Fear and Money how the AI fuse was lit. It tells the story of Elon and Larry Page's breakup of their friendship really around this AI question and could give you a little bit more insight into a man that you will continue to hear about whether you want to or not. However, that is going to do it for today's AI breakdown. I appreciate you listening or watching as always. And until next time, peace.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.