The AI Daily Brief: Artificial Intelligence News and Analysis - Google and HuggingFace Partner in Boost to Open Source AI

Starting point is 00:00:00 Today on the AI Breakdown, we're looking at a new partnership between Hugging Face and Google. Before that on the brief, the FTC has officially opened up antitrust investigations around the deals between Anthropic, Google, Amazon, and Microsoft, and OpenAI. The AI breakdown is a daily podcast and video about the most important news and discussions in AI. Go to Breakdown.network for more information about our YouTube, our newsletter, and our Discord. Welcome back to the AI Breakdown Brief, all the AI headline news you need in around five minutes. Recently, we had shared reporting that suggested that there was a lot of interest among big government agencies investigating whether some of these partnerships in the artificial intelligence space were in violation of antitrust laws.

Starting point is 00:00:45 In fact, it appear that there was a bit of a territorial match, but that finally seems to have been resolved. The FTC is officially launching a probe of how investments between companies like Microsoft and OpenAI and Amazon and Anthropic impact competition. Now, it's important to note that so far, there is no accusation of wrongdoing. This isn't the FTC saying that the deal between Microsoft and OpenAI was an antitrust violation, or the deals between Google and Amazon and Anthropic are antitrust violations. But at the same time, the organization has expressed concern that big tech has an unfair advantage

Starting point is 00:01:20 when it comes to this increasingly important field. According to the Wall Street Journal, the FTC said it would use the study to probe the company's investments in partnerships and how they affect the competitive landscape for AI. FTC Chair Lena Kahn said, our study will shed light on whether investments in partnership pursued by dominant companies risk distorting innovation and undermining fair competition. The piece from the Wall Street Journal also affirms that the FTC had been, in fact, negotiating with the Justice Department around how to divide this work, although it seems that progress is now being made.

Starting point is 00:01:49 Now, apparently the FTC has been a little bit nervous about this ever since the big Microsoft investment in OpenAI. They were apparently frustrated and concerned that they hadn't been told about that investment in advance. Now, part of what makes this complicated is that these partnerships reflect not just big tech being aggressive, but big tech being the only institutions with deep enough pockets to actually fund these projects on the scale that they cost. There aren't venture funds, for example, out there that could match Microsoft's $13 billion investment into Open AI. They just don't

Starting point is 00:02:17 exist. The economics of venture funds don't work at that scale. At the same time, open AI clearly isn't ready to go public, and so there's a very limited set of options. Now, when it comes to the argument against this sort of inquiry, Adam Kovacevich, the chief executive of the Chamber of Progress, said that the FTC's focus on Amazon, Google, and Microsoft, as well as OpenAI and Anthropic, shows that the FTC has, quote, embraced the idea that the AI space is already locked up and views these investment deals as evidence of that. Kovacevich thinks that that's patently absurd, saying every single company is talking about what they will do with AI. Now, in terms of the companies themselves, Google said in a statement that it's not locking the companies

Starting point is 00:02:54 that it invests in to work with competitors. Case in point, both Google and Amazon have invested billions of dollars into Anthropic. Microsoft also commented saying that their partnership, in fact, promotes competition and accelerates innovation. But whatever the case, this investigation is now happening, and it shows just how important AI has gotten in Washington. Speaking of which, Politico today published a piece called the Campaign to Take Down the Biden AI Executive Order. Republicans, right-of-centered think tanks and tech lobbyists are working to defang the industry-facing sections of Biden's AI executive order. Now, in some ways, this isn't just a story about AI. It's a story about the way that executive power is wielded in the U.S. right now,

Starting point is 00:03:32 which is, in of itself, a story about the power realignment between the executive and the Congress in a world where Congress can never get anything done. But the key note is that the Biden administration has argued that AI represents a national security threat, which warrants using the Defense Production Act, which gives the federal government powers over parts of the private sector. This group of Republicans, however, say that's overreach. Now, there are a few ways that folks are trying to combat this. There are some number of lawsuits from private sector think tanks, but political also reports, quote, Republican Senate commerce staff are also reportedly slowing down all AI regulation

Starting point is 00:04:04 passing through their committee. Senate Republican whip John Thune told reporters he was working with Ted Cruz and other Republican lawmakers to curb the DPA's ability to create new AI testing and training protocols for the tech industry. AI may not be an election issue yet, but it is definitely political. Moving back over to the private sector for a moment, the Financial Times is reporting that Elon Musk and his ex.a.i are seeking to raise $6 billion at a $20 billion valuation. Says the FT, Elon has courted wealthy individuals and investors around the world in recent weeks, according to multiple

Starting point is 00:04:34 people familiar with the matter. These talks have included family offices in Hong Kong. Another person said that Elon had also targeted sovereign wealth funds in the Middle East, but some said that at this stage, Elon is testing to see how much investor appetite there is for numbers at this scale. back to the reality of the costs of operating this market that we were discussing earlier, the FT writes, the scale of the attempted fundraising reflects the enormous costs required to develop generative AI, which requires huge computing power, vast amounts of data and cutting-edge chips. So whether Elon ends up fundraising right now or not, it seems very clear that he is set on having

Starting point is 00:05:07 a war chest to actually have X.a.i and GROC compete. Anyways, friends, that is going to do it for today's AI breakdown brief. Next up, the main AI breakdown. Before we get to the main part of the episode, one quick note about the AI education beta that you have been hearing about all week. This is the learning program we've been running for a couple months now that's all about tutorials, challenges, basically getting you learning AI by actually using AI faster, more effectively, more efficiently, and hopefully in a way that's more fun too. If you sign up, you'll get access to a library of 60 plus tutorials, case studies, and challenges,

Starting point is 00:05:42 new lessons that drop every weekday, and a passionate community of like-minded learners. We cover everything from LLMs and prompt engineering to image generators, voice synthesizers, AI no-cold tools, as well as professional applications like presentation generation website building and more. Due to a bunch of people saying that the window for enrollment was too short, I'm actually going to extend it to Sunday night at 1159 East Coast time. So you have a couple more days to register. Go to bit.ly slash AI Beta to learn more and sign up. I'd love to see you there, but now let's get back to the show. Welcome back to the AI breakdown.

Starting point is 00:06:18 Today we got some news that was very exciting to the open source advocates within the AI community. Google and HuggingFace announced a major new partnership, specifically between HuggingFace and Google Cloud. Philip Schmidt from HuggingFace tweets, we will collaborate with Google to foster open AI innovation across open science, open source, cloud, and hardware. Why this matters? keeping AI open, accessible, and efficient.

Starting point is 00:06:45 So before we get too deep into this, let's not assume that everyone knows exactly what Hugging Face is, and by way of summing up, let's turn to our favorite research tool, perplexity. I ask Perplexity, what is Hugging Face, and it describes it as a French-American company that provides a machine learning and data science platform and community. It is designed to help users build, train, and deploy machine learning models, offering infrastructure for running AI applications. The platform is known for its open-source Transformers Library, which is widely used for NLP tasks. HuggingFace is often referred to as the GitHub of machine learning because it allows

Starting point is 00:07:14 developers to share and collaborate on machine learning models and datasets. So this is a company that's been around since 2016 and has become a hub for, as it said, AI researchers and developers to share what they are working on, to find open source models, to run them. Often you will see, for example, when I'm talking about some new tool, that the demo that I'm running and sharing with you guys is hosted on Hugging Face. Now, one of the other important things to know about Hugging Face, is that it has positioned itself as one of the pillars of the open source AI movement. Once again, the way the perplexity sums this up is that Hugging Face's commitment to open source collaboration has catalyzed innovation and natural language processing,

Starting point is 00:07:52 allowing for communal growth and development of the technology. HuggingFace has positioned itself as a hub where data scientists, researchers, and ML engineers converge to exchange ideas seek support and contribute to open source initiatives. Now, a couple of examples include their Transformers Library, as well as them releasing open source serving solutions for LLMs and an open source chat UI for LLMs that are tools that tons and tons of companies out there use. So, key takeaway here is that HuggingFace has a long-term demonstrated commitment to the open source side of AI. So let's talk about what this actual partnership includes. On the Google Cloud blog, they write, Google Cloud and HuggingFace announced

Starting point is 00:08:27 strategic partnership to accelerate generative AI and ML development. Developers will be able to train, tune, and serve open models quickly and cost-effectively on Google Cloud. So basically, the partnership at Core is one that will, quote, allow developers to utilize Google Cloud's infrastructure for all HuggingFace services. It will enable training and serving of HuggingFace models on Google Cloud. Now, it is explicitly about a commitment to open AI development. The partnership they write advances HuggingFace's mission to democratize AI and further's Google Cloud support for open source AI ecosystem development.

Starting point is 00:08:59 With this partnership, Google Cloud becomes a strategic, cloud partner for HuggingFace and a preferred destination for HuggingFace training and inference workloads. So basically in some ways, Google Cloud is trying to become an infrastructure layer for open source AI via this partnership with HuggingFace. Again, they write developers will be able to easily utilize Google Cloud's AI optimized infrastructure, including compute, tensor processing units, and GPUs to train and serve open models and build new generative AI applications. So some of the specifics include connecting HuggingFace models with Vertex AI, which they describe as Google Cloud's purpose-built end-to-end MLOP service. They want to give more open-source developers

Starting point is 00:09:36 access to cloud TPU version 5E. But overall, it feels to me like Google is really trying to establish its place in this overall open source space. Now, you may remember last year, one of the big interesting and revelatory documents was a memo leaked from inside Google by someone who worked on AI inside that company, who argued that open AI wasn't Google's true competition in the AI space, but instead it was the open source movement. They talked about how, ironically, because of the leak around Meta's Lama 1 model's weights, meta, Mark Zuckerberg's company, of all things, had found itself at the very center of a radical shift in open source AI development.

Starting point is 00:10:15 The Googler at the time noted that open source developers were making surprisingly quick progress on problems that Google had identified as significant for the space and thought it was going to take a lot longer to solve. And why this is relevant, of course, is that as all of these big tech companies try to position themselves in the larger AI game, their relationship with open source, or contra it in the case of, for example, OpenAI in Microsoft, is one of the key pieces of positioning. Meta has played this card very well and has become a central piece of that ecosystem, although even meta has found itself recently competing more aggressively because of the rise

Starting point is 00:10:49 of Mistral, which has created sort of a renegated upstart competition with models that developers really love. So coming back to the Google Cloud and Hugging Face partnership, again, And part of it reads as a positioning play for them. Now, Jeff Boudier from Huggingface, who does product and growth, joined Brad Smith at Yahoo Finance for an interview and talked a lot more about what the partnership was and what it wasn't. Now, the first question that Smith asked was, is this Google Hugging Face deal a can or analogous to the Microsoft Open AI deal?

Starting point is 00:11:18 Jeff says, no, I would say it's quite different. And it's really an opportunity to make it easy for Google Cloud customers to build their own AI with open models. Today we host over a million models, data sets, applications for text, audio, video, time series, biology, you name it. And all of these models will be very easy to access from Google Cloud. We'll be very easy to use with the latest hardware available on Google Cloud. So already we have an interesting contrast, whereas Google Cloud's announcement about this was all about how they were supporting open source developers. In other words,

Starting point is 00:11:46 developers that were already in that space. Jeff and Huggingface are basically saying that this alliance actually allows more existing Google Cloud customers to become part of that open developer community. Now, in addition to details of the partnership, Smith also asked about some of the underlying principles. He said, one of the interesting comments that we heard from the CEO of Salesforce Mark Beniof during our conversation with him at Davos at the World Economic Forum was talking about how AI should be a human right. He also did address some of the concerns. What concerns still linger from your perspective around generative AI that the industry still needs to solve for? Jeff responds, The open source community has been tackling all of the main issues that have been

Starting point is 00:12:22 identified by the AI research community and working through those issues openly, transparently through open source. And I think that's a huge part of the solution. And what we want at Hugging Face is also to make those open models and open source AI easily accessible to companies so that they control their own AI destiny so that they can host themselves, the models, and they can protect their data as well. Now, of course, this gets into a much larger conversation around open source and its role with AI. On the one hand, AI safety advocates are hugely concerned about open access to advanced AI models. The argument tends to follow some version of, do we really want terrorists, for example, to have access to technology that can fabricate new types of weapons, especially

Starting point is 00:13:00 biological or chemical weapons? And indeed, this isn't just some fringe view. Obviously, the folks at Open AI tend to have it as well, and have said explicitly that they believe that their previous take on open source was incorrect. Another way to put that is that there is a set of people who believe that the risks of making super advanced AI freely available in an open source way, are greater than the risks that come with concentration of advanced AI in the hands of a small number of companies. There is, of course, then the opposite view, that the biggest risk is not rogue elements getting access to advanced AI, but advanced AI being controlled by just a small number of people. Ironically, given his 20-year history, Zuckerberg has become part of the

Starting point is 00:13:39 headliners of that second perspective. Most recently, he made news when he announced that meta's new goal was not just to create advanced AI and release it open source, but to actually create artificial general intelligence that could be released open source. Now, he didn't go so far as to commit 100% that if they achieved AGI, they would release it open source, but only because he said, who knows what we'll discover, and we need to leave ourselves some room. I don't want to be trapped by things I say now, but it's very clear that this is the direction they're moving. Jan Lecun, the chief AI scientist at Meta, also is one of the loudest and most vocal advocates for this. After Zuckerberg's announcement, Aravan Shrinivas from Perflexity wrote,

Starting point is 00:14:16 Open Source AGI is an amazing vision. You are building a very powerful technology and actually aligning it to what makes sense for the world. More people have a say in what makes sense and doesn't. Zuck and Jan Lakuna are leading the revolution, and it's going to have an incredible short-term impact for the AI ecosystem as a whole. Meta gets a lot of crap, but there's literally no other company doing this today. But then again, this pronouncement brought articles like this one from The Guardian that reads,

Starting point is 00:14:38 Very Scary, Mark Zuckerberg's pledge to build advanced AI alarms experts. Met a CEO accused of being irresponsible by considering making tools on par with human intelligence, open source. Now, one of the issues that people have, basically with all of these debates, are how many of these decisions are being made in the boardrooms of extremely powerful companies rather than out in the open. Dr. Andrew Rogoyski, for example, a director at the Institute for People Centered AI at the University of Surrey said, there are deep and complex arguments about the merits of open source and current AI models. Pushing that into the realm of AGI could be world-saving or catastrophic. These decisions need to be taken by international consensus, not in the boardroom of a tech giant. Now, one thing I will note along those lines is that I watch this very closely. And people like Sam Altman, who are much more on the open

Starting point is 00:15:22 source could be very bad for us side of things, have also said things like this, that there needs to be a broader set of stakeholders involved in these types of decisions. However, so far that group hasn't really advanced what that could meaningfully look like. And in the absence of some vision to rally around, it defaults back to the companies that are building it. Now, if you're interested in staying at the edge of this open source AI debate, let me give you two Twitter account recommendations. First, Jan Lacoon, who I just mentioned, at Y-L-E-C-U-N on Twitter slash X. He is always commenting on these issues, always resharing every post that relates to them, and so you will definitely get the pro-open source side from him. On the flip side is the at AI safety memes

Starting point is 00:16:02 account, which is literally at AI safety memes. The accounts bio reads techno-optimist, but AGI is not like other technologies. This account tends to do a lot of curation of these discussions, and while it comes from a very discreet perspective, it does so with extremely good faith. So if you're looking to really understand that side and that set of concerns, you can't do much better than starting there. You'll also see a lot of other accounts that you might be able to follow. As always, my recommendation is to listen to everyone and think deeply about these issues because they're going to impact us all. For now, however, that is the story for today. Hugging Face and Google have teamed up. Google is planting a bigger flag in the open source AI space,

Starting point is 00:16:40 and this whole world continues to be interesting. Thanks for listening or watching as always, and until next time, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - Google and HuggingFace Partner in Boost to Open Source AI

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.