The AI Daily Brief: Artificial Intelligence News and Analysis - Meta, IBM Form AI Alliance to Counter OpenAI with More Open AI

Episode Date: December 5, 2023

OpenAI is not open enough; at least not according to the new AI Alliance - a group led by Meta, IBM, AMD and others that is trying to ensure AI stays more open and not controlled by just a handful of ...companies. Before that on the Brief, Getty Images is going to court against Stability AI while forming partnerships with Runway, and Google delays Gemini again. Today's Sponsors: Listen to the chart-topping podcast 'web3 with a16z crypto' wherever you get your podcasts or here: https://link.chtbl.com/xz5kFVEK?sid=AIBreakdown  ABOUT THE AI BREAKDOWN The AI Breakdown helps you understand the most important news and discussions in AI.  Subscribe to The AI Breakdown newsletter: https://theaibreakdown.beehiiv.com/subscribe Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown Join the community: bit.ly/aibreakdown Learn more: http://breakdown.network/

Transcript
Discussion (0)
Starting point is 00:00:00 Today on the AI Breakdown, we're looking at the new AI Alliance led by META and IBM, and that stands as a direct contrast to OpenAI. Before that on the brief, Google delays Gemini again, but maybe doing virtual previews as soon as this week. The AI breakdown is a daily podcast and video about the most important news and discussions in AI. Go to Breakdown.network for more information about our YouTube, our Discord, and our newsletter. Welcome back to the AI Breakdown Brief. All the AI headline news you need is. in around five minutes. Another day, another delay of Google Gemini. For the last few weeks, the only news we've gotten about Google Gemini are, in fact, delays. The latest comes from the information Google preps public preview of Gemini AI after postponing in-person launch events.
Starting point is 00:00:48 Now, this is actually a two-part story. The first part was about Google scrapping a series of in-person events that were meant to launch Gemini, and then the second part, which was updated just yesterday, was the report from inside the company that instead they were planning a very, virtual preview that could come as soon as this week. All right, so let's go to the first part of the story first. The TLDR is that Google had been planning to debut Gemini, which has just an absolute ton of pressure around it. This can't just be another interesting, even highly capable AI model.
Starting point is 00:01:17 It has to, at the very least, meet GPT4 capabilities, and frankly probably needs to exceed them at least in some fairly meaningful ways. Writes the information, Google CEO Sundar Pichai recently decided to scrap a series of of Gemini events originally scheduled for next week in California, New York, and Washington, after the company found the AI didn't reliably handle some non-English queries. The planned events, which hadn't been publicized, would have marked Google's most important product launch of the year, after it strained its computing resources and merged large teams in an urgent pursuit of Open AI.
Starting point is 00:01:48 The publication writes, it's rare for Google to launch a major product between Thanksgiving and the end of the year, but Google intended to make an exception for Gemini because it's arguably the company's most important initiative in a decade. The Gemini event in Washington was intended to showcase the technology to policymakers and politicians, which have increasingly discussed potential regulations involving AI. Bichai and other executives have been concerned that OpenAI's chat GPT has become a household name and that Microsoft's co-pilot features for productivity software, including for software developers, are turning into a significant business too. Now, here was the most interesting paragraph to me. A key challenge for the Gemini team is
Starting point is 00:02:21 making sure the primary model is as good as or better than GPT4. It has met that standard in some respect, said one of the people familiar with it, but the company is still making a improvements because it wants the technology to work well globally in numerous languages. Now, to me, the whole thrust of this is very clearly how much pressure Google feels on this release. As of the time of recording, we haven't got any more information about these virtual previews, but apparently Google's team has been for weeks giving private demonstrations of Gemini to business partners, even though they say that people won't actually be able to get their hands on it until January. Anyways, yet another interesting moment in the saga that is Google Gemini,
Starting point is 00:02:56 and I will certainly be keeping a lookout for any more information about these virtual previews should it come out. Now, one interesting aside, given how generally Google's other AI model that is available in Bard has been treated, Neil Patel from NPD shared some really interesting information today. He wrote, do people prefer content written by ChatchipT or Bard? We had Chatchipt create 1,000 articles on various topics. We then gave Bard the same instructions on the same topics. In total, we created 2,000 pieces of content with AI. 1,000 from Chatchipt, 1,000 from Bard. We then paid people to read the articles we didn't tell them that they were written by AI based on their interests.
Starting point is 00:03:32 In total, 249 articles were read from ChatGBT, GBT, and the 249 counterpart versions from Bard. We then asked which one they preferred. Interestingly, Bard absolutely crushed ChatGBTGPT in this one study. 188 of the articles written by Bard were preferred to just 61 written by ChatGBT.T. In other words, 75.51% of the time people prefer the Bard articles. Now, the one thing that people are asking is whether this was GPT 3.5 or whether it was GPT4, and obviously without that, it's incomplete information, but still an interesting piece of counterintuitive data.
Starting point is 00:04:05 Next up on the brief two interesting stories surrounding Getty images. The first is that their lawsuit against Stability AI is heading to trial in the United Kingdom. Basically, a UK court has ruled that there are merits to Getty's arguments that its copyrighted material was used to train AI models, and in spite of Stability's argument that the case should not be heard in the UK, because, as the Verge put it, it said no one involved in the training or development of stable diffusion was based in the UK, a UK justice Joanna Smith didn't agree. Instead, she said that evidence from the company and its CEO, quote, raised the specter that evidence is either inaccurate or incomplete, at the very least, suggest a conflict of evidence. So, on the one hand, we have Getty images suing a company
Starting point is 00:04:43 and Stability AI that is using their images they claim in a way that violates copyright to train AI models. But then we also have the announcement of a partnership between Getty Images and Runway, which is, of course, a text-to-video leader. A runway blog post reads, Runway is partnering with Getty Images to launch a new video model for enterprise customers, addressing companies' ever-growing need for high-quality customized content. This model will combine the power of runway with Getty Images' world-class, fully licensed creative content library, providing a new way to bring ideas and stories to life through video in enterprise-ready and safe ways.
Starting point is 00:05:14 Now, basically what's going on here is that Runway is trying to get out ahead of more or business use cases of their text-to-video tools by partnering with the content creators so that they can get around these potential legal issues. Again, from their blog post, they write, this new runway Getty Images model, RGM, will provide a baseline model upon which companies can build their own custom models
Starting point is 00:05:33 for the generation of video content. Runway enterprise customers will be able to fine-tune RGM using their own proprietary data sets. This enables companies in all fields, Hollywood Studios, advertising, media broadcasting, and more to enhance their creative capabilities and provide new channels for video creation by powering entirely new content workflows
Starting point is 00:05:49 and making it easy to craft delightful experiences tailored to enterprises styles and brand identities and to their unique audiences. Now, get outside of all the announcement language here, and what this is really saying, is that a lot of different companies and a lot of different industries are going to use these text-to-video tools,
Starting point is 00:06:05 and here's a version where you can be confident that you're not going to get sued for copyright violations. Now, of course, what's interesting to me is that we're seeing Getty images play both sides of the AI space. On the one hand, they're suing people that they say are violating their copyright, and on the other hand, they're partnering with companies to provide copyright-protected versions of those models. It's frankly a pretty smart strategy from a business perspective and kind of gives them two bites at the
Starting point is 00:06:25 apple if court decisions don't go their way. That's not to say that the questions that are playing out in court aren't significant and very, very complex. Billboard recently wrote a piece called As AI grows, Artisan labels consider who owns a voice. Some record deals they write may cover voice rights that could be used for artificial intelligence. Artists are negotiating for clarity and exceptions. Rights are an extremely complicated thing. In music, for example, there is already an established difference between rights that relate to the recordings of songs and rights that relate to the underlying melodies and words. This is the reason that Taylor Swift can re-record her own music. As per the terms of her old contracts, while she doesn't own the masters to her recordings,
Starting point is 00:07:04 she does own the publishing rights. So basically, she granted herself the right to cover her own songs, producing a version where she now owns not only the publishing rights, but also the actual recording rights as well. Effectively, this Billboard article is about a new vector for rights, which is, of course, who owns voices and who can use them in what ways when it comes to AI. I think this is a question that is going to get immensely more complicated before it gets solved. Over in the world of big AI fundraising, Fidelity has just led a new investment round in VAST data. Vast is an AI software and storage startup that has been backed by other companies, including Nvidia, and the latest round values the company at more than $9 billion, that's up from
Starting point is 00:07:40 $3.7 billion back in 2021. From the information, Vast data sells software to companies like Pixar, Verizon, and high-frequency traders to store data for AI applications. The 600-person company is one of a number of startups building digital or physical infrastructure for AI-focused apps that has gotten a lift from the rise of chat GPT. Vast expects to pull in more than 100 million in revenue this year, more than triple last year's sales, one of the people said. Speaking of big fundraisers, much buzzed about Mistral, whose open-source 7B model LLM has gotten a ton of buzz among AI developers, has raised a new 450 million euro or 487 million dollar round that values the company around $2 billion. The investment is being led by Andrews and Horowitz, but also includes participation from Nvidia and Salesforce.
Starting point is 00:08:23 writes Bloomberg, the $2 billion valuation for a company less than a year old underscores the tech world's unbridled optimism about the future promise and profit of artificial intelligence companies. Now, what's interesting to me is that, their first round, which came pre-product and was once again a nine-figure round, and this one, the company has actually released a large language model that people really like. And if you are interested in the open source competition with more closed models from the big AI labs, well, then stick around for the main part of the episode, which is coming up next. Quickly a brief word from today's sponsor. As a listener of this show, I suspect you like
Starting point is 00:08:59 to stay up to date on all things AI and tech, which is why you have to check out the chart-topping podcast Web 3 with A16Z Crypto. Produced by venture firm Andresen Horowitz, Web3 with A16Z is the perfect companion podcast to the AI breakdown. Web3 with A16Z crypto is your definitive resource for the future of the internet, whether you're interested in the convergence of AI and crypto or simply curious about what's next. If you need a place to start, they recently released an excellent episode with Stanford Cryptography Professor Dan Bonay and former Google X engineer Ali Yaya in conversation with host Sonal Choxi about the intersection of AI and Crypto. From fighting deepfakes and proving humanity to large language models like ChatGBT,
Starting point is 00:09:39 GPT, they cover it all. I highly recommend checking it out, especially if you'd like to learn more about how AI and crypto will impact our everyday lives. Beyond Crypto and AI, this show is for creators seeking more ways to truly own their work, for business leaders trying to prepare for the future today, and for innovators exploring trending tech topics. Don't miss out. Follow Web3 with A16Z Crypto on Apple Podcasts, Spotify, or your favorite listening app.
Starting point is 00:10:06 Welcome back to the AI breakdown. One of the big themes for the show over the last week or so, which is pretty inevitable and always going to happen, is where the industry is at the one-year anniversary of Chatchip-T. Whether you like Chat-GPT or loathe it or like OpenAI or dislike them, there's no denying that the launch of Chat-GPT set off this wave of generative AI and will be seen as the starting point for this whole movement and industry that we're all a part of. Now, of course, the drama surrounding the OpenAI leadership has been the dominant theme However, it's not the only one.
Starting point is 00:10:38 Today, we are discussing a new alliance led by META and IBM, which is a fairly strong statement of disagreement with the way in which OpenAI and some of their peers are behaving as relates to the artificial intelligence industry. So let's talk about what the news is, and then we'll do a little bit of analysis and interpretation. The Wall Street Journal writes, Meta and IBM launch AI Alliance.
Starting point is 00:10:59 One year after the debut of ChatGBT, companies including AMD and Service Now are teaming up with academia to promote Open alternatives to Open AI. So a couple things to note about this alliance. First of all, it is not just meta and IBM. There are more than 50 AI companies and research institutes that are part of this alliance. As I mentioned, it is called perhaps not all that creatively and certainly with difficulty to Google the AI Alliance. Other members include Intel, Oracle, Cornell University, and the National Science Foundation. The journal describes its members as, quote, largely supporting open source,
Starting point is 00:11:31 an approach in which technology is shared free and draws on a history of collaboration among big tech, academics, and a fervent movement of independent programmers. And to be clear, this is absolutely a response to the way that the industry has developed, and specifically the way that narratives around the industry have developed. Said Dario Gill, senior vice president at IBM and director of IBM research, frankly, we've been a little bit unsatisfied with the overall debate and the discussions on AI over the last year. We did not feel that it reflected the diversity of the ecosystem that is making this AI moment
Starting point is 00:12:01 possible. Said Met as president of Global Affairs, Nick Clegg, We believe it's better when AI is developed openly. More people can access the benefits, build innovative products, and work on safety. Now, in addition to any sort of statement of principle, there is also a clear business objective here. Again from the Wall Street Journal, many of the alliances members are companies that have their own AI products, but are struggling to catch up with the rush of attention that Open AI and its investment partner Microsoft are drawing. The article also points out that although they have been working on this alliance since all the way back in August,
Starting point is 00:12:31 the timing is opportune. quote, since the upheaval at OpenAI in late November, businesses want to have more providers of AI products to diminish the risk of working with a single vendor and are exploring other AI systems as viable alternatives. The timing of the AI alliances launch underscores that message IBM's Gill said. This other way, it's a much more distributed approach, but much more resilient, because no given institution can derail the success of the Open Engine. It definitely feels a little bit like an alliance of companies who are just a little behind. For example, quote, advanced micro devices, the chipmaker aiming to take a piece of Nvidia's dominance in AI chips,
Starting point is 00:13:04 said that it will support an open AI ecosystem with its hardware and that it, along with other alliance members, will build the software that enables businesses to use the chips, said Forrest Narod, AMD's executive vice president and general manager of its data center group. The company is set to spotlight AI accelerator chips this week that Narod says will be a strong alternative to Nvidia's offerings. Said IBM's Gill, if you think the future of AI is going to be determined by two, three, or five institutions, you're mistaken. I hope that it gives more clarity and confidence that the world of open innovation is a world to bet in. Now, interestingly, there is far more coverage given to
Starting point is 00:13:37 those competitive dynamics of the alliance than there are to what they're actually doing. For example, the only real mention of what they're doing in that Wall Street Journal piece are the last couple sentences which say, the alliance is focusing on six areas, including regulation and safety as near-term initiatives. Gil said it would soon release a benchmarking tool for AI safety and model validation. Now, what's really interesting about this to me is that on the one hand, it's totally easy to be cynical about. To see this is something that is forced largely by competitive pressures. Basically, the open AIs and anthropics of the world, along with their big tech partners, have sucked all the oxygen out of the room, dominating not only the media coverage,
Starting point is 00:14:11 but by extension, the enterprise business. And this is a way to try to reclaim some of that narrative space. However, even if that's true, that's not to say that this message, that the future of AI is not going to be determined by two, three, or five institutions, isn't an important and resonant one. TechCrunch's coverage of the alliance gets at some of this skepticism. In fact, the title of their piece is Meta and IBM form an AI alliance, but to what end? Now, part of TechCrunch's questions come from the fact that there are already similar alliances to this in the space. They write, so what will the AI alliance do exactly and how will its work differ from the quite similar, at least in terms of its overarching mission members and tenants,
Starting point is 00:14:48 its partnership on AI? The Partnership on AI years ago promised to publish research using open-source licenses and minutes from its meetings, too, as the AI Alliance purportedly seeks to do, educate the public on pressing AI issues of the day. Well, confusingly, the partnership on AI is, in fact, a member of the AI Alliance. Giving a little bit more detail on the practicals, tech crunch writes, the AI Alliance's members will first form working groups, a governing board, and a technical oversight committee dedicated to advancing areas like AI trust and validation metrics, hardware and infrastructure that supports AI training in open source AI models and frameworks. They'll also establish product standards and guidelines and then partner with important
Starting point is 00:15:22 existing initiatives, initiatives conspicuously not named in the press release, from government, non-profit and civil society organizations who are, quote, doing valuable and aligned work in the AI space. If that sounds a lot like what the inaugural members of the alliance were already doing independently, you're not wrong. But in the release, the AI Alliance stresses that its work, whatever form it ultimately takes, is intended to be complementary and additive, rather than needlessly duplicative. The release reads,
Starting point is 00:15:45 More collaboration and information sharing will help the community innovate faster and more inclusively and identify specific risks and mitigate those risks before putting a product into the world. That stands in contrast to a vision that aims to regulate AI innovation and value creation to a small number of companies with a closed proprietary vision for the AI industry.
Starting point is 00:16:03 Now, in a section called key subtext, it gets into the fact that there is clearly a dividing line sort of battle happening here that is being calcified and hardened by this announcement. On the one hand, you have companies who are not included in this AI alliance, that include Google, OpenAI, Microsoft, Anthropic, coherer, who are skeptical, if not downright, opposed
Starting point is 00:16:20 to the open-source approach taken by a company like Meta. The piece points out that also notably absent are institutions like Stanford and MIT, and that Invidia isn't a member either. They write, I'll note that Invidia isn't a member of the AI alliance either, a suspect absence given that the company is by far the dominant provider of AI chips
Starting point is 00:16:36 and a maintainer of many open-source models in its own right. Perhaps the chipmaker perceived a conflict of interest in collaborating with Intel and AMD, or perhaps it decided to cast its lot with Microsoft, Google, and the rest of the tech giants opting out of the alliance for strategic reasons. Who can say? Now the flip side is that they did note that the groups that have come together for the AI Alliance represent a very wide swath of companies.
Starting point is 00:16:57 Others they name include CERN, Yale, Imperial College of London, Stability AI, Hugging Face, and more. TechRunch writes, the AI Alliance's initial cohort is exceptionally broad, sitting at the intersection of not just AI and enterprise, but healthcare, silicon and software as a service as well. But they write, Without the participation of so many major AI industry players and lacking deadlines or even concrete objectives, can the AI alliance succeed? What would success even look like?
Starting point is 00:17:20 Beats me. The vast number of competing interests from health care networks to insurance providers won't make it easy for the alliance's members to coalesce around a single United Front. And for all their talk of openness, IBM and META aren't exactly the poster children for the future that the alliance has released to Picks, casting doubt on their sincerity. Perhaps I'm wrong and the AI Alliance will be a smash success, or perhaps it'll crumble under mistrust in its own bureaucracy. We'll see. Time will tell.
Starting point is 00:17:44 Now, overall, I just don't think it's all that complicated. There is a major dividing line in and around the artificial intelligence space, around people's attitudes towards open source and openness in general. To me, the AI Alliance simply reads like a codification of those sides. But those sides aren't necessarily any different than they were yesterday before the AI Alliance was announced. Now, is it opportunistic for certain of these members to have joined the AI Alliance versus the uncodified alliance of big tech and closed AI people? Sure, maybe.
Starting point is 00:18:14 And is it opportunistic or at least opportune to use the chaos of open AI in the last few weeks as a moment to try to convince particularly enterprise customers and policymakers that maybe a more decentralized approach that is more open and less reliant on a small handful of companies might be a better one for this critical infrastructure and new technology? Sure. But as the famous saying goes, all is fair in an AI arms race. I think it's reasonable to have the sort of skepticism that that tech crunch office does, that such a broad-based coalition could actually find much to coalesce around, but even for
Starting point is 00:18:44 those who are skeptical of open-source AI and who are nervous about what it might mean, having a body where proposals for standards and guardrails and things like that can at least be discussed, in other words, where there is a space for ideas to flow between these different people who share this sort of sentiment, seems like a net good thing rather than a bad thing. So overall, I'm not sure. It could be that this ends up being a nothing burger, just a nice press release and a thing to discuss on podcasts for a day or two. Or it could be something more. No matter what, I do think it reflects the fact that we were moving to a new stage of the conversation, one with real policy implications, where enterprises are making buying decisions, in other words, one where the stakes are higher than ever. Interesting stuff at the beginning of this week, and something to keep an eye on for sure.
Starting point is 00:19:24 For now, however, that is going to do it for the AI breakdown. I appreciate you listening or watching. Until next time, peace.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.