The AI Daily Brief: Artificial Intelligence News and Analysis - Mistral Launches Close-to-GPT-4-Level Model with Microsoft Partnership

Episode Date: February 26, 2024

The model is proprietary rather than the open source Mistral has become known for. Also, a monster fundraising round for Figure AI. INTERESTED IN THE AI EDUCATION BETA? ONE DAY LEFT TO REGISTER! Lear...n more and sign up https://bit.ly/aibeta ABOUT THE AI BREAKDOWN The AI Breakdown helps you understand the most important news and discussions in AI.  Subscribe to The AI Breakdown newsletter: https://theaibreakdown.beehiiv.com/subscribe Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown Join the community: bit.ly/aibreakdown Learn more: http://breakdown.network/

Transcript
Discussion (0)
Starting point is 00:00:00 Today on the AI breakdown, Mistral announces Mistral Large and a big partnership with Microsoft. Before that on the brief, robotics company figure raises a monster round with Bezos, Microsoft, Nvidia, and Amazon all participating. The AI breakdown is a daily podcast and video about the most important news and discussions in AI. Go to Breakdown. Not Network for more information about our YouTube, our newsletter, and our Discord. Welcome back to the AI breakdown brief, all the AI headline news you need in around five minutes. AI is of course not something that just exists in two dimensions on our screens, in our chat GPT apps. No, one of the big prizes in AI is of course AI in the context of humanoid robots.
Starting point is 00:00:43 There are a ton of efforts in this space right now, obviously, big companies like Tesla or some of the leaders. And there is a strong sense that in the same way that the foundation model game is a massive market opportunity, so too is the humanoid robot space. Well, one of the hot companies in that space is called Figure. News broke today that the company is raising $675 million at a $2 billion pre-money valuation. Now, this has not been confirmed by Figure, but is being reported by Bloomberg, and they have lots of details. Perhaps most notably among the details is who's getting involved. In addition to VCs and corporate investments from Intel, Samsung, Parkway Venture Capital, Align Ventures,
Starting point is 00:01:25 the company is also reportedly getting $100 million from Jeff Bezos, $95 million from Microsoft, and 50 million each from Nvidia and Amazon. Bloomberg first reported this funding round back in January, and it appears that the interest of companies like Microsoft and OpenAI helped bring these other big actors to the table. Now, those reports say that the deal is very close to done, that wiring could actually happen today. But in the meantime, while we haven't gotten confirmation of that, figure CEO Brett Adcock did share video of the figure 01 robot completing a number of real-world tasks. He writes, this is end-to-end autonomous. We've made advances in our autonomous navigation, learn perception models, manipulation robust to pose variation, and generalizable systems for future
Starting point is 00:02:04 applications. The video puts the speed to human ratio as 16.7%, as in the figure 01 is around 17% as fast as a human at completing these tasks. Robotics is, of course, a very divisive area of this space. Take, for example, this tweet from More Perfect Union, which writes, Jeff Bezos made billions and billions exploiting low-wage workers. Now he's using some of his hoarded wealth to fund AI robots aimed at eliminating human labor. Others are more focused on the horse race comparison between figure and Tesla's optimists. Then again, there are others, and this might explain the massive venture round, who think that while as fun as those comparisons might be, this is far from a winner-take-all market. Right, Sophia and Malik, who is notably a small
Starting point is 00:02:43 investor in figure, this is going to be one of the largest tech companies of the 2030s. The total addressable market for humanoid robots is basically all of Earth. According to Vinod Kossala, we'll see one billion bipedal robots by 2040. C-3PO, eat your heart out. Now, moving on to last week's big story, Google Deep Minds Demis Hasabas was at the Mobile World Congress today and discussed the controversy around Geminii's historical images, which were often inaccurate to reflect a bias towards diversity even in situations where there was none. Said Hasabas, we've taken the feature offline while we fixed that. We're hoping to have that back online very shortly in the next couple of weeks, a few weeks. Now, this is not a small controversy.
Starting point is 00:03:21 In addition to it being loud on Twitter, Alphabet's shares were also down 3.5% today. That was actually the single biggest drag on the S&P 500. There is buzz that the upset goes even farther. Brian Romley tweeted earlier today, I just got off the phone with a Wall Street firm that has been politically neutral, and they just added themselves to a lawsuit that is being passed around against Alphabet Google on the release of Gemini. The amount they are asking for class members is massive.
Starting point is 00:03:45 But what is the basis? They are big shareholders, and this is an obvious flawed product. They feel they have the right to sue, and remove the board of directors for not removing management that allowed a flawed product to be released. We will see. Now, speaking of the market, the conversation that we may be in an AI bubble is kind of growing a little bit.
Starting point is 00:04:00 Bloomberg, for example, points to Kathy Wood and Arc selling more Nvidia as well as cutting their TSM stake. Bloomberg writes, Kathy Wood sold shares of Taiwan Semiconductor Manufacturing Corp for the first time in more than two years, adding to moves to cut exposure in the chipmaker's key customer, Nvidia Corp. Wood is trimming her holdings in the global chip bellwethers as the artificial intelligence frenzy intensifies. Wood was one of the most prominent voices predicting AI would be a game changer, despite that she sold Nvidia shares throughout last year, betting on growth potential and less talked about software companies such as UiPath Inc. And Twilio Inc. Now, one thing that's worth noting whenever you see any Kathy Wood headlines is that different arc funds have different rules
Starting point is 00:04:35 about how much of the fund anyone's stock can make up. And so you always have to ask, and I'm not sure if this is the case when it comes to Nvidia, are these reflections of those rules and overall portfolio balancing issues versus what they're being presented as, which is getting out ahead of a bubble popping. Now, whether the market thinks it's a bubble or not, one area that that continues to be white-hot is the talent battle. A leader of Google's video generation efforts has now joined TikTok owner BiteDance as the introduction of Sora has definitely churned up more intensity around AI video efforts. Now, obviously, this type of lateral company movement is nothing surprising or different, but I am certainly watching to see whether Sorah specifically
Starting point is 00:05:10 has an impact on other company behavior as it has on consumer perception. Lastly, one more update on a story we've been following for a while now, that Biden robocall in New Hampshire that kicked up so much dust in D.C. That was, of course, a deep fake of Biden saying that Democrats shouldn't go out and vote and they should save their vote till November. Well, now we've learned who was behind it. writes Axios, a former political consultant for President Biden's longshot Democratic primary challenger, Representative Dean Phillips, said Sunday he commissioned an AI-generated
Starting point is 00:05:38 robocall impersonating the president. The consultant was named Steve Kramer, and Axios said that the statement he first shared with NBC News, quote, showed no sign of remorse about the deep fake, and instead said it was a wake-up call for regulation. Kramer said, with a mere $500 investment, anyone could replicate my intentional call. Immediate action is needed across all regulatory bodies and platforms. Even individuals acting alone can quickly and easily use AI for misleading and disruptive purposes. Seems a little unlikely to me that his purpose was actually a big active political awareness
Starting point is 00:06:07 raising. Then again, what do I know? However, that is going to do it for today's AI breakdown brief. Next up, the main AI breakdown. Hello, AI friends. Quick note before we get back into the show, we have just opened up registration for the March edition of the AI Education Beta Program. The whole philosophy of this program is to get you learning by doing. So we have short tutorials, think three minutes, five minutes,
Starting point is 00:06:29 seven minutes, around specific features and use cases in AI, followed by challenges that are step-by-step instructions that get you actually using the most interesting and relevant tools. We have now built out a library of more than a hundred of these lessons in step-by-step companion instructions, and we'll be dropping more each week. For the first time, we'll also be moving beta users this month. We'll to a new dedicated platform where you can access that library of content, build list of lessons you want to learn from later, and other features that we hope will help make this the single best AI learning experience available. If you want to check it out, go to bit.ly slash AI beta. That's BIT.L.L.Y. slash AI beta. Registration is only open this week until next Monday,
Starting point is 00:07:10 so go check it out. Welcome back to the AI breakdown. One of the busiest companies in the AI space is called Mistral. It is full of former Google and Meta and OpenAI engineers. It's based in France, and it's been in many ways a standard bearer for the open source AI movement. Before Mistral came along in the middle of last year, Meta's Lama was absolutely dominating the entire open source conversation. A huge number of developers were building off it in one way or another. When Mistral arrived, though, it really started to suck some of the oxygen out of that open source room, and as I've discussed before in the show, seems to start putting some pressure on meta, who couldn't any longer guarantee that they were just going to be the default top of the
Starting point is 00:07:52 open source LLM heap. Well, now, Mistral has made a set of interesting announcements today. First, they announced a new Large Model. Their co-founder and chief scientist, Guillaume Lampal, writes, Today we are releasing Mistral Large, our latest model. Mistral Large is vastly superior to Mistral Medium, handles 32K tokens of context, and is natively fluent in English, French, Spanish, German, and Italian. We have also updated Mistral Small on our API to a model that is significantly better and faster
Starting point is 00:08:18 than Mixtral 8x7B. Now, from a performance standpoint, the numbers are looking really promising, with Mistral Large performing only below GPD4 and ahead of Claude 2 and Gemini Pro. What's more, the fact that it is natively multilingual is something that people are taking note of. Most LLMs are currently optimized just for English, so the fact that this is natively fluent in English, French, Spanish, German, and Italian covering a much larger swath of the world is super notable.
Starting point is 00:08:42 But to some extent, the biggest notable part of the least. announcement is the fact that Mistral Large is not being released open source, and in fact, will be delivered through Azure as part of a partnership with Microsoft. We're going to get into that and the community's reaction to it, but just to give a bit more of a background on the space in the AI world that Mistral is trying to carve out, let's look at a Wall Street Journal piece that was published today called the nine-month-old AI startup challenging Silicon Valley's giants. The piece begins, this time last year, Arthur Mench was 30, still employed at a Google unit here, and artificial intelligence had just started to take off in the public consciousness,
Starting point is 00:09:13 something more than science fiction. Mench's startup, called Mistral AI, is challenging the conventional wisdom that the winners of the AI race will emerge from among the tech industry's U.S. giants. Mensh, who founded the company with two engineering school friends, doesn't think enormous scale is essential, or that the U.S. will necessarily dominate. Mensch said, I've always regretted that there was no big tech in Europe. I think this is our chance to become one. Now, I know I'm reading a bit more than I normally would, but it's relevant. Mistral, which has raised just over 500 million from investors, including Andriesen Horowitz, remains tiny compared to the Goliaths of the industry. Microsoft-backed OpenAI and Alphabet's
Starting point is 00:09:45 Google are pouring billions of dollars into training the latest AI systems, leveraging their access to the specialized computer chips needed to build such systems and the fat balance sheets needed to pay for the electricity those chips consume. Mistral, named for a strong wind that blows from France, is founded in part on the idea that a lot of that money is being wasted. Mench said, we want to be the most capital-efficient company in the world of AI. That's the reason we exist. Now, for some sense of what that means or what that looks like, mistral large, which again is performing at a level somewhere between Claude 2 and GPT4, was apparently trained for around $22 million. While we don't know exactly how much it
Starting point is 00:10:20 cost to train GPT4, OpenAI CEO Sam Altman had said after the release of GPT4, the training it costs, quote, much more than $50 million to $100 million. Now, there's also a whole section around how Mistral is impacting the geopolitics of AI, with it particularly influencing how France has positioned itself vis-a-vis the EU's AI Act. France is very protective of mistral and was concerned about a number of provisions in that act that MENCH said could slow Mistral and companies like it down. But what about this question of open source versus proprietary? Open source was at the very core value proposition of Mistral, and indeed just being more engineer and developer-focused, has been one of their main brand strengths. I remember back in December, when Google announced
Starting point is 00:11:01 that Jem and I would be coming soon, Mistral just quietly dropped their latest model as a torrent that anyone could download with no announcement or fanfare, which was hugely beloved by the developer community. The Wall Street Journal's one quote about their new Mistral Large model not being available open source is MENC saying, it's obviously a thin balance between building a business model and sticking to our open source values. We want to invent new things, new architectures, and we still want to have something to sell extra to our customers. So what does their deal actually look like? The way that Microsoft frames it is this. Microsoft and Mistral announce a new partnership to accelerate AI innovation and introduce Mistral Large First on Azure. The press release and announcement from Microsoft
Starting point is 00:11:39 emphasizes that Mistral is on the very frontier of AI, that they are a leader in the open source community, and that perhaps most importantly for Azure's customers, they will get access to Mistral Large First. Microsoft writes that their partnership, which they call multi-year, is focused on three areas. The first is super computing infrastructure. They write Microsoft will support Mistral AI with Azure AI supercomputing infrastructure delivering best in class performance and scale for AI training and inference workloads for Mistral's AI flagship models. Number two, scale to market. Basically, this is their section that's all about how Mistral's models will now be the next to be available to Azure customers on top of OpenAI models. Finally, the third pillar is AI research
Starting point is 00:12:16 and development. They write Microsoft and Mistral AI will explore collaboration around training purpose-specific models for select customers, including European public sector workloads. Now, as you might imagine, a lot of the press coverage is focusing on what it means for Microsoft's Open AI relationship. The Verge writes Microsoft partners with Mistral and second AI deal beyond OpenAI. The piece notes that in addition to those three pillars articulated in the Microsoft announcement, the company will also take a minority stake in Mistral, although how much is not disclosed. And a real emphasis for this piece is embodied in the last paragraph, which reads, Microsoft Investment comes months after a rocky period for its main AI partner, OpenAI.
Starting point is 00:12:51 And of course, then it rehashes Sam Altman's firing, re-hiring, and the board recomposition, which is still a work in progress. Now, this is not the first time we've seen press want to make a Microsoft partnership out of OpenAI be a threat to OpenAI. Back in August, for example, Reuters published a piece, Microsoft plans AI service with Databricks that could hurt OpenAI. This actually came from a report from the information, which read, Microsoft plans to start selling a new version of Databricks software that helps customers make AI apps for their businesses. The Databricks software, which Microsoft would sell through its Azure Cloud Server unit, helps companies make AI models from scratch or repurpose open source models as an alternative to licensing OpenAI's proprietary
Starting point is 00:13:28 ones. Basically, what they're getting at here is that the philosophy of a company like Databricks, which is much more similar to something like Amazon's Bedrock, is that enterprises are going to want to have lots of different options, not be locked into any single one, and so perhaps Microsoft partnering with them suggested that they were heading more in that direction rather than being locked into just one partner, even a good one like OpenAI. Now, as I mentioned at the time, Microsoft and OpenAI's relationship is really complex territory. It doesn't really have precedent in the past, and is a particular reflection of the fact that big tech is the only capital source deep enough for competing at the highest levels
Starting point is 00:14:01 of LLM development, at least outside of Mistral. As much as it looks to us and we want to read it like a full acquisition or a default acquisition, that's never what it was. And there are elements of the relationship that are much more frenemy than friend. The amount that OpenAI makes, for example, from when Microsoft sells OpenAIs models through Azure, as opposed to when enterprise customers go directly to OpenAI, is a huge difference, creating inherently attention there, where presumably OpenAI would prefer to have all those customers going through it, and Microsoft would prefer to have all of them coming through its setup. But this doesn't mean the companies are secretly enemies, and I don't think that it means that deals like this one with Mistral necessarily
Starting point is 00:14:36 represent some lack of faith or growing discontent with their partnership with OpenAI. I do think that it's reasonable to believe that between this and that Databricks news, Microsoft does have at some sense that perhaps there is going to be a demand for a diverse set of models or at least a choice between models, much more akin to, for example, the positioning that Amazon Bedrock has taken. Now, interestingly, Amazon Bedrock also announced at the end of last week that Mistral 7B and Mixtral 8X7B, two of Mistral's other models, were also coming soon to Amazon Bedrock. That makes Mistral's Amazon's seventh official foundation model partner alongside Stability AI, meta-cohere, Anthropic, and others. Now, there was one other piece of news in this dense mistral
Starting point is 00:15:16 which is that they were releasing their own chat-GBT equivalent called La-Chat. In his Twitter thread about the announcement, Arthur Mench said, as a small surprise, we're also releasing LeChat Mistrel, a front-end demonstration of what Mistral models can do. And so, given all this, what has been the response of the community? You can find some concern that this deal effectively represents a new phase for the company, moving them farther away from their open-source roots. Bantag on Twitter writes,
Starting point is 00:15:42 Mistrel drops a new model that Bench is second after GPT4 as a hosted API, their glory days of magnet links might be over. The company seems in bed with Microsoft now is hinted by the mentions of Azure. Max von Thune writes, yet another independent startup sucked into big tech's gravitational field, this time French AI darling Mistral. Until we create alternative pathways for startups to access computing power and commercialize their products, this will keep happening. To be honest, though, this type of tone was far less prevalent than I might have assumed. Alex Volkov from the Thursday pod said, just a reminder to y'all before you go hyperbolic, Microsoft has been supporting Mistral and has outlined their strategy to serve open source models back in
Starting point is 00:16:17 November. And indeed, if you go back to that announcement page about this new partnership, Microsoft writes, in November 2023 at Microsoft Ignite, Microsoft unveiled an integration of Mistral 7B into the Azure AI model catalog accessible through Azure AI Studio and Azure Machine Learning. Basically the point being that this is not actually the beginning of a relationship between Mistral and Azure, but the expansion of that relationship. Even more, it just seems to be broadly speaking, the people understand that there does have to be a business model here. Bindu Ready, the CEO of Avicus, who advocates about as loudly as anyone for open source models, writes, a new model from Mistral, Mistral Large. With an MMLU of over 81, it's got pretty impressive numbers.
Starting point is 00:16:54 Sadly, they didn't choose to open source mistral medium. That said, good to see yet another LLM in the marketplace, which I think is pretty reflective of the tone overall. Indeed, some people went straight to brass tacks and functional comparisons. Matt Schumer from Hyper Right AI, for example, writes, Mistral Large is priced around 20% cheaper than GPT4 Turbo. It's a slightly weaker model as well. Curious to see how things play out and whether this is a worthwhile tradeoff for many applications. So lots of interesting things here, much to consider and take away. One thing that's for sure is that Mistral continues to earn its place at the very heart of the AI conversation and has not appeared to be doing anything but increasing that perception as time goes on.
Starting point is 00:17:32 However, that is going to do it for today's AI breakdown. Until next time, peace.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.