The AI Daily Brief: Artificial Intelligence News and Analysis - Mistral Launches Close-to-GPT-4-Level Model with Microsoft Partnership
Episode Date: February 26, 2024The model is proprietary rather than the open source Mistral has become known for. Also, a monster fundraising round for Figure AI. INTERESTED IN THE AI EDUCATION BETA? ONE DAY LEFT TO REGISTER! Lear...n more and sign up https://bit.ly/aibeta ABOUT THE AI BREAKDOWN The AI Breakdown helps you understand the most important news and discussions in AI. Subscribe to The AI Breakdown newsletter: https://theaibreakdown.beehiiv.com/subscribe Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown Join the community: bit.ly/aibreakdown Learn more: http://breakdown.network/
Transcript
Discussion (0)
Today on the AI breakdown, Mistral announces Mistral Large and a big partnership with Microsoft.
Before that on the brief, robotics company figure raises a monster round with Bezos, Microsoft,
Nvidia, and Amazon all participating.
The AI breakdown is a daily podcast and video about the most important news and discussions in AI.
Go to Breakdown. Not Network for more information about our YouTube, our newsletter, and our Discord.
Welcome back to the AI breakdown brief, all the AI headline news you need in around five minutes.
AI is of course not something that just exists in two dimensions on our screens, in our chat
GPT apps. No, one of the big prizes in AI is of course AI in the context of humanoid robots.
There are a ton of efforts in this space right now, obviously, big companies like Tesla or some of the
leaders. And there is a strong sense that in the same way that the foundation model game is a
massive market opportunity, so too is the humanoid robot space. Well, one of the hot companies
in that space is called Figure.
News broke today that the company is raising $675 million at a $2 billion pre-money valuation.
Now, this has not been confirmed by Figure, but is being reported by Bloomberg, and they have lots of details.
Perhaps most notably among the details is who's getting involved.
In addition to VCs and corporate investments from Intel, Samsung, Parkway Venture Capital, Align Ventures,
the company is also reportedly getting $100 million from Jeff Bezos, $95 million from Microsoft,
and 50 million each from Nvidia and Amazon. Bloomberg first reported this funding round back in January,
and it appears that the interest of companies like Microsoft and OpenAI helped bring these other big
actors to the table. Now, those reports say that the deal is very close to done, that wiring
could actually happen today. But in the meantime, while we haven't gotten confirmation of that,
figure CEO Brett Adcock did share video of the figure 01 robot completing a number of real-world
tasks. He writes, this is end-to-end autonomous. We've made advances in our autonomous navigation,
learn perception models, manipulation robust to pose variation, and generalizable systems for future
applications. The video puts the speed to human ratio as 16.7%, as in the figure 01 is around 17%
as fast as a human at completing these tasks. Robotics is, of course, a very divisive area of this space.
Take, for example, this tweet from More Perfect Union, which writes,
Jeff Bezos made billions and billions exploiting low-wage workers. Now he's using some of his
hoarded wealth to fund AI robots aimed at eliminating human labor. Others are more focused on the
horse race comparison between figure and Tesla's optimists. Then again, there are others, and this
might explain the massive venture round, who think that while as fun as those comparisons might be,
this is far from a winner-take-all market. Right, Sophia and Malik, who is notably a small
investor in figure, this is going to be one of the largest tech companies of the 2030s. The total
addressable market for humanoid robots is basically all of Earth. According to Vinod Kossala,
we'll see one billion bipedal robots by 2040. C-3PO, eat your heart out. Now, moving on to
last week's big story, Google Deep Minds Demis Hasabas was at the Mobile World Congress today
and discussed the controversy around Geminii's historical images, which were often inaccurate
to reflect a bias towards diversity even in situations where there was none. Said Hasabas,
we've taken the feature offline while we fixed that. We're hoping to have that back online
very shortly in the next couple of weeks, a few weeks. Now, this is not a small controversy.
In addition to it being loud on Twitter, Alphabet's shares were also down 3.5% today. That was actually
the single biggest drag on the S&P 500.
There is buzz that the upset goes even farther.
Brian Romley tweeted earlier today,
I just got off the phone with a Wall Street firm that has been politically neutral,
and they just added themselves to a lawsuit that is being passed around against
Alphabet Google on the release of Gemini.
The amount they are asking for class members is massive.
But what is the basis?
They are big shareholders, and this is an obvious flawed product.
They feel they have the right to sue,
and remove the board of directors for not removing management that allowed a flawed product
to be released.
We will see.
Now, speaking of the market,
the conversation that we may be in an AI bubble is kind of growing a little bit.
Bloomberg, for example, points to Kathy Wood and Arc selling more Nvidia as well as cutting their
TSM stake. Bloomberg writes, Kathy Wood sold shares of Taiwan Semiconductor Manufacturing Corp for the first time
in more than two years, adding to moves to cut exposure in the chipmaker's key customer, Nvidia
Corp. Wood is trimming her holdings in the global chip bellwethers as the artificial
intelligence frenzy intensifies. Wood was one of the most prominent voices predicting AI would be a game
changer, despite that she sold Nvidia shares throughout last year, betting on growth potential and less
talked about software companies such as UiPath Inc. And Twilio Inc. Now, one thing that's worth noting
whenever you see any Kathy Wood headlines is that different arc funds have different rules
about how much of the fund anyone's stock can make up. And so you always have to ask, and I'm not
sure if this is the case when it comes to Nvidia, are these reflections of those rules and
overall portfolio balancing issues versus what they're being presented as, which is getting out
ahead of a bubble popping. Now, whether the market thinks it's a bubble or not, one area that
that continues to be white-hot is the talent battle. A leader of Google's video generation efforts
has now joined TikTok owner BiteDance as the introduction of Sora has definitely churned up more
intensity around AI video efforts. Now, obviously, this type of lateral company movement is
nothing surprising or different, but I am certainly watching to see whether Sorah specifically
has an impact on other company behavior as it has on consumer perception. Lastly, one more
update on a story we've been following for a while now, that Biden robocall in New Hampshire
that kicked up so much dust in D.C.
That was, of course, a deep fake of Biden saying that Democrats shouldn't go out and vote
and they should save their vote till November.
Well, now we've learned who was behind it.
writes Axios, a former political consultant for President Biden's longshot Democratic
primary challenger, Representative Dean Phillips, said Sunday he commissioned an AI-generated
robocall impersonating the president.
The consultant was named Steve Kramer, and Axios said that the statement he first shared
with NBC News, quote, showed no sign of remorse about the deep fake, and instead said it was
a wake-up call for regulation.
Kramer said, with a mere $500 investment, anyone could replicate my intentional call.
Immediate action is needed across all regulatory bodies and platforms.
Even individuals acting alone can quickly and easily use AI for misleading and disruptive purposes.
Seems a little unlikely to me that his purpose was actually a big active political awareness
raising.
Then again, what do I know?
However, that is going to do it for today's AI breakdown brief.
Next up, the main AI breakdown.
Hello, AI friends.
Quick note before we get back into the show, we have just opened up
registration for the March edition of the AI Education Beta Program. The whole philosophy of this
program is to get you learning by doing. So we have short tutorials, think three minutes, five minutes,
seven minutes, around specific features and use cases in AI, followed by challenges that are step-by-step
instructions that get you actually using the most interesting and relevant tools. We have now built out
a library of more than a hundred of these lessons in step-by-step companion instructions, and we'll
be dropping more each week. For the first time, we'll also be moving beta users this month. We'll
to a new dedicated platform where you can access that library of content, build list of lessons
you want to learn from later, and other features that we hope will help make this the single
best AI learning experience available. If you want to check it out, go to bit.ly slash AI beta.
That's BIT.L.L.Y. slash AI beta. Registration is only open this week until next Monday,
so go check it out. Welcome back to the AI breakdown. One of the busiest companies in the
AI space is called Mistral. It is full of former Google and Meta and OpenAI engineers. It's based
in France, and it's been in many ways a standard bearer for the open source AI movement. Before
Mistral came along in the middle of last year, Meta's Lama was absolutely dominating the entire
open source conversation. A huge number of developers were building off it in one way or another.
When Mistral arrived, though, it really started to suck some of the oxygen out of that open source
room, and as I've discussed before in the show, seems to start putting some pressure on
meta, who couldn't any longer guarantee that they were just going to be the default top of the
open source LLM heap.
Well, now, Mistral has made a set of interesting announcements today.
First, they announced a new Large Model.
Their co-founder and chief scientist, Guillaume Lampal, writes,
Today we are releasing Mistral Large, our latest model.
Mistral Large is vastly superior to Mistral Medium, handles 32K tokens of context, and is
natively fluent in English, French, Spanish, German, and Italian.
We have also updated Mistral Small on our API to a model that is significantly better and faster
than Mixtral 8x7B.
Now, from a performance standpoint, the numbers are looking really promising, with Mistral Large
performing only below GPD4 and ahead of Claude 2 and Gemini Pro.
What's more, the fact that it is natively multilingual is something that people are taking
note of.
Most LLMs are currently optimized just for English, so the fact that this is natively fluent in
English, French, Spanish, German, and Italian covering a much larger swath of the world is super
notable.
But to some extent, the biggest notable part of the least.
announcement is the fact that Mistral Large is not being released open source, and in fact,
will be delivered through Azure as part of a partnership with Microsoft. We're going to get into
that and the community's reaction to it, but just to give a bit more of a background on the space
in the AI world that Mistral is trying to carve out, let's look at a Wall Street Journal piece
that was published today called the nine-month-old AI startup challenging Silicon Valley's giants.
The piece begins, this time last year, Arthur Mench was 30, still employed at a Google unit here,
and artificial intelligence had just started to take off in the public consciousness,
something more than science fiction. Mench's startup, called Mistral AI, is challenging the conventional
wisdom that the winners of the AI race will emerge from among the tech industry's U.S. giants.
Mensh, who founded the company with two engineering school friends, doesn't think enormous scale
is essential, or that the U.S. will necessarily dominate.
Mensch said, I've always regretted that there was no big tech in Europe. I think this is our
chance to become one. Now, I know I'm reading a bit more than I normally would, but it's relevant.
Mistral, which has raised just over 500 million from investors, including Andriesen Horowitz,
remains tiny compared to the Goliaths of the industry. Microsoft-backed OpenAI and Alphabet's
Google are pouring billions of dollars into training the latest AI systems, leveraging their
access to the specialized computer chips needed to build such systems and the fat balance sheets
needed to pay for the electricity those chips consume. Mistral, named for a strong wind that
blows from France, is founded in part on the idea that a lot of that money is being wasted.
Mench said, we want to be the most capital-efficient company in the world of AI. That's the reason
we exist. Now, for some sense of what that means or what that looks like,
mistral large, which again is performing at a level somewhere between Claude 2 and GPT4,
was apparently trained for around $22 million. While we don't know exactly how much it
cost to train GPT4, OpenAI CEO Sam Altman had said after the release of GPT4, the training
it costs, quote, much more than $50 million to $100 million. Now, there's also a whole section
around how Mistral is impacting the geopolitics of AI, with it particularly influencing how France
has positioned itself vis-a-vis the EU's AI Act. France is very protective of mistral and was
concerned about a number of provisions in that act that MENCH said could slow Mistral and companies
like it down. But what about this question of open source versus proprietary? Open source was at the
very core value proposition of Mistral, and indeed just being more engineer and developer-focused,
has been one of their main brand strengths. I remember back in December, when Google announced
that Jem and I would be coming soon, Mistral just quietly dropped their latest model as a torrent that
anyone could download with no announcement or fanfare, which was hugely beloved by the developer
community. The Wall Street Journal's one quote about their new Mistral Large model not being available
open source is MENC saying, it's obviously a thin balance between building a business model and
sticking to our open source values. We want to invent new things, new architectures, and we still
want to have something to sell extra to our customers. So what does their deal actually look like?
The way that Microsoft frames it is this. Microsoft and Mistral announce a new partnership to accelerate
AI innovation and introduce Mistral Large First on Azure. The press release and announcement from Microsoft
emphasizes that Mistral is on the very frontier of AI, that they are a leader in the open source
community, and that perhaps most importantly for Azure's customers, they will get access to
Mistral Large First. Microsoft writes that their partnership, which they call multi-year, is focused
on three areas. The first is super computing infrastructure. They write Microsoft will support
Mistral AI with Azure AI supercomputing infrastructure delivering best in class performance and scale for
AI training and inference workloads for Mistral's AI flagship models. Number two, scale to market.
Basically, this is their section that's all about how Mistral's models will now be the next to be
available to Azure customers on top of OpenAI models. Finally, the third pillar is AI research
and development. They write Microsoft and Mistral AI will explore collaboration around training
purpose-specific models for select customers, including European public sector workloads.
Now, as you might imagine, a lot of the press coverage is focusing on what it means for Microsoft's
Open AI relationship. The Verge writes Microsoft partners with Mistral and second AI deal beyond OpenAI.
The piece notes that in addition to those three pillars articulated in the Microsoft
announcement, the company will also take a minority stake in Mistral, although how much is not
disclosed. And a real emphasis for this piece is embodied in the last paragraph, which reads,
Microsoft Investment comes months after a rocky period for its main AI partner, OpenAI.
And of course, then it rehashes Sam Altman's firing, re-hiring, and the board recomposition,
which is still a work in progress. Now, this is not the first time we've seen press want to
make a Microsoft partnership out of OpenAI be a threat to OpenAI. Back in August, for example,
Reuters published a piece, Microsoft plans AI service with Databricks that could hurt OpenAI. This
actually came from a report from the information, which read, Microsoft plans to start selling a new
version of Databricks software that helps customers make AI apps for their businesses. The Databricks
software, which Microsoft would sell through its Azure Cloud Server unit, helps companies make AI
models from scratch or repurpose open source models as an alternative to licensing OpenAI's proprietary
ones. Basically, what they're getting at here is that the philosophy of a company like
Databricks, which is much more similar to something like Amazon's Bedrock, is that enterprises
are going to want to have lots of different options, not be locked into any single one,
and so perhaps Microsoft partnering with them suggested that they were heading more in that
direction rather than being locked into just one partner, even a good one like OpenAI.
Now, as I mentioned at the time, Microsoft and OpenAI's relationship is really complex
territory. It doesn't really have precedent in the past, and is a particular reflection of
the fact that big tech is the only capital source deep enough for competing at the highest levels
of LLM development, at least outside of Mistral. As much as it looks to us and we want to read it
like a full acquisition or a default acquisition, that's never what it was. And there are elements
of the relationship that are much more frenemy than friend. The amount that OpenAI makes, for example,
from when Microsoft sells OpenAIs models through Azure, as opposed to when enterprise customers
go directly to OpenAI, is a huge difference, creating inherently attention there, where
presumably OpenAI would prefer to have all those customers going through it, and Microsoft would
prefer to have all of them coming through its setup. But this doesn't mean the companies are
secretly enemies, and I don't think that it means that deals like this one with Mistral necessarily
represent some lack of faith or growing discontent with their partnership with OpenAI. I do think
that it's reasonable to believe that between this and that Databricks news, Microsoft does have at
some sense that perhaps there is going to be a demand for a diverse set of models or at least
a choice between models, much more akin to, for example, the positioning that Amazon Bedrock
has taken. Now, interestingly, Amazon Bedrock also announced at the end of last week that Mistral 7B
and Mixtral 8X7B, two of Mistral's other models, were also coming soon to Amazon Bedrock.
That makes Mistral's Amazon's seventh official foundation model partner alongside Stability AI,
meta-cohere, Anthropic, and others. Now, there was one other piece of news in this dense mistral
which is that they were releasing their own chat-GBT equivalent called La-Chat.
In his Twitter thread about the announcement, Arthur Mench said,
as a small surprise, we're also releasing LeChat Mistrel,
a front-end demonstration of what Mistral models can do.
And so, given all this, what has been the response of the community?
You can find some concern that this deal effectively represents a new phase for the company,
moving them farther away from their open-source roots.
Bantag on Twitter writes,
Mistrel drops a new model that Bench is second after GPT4 as a hosted API,
their glory days of magnet links might be over. The company seems in bed with Microsoft now is hinted
by the mentions of Azure. Max von Thune writes, yet another independent startup sucked into big tech's
gravitational field, this time French AI darling Mistral. Until we create alternative pathways for
startups to access computing power and commercialize their products, this will keep happening.
To be honest, though, this type of tone was far less prevalent than I might have assumed.
Alex Volkov from the Thursday pod said, just a reminder to y'all before you go hyperbolic,
Microsoft has been supporting Mistral and has outlined their strategy to serve open source models back in
November. And indeed, if you go back to that announcement page about this new partnership,
Microsoft writes, in November 2023 at Microsoft Ignite, Microsoft unveiled an integration of
Mistral 7B into the Azure AI model catalog accessible through Azure AI Studio and Azure Machine
Learning. Basically the point being that this is not actually the beginning of a relationship
between Mistral and Azure, but the expansion of that relationship. Even more, it just seems to be
broadly speaking, the people understand that there does have to be a business model here. Bindu Ready,
the CEO of Avicus, who advocates about as loudly as anyone for open source models, writes,
a new model from Mistral, Mistral Large. With an MMLU of over 81, it's got pretty impressive numbers.
Sadly, they didn't choose to open source mistral medium. That said, good to see yet another
LLM in the marketplace, which I think is pretty reflective of the tone overall. Indeed, some people
went straight to brass tacks and functional comparisons. Matt Schumer from Hyper Right AI, for example,
writes, Mistral Large is priced around 20% cheaper than GPT4 Turbo. It's a slightly weaker model as well.
Curious to see how things play out and whether this is a worthwhile tradeoff for many applications.
So lots of interesting things here, much to consider and take away. One thing that's for sure
is that Mistral continues to earn its place at the very heart of the AI conversation and has
not appeared to be doing anything but increasing that perception as time goes on.
However, that is going to do it for today's AI breakdown. Until next time, peace.
