The AI Daily Brief: Artificial Intelligence News and Analysis - RUMOR: GPT-4.5 Coming Before EOY (Maybe Even This Week?)

Episode Date: December 11, 2023

NLW looks at rumors from some sources that have previously had OpenAI insider leaks that GPT-4.5 could be coming as soon as this week and likely before the end of the year. Also a look at fast moving ...open source developments with Mistral closing its fundraise and releasing its API. Today's Sponsors: Listen to the chart-topping podcast 'web3 with a16z crypto' wherever you get your podcasts or here: https://link.chtbl.com/xz5kFVEK?sid=AIBreakdown  ABOUT THE AI BREAKDOWN The AI Breakdown helps you understand the most important news and discussions in AI.  Subscribe to The AI Breakdown newsletter: https://theaibreakdown.beehiiv.com/subscribe Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown Join the community: bit.ly/aibreakdown Learn more: http://breakdown.network/

Transcript
Discussion (0)
Starting point is 00:00:00 Today on the AI breakdown, we're looking at Mistral's latest release. It appears that open source AI might have a new leader. Before that on the brief, rumors are flying that GPT 4.5 could be coming before the end of the year and maybe even this week. The AI breakdown is a daily podcast and video about the most important news and discussions in AI. Go to Breakdown.net network for more information about our YouTube, our newsletter, and our Discord. Welcome back to the AI Breakdown Brief. All the AI headline news you need in our digital. around five minutes. We start today's episode in a slightly different place than normal, which is in
Starting point is 00:00:36 the realm of speculation. However, one of the interesting quirks of the Twitter slash X space right now is that there are a couple accounts that seem to have insider information on what happens next at OpenAI. The two accounts are the Jimmy Apples account, which we've talked about on the show before, and then Flowers from the Future at Futurist Flower, which only started in October, but has already built a bit of a reputation. All the way back at the end of October, on October 26th, that account had tweeted OpenAI big announcement in December, which is a fairly generic and open-ended thing. And obviously, this was all before the Open AI drama, so who knows if that would still be true. But in terms of establishing their bona fides, let's look at another tweet from October 26th,
Starting point is 00:01:16 where that same account tweeted, just verified one of my sources, Gem and I will come in the second week of December. As Pedro responded on December 7th, I came from the future, second week of December. this is actually true. Now, of course, one correct prediction does not mean that we should believe everything that this account says, but suffice it to say that there are a lot of people taking it seriously. And so, last week, when on December 7th, that account said, some of you can look forward to a little something coming next week, a lot of people stood up and took notice. Now, they expanded on that about a day later when they wrote, okay, so there's one big thing and one small open-eye thing waiting for us. The small thing is currently delayed due to company dramas, and the big
Starting point is 00:01:54 thing seems to be progressing according to plan. December remains exciting, especially next week. That was tweeted on the 8th, so next week is, in fact, this week. That account also posted this cryptic image of a black screen with a blue chat GPT logo in the middle. The AI Safety Memes account writes, GPD 4.5 next week, Jimmy Apples and Futurist Flower, both of whom have made some accurate leaks are dropping gossip.
Starting point is 00:02:17 Futurist Flower writes, there's one big thing and one small open AI thingy waiting for us. This is the same quote that I just read you. Now, back to AI safety memes, they continue. The big thing could be GPT 4.5. The small thing? Maybe that was the GPT store that just got delayed to 2024. Now, in the same post, they also referenced a Jimmy Apples tweet. Keep an eye out on a potential end of December GPT4.5 drop and new multimodal from Anthropic.
Starting point is 00:02:40 How big of a deal with this be? I don't know. But remember, the jump from GPT3 to GPD3.5 was massive. Gemini might look quaint overnight. The AI safety memes account also shared a tweet that Jimmy Apples deleted that came out right after Gemini was launched that I had referenced on this show where he wrote, At Sam Altman, what are you waiting for? Going to take the mitts off yet and release what you are sitting on or keep drip feeding. Cat emoji. Now, AI entrepreneur Bindu Reddy also tweeted
Starting point is 00:03:06 about GPT4.5, writing, fascinating. Grock, in response to the prompt, what were the top AI tweets in the last two days? Grock has decided that OpenAI has released GPT4.5. It is hallucinating, but maybe it knows something we don't know. I'm hearing juicy rumors about 4.5 dropping before end of year. Now, what Kroc had come back with was a theoretical tweet from OpenAI that said introducing GPT 4.5 the next step in our journey towards more advanced and capable AI. Stay tuned for more updates and demos. There has been some reporting recently, although it's unverified as of yet, that GROC is actually accessing draft tweets, which obviously would be a big security problem if it's true. But anyways, all of this adds up to probably nothing, but certainly
Starting point is 00:03:47 some interesting speculation for a Monday morning. Is this important? Is this something you should plan around? Obviously not. Could be that we just wasted a few minutes on rampant speculation, but the reality is that right now, coming off of this Gemini announcement of an announcement, as well as the OpenAI power struggle a couple weeks ago, this is for better or worse, something that everyone is interested in. Meanwhile, in terms of real news, the EU has finally come to agreement over the EU AI Act. Now, if you've been paying attention to this, you're probably confused around exactly how the EU lawmaking process works, because we reported a few months ago that the EU AI Act had been passed, but then there were a few remaining steps.
Starting point is 00:04:22 But then it turned out those few remaining steps were very controversial or at least had room for some controversy. And now once again, an agreement had been reached. But as the New York Times writes, the law still needs to go through a few final steps for approval. So, TLDR, this podcast is much too short to try and actually explain the European Union political process. But suffice it to say that the AI Act has moved through yet another critical step and one which did see a fair bit of controversy and disagreement. Now, the biggest issue recently had been around what the rules would be for foundation model makers like OpenAI. A lot of the other rules, such as basically banning facial recognition AI, had been there from the beginning. Remember, the EUAI Act
Starting point is 00:04:58 has been in the works for many years, in fact, longer than ChatGPT has been around. Part of the reason that there was such controversy around things like ChatGPT is that it was kind of a new addition late in the process, and there were some who wanted to punt it off entirely and basically figure it out later. Now, when it comes to those rules for Foundation models, they did not end up exempting them entirely, but as Axios sums up, foundation model providers will need to submit detailed summaries of the training data they use to build their models. However, Axios also writes, quote, there's a big catch in the EU's landmark new AI law, which is the fact that it doesn't come into effect until 2025, which is, of course, an eternity in terms of the development of AI systems.
Starting point is 00:05:36 Again from Axios, the hiatus leaves plenty of room for the US or others to undercut the EU's plans before they go into effect by, for instance, implementing less restrictive rules before Europe's kick in. Senate Majority Leader Chuck Schumer has expressed concern that EU-style laws enacted by the U.S. would put American firms at a disadvantage competing with China. Now, EU lawmakers like Tieri Breton hail the agreement tweeting out an image that pointed out that the EU was now the only continent with comprehensive AI regulation. Many commentators, however, noted that the EU, screaming about how good at regulating it was, sort of just reinforced the perception of its place as a backwater for technology. Adam Singer tweets, you hate tech and economic growth. No one takes you
Starting point is 00:06:15 seriously, along with the graph, showing how few European tech companies there were relative to Asia and the U.S. Stephen Serges tweets, You need to understand that people want industry that works, and regulation can be required to make sure this exists, but nobody is clamoring for regulation in the absence of industry. Mike Salana writes, I am genuinely so embarrassed for you. And Roheet writes genuinely thought this was someone making fun of the EU. Now, all of that said, there are indications that regular citizens in a lot of countries, including America, are not as universally pro-AI as the people tweeting in response to the EU
Starting point is 00:06:49 might make it seem. The AI Safety Memes account again writes, a new poll dropped and wow are Americans anti-AGI-I, anti-opense source, and anti-effective accelerationism. Three to one want to prioritize AI safety over AI innovation. Five to one want to stop the AGI race by banning development of ASI, and five to one are opposed to EACC. Now, once again, these results do come from the AI Policy Institute, whose whole self-admitted role is to slow down. the development of AI, but they're still pretty interesting statistics. The question I think is to what extent these numbers reflect people's considered opinions or the incredible amount of fear-based media that has been published this year around the potential risks of AI. In either case, the numbers are
Starting point is 00:07:29 so stark that they're probably fairly hard to ignore, and so really the question becomes, I think, what U.S. policymakers do with all of that. One last note, for those of you who are sad that that Google demo video where it was identifying the duck and then the hand motions about what was going on, ended up being faked or fabricated or at least just not exactly what it appeared. Someone wired up GBT4 Vision to see if chat GPT4 could do it, and sure enough, it was able to perform a little bit better. Anyways, friends, that is going to do it for today's AI breakdown brief. Up next, the main AI breakdown.
Starting point is 00:08:01 Quickly a brief word from today's sponsor. As a listener of this show, I suspect you like to stay up to date on all things AI and tech, which is why you have to check out the chart-topping podcast Web3 with A16Z Crypto. Produced by venture firm Andresen Horowitz, Web3 with A16Z is the perfect companion podcast to the AI breakdown. Web3 with A16Z crypto is your definitive resource for the future of the internet. Whether you're interested in the convergence of AI and crypto or simply curious about what's next. If you need a place to start, they recently released an excellent episode with Stanford Cryptography Professor Dan Bonay and former Google X engineer Aliya in conversation with host Sonal Choxi about the intersection of AI and crypto.
Starting point is 00:08:41 From fighting deepfakes and proving humanity to large language models like chat GPT, they cover it all. I highly recommend checking it out, especially if you'd like to learn more about how AI and crypto will impact our everyday lives. Beyond crypto and AI, the show is for creators seeking more ways to truly own their work, for business leaders trying to prepare for the future today, and for innovators exploring trending tech topics. Don't miss out. Follow Web3 with A16Z crypto on Apple Podcasts, Spotify, or your favorite listening app.
Starting point is 00:09:11 Welcome back to the AI breakdown. Mistral first jumped into the headlines back in June when the company, which was at the time only four weeks old, announced that they had raised a $113 million seed round. Yes, a nine-figure seed round, which many at the time took as indication of just how overhyped and crazy the AI world was. However, subsequent to that, the company's first model that they released, their Mistral 7B model started to become a favorite among open source developers. Indeed, whereas Meta's Lama had commanded so much of the energy and attention throughout the year, Mistral really became another genuine
Starting point is 00:09:50 open source competitor of the big AI labs at companies like Gemini Anthropic OpenAI. Now, as we head into the end of the year, there has been a lot of speculation around what the open source side of the AI arms race will look like going into 2024. Arvin Shrinivas, the CEO of Perplexity writes, next year we will see at least two of meta, mistral, and anthropic catch up to Gemini and GPT4, going to be exciting times. Well, today we got a bunch of announcements from Mistral that suggest that Arvin may be even more right than he seemed. Now, first of all, was the confirmation of the funding round, which was first reported last week. The final tally was $385 million euros or $415 million, and the valuation, which again had been reported last week, was $2 billion. The round was led by
Starting point is 00:10:34 Andresen Horowitz, and other investors including Lightspeed, Salesforce, B&B Parabas, General Catalyst, and more also participated. Now, given that a big topic of our conversation during today's brief was the EU-AI Act, it's interesting to note the extent to which Mistral is using European pride as a selling point for this company, said co-founder and CEO Arthur Mench, since the creation of Mistral AI in May, we have been pursuing a clear trajectory, that of creating a European champion with a global vocation in generative artificial intelligence, based on an open, responsible, and decentralized approach to technology. Now, in terms of how Mistral thinks about European regulations, they are one of the companies that has been lobbying for a total exemption for foundation models, with their argument
Starting point is 00:11:15 being as TechCrunch sums up, quote, that regulation should apply to the use cases and companies working on products that are used by end users directly. Now, part of what makes Mistral interesting is that their 7B model was released under the Apache 2.0 license, which means no restrictions on user reproduction outside of attribution. Now, for those wondering how Mistral plans to make money, today also came with the announcement of a new API program. Through that, developers will be able to access their new Mixtral 8x7B model, which they call Mistral Small, and that will be for free again under the Apache 2.0 license. But a third model, Mistral Medium, is only available through the paid API platform. Devendra Trapp low from the company writes,
Starting point is 00:11:53 Excited to release Mistral Platform, three chat endpoints with competitive pricing. Mistrel Tiny, Mistral 7B instruct version 0.2, upgraded base model with higher context length 8K to 32K and better fine tuning. Mistral small. Mistral 8X7B instruct version 0.1 matches or exceeds GPT3.5 performance multilingual. Mistral Medium outperforms GPT3.5 on all metrics multilingual. All endpoints have a 32K context size.
Starting point is 00:12:21 Author Carlos E. Perez writes, Mistral Medium blows away GPD 3.5. What's Mistral Medium? Hyperite CEO Matt Schumer says, Looks like Mistral has a model that's even better than Mixtral 8x7B, and they're serving it to alpha users of their API, scoring 8.6 on MT Bench that's frighteningly close to GPT4 and beats all other models tested. This is their medium size. Large will likely beat GPT4. Carlos Z. Perez again writes, Mistral has released their report on mixtral. It's very impressive. It challenges Lama 70s. and GPT 3.5 on less hardware and at lower inference latency. Furthermore, it uses only 12 billion
Starting point is 00:12:58 active parameters to achieve this. So how does this all work? Well, if you guys are willing, let's wade into the realm of the technical for just a moment. And for this, I'm just sharing other explanations from folks who are attempting to provide a little bit of a layman's explanation of how mistral is approaching things, even if, as you'll see, it's still a little bit dense. Dmitro Zulgakov writes, Mistral model is hot with mixture of experts like GPT4. It promises faster speed and lower cost than model of the same size and quality. Fascinatingly, the speed up is unevenly distributed, running on a laptop or the biggest GPU server benefits the most. Here's why. The new Mixtral 8x7b model has around 47 billion parameters. Each token uses only a subset of those approximately 25%,
Starting point is 00:13:39 two experts out of 8. But when running multiple requests in the same batch, different requests will activate different experts. For a small batch, speed is determined by how quickly the GPU can read waits for memory. For a single request, reading two out of eight experts is four times faster. At a small concurrent load and most of the speed up evaporates, all experts have to be loaded. If one can fit a big batch with many requests in a GPU, the situation changes. Speed now mostly depends on how many multiplications, flops, are done. Mixtral MOE still does only 25% of those, and we can expect speed to approach 4x. But running a big batch comes with its own challenges. The model needs to be aggressively split into many GPUs. A mixture of experts brings more options here
Starting point is 00:14:19 like expert-level parallelism. GPT4 is rumored to use mixture of experts too. It serves a lot of traffic and thus benefits from this regime. Bindu Reddy also wrote an explainer writing, Transform a mixture of expert architectures, why they are more efficient. Mistral 8x7B M-OE model is a solid 70B GPT 3.5 class model. Instead of having every part of the model work on every task, a mixture of expert model splits up the work among many specialized submodels or experts. Each expert is good at handling specific types of tasks or data. When an MOE receives a request, it routes the request to the right expert. The good news is it is more efficient when you have a diverse number of requests as you don't
Starting point is 00:14:56 hit on all the experts at the same time. In the case of Mistral for each token, only two experts are used, and this could potentially mean a 4x speedup compared to something like Lama 2. GPT4 is rumored to be an MOWE model as well. It's likely to have a greater number of larger size experts. When it comes to training MOEs, there are complexities around dealing with training the gating network or router and load balancing, avoiding over-reliance on a few experts. Since MOEs are more parameter-efficient, you don't require quite so many computational resources. It will be really interesting
Starting point is 00:15:24 to see how much the open source community will improve Mistral MOE in the coming weeks. Already, I am seeing several fine tunes, but nothing very compelling in terms of metrics yet. Anyways, there is probably a much deeper show that could get into all of this if that's something that interests you. But for now, the other piece of the story that I want to talk about is how they launch this. Again, from Bindu, the right way to drop a model, she writes. Mistral AI just dropped a mixtral 8x7b model as a torrent link. Christmas came early for open source AI. Eric Jang writes,
Starting point is 00:15:52 Mistrel's brand is already becoming one of my favorites in the AI space, releases 87-gibite torrent containing an 8X7B M-OE model via tweet and refuses to elaborate. That was a couple days ago before this whole announcement came out. Nick Dobos pointed out the contrast to what we got from Google around Gemini last week. Mistral launching a torrent link versus Google Gemini Ultra blog post waitlist. Mistral actually understands the market it's building for, power users and hacker. Google keeps trying these disappointing consumer demos to convince the uninterested.
Starting point is 00:16:20 Don't understand, Doom, to fail. Now, the other thing that some people are discussing is how much mistral is sucking the oxygen out of the room for other open source competitors like META in particular. Farr L writes, META is losing Mineshare. Hopefully they release Lama 3 via Apache 2. Accelerate Harder, writes, If META really wants to make an impact on the open source world,
Starting point is 00:16:40 they would choose a fully open source license and they'd do it soon, before mistral steals their thunder completely. Now, bringing this conversation back to the very beginning of today's brief, the way that the flowers from the future account kicked off was actually with rumors about Lama 3. On October 23rd, that account wrote, Lama 3 will come in different sizes, but the most capable one will be around 120B.
Starting point is 00:17:02 Oh yes, and it will come February 24. We'll be multimodal on par with GPT4. Still others don't want to wait for February. Boyan Tungu says, all I want for Christmas is Lama 3 175B. Now, lastly, on this Mistral announcement, a number of people including One Little Coder, noted that in their announcement post, they had a space for a link to a safety document that they ended up not actually linking. One Little Coder writes, is Mistral actually trolling us, or maybe they forgot the link?
Starting point is 00:17:29 Professor Ethan Mollock writes the significance. For those who don't follow AI closely, one, an open source model, free anyone can download modified beats GPT 3.5. Two, it has no safety guardrails. There are good things about this release, but also regulators, IT security experts, etc., should note the genie is out of the bottle. However, to Mistral's credit, AI safety memes writes, quote, it has no safety guardrails. No open source models do, since you can just strip them off in two hours.
Starting point is 00:17:55 Mistral is just honest about it. Anyways, different people might have different takes on this, but from a technology perspective, what Mistral is putting out is pretty impressive and certainly makes these big investments and big valuations make a little bit more sense. That, however, is going to do it for today's AI breakdown. I appreciate you listening or watching wherever you are. Until next time, peace.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.