The AI Daily Brief: Artificial Intelligence News and Analysis - GPT-4.5 - Does a Cached Announcement Blog Prove It’s Coming?

Starting point is 00:00:00 Today on the AI breakdown, the European Parliament has passed the AI Act. Before that on the brief, does a cash blog suggest that GPT4.5 is coming? The AI breakdown is a daily podcast and video about the most important news and discussions in AI. Go to breakdown.network for more information about our Discord, our YouTube, and our newsletter. Welcome back to the AI breakdown brief, all the AI headline news you need in around five minutes. Questions around OpenAI's next model have reached an absolute. fever pitch recently. In general, people have been waiting for a very long time for OpenAI to release whatever is coming next, given that their GPT4 has been the standard bearer for more than a year.

Starting point is 00:00:46 However, what has really increased focus on when OpenAI's next model is going to come is, of course, the release of Claude 3, which is the first time that there is broad consensus that a model is actually better than GPT4 in practice. Sure, Google's Gemini Ultra claim to be better on certain benchmarks, but Claude 3 is really the one that I'm seeing people actually switch their behavior to. Take that plus Elon Musk's lawsuit and you have a fierce little cauldron of rumor and innuendo. One of those rumors exploded yesterday when people started reporting that they were finding a cached blog post showing up in search results on Duck, Dock, Go, and on Bing. It was theoretically a blog post on OpenAI.com, where the preview reads, OpenAI announces GPT4.5 turbo, a new model

Starting point is 00:01:29 that surpasses GPT4 turbo and speed, accuracy, and scalability. Learn how to hear how to be. how GPD 4.5 can generate natural language or code with a 256K context window and a knowledge cut off of June 2024. You might notice that June 2024 is in the future from where we are now, leading to speculation that the plan was to be training it up until then and then releasing it. More than a lot of rumors, I saw numerous people tweeting that they had repeated this and they had also found it. Some used co-pilot to try to get more information out of that cash piece, and keep in mind this is from not necessarily normal hypey sources. Still, some people aren't it. Matt Popovich, the co-founder and CEO of Legislature AI, says,

Starting point is 00:02:07 really doubting the GPT 4.5 turbo-cashed Bing result means anything. One, there's hints 4.5 is being skipped over in favor of five. Two, I doubt they'd release a turbo version of a brand new model substantially more powerful than GPT4. Three, the cached page is over five months old. Now, on that 4.5 being passed over for five point, some of the folks best associated with OpenAI leaks, like the Flowers from the future account have been on this tip that we will not see a GPT4.5 but will instead go straight to GPT5. On March 9, Flowers tweeted, no GPT5 is not coming next week and GPT 4.5 is still canceled, as I said. That didn't stop Flowers, however, from saying that this will be an incredible

Starting point is 00:02:46 week that will shock the entire industry, but it won't be GPT4.5 or GPT5. Given how much chatter there has been about Cognitions Devon, the AI coding platform, maybe that's what they're referring to. In any case, I think that the sourcing is about as accurate as this tweet from Accelerate Harder who writes, it was revealed to me in a dream that GBT 4.5 is coming. Basically where we are right now is that there is a very broad and strong sense that open AI has to do something that for the first time since we've all been paying attention, they are plausibly behind and getting more so every day. And I think that a lot of the energy around the discussion shows the general anticipation

Starting point is 00:03:21 and assumption that something big is coming. Meanwhile, moving over into Google Land for a minute, the company has announced some new policies around the 2024 elections. Specifically, it is now restricting Gemini from answering any questions about 2024 elections in countries that are holding those elections this year, including the U.S. and India. The company's India team writes, out of an abundance of caution on such an important topic, we have begun to roll out restrictions on the types of election-related queries for which Gemini will return responses. This is an update from a policy announced in December that said it would be limiting election-related queries,

Starting point is 00:03:55 obviously ratcheting things up even a little bit more. Although this started in India, TechCrunch confirmed with Google that it was going to roll out these restrictions globally. Now, some argue that this should give us pause about Gemini's capacity to give us good information in the first place. Cornell Professor Daniel Seusser writes, If Google's generative AI tools are too unreliable for conveying information about Democratic elections, why should we trust them in other contexts such as health or financial information? What does that say about Google's long-term plans to incorporate generative AI across its services, including search? From where I'm sitting, the more notable response is actually the lack of response.

Starting point is 00:04:28 This is the type of thing that I think a month ago would have provoked a big outcry on Twitter slash X about Google censoring things, et cetera, et cetera. But in the wake of the dust up around Gemini's image inaccuracy, it feels more and more like this is kind of assumed behavior and that people are, at least for now, writing Google off. That may just be a reflection of the moment, or it may be that people just aren't that interested in election policies, but it does feel to me at least a little bit notable. Even while big companies are restricting their tools, this week has seen a raft of startup announcements.

Starting point is 00:05:00 One of those companies is called Physical Intelligence, and here's how one of the founders, Lockhee Groom, describes it. Lockhey writes, There's been so much progress in AI, but it's been largely constrained to the digital world. At physical intelligence, we're developing foundation models to power any and all physically actuated devices with an early focus on dexterous manipulation. We're spending most of our time on scale data collection, algorithmic development, and training large-scale models. We are extremely early in setting out to create a world where atoms are

Starting point is 00:05:25 almost as easy to manipulate as bits. I'm excited to share a lot more soon. Well, one of the things that was shared was that the company is well-resourced to go after this mission. Physical intelligence, or Pi, has raised $70 million in a pre-seed round. To do this, it is set to work creating its own AI model design to bring basic human abilities to machines. But what fundamentally makes humans interesting is the brain, not our hardware. If you've been listening to this show, you know that the robotics AI overlap is heating up in a significant way, and it's pretty interesting to see how this company is approaching those problems a little bit differently. For now, however, that is going to do it for today's AI breakdown brief. Up next, the main AI breakdown. The tools that

Starting point is 00:06:04 help us manage our projects and endeavors become way more significant feeling than most other normal tools we interact with. They are the things that help us breathe life into ideas, and that's why I am so pleased that today's episode is brought to you by Notion. Notion is the power center for all of my projects from the AI breakdown to the new education initiative that so many of you are participating in. Notion combines your notes, documents, and projects into one space that's simple and beautifully designed. They've also now brought the power of AI directly into the platform so that when it comes

Starting point is 00:06:35 to writing, brainstorming, getting creative with new ideas, strategic planning, all of these things where AI can be an incredible companion and co-pilot, all of that is available to you without leaving Notion. Now, Notion is a tool for you, but it's also for your teams. In fact, the more your team uses it, the more useful it becomes for everyone. Notion is used by over half of Fortune 500 companies. And teams that use Notion send less email, cancel more meetings, save time searching for their work, and reduce spending on tools, which helps keep everyone on the same page. Try a Notion for free when you go to notion.com slash AI breakdown. That's all lowercase letters,

Starting point is 00:07:10 notion.com slash AI breakdown to try the powerful, easy-to-use Notion AI today. And when you use our link, you're supporting the show. Notion.com slash AI breakdown. Welcome back to the AI breakdown. The breaking news this morning is that the European Parliament has passed the AI Act. I couldn't help but laugh when I saw a meme from Kier Lamont on Twitter who wrote, How the EU AI Act News sounds to us non-EU legislative process understanders, with an image that says, rest assured, this will be the sixth time we have passed the EU AI Act,

Starting point is 00:07:41 and we have become exceedingly efficient at it. The joke here is, of course, that it is a very complex process to get legislation into law in the European Union, but this was, in fact, a significant and important step. Back in December, the European Union had gotten to provisional political consensus, which was itself not the first time that they had voted on this set of rules, but in Wednesday's session, the European Parliament actually endorsed the full act. The final tally was 523 votes in favor, 46 against, and 49 votes abstaining. Now, for a little bit of background, if you were just coming to this conversation, the AI Act did not, as one might have guessed, start with the initiation of chat GPT.

Starting point is 00:08:19 In fact, they have been negotiating this law since 2021. And indeed, one of the big challenges for them, in terms of finalizing it, was how to deal with generative AI, given that it hadn't been part of the early negotiations. The central underlying premise of the EUI Act is to divide artificial intelligence into different categories based on risk. Louisa Dirovsky on Twitter did a good breakdown writing, The AI Act follows a risk-based approach. Some AI systems she points out her band, including those that involve cognitive behavioral manipulation of people or specific vulnerable groups,

Starting point is 00:08:51 social scoring, i.e. classifying people based on behavior, socioeconomic status, or personal characteristics, biometric identification and categorization, real-time and remote biometric identification systems such as facial recognition. You may notice, in other words, that what the European Union has classified as the riskiest potential applications of AI are basically the Skynet totalitarian surveillance types of applications. The next category down is a high-risk category, which is fairly broad.

Starting point is 00:09:15 Some of the systems that fall into that category include critical infrastructure like transport that could put the life and health of citizens at risk, education or vocational training that may determine access to education and professional course of someone's life, for example scoring of exams, safety components of products, the example given being AI application, in robot-assisted surgery, employment, management of workers, and access to self-employment, think CB sorting software for recruitment, essential private and public services, e.g. credit scoring, denying citizens' opportunity to obtain a loan, law enforcement that may interfere with people's fundamental rights,

Starting point is 00:09:45 for example, the evaluation of the reliability of evidence, migration, asylum and border control management, e.g. automated examination of visa applications, and administration of justice and democratic processes like AI solutions to search for court rulings. So of course, the common thread here is that these are all applications, where the stakes of what happens or what is decided, is extremely high for the person whose life is affected by that decision. It's not just physical harm or risk, but things that could impact someone's career, someone's livelihood, someone's legal treatment. And so what does it mean to be in the high risk category? Well, it doesn't mean that things are banned, like the highest risk category that we were just discussing. Instead, these high risk systems have to

Starting point is 00:10:23 be assessed before being put on the market and throughout their life cycle. What's more, there will be specific procedures and processes for people to file complaints about how these systems are being used. What about generative AI? This was a big sticking point and the subject of lot of intense lobbying and negotiation. These types of tools will not be classified as high risk, but will have their own set of rules as per the AI Act. Specifically, they'll have to comply with certain transparency requirements in EU copyright law, such as things like disclosing that content was generated by AI, an obligation to design models to prevent them from generating illegal content, and publishing summaries of the copyrighted data that was used for training. The Wall Street

Starting point is 00:10:58 Journal describes some additional requirements around generative AI, quote, makers of the most powerful AI models deemed to have what the EU calls a systematic risk, will be required to put those models through state-of-the-art safety evaluations and notify regulators of serious incidents that occur with their models. They will also have to implement mitigations for potential risks and cybersecurity protection. The provisions on general-purpose AI were added during the EU's legislative process. Industry groups in some European governments pushed back against the introduction of blanket rules for general-purpose AI, saying legislators should focus instead on risky uses of the technology rather than the models that underpin its use.

Starting point is 00:11:30 France, home to Mistral AI, and Germany sought to water down some of the legislation's proposals. Mistral's chief executive Arthur Mensch said recently that the AI Act would, after some changes in final negotiations, that lighten some obligations, be a manageable burden for his company, even if he thinks the law should have remained focused on how AI is used and not the underlying technology. So how is this being received? Well, obviously, the lawmakers themselves are hailing it. EU Commission President Ursula von der Leyen writes,

Starting point is 00:11:55 I welcome the European Parliament vote on the AI Act. Europe's pioneering framework for innovative AI with clear guardrails. This will benefit Europe's fantastic pool of talent and set a blueprint for trust-worthy AI throughout the world. Lawmaker Tierie Britain wrote, Democracy 1, Lobby Zero. I welcome the overwhelming support from European Parliament for our AI Act, the world's first comprehensive binding rules for trusted AI.

Starting point is 00:12:16 Europe is now a global standard setter in AI. We are regulating as little as possible, but as much as needed. And indeed, that really seems to be the line that they've tried to hit, or at least that's the narrative they're going for. Romanian lawmaker Dragos Tudorik writes, The AI Act has pushed the development of AI in a direction where humans are in control of the technology and where the technology will help us leverage new discoveries for economic growth, societal progress, and to unlock human potential.

Starting point is 00:12:41 Not everyone agrees, however. Some of the loudest commentary if you look at Twitter slash X are from those who feel that the law was too watered down and not strong enough. Daniel Leifert tweets, EU officials boast about being global trendsetters in regulating AI, but with this law sets the lowest bar possible. The AI Act is full of concessions to industry lobbying, exemptions for law enforcement, and prohibitions full of loopholes. Amnesty International's tech account writes, the AI Act is adopted today. EU policymakers are hailing the AI rulebook as a global paragon for AI regulation, but the

Starting point is 00:13:10 legislation fails to take basic human rights principles on board. Countries outside of the EU should learn from the bloc's failure. Going on, Amnesty writes, the AI Act offers only limited protections to impacted and marginalized people. It does not ban the reckless use and export of draconian AI technologies, while also failing to ensure equal protection to migrants' refugees and asylum seekers. Similarly, it lacks proper accountability and transparency provisions which will likely exacerbate human rights abuses. The notable thing here is that Amnesty's complaint is not actually about generative AI or big company

Starting point is 00:13:39 power. It's more about the potential for abuse. Fannie Hedvegi, the Europe advocacy director at AccessNow writes, the EU Act is a failure for human rights, a victory for industry, and law enforcement. Access Now points out a similar set of issues as Amnesty, but also writes that the law, quote, creates a glaring loophole via Article 6.3 for developers to exempt themselves from the obligations of high-risk systems. On the flip side are those, often represented from an American perspective, who think that this legislation is doomed to relip urugate to an innovation backwater. In other words,

Starting point is 00:14:07 to incentivize European talent that might create companies and AI applications at home to instead seek less regulated places to build. Lawyer Preston Byrne writes, the passage of the EU AI Act once again creates a rift between the USA and the EU, which the USA can exploit with legal protection for AI development. Europe regulates. America innovates created the USA's multi-trillion dollar internet boom, it can do so again. Of course, I think that we will have to wait and see to what extent that actually is the case. Will companies like Mistral give up on the European tech dream and simply return to the U.S. and to Silicon Valley, places like it, or is that concern overblown?

Starting point is 00:14:41 I am by no means an expert on this, and I have not gone super deep on this act, other than following along in the context of what we've been discussing here. Still, my general sense of legislation is that when people on both sides are pre-exper, pretty angry about it, that probably means it's hit this sort of uncomfortable compromise where most workable laws seek to hit. In other words, if one side or another is too happy about the rules, it may not represent the averaged out interests of lots of different groups. One of the only people I saw representing that perspective was the account from growth to value on Twitter who wrote, The European Parliament has approved the AI Act. Everyone saying this is

Starting point is 00:15:14 awful is either very biased or has not looked at it and probably both. I think overall it's good. Social scoring, as is done in China, is absolutely forbidden. AI models can do no emotion recognition at school or work. It's forbidden to take pictures of people on social media for databases for facial recognition. Existing AIs like OpenAIs chat GPT will have to be clearer that a text or image is generated with AI, more attention to copyright infringements, three sensitivity categories from high risks to medium to low risk, plus forbidden AI. Overall, I think this is quite balanced. Now, in terms of where we go from here, the AI Act should officially become law in May or June, and then the provisions start coming into effect in stages. Six months after the law officially

Starting point is 00:15:51 comes into effect, countries will be required to ban those prohibited AI systems. A year later, the rules for general purpose AI systems start to apply, and two years after that, the whole AI Act will then be enforceable. Notably, these rules apply not just to companies from Europe, but for anyone operating in Europe, and fines for noncompliance can be up to 7% of worldwide annual turnover. From where I'm sitting, I don't expect to debate about these rules to end any time soon, but they exist and I think reflect the beginning of a regulatory catch-up period that is just starting right now. That is going to do it for today's AI breakdown. Until next time, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - GPT-4.5 - Does a Cached Announcement Blog Prove It’s Coming?

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.