The AI Daily Brief: Artificial Intelligence News and Analysis - With MAI-1, Is Microsoft Trying to Compete with OpenAI?

Starting point is 00:00:00 Today on the AI Daily Brief, Microsoft is reportedly launching a model to compete with GPT4. Before that on the brief, is Stack Overflow's new deal with OpenAI the end of Stack Overflow? The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI. To join the conversation, check out the Discord linked in the show notes. Welcome back to the AI Daily Brief, kicking off with today's AI headline news. Yesterday, Stack Overflow and OpenAI announced a new partnership. On an announcement blog post, Stack Overflow wrote, today the company's announced a new API partnership

Starting point is 00:00:37 that will empower developers with the collective strengths of the world's leading knowledge platform for highly technical content with the world's most popular LLM models for AI development. So what does this partnership include? Well, one piece of it is that OpenAI will have access to Stack Overflow's Overflow API product and will, quote, collaborate with Stack Overflow to improve model performance for developers who use their products. The goal here is to help OpenAI improve its models

Starting point is 00:01:00 using quote, enhanced content and feedback from the Stack Overflow community, while also providing attribution to that community. Indeed, they say OpenAI will surface validated technical knowledge from Stack Overflow directly into ChatGPT, giving users easy access to trusted, attributed, accurate, and highly technical knowledge, and code backed by the millions of developers that have contributed to the Stack Overflow platform for 15 years. The other part of the partnership is that Stack Overflow will utilize OpenAI models as part of the development of their own Overflow AI.

Starting point is 00:01:27 Now, this is a notable partnership for a couple of different reasons. Stack Overflow has been largely seen by many to be one of the chief victims of the AI era. The chart that's currently on the screen shows the decline of Stack Overflow's traffic, going back all the way to the release of GitHub co-pilot in October 21, but especially accelerating with the release of ChatGPT in November of 2022. Between a peak of around 20 million, daily page views in the 2020-to-2020 era, down to around 14 million in the wake of the release of copilot, and going all the way down to 10 million, or half of their previous traffic by July of 2023.

Starting point is 00:01:59 Francesco writes, So it finally happened. Stack Overflow announced its partnership, surrender, with OpenAI. I have mixed feelings about that and haven't used Stack Overflow in a while, and it always seemed like a useful but toxic place, especially for beginners. Commenter Saba Kisi writes, partnership or they'll die. No other option on the table. I have barely used Stack Overflow since Chat Chb-T was released.

Starting point is 00:02:21 While the feelings were mixed for some, they were decidedly less mixed for others. A UI programmer at Epic Games tweeted, Stack Overflow announced that they are partnering with OpenAI, so I tried to delete my highest-rated answers. I found that Stack Overflow does not let you delete questions that have accepted answers in many upvotes because it would, quote, remove knowledge from the community. So instead, I changed my highest-rated answers to a protest message. Within an hour, Mods had changed the questions back and suspended my account for seven days. Ben goes on, I'm requesting that my questions and answers be permanently deleted under GDPR. It's just a reminder that anything you post on any of

Starting point is 00:02:53 these platforms can and will be used for profit. It's just a matter of time until all your messages on Discord, Twitter, etc., are scraped, fed into a model and sold back to you. Lucille Danil Danilov retweeted Ben's message and wrote, Be Good Guy Dev, contribute to Stack Overflow for years, suddenly Stack Overflow for years. Suddenly, Stack Overflow decides to sell out to Open AI and scrape your highest rated answer, get banned, and have your original answer forcibly reinstated. How is this legal? Nick's Craft writes, everyone should immediately stop contributing to the Stack Overflow and its

Starting point is 00:03:21 network. The human touch is what made it unique. Delete your profile from S.O. and all your answers. Free loaders are making money out of human contributors. OpenAI, Microsoft, and Google together kill the open web. Thousands of independent blogs and forums are now nowhere in search engines or push back to page two to support their AI and partnerships with Reddit, Stack Overflow, and more. Many humans contributed to these sites hoping to build a knowledge base for humanity, but now greedy people like Sam Altman and OpenAI are taking over everything. I share these not because I think they're the right take, but just to show how intensely people feel about this. It is a complicated issue that is completely uncomplicated for some. I think it's an indicator of the portion of the

Starting point is 00:03:56 developer community, and indeed probably the larger community, that feels like any form of AI training is just another word for theft. At the same time, however, this sort of deal with publishers, where it's not so much about OpenAI getting access to their data, but Open AI actually attributing things to them as well, seems to be the way the winds are blowing. Right now, it's kind of split between newspapers, for example, who are suing Open AI and newspapers who are cutting these deals, and I think that pretty well demonstrates the particular bifurcation of this moment. Now, speaking of OpenAI, the information is reporting that two senior Open AI executives have left the company, those include vice president of people, Diane

Starting point is 00:04:30 Yun, and head of nonprofit and strategic initiatives, Chris Clark. I'm always a little skeptical of this type of story. Especially in the technology space, people move between companies all the time. Executives leaving a company doesn't necessarily mean that something bad is going on in that company. It could be for any number of personal reasons. It could be because these folks were able to cash out some chunk of what they had vested with that tender offer at an $86 billion valuation. Of course, what the press tends to notice is when there seems to start to be a pattern. The information, for example, connects this to OpenAI co-founder Andre Carpathy leaving earlier this year, and there are some other notable departures as well, but for now it feels to me more like the natural ebbs and flows of

Starting point is 00:05:09 people in and out of OpenAI rather than a smoke signal that something really bad is going on. Finally, today some announcements in the Apple and Chip world. First, Apple has unveiled its latest iPad Pro, the first time that we're seeing its new M4 chip. During the presentation, Tim Millett, the vice president of platform architecture at Apple said, with this level of performance, the neural engine in M4 is more powerful than any neural processing unit in any AI PC today. As Reuters writes, debuting its latest chip in a tablet rather than its Mac laptops is unusual for Apple, and suggests it is eager to give app makers a head start creating AI-related software ahead of its annual software developer conference next month. Continuing, they write,

Starting point is 00:05:47 precisely what AI features the new M4 chips could power might not become clear until June 10th at Apple's annual Worldwide Developers Conference, where it often shows new capabilities for Siri its voice assistant as well as the rest of its operating systems. Ben Bajor and the chief executive of consulting firm Creative Strategies said of the announcement, essentially there was a lot of foreshadowing on where they can go or will go with generative AI on their hardware. So we have this one official announcement, but then we also got a report from the Wall Street Journal that Apple has also been working on custom chips to run AI in data centers. According to reports, the server project is internally codenamed project ACDC, which stands

Starting point is 00:06:22 for Apple Chips in Data Center. The journal writes, Project ACDC has been in the works for several years, and it is uncertain when the new chip will be unveiled if ever. For now, it's mostly just being taken as yet another sign that Apple is indeed quietly working on AI stuff and that the market should price it as such. However, that is going to do it for the headline section of today's AI Daily Brief. Next up, the main part of the episode. Hello, AI friends. Today, I want to tell you about our platform super intelligent. In short, it's a platform for useful, practical, immediately applicable AI learning.

Starting point is 00:06:55 We have nearly 400 video tutorials, each of which comes with step-by-step how-toos, and the idea is to get you actually using these AI tools we talk about every day in a matter of minutes to actually solve problems, create new opportunities, and just do really cool things. To learn more and subscribe, go to B-Supert.a.i. And if you do decide to subscribe, use code podcast for 50% off your first month. Again, that's B-Super.AI. Welcome back to the AI Daily Brief.

Starting point is 00:07:23 For a while now, the AI industry has been closely watching the evolving relationship between Microsoft and OpenAI. There have been some big questions about whether Microsoft is over that relationship, just trying to hedge their bets, or going in fundamentally different directions. The latest saga in this story comes with reports about MAI 1. The information rights meet MAI 1, Microsoft Reddy's new AI model to compete with Google and OpenAI. So let's see what this model is, what the community's take on all this is, and what it means for this relationship in the long run. The information writes, for the first time since it invested more than $10 billion into open

Starting point is 00:07:58 AI in exchange for the rights to reuse the startup's AI models, Microsoft is training a new in-house AI model large enough to compete with state-of-the-art models from Google, Anthropic, and OpenAI itself. Now, they go on to write that the model is internally being referred to as M-AI-1, and it's is being overseen by Mustafa Suleiman. Now, that's notable because the last time that we were having this conversation about Microsoft and OpenAI was when Microsoft hired the majority of the startup inflection, as well as paying $650 million for the rights to its IP.

Starting point is 00:08:27 It was an acquisition in anything but name. Mustafa was, of course, not only the CEO and founder of Inflection, but before that, he had been one of the co-founders of DeepMind who sold that company to Google. It felt to some, like Suleiman and Inflection were being hired into the role that had for a time during the whole Sam Altman saga, been where Altman himself might be headed. Remember, when Altman was fired as CEO, for a moment, it appeared that he would be heading to Microsoft to lead a new AI division there. Obviously, that's not how that situation played out, but it seems in retrospect like either A, Microsoft had already had a plan like that, or B,

Starting point is 00:08:58 they liked the idea so much that even after Altman went back to Open AI, they decided to go through with it as well. Now, as for MAI 1, the information says that this is not a model that's carried over from inflection, that it is instead a homegrown Microsoft model. According to sources, MAI-1 will have roughly 500 billion parameters, as opposed to GPT-4's estimated more than 1 trillion parameters, but a lot more than other open-source models like meta and mistral 70 billion parameters. The information's analysis is this. That means Microsoft is now pursuing a dual strategy of sorts in AI, aiming to develop both small language models that are inexpensive to build into apps and that could run on mobile devices alongside larger state-of-the-art AI models. The last note from the report

Starting point is 00:09:38 is that Microsoft has been setting aside a large cluster of servers with Nvidia GPUs, as well as compiling training data. So what do the community think? Well, BinduReady from Abacus writes, as I predicted Microsoft is training its own LLM. It's called MAI 1, a 500 billion parameter model and may be previewed at their build conference. When this model becomes available, it will only be natural for Microsoft to push this instead of the GBT line. As predicted, OAI and Microsoft are becoming more and more competitive. Any which way, LLMs have become a commodity. Andrew Curran writes there have been indications for some time that Microsoft was intending to take their own path, and we're planning to train a bespoke model to compete with GPT4. We now have a name, MAI 1. Microsoft, for their part, tried to tamp down the idea

Starting point is 00:10:19 that this was somehow representative of a change in their relationship with OpenAI. Kevin Scott, the CTO of Microsoft, wrote, I'm not sure why this is news, but just to summarize the obvious, we build big supercomputers to train AI models. Our partner, OpenAI, uses these supercomputers to train frontier-defining models, and then we both make these models available in products and services so that lots of people can benefit from them. We rather like this arrangement. We've been at it for almost five years now. Each supercomputer we build for OpenAI is a lot bigger than the one that preceded it,

Starting point is 00:10:46 and each frontier model they train is a lot more powerful than its predecessors. We will continue to be on this path, building increasingly powerful supercomputers for Open AI to train the models that will set the pace for the whole field well into the future. There's no insight to the increasing impact that our work together will have. We also, for years and years and years, have built AI models in Microsoft in MSR and in our product groups. AI models turn out to be interesting things to work on, and our researchers do great work studying and building them. AI models are used in almost every one of our products, services, and operating processes at Microsoft, and the teams making and operating things on occasion need to do their own custom work.

Starting point is 00:11:18 Whether that's training a model from scratch or fine-tuning a model that someone else is built. There will be more of this in the future, too. Some of these models have names like Turing and MAI. Some, like five, for instance, we even open source. I know the way I've said it isn't all that dramatic, but it is reality. And it's a plenty exciting reality for all us geeks, given how hard all of this is to do. in practice. To basically, the argument here is that just because Microsoft is working on other models doesn't mean that it doesn't like OpenAI's models or its partnership with OpenAI. It just

Starting point is 00:11:45 has other uses and other needs for building other models. There's also been some interesting stuff coming out recently about how the relationship got started, specifically the involvement of Bill Gates. For example, a recent article reads, in 2017, just before Microsoft forged a partnership with a then-reli-un-unowned startup called OpenAI, Bill Gates shared a memo with CEO Satya Nadella and a small group of the company's top executives. A new world order, Gates predicted, would soon be brought on by what he called AI agents, digital personal assistance that could anticipate our every wanton need. These agents would be far more powerful than Siri and Alexa with godlike knowledge and supernatural intuition. Gates wrote, agents are not only going to change how everyone interacts with computers,

Starting point is 00:12:22 they're also going to upend the software industry, bringing about the biggest revolution in computing since we went from typing commands to tapping on icons. Business Insider went on, publicly Gates has been almost entirely out of the picture at Microsoft since 2021. In fact, Gates has been quietly orchestrating much of Microsoft's AI revolution from behind the scenes. Current and former executives say Gates remains intimately involved in the company's operations, advising on strategy, reviewing products, recruiting high-level executives, and nurturing Microsoft's crucial relationship with Sam Altman. One Microsoft executive said,

Starting point is 00:12:51 What you read is not what's happening in reality. Satya and the entire senior leadership team lean on Gates very significantly. His opinion is sought every time we make a major change. Another email that recently came out was one from CTO Kevin Scott, who we just heard from. It was from June 12, 2019. It was addressed to Satya Nadella and Bill Gates, and was called Thoughts on OpenAI. Kevin writes, The thing that's interesting about what OpenA.I and DeepMind and Google Brain are doing is the

Starting point is 00:13:15 scale of their ambition, and how that ambition is driving everything from data center design to compute silicon, to networks and distributed systems architectures, to numerical optimizers, compilers, programming frameworks, and the high-level abstractions that model developers have at their disposal. When all these programs were doing was competing with one another to see which RL system could achieve the most impressive gameplay stunt, I was highly dismissive of their efforts. That was a mistake. When they took all of the infrastructure that they had built to build NLP models that we couldn't easily replicate, I started to take things more seriously. And as I dug in

Starting point is 00:13:42 to try to understand where all the capability gaps were between Google and us for model training, I got very, very worried. As Bloomberg summed up, Microsoft's motivation for investing heavily in partnering with OpenAI came from a sense of falling badly behind Google. Said Scott in the email, we are multiple years behind the competition in terms of machine learning scale. Satya Nadella endorsed Scott's email forwarding it to chief financial officer Amy Hood. and saying it explains, quote, why I want us to do this. Darrow Bassenjo writes, both the competitive awareness to understand how far ahead Google was

Starting point is 00:14:11 and the realization that Microsoft's Bing teams would not be able to compete without help are great. So of course, the question becomes, have things changed so fundamentally in five years that Microsoft no longer feels behind? Semi-analysis explored this in a piece today called Open AI is doomed question mark, E2 Microsoft?

Starting point is 00:14:27 In it, they discussed the changing landscape of compute resources, the new emergence of models from China that are cheaper to run and high-levels. And then there's a section they have called, is Microsoft even committed? They write, Microsoft is spending over 10 billion in CAPEX directly for OpenAI, but they are not directing most of their GPU capacity to OpenAI. The majority of Microsoft's planned, over $50 billion annual spend on AI data centers is going to internal workloads. Much of this has been inference to deploy open AI models in their own products and services, but that's changing. Microsoft is forced

Starting point is 00:14:56 into looking for contingency plans because of OpenAI's bizarre structure. OpenAI is a nonprofit whose primary goal is creating artificial general intelligence that is safe and benefits all of humanity. OpenAI can and will break the agreement that enables Microsoft to have access to OpenAI's model with zero recourse from Microsoft. And then they quote an OpenAI document that reads, While our partnership with Microsoft includes a multi-billion dollar investment, OpenAI remains an entirely independent company governed by the OpenAI nonprofit. Microsoft is a non-voting board observer and has no control. AGI is explicitly carved out of all commercial and IP licensing agreements. The board determines when we've attained AGI. Again, by AGI,

Starting point is 00:15:31 we mean a highly autonomous system that outperforms humans at most economically valuable work. Such a system is excluded from IP licenses and other commercial terms with Microsoft, which only apply to pre-AGI technology. As semi-analysis continues, the most worrisome thing for Microsoft here is that OpenAI's board can decide at any moment, with no voting input whatsoever for Microsoft, that they have achieved AGI, and Microsoft is not entitled to the IP that was created with their investment dollars. When you stack that on top of the massive governance issues, OpenAI already has related

Starting point is 00:15:58 to their nonprofit and for-profit arms, Microsoft must put a contingency plan in place. The semi-analysis authors then draw the line between a bunch of internal initiatives leading all the way to MAI1. They call it Microsoft's first big effort at hitting GPT4 class and say that the goal is to have their own in-house from scratch GPT4 class model by the end of this month. So what to make of all of this? With no knowledge whatsoever of what's actually going on behind the scenes, it seems to me like everything is kind of right. What I mean by that is that I don't think that Kevin Scott is putting on a show of saying that they continue to plan to have a great relationship with OpenAI for years to come. I think that relationship has been extremely profitable for them.

Starting point is 00:16:35 I think they see a lot of potential in it. I think that they believe that OpenAI has some secret sauce that keeps them ahead of the curve. So I take all of what he said in that LinkedIn post at face value. At the same time, I also think that that semi-analysis analysis is correct. The simple fact of having this AGI clause in the deal terms means that Microsoft can't rely on that relationship forever. And what's more, they've seen how quickly and how seemingly capriciously the Open AI board can act. It's not surprising then that even if they are not wishing for a breakup with Open AI, they are hedging their bets more aggressively since the end of last year. Anyways, it is a fascinating time. It will continue to be a fascinating time, and I will be here to tell you all about it as it happens.

Starting point is 00:17:16 For now, however, that is going to do it for the AI Daily Brief. Until next time, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - With MAI-1, Is Microsoft Trying to Compete with OpenAI?

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.