The AI Daily Brief: Artificial Intelligence News and Analysis - 2X Bigger than GPT-4!? Amazon Is Training "Olympus"

Starting point is 00:00:00 Today on the AI breakdown, Amazon is apparently training a new model called Olympus that has twice the parameters of GPT4. Before that on the brief, the Screen Actors Guildstrike is finally over after coming to agreement on the most thorny issue, which was, of course, AI. The AI breakdown is a daily podcast and video about the most important news and discussions in AI. Go to Breakdown.network for more information about our Discord, our YouTube channel, and our newsletter. Welcome back to the AI breakdown brief. All the AI headline news you need in around five minutes. Late last night, SAG AFRA announced that they were finally, after 118 days, the longest strike in Hollywood history, coming to a deal with the studios to end the strike. Now, there were a bunch of more Hollywood things about this. This strike was never just about artificial intelligence. In their announcement, the Screen Actors Guild said that they were able to secure a contract valued at over a billion dollars, that they were able to.

Starting point is 00:00:56 able to negotiate, quote, above pattern compensation increases. But the most interesting one for our purposes at least was the announcement that they had secured, quote, unprecedented provisions for consent and compensation that will protect members from the threat of AI. Now, as Wired points out, while the strike was initially about a lot of different things, quote, as this year's negotiations between SAG and AMPTP dragged on, generative AI became the major sticking point. Back in July, studios claimed they offered a, quote, groundbreaking AI proposal that protects actors digital likenesses. SAG countered that the proposal stipulated background performers could be scanned, paid for the day, and then turned into digital characters that studios could use for the rest of

Starting point is 00:01:35 eternity. AMPTP disputed this. Continuing, the issue was volleyed back and forth until last weekend when SAG reviewed the studio's last best and final offer and rejected it, claiming, there are several essential items on which we still do not have an agreement, including AI. A follow-up story in the Hollywood reporter revealed that the AMPTP proposal sought to allow studios to pay for AI scans of what are known as Schedule F performers, and following the actor's death, allow studios to use the scans without the consent of the estate or SAG. Schedule F performers include anyone who makes more than the minimum rate for TV series regulars or feature films. The Guild wanted compensation for reuse of the scans along with consent. So basically what you're talking about here is that reportedly

Starting point is 00:02:16 Hollywood wanted to be able to scan actors, working actors, just once, and use those scans for perpetuity. Meanwhile, SAG was arguing for one compensation when those scans were used. and two, for the actors who scans were being used to have to consent. Now, as a total outsider, it's hard not to be on the side of the people who want to be able to say when their likeness gets to be used or not and want to be paid for that, but as of yet, we haven't gotten the specific final terms of the deal. As I've always said, what makes this story notable is not just its Hollywood context, but about the way that I think it presages many, many of these sort of organized labor battles that are going to be fought around artificial intelligence. My expectation,

Starting point is 00:02:54 my base case, if you will, is that organized labor is going to get significantly more powerful and be seen as significantly more important because specifically of the rise of AI. Now, moving on to a very different topic, one of the big themes in AI policy this year has, of course, been the United States desire to cut off China's access to advanced AI chips. The first set of export restrictions went into place at the end of last year, but there were still some pretty meaningful loopholes. For example, if a Chinese company had subsidiaries in other countries, those subsidiaries weren't necessarily subject to the same restrictions that the China-based corporate leader was. Now, just a couple weeks ago, we got updated rules from the Commerce Department that represent

Starting point is 00:03:34 an attempt to close some of those loopholes. And we've also heard that the U.S. has also behind the scenes been pressuring companies like Nvidia and AMD to stop or slow down shipments of chips to certain areas of the world, particularly the Middle East, from which they think they might get sent on to those Chinese companies. Yesterday we discussed about how Baidu has been shifting its purchases away from Nvidia and two local suppliers. But now the Financial Times is reporting that Nvidia is developing another power-moderated chip that would come in under U.S. mandated power restrictions.

Starting point is 00:04:03 In fact, Nvidia has actually developed three new chips focused on China. The H20, the L20, and the L2. Information about the chips was shared in a document to prospective customers. Road chip consultancy semi-analysis. Invidia is perfectly straddling the line on peak performance and performance density with these new chips to get them through the new U.S. regulations. Now, this is the second time Nvidia has had to update or reconfigure chips for the Chinese market. And the question, of course, will be whether the U.S. actually allows these sales to go through.

Starting point is 00:04:31 Moving over to an interesting topic, which we actually haven't had much of a chance to talk about here on the AI breakdown, but which I expect will come up more and more. And that is the overlap of AI and investing. In an interview with Reuters, the CEO of Norway's sovereign wealth fund said that they're using AI to help manage their investments. Now, the reason that this is notable is that Norway's sovereign wealth fund manages $1.4 trillion. It invests the state's revenues from oil and gas production and is the largest sovereign wealth fund in the world. It holds stakes in more than 9,200 companies globally and owns 1.5% of all listed stocks. Said CEO, Nikolai Tangan, we are using AI now and how we deploy the capital. We use it to

Starting point is 00:05:10 reduce the trading we do because we are a near index fund and sometimes we have to make adjustments to the portfolio and these types of models can help us trade less. Tangan went on to say that he had set the fund an internal target to boost productivity by 10% over the next 12 months using AI, and it even had a conversation about how to do that with Sam Altman, who is, of course, the CEO of OpenAI. Tangan said that when he asked Altman if he thought that they could increase their productivity by 10%, Altman said, I think 20%, you can do. I have to assume that many other investors, including sovereign wealth funds and other private market actors, are going to be watching this experiment closely. Over in the AI safety world, scale AI has launched a new AI safety lab called the Safety

Starting point is 00:05:49 Evaluations and Analysis Lab or SEAL. The goal of the unit is to build a suite of products for scale AI's existing customers that can be used to help evaluate the possible risks of AI deployments, as well as implement red team methods to help find faults and biases in software before it's launched. The new SEAL lab is being read by a former Google researcher who worked on Google Bard. Now, it seems like part of the motivation isn't just an inborn desire to see responsible AI development and deployment, although I'm sure that's a big part of it. It also seems to be targeted based on their announcement blog post at helping companies get out ahead of what are expected to be more strict compliance needs in a world of Biden executive orders on

Starting point is 00:06:26 AI and AI safety summits in the UK. In the blog post announcing seal, they wrote, every enterprise and government deploying an LLM application is facing the need to adopt and comply with the forthcoming standards and regulations that will be put in place. Now, on that front scale was one of the companies that voluntarily joined the White House Responsible Development commitments back in September, said CEO Alexander Wang, to advance our knowledge and understanding of LLMs, and therefore advance their capabilities, we must understand their weaknesses as much as their strengths. It is imperative for the tech industry to take proactive measures like these, given the rapid

Starting point is 00:06:58 pace of technological development. Over in the world of AI products, a ton of buzz today is about Humane's AI pin. Now, we've seen little premieres of this device. There was a widely shared TED talk from earlier in the year, as well as an appearance at a fashion show recently. But the official launch event for the PIN is coming later today. And in advance, it's basically seeming like this thing is a phone without the screen. The device is reported to cost $699 and come with a $24 a month subscription. We won't get too deep into that now because I anticipate that it will be a big part of the show tomorrow, but something to keep an eye on for sure. Now, the biggest news today is undoubtedly reports that Amazon is worth. working on a model they're calling Olympus that reportedly has twice the parameters of GPT4, but if you want to hear about that, you will have to come back for the main episode. That's coming up next. And now a word from today's sponsor. Are you interested in how two top-of-mind trends AI and crypto can work together?

Starting point is 00:07:55 If so, I have the perfect podcast recommendation for you. Web3 with A16Crypto, the chart-topping show brought to you by venture firm Andresen Horowitz. Web 3 with A16Z Crypto is your definitive resource for the future of the internet. Whether you're already building in these spaces or simply curious about what's next. If you need a place to start, they recently released an excellent episode with Stanford Cryptography Professor Dan Bonay and former Google Xer Aliya in conversation with host Sonal Choxi about the intersection of AI and crypto. From fighting deep fakes and proving humanity to large language models like Chatchip-T,

Starting point is 00:08:29 they cover it all. I highly recommend checking it out, especially if you'd like to learn more about how AI and crypto will impact our everyday lives. Beyond crypto and AI, this show is for creators seeking more ways to truly own their work, for business leaders trying to prepare for the future today, and for innovators exploring trending tech topics. So go ahead, listen to Web3 with A16Z Crypto, wherever you get your podcasts. Welcome back to the AI breakdown. Today we are talking about the latest news and reports out of Amazon, which is, as I said in the intro, that they are working on a model code-named Olympus that is rumored to be about twice as big in terms of parameter size as GPT-4.

Starting point is 00:09:10 So today we're going to talk about the news, as well as the larger context of competition between the big players when it comes to Foundation and Frontier models. So Reuter sourcing has two people familiar with the matter, saying that Amazon's new codenamed Olympus model has two trillion parameters. Again, that's about twice what OpenAI's GPT4 model is rumored to have. According to Reuter sources, the team inside Amazon is headed by Rohit Prasad, who's the former head of Alexa, and who is reporting directly to Amazon CEO Andy Jassy. Prasad's title is head scientist of artificial general intelligence and has been able to pull in people

Starting point is 00:09:44 from the Alexa AI and Amazon Science team to work in a unified effort. Now, when it comes to the business motivation of this, despite the fact that Amazon already has bedrock, which is their platform through which they help AWS customers take advantage of or customize existing models, including things like Anthropic, as well as open source models, they believe that having their own unique model within the larger AWS ecosystem could be a desirable offering for their existing customers. Now, to get a little bit of a sense about the larger Amazon context, let's go to a piece from the information from August called how AWS stumbled in AI giving Microsoft an opening. What the story reports is that even before ChatGPT had been announced,

Starting point is 00:10:24 AWS had been developing a similar type of LLM. They had wanted to debut the that tool, which they were referring to inside the company as Bedrock at the time, at their annual customer conference in November, however at the last moment they had to postpone it due to, as the information puts it, technical snags. Now, in many ways that ended up being fortuitous, because even as that event was happening, OpenAI released ChatGBT, which, as we know, went extremely viral and became the fastest growing software at the time in history. It didn't take Amazon long to recognize that the Bedrock model that they had been working on wasn't even close to the model powering open AIs chat GPT, but at the same time, they didn't want to do nothing.

Starting point is 00:11:01 In the scramble that ensued, Amazon threw away their old plan of leading with a model-first strategy and instead came up with the approach to build a service which would allow developers to connect with various different large language models, taking an approach where effectively they were arguing from a business standpoint there wasn't going to be a winner take-all situation, and that especially enterprise customers were going to prioritize having choice and customization options to find the right models for their particular businesses. Subsequent to that, they attached that bedrock name that had been floating around for their model to that new service and started promoting it. Now, Amazon did include some of its own models in that bedrock environment, which they referred to as

Starting point is 00:11:35 Titan, but the review of those models has been extremely mixed, say the least. Now, perhaps because of the reputation of Titan, or just the general sense of how far ahead other companies like OpenAI are, I would say that there's a fair bit of skepticism around the potential for a launch of Olympus. The information summed it up in a section of an article called The Mountain that Olympus must climb. They write, making Olympus successful could be a challenge for Amazon, at least based on the reception accorded Titan. The launch of Titan was delayed last year because it didn't even hold a tiny candle to OpenAI's chat GPT, and since making one of the Titan models generally available to AWS customers, the feedback about it has only been so-so. They also pointed to a piece from

Starting point is 00:12:14 from Hugging Face technical lead Phil Schmidt, who, as they put it, after side-by-side tests involving Titan and other models concluded that Titan is significantly worse than available open source models as well as software from OpenAI. Now, in terms of when we might get more information about Olympus, it seems like one potential is that it could come at ReInvent, which is Amazon's annual company conference, which is happening in Las Vegas at the end of the month. As the information points out, if they don't debut it there, it might make customers in the press think it's even further behind OpenAI. Now, at the same time, Amazon is one of the very few companies who has the resources to actually potentially compete on size for these models, and that's certainly where a lot

Starting point is 00:12:52 of the attention has been focused if you look at the chatter on Twitter. John Paris writes size matters when it comes to training AI, and points to a piece from the bite, Amazon reportedly training AI with twice as many parameters as GPT4. The AI wars have become a parameter measuring contest. However, as meta-chief AI scientist Jan Lacoon pointed out, back in September, quote, a model with more parameters is not necessarily better. It's generally more expensive to run and requires more RAM than a single GPU card can have. Jan also points out that when it comes to GPT4, it's rumored to be a mixture of experts, i.e. a neural net consisting of multiple specialized modules, only one of which is run on any particular prompt. So the effective number of parameters used at any one time is smaller than the

Starting point is 00:13:33 total number. Now, another line of conversation on X is around the fact that this is an inevitable move. That's the way that Boris here puts it. He says, I think it's inevitable for them to start competing and they also need a state-of-the-art model to power Alexa. Additionally, having custom GBT's for every book, author, and topic in the bookstore sounds like a no-brainer. Now, a third thing that people are talking about or asking about is what this suggests of its investment in Anthropic. Vizal Sharif asks, is that just a hedge? And I think that there are a couple things going on here. I don't think that Amazon is totally full of it when they've been talking about the idea that there are going to be multiple winners in this market and that a service like Bedrock, which gives enterprise customer's choice, is likely going to be a

Starting point is 00:14:12 valuable part of the equation. Indeed, even despite their huge investment in OpenAI, we've sort of seen Microsoft shift to a similar strategy, for example, in the form of their partnership with Databricks, which gives companies access to more models than just OpenAI models. Now, the other thing about their Anthropic investment is that they're not the only big tech player in that company. Just a few weeks after they announced their partnership, it also came out that Google would be investing a couple billion dollars in Anthropic as well. That obviously suggests a much more blended affiliation and a much less clear business relationship than, for example, OpenAI has with Microsoft. However, for many, the biggest conversation that this brings up is actually a question,

Starting point is 00:14:48 summed up best by Playground AI founder Sohail, who says, where is Google? Now, you might remember that back in September, the information in other sources suggested that Google was nearing the release of their much-anticipated Gemini model. The news back then was that they had given a small group of companies access to an early version of Gemini, and that in general, when it came to Google, giving outside developers access meant that Google was getting close to incorporating it into its other services. Now, Gemini matters for two different reasons. For one, many think that it might be the first model to actually exceed the capacities of GPT4,

Starting point is 00:15:23 which has been the benchmark for the industry ever since it came out. I've talked before in previous episodes about what a significant moment I think that will be for the industry, and really is signaling that this first phase of generative AI post- ChatGPT coming out last November, has come to a close and something new has begun. However, it's not just significant for the industry in general as a waymarker of how it's evolving. It's also really significant for Google. I think broadly speaking, there has been a lot of surprise at how far behind Google has appeared ever since ChatGPT came onto the scene. Frankly, the longer the Gemini goes unreleased, the higher the stakes are for that company. Nathan Landz writes,

Starting point is 00:16:01 What happened to Google? They seem to have completely dropped the ball. There was a lot of hype about Gemini than nothing. Meanwhile, OpenAI hit it out of the park and XAI proved they have a decent product and are iterating incredibly fast. Invidias Dr. Jim Fan writes, expectation for Google Gemini is now ridiculously high. Gemini has to check off at least one of the following, 120% IQ of textual GPT4 or 100% of GPT4 but at half the cost or 2x speed of turbo, or 100% of visual GPT4 or natively support long videos and ship the API in Q1 of 2020. Now, notably, even that 2024 number is a delay. Kevin Zhu pointed to a recent interview with Google DeepMind CEO Demas Sassabas

Starting point is 00:16:44 and writes, Google Gemini's delay to 2024 reiterated. Reorg of Google Brain and DeepMind is taking much longer to work through than the market expected. Then perhaps somewhat snarkily, but also reflecting a lot of people's perspectives, he writes, having the team's CEO go on regulatory capture missions rather than shipping product doesn't help. The Macrosift.eft.eith writes, Google Gemini still in training and fine-tuning, suggesting 2024 launch. Said Demis Hesaba's CEO of Google DeepMind in an interview with CNBC, we'll see about a 2024 launch.

Starting point is 00:17:13 Now, he points to one prediction market that has 49% betting that Gemini comes out this quarter, with another 40% betting that it comes out in Q1. Metaculous markets have the betting at February 3rd, 2024, with a lot of concentration across the Q1 dates. And even with these delays, there are also, as Dr. Jim pointed out, pretty high expectations of how this model will actually perform. Metaculous has another poll, what MMLU benchmark score will Google DeepMind's Gemini model have on release? By way of explanation, the background info writes, MMLU, massive multitask language understanding, is a test to measure a text

Starting point is 00:17:47 model's general knowledge and intelligence. The test covers 57 subjects, which range in difficulty from elementary level to advanced professional level, and it tests both informational knowledge and problem-solving ability. Now, Metaculus expects the benchmark score of 90.13, which would put Gemini ahead of GPT4. GPT4, for example, comes in an 86.5. Now, Brian Romley also talked about where Google's Gemini might try to position itself. He writes, With the release of GROC and GROC in GPT4 Turbo,

Starting point is 00:18:16 Google's Gemini will try to place itself in the middle. The focus will be to solve hallucinations by using the search graph that has high weight to the mixture of experts output. Now, what Brian is getting at is that in addition to the known competitors, like Google and like meta, Amazon and their forthcoming Olympus also have to deal with Elon's XAI and their new GROC model, along with OpenAI and whatever comes out as GPD 4.5 or GPD5. One of the critiques that I saw of OpenAI's dev day was around their prioritization,

Starting point is 00:18:44 with some like AI entrepreneur Bindu Ready, suggesting that they really should have focused on these more advanced models. Bindu wrote, rumors are they are training Gobi, a multimodal model, GPT5, that will be way superior to GPT4. It's not clear when Gobi will be available, but I suspect it won't be until. next year. For what it's worth, Metaculous is betting that OpenAI will announce GPT5 in about 11 months on October 5th, 2024. What's clear is that the speed of innovation is increasing. This has been particularly on display recently with the launch of GROC. Suhail again writes, it's interesting that it only

Starting point is 00:19:16 takes four months now to train an LLM to GPT3.5 Lama 2 level from scratch. Prior to January this year, nobody had practically replicated GPT3 still. Doesn't seem like the lead of GPT4 will last too much longer. And so, as the information put it, Olympus has a heck of a mountain to climb. But at the end of the day, it's not particularly surprising that they're going to take this on. And so really the only question is, when are they going to officially announce it? Come join us on the Breakers Discord if you want to discuss that or anything else. But for now, we'll wrap there. I appreciate you listening or watching as always. Until next time, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - 2X Bigger than GPT-4!? Amazon Is Training "Olympus"

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.