The AI Daily Brief: Artificial Intelligence News and Analysis - OpenAI's Potential $5B Hole

Starting point is 00:00:00 Today on the AI Daily Brief, another example of why Open AI is perhaps the most capital-intensive startup in Silicon Valley history. Before that in the headlines, Mistral launches another open competitor to Lama 3.1405B. The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI. To join the conversation, follow the Discord link in our show notes. Welcome back to the AI Daily Brief Headlines edition, all the daily AI news you need in around five minutes. Just one day after Meta released its Lama 3.4. point one family of models, Mistral has hit back with another competitive open model that they're calling Mistral Large 2. Devondra Chaplow from Mistral, writes, super excited to announce

Starting point is 00:00:44 Mistral Large 2, 123 billion parameters fits on a single H-100 node, natively multilingual, strong code and reasoning, state-of-the-art function calling, and open weights for non-commercial usage. The blog post that they announced this with was called Large Enough, and the company writes, compared to its predecessor, Mistral Large 2 is significantly more capable in code generation, mathematics, and reasoning. And indeed, the blog really focuses on the code and reasoning dimensions of this. They write, we train Mistral Large 2 on a very large proportion of code. Mistral Large two vastly outperforms the previous Mistral Large and performs on par with leading models such as GPT40, 405B. They also write that a, quote, significant effort was devoted to enhancing the

Starting point is 00:01:25 model's reasoning capabilities. One of the key focus areas during training was to minimize the model's tendency to hallucinate or generate plausible sounding but factually incorrect or irrelevant information. This was achieved by fine-tuning the model to be more cautious and discerning in its responses, ensuring that it provides reliable and accurate outputs. Now, anytime Mistral launches a model, the developer community gets very excited. Indeed, in many ways, the battle for the standard bearer flag for open source AI has been in competition between meta and Mistral. Alex Banks tweets, the pace of AI innovation is relentless. Just 24 hours after Lama 3.1405B, Mistral announced their latest model.

Starting point is 00:02:01 This is a much smaller model, one-third the parameters of Lama 3.1-405B, yet large 2 performs equal or even superior to both Lama 3.1 and GBT40 across leading benchmarks. After sharing comparison benchmarks for the MMLU, Human Eval, and GSM-A-K, Alex continues this is incredible performance given the model size. Foundation model competition has never been higher. If this isn't the catalyst for Sam Altman and OpenAI to release GPT5, I don't know what is. In other parts of the open source community, people are still wrapping their heads around the new offerings from Lama. Bindu Ready of Abacus writes,

Starting point is 00:02:33 Mistral Large 2 is good, but Lama 3.17B is insane. Bindu continues, we've updated Live Bench AI to include Lama 370B and Mistral 2. Interestingly, Lama 3.17B is the best model given its size. It's even better than Mistral 2. Another small update, we fixed a couple of bugs, and now Lama 405B beats GBT40, making it the second best model in the world. A fantastic week for open source AI. If you've been following along, you'll know that the early leaked benchmark suggested that Lama 405B was right up there in that GPT-40 Claude 3.5 Sonic class and maybe even exceeding them and that seems to be being confirmed by these independent tests. However, the mistral announcement wasn't without its detractors entirely. Artificial guy writes, this isn't a fight, this isn't shade,

Starting point is 00:03:15 but honestly, Mistral Large doesn't make much sense right now. It's a model that most people can't run locally, non-commercial, only at Mistral API, and with a price of $9 per 1 million token output, it doesn't make sense compared to Lama 405B, at $3 for $1 million token output. And the non-commercial license was a really big sticking point for many people. Andre Bercoff writes, on July 23rd, meta-released a 405 billion parameter parameter context multilingual model for commercial use. On July 24th, Mistral released a 123 billion parameter model restricted to research use only. I'm not sure what the goal of this release was.

Starting point is 00:03:49 I will not even link the model because it's pointless. Has Mistral lost the way? Jeff Flaherty ironically points out that most of the people who have those H-100s that are required for running Mistral Large 2 aren't necessarily non-commercial. He writes, mistral responds to Metas Lama 3.1 with a much smaller model that claims to perform slightly better. The license, however, doesn't allow commercial usage. Wonder what kind of research this will enable for all those non-commercial researchers with all those H-100s. For those looking for signs, though, that this is all forcing the space to move towards the open, some have pointed out to OpenAI announcing that they were offering free fine toning for GPT40 Mini for the next two months.

Starting point is 00:04:24 Many people took this as a signal of the pressure coming from these open source models. Overall, there are definitely some interesting vibes. Leaker extraordinaire Jimmy Apples writes, there's something in the air, a schizzo vibe of hope. Let's get mathy. Andrew Curran points out that we're due for a big announcement from Google and that there had recently been reports that Google's deep mind had made a big leap in math reasoning. All in all, a good week for model competition and a portent of exciting. things to come. Speaking of models, Kling, the Chinese AI video generation model that took the internet by storm about a month ago, is now widely available. Initially, Kling was only available in China and

Starting point is 00:04:58 required a Chinese phone number, but on Wednesday, July 24th, Kling tweeted the moment we've all been waiting for is here, introducing the official global launch of Kling's AI International version 1.0. Any email address gets you in, no mobile number required. Once again, this has got to put pressure on OpenAI to release something when it comes to SORA. Over in the search engine wars, Bing has gotten an AI redesign, and like what we've seen recently from Google's AI overviews, of course, in many ways feeling to imitate the example of perplexity, Bing now puts the AI generated answers right at the middle as the main part of the experience while moving traditional search results over to the side.

Starting point is 00:05:34 As the Verge points out, Bing's new layout goes beyond general summaries. It's basically much more equivalent to a customized Wikipedia page that's created on the fly, and at least that particular reviewer is worried that it will be overwhelming. That said, I'm certainly willing to give it a try, and I think from Microsoft's perspective, Bing has to make some bold moves to try to carve out space in a search market that is still obviously dominated by Google. Lastly today, former NFL quarterback Colin Kaepernick has launched a new AI startup called Loomi's Story AI.

Starting point is 00:06:04 It's backed by Reddit co-founder Alexis O'Hanian and, quote, plans to use AI's capabilities to give aspiring creators tools they might otherwise not have access to. Says Kaepernick, it allows us to help fill in the skill gaps of creators. We're now building in that job. direction to try and open that up and democratize storytelling. In his post announcing their investment, Alexis wrote, Loomi is much more than a cool AI demo. It's a place where anyone, no matter their skill set is empowered to build, publish, and actually monetize their stories. Too many new AI tools are forgetting about that last part. How do they do that? We're talking end-to-end story creation,

Starting point is 00:06:35 simple AI tools, physical and digital publishing and built-in merchandising. It's everything a creator needs to make and scale their work even without a network in Hollywood or the dollars to pay a big team. That's how we level the playing field. That's how we get more people in the room telling more diverse stories. Loomi is about to turn every storyteller into their own personal Disney. And so once again, it appears that we have here an example of the trend away from just general AI interfaces towards actual full product suites that have a vision of how to integrate AI to make things happen. That, however, is going to do it for the AI Daily Reef Headlines edition. Next up, the main episode. Today's episode is brought to you by Venice. The leading AI company store your entire conversation

Starting point is 00:07:13 history and attach it to your identity forever. That's every question you ask, every answer you receive, every image you generate, every thought you share with the machine it's all being spied on. If you trust all the company's hackers and NSA board members that will ever have access to your AI conversations, then rejoice, for you are well served. For the rest of us, Venice is an alternative. Venice is a powerful AI app for text, image, and code generation that respects you as a sovereign individual and believes privacy and free speech are not only human rights, but necessary for civilizational advancement. Private, permissionless, and uncensored, you can try it for free without an account. AIA Daily Brief listeners receive a 20% discount on Venice Pro. Visit venice.a. slash

Starting point is 00:07:51 NLW.LW. Daily Brief. That's NLW Daily Brief. All one word. Today's episode is brought to you by Super Intelligent. As you guys know, Superintelligent is a platform we are building to help everyone, individuals and teams maximize their use of AI. We help you figure out how to use AI tools, as well as what to use AI for. And this is really important. The whole goal of Superintelligent is not just to give you tutorials and lessons, but to show you how other people like you are actually getting value from AI right now. For those of you who are still out there working, learning, and grinding deep in the summer, I'm excited to share our best offer ever. If you sign up with code year 50 right now, you will get 50% off the already reduced annual price. That means you'll

Starting point is 00:08:38 get access to Super Intelligent for a full year for less than $100. Again, that code is year 50 for 50% off the already reduced annual fee. This particular code is going to expire in just a week or two, so head on over to B-Supert.a.I. And check it out. Welcome back to the AI Daily Brief. Today, nominally, we are talking about a new report from the information that suggests that Open AI could be on track to spend $5 billion this year. We'll get into their arguments in what sourcing they have for this, but we're going to situate it in the broader context of whether Wall Street is starting to turn and view AI as being in a bubble. Let's get into the reporting from the information first, though. If you listen to this show regularly, you probably know that the information is one of the best

Starting point is 00:09:23 sourced, if not just straight up the best source publication when it comes to insider information about the AI industry. Their analysis is based, they say, on undisclosed internal financial data, as well as people involved in the business. They write our conclusion pinpoints why so many investors worry about the profit prospects of conversational artificial intelligence. Our results also underline the question of whether those companies will eventually need to charge higher prices for their technology if they can't find a way to reduce the cost of developing and running AI. So, with that in mind, let's get into the details. The information argues that as of March, OpenAI was on track to spend around $4 billion this year on basically renting servers. That comes from a person with direct

Starting point is 00:10:01 knowledge of the spending. And that is, of course, just for running chat GPT. That's not about training costs. Those training costs, including their new deals where they pay for data, could be as much as $3 billion this year. A person with direct knowledge of the decision said that last year, Open AI ramped up the training of new AI faster than it had originally planned. The company had earlier planned to spend around $800 million on such costs, but ended up spending considerably more, and so the information is estimating that those costs will double this year. Next, they estimate Open AI's 1,500 strong workforce to cost them around $1.5 billion, but this seems to be one of the areas where they are least confident, calling it a guesstimate to be sure.

Starting point is 00:10:37 OpenAI had previously projected workforce costs of $500 million for 2023, while doubling headcount from $400 to $800 over the course of that year. Given that it's nearly double that workforce again and is likely to add even more people in the second half of this year, that's where that $1.5 billion estimate comes from. So that puts OpenAI's operating costs this year at around $8.5 billion, or at least as high as $8.5 billion. The revenue story is one we've heard before. ChatGPT recently was on pace to generate around $2 billion annually. Although as the information flags, the issue that OpenAI faces of people using a free version of ChatGPT, raising computing costs without generating revenue, could be exacerbated this year

Starting point is 00:11:15 when Apple begins rolling out ChatGPT on the iPhone. As of March, the information writes, OpenAI's API business was generating around $80 million per month, and so all in all, they estimate that its full year revenue could be between $3.5 and $4.5 billion depending on sales in the second half of this year. From there, it's just a matter of simple math. Potential costs of up to $8.5 billion, from revenue of up to $4.5 billion, and you get losses of between $4 and $5 billion. Now, it's not like Open AI isn't clear-eyed about this. Zammaltman has previously described the company as the, quote, most capital-intensive startup in Silicon Valley history.

Starting point is 00:11:46 But, as the information points out, quote, it means open AI will need to raise money soon. Where does this put them relative to competitors, though? Well, the piece argues that although Open AI may be burning a lot of cash, it's better off than some of its rivals. They point to a discount in what it pays Microsoft for renting its servers, and its better revenue profile than its competitors like Anthropic. The piece estimates Anthropics revenue to be between a fifth and a tenth of Open AIs, yet their burn may be around 50% of Open AIs. According to a person who saw the figures, earlier this year, Anthropic had projected

Starting point is 00:12:17 spending $2.5 billion on computing costs alone. Anthropic projects that it'll reach around $800 million in annualized revenue this year, but shares some of that with Amazon, meaning that its net could be between $400 and $600 million, leading to the information's conclusion that, quote, although Anthropic is growing faster than OpenAI, it is nowhere near as efficient. So what comes next? Well, in classic business fashion,

Starting point is 00:12:38 OpenAI is looking at ways of reducing costs and generating more revenue. The reports are that OpenAI is planning to launch a search engine as well as the computer using agent, both of which would handle different types of multi-step tasks. They also anticipate GPT5, whatever it ends up being called, due out before the end of the year, which could be another boon to growth. So that is the OpenAI story.

Starting point is 00:12:59 Nothing in there, I think, is particularly surprising, but it is expensive, and it of course then plays into the larger debate around the ROI from AI that we've been discussing over the past few months. The Wall Street Journal yesterday published a piece called Google Fails to Wow as AI Bills Mount. Advertising business faces tough growth comparisons while AI spending continues to surge. I mentioned that the interpretations of Google's recent financial results were sort of a Rorschach test, and the Wall Street Journal here is kind of trying to play both sides. They start the piece, it's good to be Google these days, but it isn't easy and it will keep getting harder. And effectively, the story that they're telling is revenue growth that is right

Starting point is 00:13:33 around expectations, although a little bit better, but increased capital expenditure, particularly on AI infrastructure, that just seems to continue to be going up. Don't expect to see a shift in Google strategy anytime soon, however. CEO Sundarpa Chai said during the earnings call, look, obviously we're at the early stage of what I view as a very transformative area. The risk of underinvesting is dramatically greater than the risk of overinvesting for us here. Mark Zuckerberg of Meta has made a very similar point. And an additional point I'll make, which I do think is germane to the whole bubble conversation, is that these are not levered companies getting into some risky area that's more smoke and mirrors than real opportunity. Alphabet is sitting on $98 billion in cash.

Starting point is 00:14:13 That gives it a lot more room to run when it comes to these sorts of big long-term bets. But the bubble talk keeps increasing. The Washington Post writes big tech says AI is booming, Wall Street is starting to see a bubble. The industry has rushed headlong into AI and stock market investors are following them, but a growing number of analysts are skeptical. Then again, the quote-unquote growing number of analysts actually are pretty much the same voices that keep saying the same thing over and over. Jim Covello, who is the main skeptic in that Goldman Sachs piece that we did a deep breakdown on recently, is the lead-quoted analyst here. What's undeniable is that this week has been rocky when it comes to public markets. The BBC writes, shares drop in US and Asia as AI stock slide. On Wednesday,

Starting point is 00:14:53 the S&P 500 lost 2.3% and NASDAQ felt 3.6. percent, which is its biggest one day fall since 2022. In the same way that for the last couple years, all the gains have been driven by big tech, and particularly big tech that's touching AI, the losses this time were also driven by those firms like Nvidia, Alphabet, Microsoft, Apple, and Tesla. Said June Bai Liu, a portfolio manager at Tribeca investment partners, investors are now becoming more concerned about all this expenditure with AI without the revenue benefit. I don't think this will mark the start of the disbelief in AI. It just simply means investors will focus more on returns in the space than just buying the whole sector. And now my friends would be a

Starting point is 00:15:26 completely reasonable thing. One of the weirdnesses of AI is the fact that Wall Street is involved so early because of the presence of big tech at such an early stage in a new technology's life. Usually there's a decade or more where something like generative AI would be incubated in the private market bastion of Silicon Valley before it came to public markets, but because of the particular dynamics of AI that's just not the case this time around, and Wall Street isn't necessarily having the easiest time figuring out how to price things. I think ultimately we need to break apart the conversation around the market bubble and valuation bubble from AI value and utility. For now, though, that is going to do it for today's AI Daily Brief.

Starting point is 00:16:02 Until next time, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - OpenAI's Potential $5B Hole

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.