The AI Daily Brief: Artificial Intelligence News and Analysis - OpenAI Is Officially Training GPT-5

Starting point is 00:00:00 Today on the AI breakdown, we're talking about the real value and importance of OpenAI's custom GPTs. Before that on the brief, OpenAI is officially training GPT5. The AI breakdown is a daily podcast and video about the most important news and discussions in AI. Go to Breakdown.network for more information about our Discord, our YouTube, and our newsletter. Welcome back to the AI breakdown brief, all the AI headline news you need in around five minutes. Today we kick off with a couple of stories around OpenAI. Big surprise, that's basically been the kickoff of every episode for the last week or so. But where we start is with a really interesting interview between Sam Altman, who is, of course,

Starting point is 00:00:42 the CEO of OpenAI and the Financial Times. There are a few things in here which, although seemingly innocuous at first, actually I think have pretty big implications. The first, and what the FT chose to focus on, was the intimation that OpenAI would be raising even more money from their partners at Microsoft. From the interview, Altman said his company's partnership with Microsoft's chief executive Satya Nadella was working really well, and that he expected to, quote, raise a lot more over time from the tech giant, among other investors, to keep up with the punishing costs of building

Starting point is 00:01:12 more sophisticated AI models. Specifically when asked if Microsoft would keep investing further in OpenAI, Altman said, I'd hope so. There's a long way to go and a lot of compute to build out between here and AGI. Training expenses are just huge. Indeed, Altman also pointed out that although the company is over a billion dollar revenue run rate at this point, they're still unprofitable because of how high those training costs are. Alman said that the Microsoft partnership would ensure that, quote, we both make money on each other's success and everybody is happy. Now, another interesting part of this conversation, although not particularly unexpected

Starting point is 00:01:44 if you listen to Sam Altman talk with any sort of frequency, is the way that he describes what OpenAI's real products are. He said, right now people say you have this research lab, you have this API software, you have the partnership with Microsoft, you have this chat GPT thing, now there is a GPT store. But those aren't really our products. Those are channels into our one single product, which is intelligence, magic intelligence in the sky. I think that's what we're about. Indeed, Altman said that his specific time is split into, one, research into how to build superintelligence, and two, how to get the computing resources to do so.

Starting point is 00:02:15 Still, one of the most significant revelations from the interview is something which perhaps was widely expected, but represents a significant shift in the tone from OpenAI, which is that the company has confirmed that it's working on GPT5. Now, the reason that this is notable is that in all previous interviews when asked about this, Altman went out of his way to say that we are not training GPT5 yet. And while he still didn't commit to any sort of timeline for release, he did talk about it in more than passing detail. Indeed, it sounds like the interviewer, although we can't be sure because we only have these excerpts, not the actual interview, delved into it with Altman, who even got into

Starting point is 00:02:49 where the data would be coming from. On the one hand, it is going to be publicly available datasets, but on the other hand, it's going to be proprietary data from companies that they partner with. This, of course, gets to what we talked about last week with OpenAI's new data partnership initiative, which is clearly an attempt to go out and proactively recruit data that they didn't have access to previously. Alman also talked about how hard it was to predict exactly how GPD5 will be better than its predecessors, even though they can have pretty strong confidence that it will be better than those predecessors. Altman said,

Starting point is 00:03:18 Until we go train that model, it's like a fun guessing game for us. We're trying to get better at it because I think it's important from a safety perspective to predict the capabilities, but I can't tell you here's exactly what it's going to do that GPT4 didn't. Alman also discussed that there would be more to getting AGI than just training better and better LLMs. He called them, quote, one of the core pieces for how to build AGI, but there will be a lot of other pieces on top of it. However, what that other pieces are, he's not really sure. He used an analogy of Isaac Newton, saying basically that he couldn't just read more textbooks and talk to more professors. He had to get out there and actually do things. However, what those things are

Starting point is 00:03:51 when it comes to the context of AGI, Altman just isn't sure. He said, said, so the question is, what is the missing idea to go generate new knowledge for humanity? I think that's the biggest thing to go work on. Now, in the meantime, for those who are a little bit more focused on the here and now and the new launch of GPTs, this interview reaffirms that they are very much focused on these as the first step towards more complex and more capable agents. Said Altman, we will make these agents more and more powerful, and the actions will get more and more complex from here. The amount of business value that will come from being able to do that in every category, I think, is pretty good. Now, as I mentioned, a lot of people are

Starting point is 00:04:25 noting the fact that this $10 billion wasn't enough to compete in this incredibly expensive industry. Well, part of that is not just the computing costs of training ever more advanced models, but also the salary costs of the team members that they are trying to recruit. The information dropped a piece over the weekend. OpenAI's new weapon in talent war with Google? $10 million pay packages for researchers. Now, basically what's happening here is that there is this interesting little window where Open AI is on the verge of effectively tripling the valuation of the company

Starting point is 00:04:53 with this employee tender offer that we've talked about in the past. Effectively, OpenAI is giving some of its employees the ability to get liquidity, and they're doing so at a price tag that's between $80 and $90 billion. Apparently then, OpenAI's recruiters are taking that fact and bringing it to top AI employees at other companies like Google and basically saying, lock in a stock package now at the current valuation of $27 billion from earlier this year, and almost immediately it'll be worth $3x what it is on paper.

Starting point is 00:05:19 Now, in addition to those big, juicy pay packages, the recruiters are also making claims that researchers would have greater access to compute. However, exactly what that's based on isn't clear, given that there is still a perception that Google has an advantage in that area, even if Sam Altman has told some colleagues that he expects that advantage to shrink next year, as Microsoft makes more chips available to OpenAI. Now, while it doesn't sound like Google is trying to match these salary offers currently, they have clearly increased their willingness to compete on that vector, given that the flow hasn't just been from Google to OpenAI, but that Google has been successfully recruiting some

Starting point is 00:05:51 OpenAI employees, such as the person who led the development of code interpreter, who joined Google from OpenAI in October. Now, speaking of compute, while Nvidia's H-100 has set the tone and tenor for this latest generation of generative AI, today they unveiled the H-200. The H-200, as you might expect, is meant to be a GPU upgrade from the H-100. CNBC writes, The key improvement with the H-200 is that it includes 141 gigabytes of next-generation HBM3 memory that will help the chip perform inference.

Starting point is 00:06:21 using a large model after it's trained to generate text, image, or predictions. NVIDIA has said that the H200 will generate output nearly twice as fast at the H100. Now, interestingly, this is the same focus that AMD's new MI300 X-CH chip is supposed to have, a real focus on inference, and both are expected to come sometime in the first half of 2024. Now, back in the land of Google, Reuters is reporting that the company is in talks to invest in very hot AI startup Character AI. Character AI, which was founded by ex-Gougalers, gives people, particularly young people, the ability to chat with virtual celebrities, anime characters, or create their own chatbots.

Starting point is 00:06:58 According to similar web, 60% of Character AI's traffic is between 18 and 24. As Reuters puts it, the demographics helping the company position itself as the purveyor of more fun personal AI companions compared to other AI chatbots from OpenAIs chat GPT in Google's Bard. The company is apparently looking for a big valuation increase from back in March when it raised $150 million at a billion dollar valuation. Character AI is now apparently looking to raise funding at around a $5 billion valuation. Last up today, from the science in AI file, space.com is reporting about a new AI that can track the melting of icebergs 10,000 times faster than can humans. Right space, scientists are turning to artificial intelligence to quickly spot giant

Starting point is 00:07:39 icebergs and satellite images with the goal of monitoring their shrinkage over time. And unlike the conventional iceberg tracking approach, which takes a human a few minutes, to outline just one of these structures in an image, AI accomplished the same task in less than 0.01 seconds. Basically, the team in question trained AI to spot large icebergs, using images from the European Space Agency's Sentinel 1 satellite, and it did so with not only accuracy, but also with fewer mistakes. Again, from space, the AI tool didn't make the same mistakes as other more conventional automated approaches, such as the error of misconstruing individual bits of ice as one collective iceberg. And of course, basically with the observation taken out of the hands of

Starting point is 00:08:16 individual scientists, there's more time to spend focusing on actual solutions. Anyways, guys, that is going to do it for today's AI breakdown brief. Next up, the main AI breakdown. And now a word from today's sponsor. Are you interested in how two top-of-mind trends AI and crypto can work together? If so, I have the perfect podcast recommendation for you. Web3 with A16Crypto, the chart-topping show brought to you by venture firm and Dries in Horowitz.

Starting point is 00:08:44 Web 3 with A16Z Crypto is your definitive resource for the future of the internet. Whether you're already building in these spaces or simply curious about what's next. If you need a place to start, they recently released an excellent episode with Stanford Cryptography Professor Dan Bonay and former Google Xer Aliya in conversation with host Sonal Choxi about the intersection of AI and crypto. From fighting deep fakes and proving humanity to large language models like ChatGBT, they cover it all. I highly recommend checking it out, especially if you'd like to learn more about how

Starting point is 00:09:14 AI and crypto will impact our everyday lives. Beyond crypto and AI, this show is for creators seeking more ways to truly own their work, for business leaders trying to prepare for the future today, and for innovators exploring trending tech topics. So go ahead, listen to Web3 with A16Z crypto wherever you get your podcasts. Welcome back to the AI breakdown. Well, a week ago, at OpenAI's Dev Day, one of, if not the biggest announcement was around what they were calling GPTs or custom GPTs. GPs, they said, were customized versions of chat GPT that could combine instructions, extra knowledge, and take actions on behalf of the person interacting with it. They wrote, GPs are a new way for anyone to create a tailored version of chat GPT to be more

Starting point is 00:09:56 helpful in their daily life, at specific tasks, at work, or at home, and then share that creation with other. GPs can help you learn the rules to any board game, help teach your kids math, or design stickers. Now, a couple more things to note. One is that GPTs required no code. There was a builder, the chat GPT released at the same time, and two, there was the promise of a GPT store rolling out later this month in November. They wrote, starting today, you can create GPTs and share them publicly. Later this month, we're launching the GPT store featuring creations by verified builders. Once in the store, GPs become searchable and may climb the leaderboards. We will also spotlight the most useful and delightful GPDs we come across in categories like

Starting point is 00:10:34 productivity, education, and just for fun. In the coming months, you'll also be able to earn money based on how many people are using your GPT. So all of this was initially greeted with tons of excitement, but then, of course, the AI hypefluencers on Twitter slash X started doing their thing, and every other post that anyone who touches AI saw was some version of here's 10 mind-blowing examples of GPTs, which necessarily creates its own sort of fatigue. Staff engineer writes,

Starting point is 00:11:01 I'm already sick of seeing the word GPTs, when will it end? And on the one hand, I totally understand this point of view. It's why I've sort of railed against that type of content in the past. But at the same time, I also started to see many people posting critiques like this one from Daniel Lossie. I'm going to say it, custom GBT's just a fancy system prompt, lull. AI entrepreneur Bindu Reddy said the same thing. Most Gptys are glorified instruction prompts.

Starting point is 00:11:27 While these are fun to play with, I suspect the novelty will soon wear off. Most users will go back to using core chat GPT features versus messing around with these GPs. Data Chaz posted the Scooby-Doo reveal meme, where so-called mind-blowing custom GPT is actually revealed to be a fancy one-liner GPT4 prompt. Now, it is my contention that dismissing these as fancy system prompts completely misses the point

Starting point is 00:11:50 of what GPs are trying to do and where their real value lies. TLDR, they are just a fancy system prompt, but that's exactly the point. So let's talk about what GPs are not. One, they are not agents. OpenAI made it very clear that this was the case. Open AI does view them as their very first step towards the world of agents in which AIs have more context about their user and can actually go accomplish tasks, but they are intentionally and explicitly just a very small step in that direction.

Starting point is 00:12:21 At OpenAI Dev Day, Sam Altman used the introduction of custom GBT's to reinforce the company's belief that when it comes to a technology that has as many upsides but also downsides as autonomous AI agents, OpenAI believes that the only way to approach it is to actually see how things work in the real world rather than just trying to speculate on them in the lab. The reason it matters that GBT's are not agents is that the label of agent comes with it the baggage of inflated expectations. People expect more from these GPs than they're actually used. useful for. And because they don't achieve that idea that people had in their minds, they are dismissed as not

Starting point is 00:12:56 very useful, even though they were never intended to be that thing that people were imagining them to be. I think this is much the same with the idea of them as apps. Now, the comparison to apps comes not from OpenAI in any way, but simply because when we think of a store for stuff that users create that a platform cuts people in on, what we think of is the app store. And since these GPTs are going to be in some cases useful for similar types of things as are apps currently, it's a fairly natural jump to make. Apps, however, of course, are a totally different phenomenon. They're coded and built from the ground up with a specific purpose. They are not meant to be an excess point to an AI model, at least not in most cases. And more than anything, they're just more sophisticated and complex

Starting point is 00:13:35 than what custom GPs are trying to be. So again, the reason that it matters to identify what they're not is that I think part of the latent problem with these critiques is that they start from a standpoint of inflated expectations that it doesn't make any sense to actually apply to custom GPTs. But if that's what GPTs are not, they are not apps and they are not agents, then what are they? Well, one, as we said, in OpenAI's view, they are a first step towards agents. Logan, who does developer relations at OpenAI, actually responded to Daniel in that post that I just read and said, it's our first step towards agents. GPTs will get much better over time. Many people have been asking for the ability to customize chat GPT with files, tools,

Starting point is 00:14:15 and instructions. Excited we deliver that, but this is only the first step. So much more to come. So one part of what GPTs are is a first step towards agents. But that's not the thing that I actually think is important right now in terms of how users consider it. I said a moment ago that I think that the critique that GPTs are just glorified prompts is wrong because they are glorified prompts, but that's exactly the point. GPTs are workflows. As often happens, Professor Ethan Malik from Wharton sums it up really crisply and acutely. He writes, I think GPTs may have have actually unlocked the potential of structured prompts for many people. I see tons of people sharing GPTs where before, almost no one shared links to prompts. It's not really a capability

Starting point is 00:14:55 change, but a user experience change. What he means is that the most useful GPs, right now, are the ones that have come to embody the specific prompts that power users have figured out how to unlock value from, just created in a shareable way. Instead of a Twitter thread or a blog post or a video or some downloadable PDF in exchange for your email that tells you all about prompt engineering and how to copy someone's prompting, instead you can now just hand them one of these custom GPs. Have you perfectly figured out a way to get great writing critique for technical writing? Well, again, instead of teaching someone how to prompt chat GPT in the same way that you did to get those results, you can just give them your custom GPT that has all of that structure

Starting point is 00:15:36 built into it. GPTs are just saved, shareable, structured prompts. But the fact that they are saved and shareable is what makes them powerful, because that gets us to the next thing that Gptis are, which is use case simplifiers. For the vast majority of people, when they look at the blinking button of the main page of chat CPT that asks, how can I help you today? Their answer is, well, I'm not sure. That's why GPT has starter prompts. Mine currently read, design a database schema. Tell me a fun fact about the Roman Empire. Recommend a dish to impress a date. Explain options trading. Maybe randomly one of those things will be useful, or maybe I came to chat GPT because I had an idea of exactly what I wanted. But I think that the vast majority of people instead

Starting point is 00:16:18 come because they've heard this technology is incredibly powerful, and they want to see if it's powerful for them. There is nothing more intimidating in many ways than a blank page. And the vast majority of people who learn about artificial intelligence and generative AI tools, learn by one, seeing what other people are doing, and then two, trying that out themselves and taking it in slightly different directions that are more relevant for them personally. When I say that GPTs are use case simplifiers, what I mean is that instead of having to browse around X to get an idea of what other people are using chat GPD for, you can just look at the names of these GPs, all of which have some specific focus, and hone in directly on what's most useful for you.

Starting point is 00:16:57 their use-case simplifiers again, like Ethan said, not because they've added some fundamental new capacity, but because they've created a new user experience that makes it much easier to see what chat GPT can actually be useful for. And that gets us to one more thing that GPTs are, a way to transfer knowledge from the most to the least sophisticated chat GPT users. In the world that we live in, when it comes to tools of any type, a very, very tiny fraction of the population that uses those tools are going to be the innovators

Starting point is 00:17:28 that actually figure out how to use them. The vast majority of people instead will just copy what they see and really only learn a few workflows that ultimately capture a big chunk of what they need. On the other hand, there are a small handful of users who sit at the top of that power distribution and come up with the vast majority of use cases that ultimately people will copy, imitate, and adapt. These GBTs are the way that that very small cohort, that always an inevitably smaller portion of the larger chat chbtu using population is going to get what works and what is useful to everybody else. Again, as Ethan wrote, it's not really a capability change

Starting point is 00:18:03 but a user experience change, but that doesn't mean it's insignificant. In fact, in many ways, it seems likely to me that user experience changes more than capability changes are what is going to be most important to unlocking the value of generative AI to the next 100 million or billion users. Now, this is not to say that there aren't reasonable

Starting point is 00:18:22 critiques of custom GPTs. One thing that many have noted is that it appears that there are ways to reverse engineer the custom instructions underlying GPTs. That includes potentially getting access to the files that were uploaded as part of building the GPT. Those are real meaningful concerns. They are security holes, they are privacy considerations, and those are absolutely things to consider when using these tools, or in particular, building these tools. They also suffer for many of the same challenges that just generic use of Chad GPT does. For example, hallucinations, and just how compliant with these custom instructions GPTs really are. Some like Jerry Liu, the CEO of Lama Index, also argues that they exist in a weird space where they are perhaps

Starting point is 00:19:01 too complicated for the average user to create, but also not sophisticated enough for developers who use who might opt to use the API instead. There's also the question of what the store will actually look like. I think Boris did a really great summary here. He wrote, because of the word store, many people think that it'll be like the App Store and get excited about the money-making opportunity. I have a feeling that the better analogy is a Spotify-type subscription platform. Millions of interchangeable easy-to-make GPTs. The vast majority of them will be free with the $20 per month-loss subscription. There will be paid GPs, but not many people will buy them. Of course, a few hits will sell big numbers. Overall, the store will be one, great for the users,

Starting point is 00:19:38 two, wonderful for OpenAI, three, better than nothing for the most new, quote-unquote, GPT developers. Just like the vast majority of the musicians in Spotify, make pennies from streaming, the same will be true with the vast majority of the GPT developers. Most developers will publish quality GPs and use them as a free lead magnet to monetize in other ways. Some will make good money but nothing big from the revenue share when their GPs go viral. Overall, I think it'll be a net positive for everyone, or at least for most, and a good next step in the AI adoption wave. Now, I think that there are some reasons to think that Boris is right when it comes to the idea that GPs will be monetized not via the creators of GPT setting their

Starting point is 00:20:12 own price, but through something more like a subscription. If you go back to how OpenAI describe the GPT store, they write, in the coming months you'll be able to earn money based on how many people are using your GPT. That's a weird way of saying you'll be able to set the price and get paid each time someone uses it, meaning it seems much more like a revenue sharing type of agreement based on some black box usage formula that we probably won't have access to. Now all of that said, I don't want to give the impression that this critique is the main threat of conversation surrounding GPTs. Instead, it is definitely excitement and

Starting point is 00:20:44 entrepreneurial creativity. Replit Builder and Residence Martin Bowling writes, excited to unveil the custom GPT starter kit. It's designed to empower creators with the tools to build advanced GPTs with ease. And basically, this is a toolkit for an advanced custom GPT that can do things like integrate your own API, create shortcuts, and embed custom knowledge directly into that custom GPT. GPT's Dex is a new product that is racing up the product hunt charts right now that is, as you might expect, a third-party database of interesting GPs. Now, going back to my claim that GPs are use case simplifiers. One of the things that will or won't make that so is what OpenAI does or third parties do to make it easy for users to actually go out and find

Starting point is 00:21:25 the right ones for them. Yes, this is great that there's a visual interface for searching through thousands of GPs, but it's still an overwhelming amount of information. And there isn't really any sort of sophisticated search that allows me to go find exactly what I might need for a use case right now, which still kind of leaves people in the position of having to just scroll and see if anything triggers an idea of what might be useful for them. One of the major questions to me for what comes next with these GPTs is what sort of search and browse interface OpenAI puts around them. I think that'll be as significant as the monetization approach

Starting point is 00:21:54 when it comes to whatever they're planning for the GPT store. Plus, even though I'm more focused on the here and now, there is certainly an understanding of OpenAI's perspective that these are the beginning of personalized AI agents. Kevin Ruth's from the New York Times wrote a piece this weekend called Personalized AI Agents Are Here? Is the world ready for them? The piece concludes, if Open AI is right, we may be transitioning to a world in which AIs are less our creative partners than silicon-based extensions of us, artificial satellite

Starting point is 00:22:19 brains that can move throughout the world, gathering information and taking actions on our behalf. I'm not fully prepared for that world yet, but by the look of things, I better start getting ready. So, friends, that is my take about what GPs are and are not. I think when viewed through the lens, of being better ways of capturing structured prompts and making them shareable. It is an extremely use case expanding and community expanding feature update, and one that I think we're just barely scratching the surface on.

Starting point is 00:22:47 I'm sure later this week I will do some rundown of my favorite GPTs that I found so far. Now, if you are interested in talking more about GPTs specifically, you can either join us in the AI breakdown Discord, which is at bit.ly. Or I've also created another experimental discord just for focus on GPT building, called, as you might guess, GPT builders. You can find a link to that one at bit.ly slash gpt builders. In either case, I'm excited to learn alongside you. And of course, as always, I appreciate you listening or watching. Until next time, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - OpenAI Is Officially Training GPT-5

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.