Everyday AI Podcast – An AI and ChatGPT Podcast - Ep 271: OpenAI Releases GPT-4o. 12 things you need to know

Episode Date: May 13, 2024

Win a free year of ChatGPT or other prizes! Find out out.Just when we thought the AI world couldn't get more exciting, OpenAI has dropped its new GPT-4o model! We're breaking down what this ...model is and 12 things that you need to know about OpenAI's first omnimodel, GPT-4o.Newsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion: Ask Jordan questions on GPT-4oUpcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:1. OpenAI announces GPT-4o model2. Key Features of GPT-4o3. GPT-4o Rollout Plan4. What GPT-4o means for GoogleTimestamps:01:50 New GPT-4o model is Omni modal03:52 GPT-4 paid users get 5x capacity.10:09 ChatGPT desktop app for easy AI-assisted workflow.14:11 Google's marketing video not entirely truthful.16:25 Mistrust in AI marketing created by Google.18:40 OpenAI's rollout plan21:47 GPT-4 combines transcription, intelligence, text-to-speech.Keywords:OpenAI, GPT 4 o, Everyday AI, model, Jordan Wilson, account, access, upgrade, livestream podcast, free daily newsletter, AI news, observations, hot takes tuesday, Omni model, free and paid users, paid account, capacity limit, GPT store, transcription intelligence, text to speech, desktop assistant, API, reduced cost, live view mode, voice to voice communication, human feel, Google Gemini model, marketing stunt, rollout, IO developer conference.Send Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info)

Transcript
Discussion (0)
Starting point is 00:00:00 This is the Everyday AI Show, the Everyday Podcast where we simplify AI and bring its power to your fingertips. Listen daily for practical advice to boost your career, business, and everyday life. Meet Firefly AI Assistant, now live in Adobe Firefly, the All In One Creative AI Studio. Just describe what you want to create and the assistant handles the rest, orchestrating multi-step workflows across Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome. The assistant accelerates execution. Open AI has just released a new model called GPT4.
Starting point is 00:00:56 So this is actually our second episode of Everyday AI. We normally just do this one today. But it was a big enough announcement from OpenAI just about an hour ago that we had to come back for a second time today. So you can stay up to date. So we're going to tell you not just what this new GPT4 model is, but we're also going to give you quickly 12 things that you absolutely need to know. All right. Well, hey, if you're tuning in, maybe for the first time ever or for the second time today,
Starting point is 00:01:29 my name is Jordan Wilson. I'm the host of Everyday AI. We're a daily live stream podcast and free daily newsletter helping everyday people learn and leverage generative AI to grow their companies and to grow their careers. So, hey, thanks for tuning in if it's your second time. time. But if it's your very first time, make sure to go to your everyday AI.com. Sign up for the free daily newsletter. Much later today, right? Because we have a lot of news that we're trying to sort through. So earlier today, we brought you the AI news that matters for
Starting point is 00:01:55 the week. So now let's just dive straight into this new announcement from OpenAI, a little bit about what it is and why it definitely matters. All right. So today, we're just sticking with the facts and some observations. But tomorrow, we're going to have a much more in-depth episode. full of hot takes as always. It's hot take Tuesday. So you know we're going to come with it. All right. So here's what you need to know. Well, right now, the new version GPT4O is what it's called. And if you log into your account right now into your chat GPT account, you may already have access to this new model called GPT4. All right. So it's pretty, pretty exciting. That was a fast turnaround, pretty quick release. Also, you will need to make sure you select that model. So if you are,
Starting point is 00:02:43 an avid chat GPT users such as myself, make sure when you log in that you select the newest model GPT4O. All right. So here are 12 things that you need to know. So first, the basics. So the new model is called GPT40, which stands for Omni model, right? So essentially what that means is the ability to kind of work with it in reason, in text, video, audio, and more.
Starting point is 00:03:13 So more on that here in a minute. All right, second thing you need to know, GPT40 will be available to free and paid users, right? I kind of found that interesting. I don't know if now there is as much of a reason to have a paid account. So we'll get into some of the differences between what a paid account actually gets you. But a big part of Open AI's announcement today was really about, their desire to make this technology available to all. So pretty big, pretty big kind of thought
Starting point is 00:03:47 there, pretty big move by Open AI. I'm actually, you know, like I said, this is brand new. So I'm checking this as I even type to see if you have access. So it looks like at least right now, you can only access this model if you are on the paid plan. Right now, if you are on a free plan, even without logging in, you can still use the old model, GPT 3.5, but it doesn't look like everyone, you know, not even, you know, users who aren't logged in, at least cannot access this, at least as of, you know, Monday afternoon at 2 p.m. Central Standard Time. Things could move quickly. All right. Third thing that you need to know about this new GPT40 is that paid users will have five X the capacity limit as free users. So again, we don't know, at least right now,
Starting point is 00:04:34 what other differentiators there will be between the free account and paid account. If nothing else, this is huge news, I think, for free users. To have the ability to use the exact same model as paid users is actually something we haven't really seen out of the big, large language model players. So as an example, up until this recent announcement, chat GPT, so OpenAI's model had you know, big difference in the model between free and paid tier. Google with their Gemini 1.5, big difference between the free and the paid tier. Claude 3, you know, big, big kind of jump between, you know,
Starting point is 00:05:16 haiku and sonnet versus the paid version in Opus. So this is the first large language model that we've seen, aside from Matas Lama 3, which is kind of free and pretty much open source. But aside from this, this is the first kind of big play from a large language model worker that gives free and paid users the exact same model and level of access. Again, at least now, that's what they said. Maybe there might be future capabilities or future features for the paid version of chat GPT Plus.
Starting point is 00:05:49 But right now, it looks like the only differentiator might just be the limits. So having five times the capacity if you are a paid user. All right. Number four, well, speaking of more things for free users. Even free users will soon be able to access the GPT store, which was previously limited to chat GPT plus paid users only. So if you don't know anything about that, so essentially chat GPT and OpenAI have what's called a GPT store, which is a very simple way for people to go and create essentially a custom, pretty simple version of chat GPT. So you can give it kind of custom instructions, some custom configurations, you can drop in your own documents, things like that.
Starting point is 00:06:39 So now all of a sudden, you're going to open that up to every single free user out there. So I don't believe you will be able to use this if you're just in an incognito or not logged in. But Open AI did say today that even if you are a free user, you will soon have access to the chat GPT. store. So pretty big news there. And I am checking live in real time, y'all. This is like literally as quickly as we could get a podcast episode out. So it looks like right now I'm checking and even in a free account because we have multiple paid accounts and multiple free accounts. So I just just went into a free account and don't have access to the new GPT40 model yet, but we do have access to it. Obviously, both on our iOS, so the mobile app as well as obviously in the browser.
Starting point is 00:07:29 in the paid account. So yeah, let me know. Did anyone out there? I know we're going live here as well, as well as the podcast, but wondering if anyone here live has jumped in. Number five thing that you need to know is, well, GPD40 combines transcription, intelligence, and text to speech, essentially all in one mode. So opening I kind of went through kind of a historical look at how they've traditionally handled, you know, conversations with an AI, right? So conversations with a large language model. In a lot of it, you know, there's a lot of things going on, especially if you're using your voice to talk to chat GPT or if chat GPT is talking back to you, which is the read allowed option has been available for quite some time. But, you know, we're going to leave
Starting point is 00:08:15 some demos in our newsletter today, which should be going out here, hopefully in an hour or two. But, you know, the ability now to have all of this happen in one instance without a bunch of latency is pretty impressive. All right. The sixth thing you need to know, and this is actually, I think, maybe the biggest is there will be a new desktop assistant that can, quote unquote, here and see what you're working on. So again, and we're going to talk about this more in our hot take Tuesday tomorrow, but opening
Starting point is 00:08:48 I did what they said were live demos, and they looked pretty live, unlike Google's kind of snafu back from December, when they brought. preview their Gemini model for the first time. And then we kind of came to find out much later that it was all a marketing stunt and none of it was actually live. Today's demo did truly look live and it was confirmed by, you know, at least one or two Open AI senior staff members that it was live. It wasn't a pre-recorded demo and it, you know, I watched it myself live. It looked like it was live. There were a couple hiccups as well. But the new desktop app, I think, is huge. There were also some some videos that OpenAI put out on their YouTube channel,
Starting point is 00:09:32 about 10 or so short videos showing some of the different capabilities. And I like that they do a split screen. So you can kind of see what's happening inside of the GPT4 interface, as well as presumably what is a live recording of the user or users who are interacting with this kind of new desktop assistant and on the app. The desktop assistant, I think, is going to be huge, at least if the way that they demoed it, if that actually comes to fruition, where you can actually have a desktop app. You can be working on something on your computer.
Starting point is 00:10:09 You know, an overlay on the screen is what it looked like. But the chat GPT app would come up. You can click to speak to it and say, like, hey, you know, look at this on my computer screen. You click one button. It gets what's on your computer screen. Help me solve this problem. help me improve this code, help me finish this email, right? So pretty cool, you know, it's technically not an AI agent in the form that we've thought of it.
Starting point is 00:10:34 But this is kind of agent capabilities, right? Like maybe not autonomous agents, but one-click agents, right? So think with this desktop app, no matter really what you're working on, being able to instantly have a one-click conversation with this new, GPT4O and to be able to share with it within what look like one click, kind of what's happening on your screen and for it to be able to help walk you through it, maybe help you know, create a blog, you know, finish a blog post. You know, I think that changes really how we work because before, you know,
Starting point is 00:11:10 even people who are great at chat, GPD, you would have to do a lot of things, you know, uploading files, screenshots, documents, etc. And now it seems like this new desktop app whenever it does or maybe release is really going to make that experience much more seamless. So if you haven't already gone through the process, whether actually physically working with AIs in your day-to-day or even in your mind, right? So we're going to have more on that tomorrow. But you're going to most people out there, right? And presumably, you know, Open AIs is kind of kind of the pace car here. And everyone else, I'm guessing, we'll be scrambling to catch up. But, you know, if you haven't already come to the
Starting point is 00:11:52 conclusion. And, you know, we, in our 2024 prediction show back in December, we predicted this, that, you know, most people are going to be working with agent workflows in 2024. And this is, I think, the first example of that is just having essentially a dedicated agent in the form of a desktop app that can see what you're working on. You can talk to it quickly, and it can help you work in real time. At least to me, that was probably the most impressive piece of the demo. There are a lot of other things that were really cool, but that was one of the most impressive to me. Number seven, and hey, if you do have your question, get it in. We don't have a ton of time, but we'll try to get any of your comments.
Starting point is 00:12:33 You know, Douglas said he doesn't have 4-0 yet. You might have to, you know, if you are listening to this live or on the podcast, you usually, this is just, I'm an internet dork. So try logging out, clearing your cash, clearing your cookies, logging back in. And presumably you might have it at that point. but I'm guessing this, like most updates from not just OpenAI, chat GPT, but any large language model. You know, they're iterative.
Starting point is 00:12:57 They're slow to roll out. They go in phases. Sometimes it's available to everyone. Sometimes, you know, you might have to wait a couple hours, a couple days. So, yeah, make sure to go check that out. All right. So here's another big piece. Number seven, GPT40 is rolling out the API at a reduced cost.
Starting point is 00:13:14 So OpenAI said it is much faster and much cheaper to use the, API as well. So what that means is there are literally thousands and probably tens of thousands or even more of products that you probably use every single day that are connected to chat GPT via OpenAI's API. So presumably what that means is those programs are going to be getting faster and better because they will have access to this new model. And Open AI said it is much cheaper and much faster. So presumably also, right, especially if you're an enterprise company and if you're paying a lot of money, Maybe you, you know, gotten to a contract with a company maybe a year ago and it was pretty expensive.
Starting point is 00:13:55 You should be revisiting that contract because, especially over the last year, you know, the API has gotten much more affordable and faster. But now even with this, it is getting even less expensive. So pretty big news, especially if within your company's tech stack, you're working with multiple, you know, pieces of enterprise software that have a open AI GPT connection via their API, which is, you know, nowadays, it's like any, it seems like just about any enterprise software, whether it's marketing, advertising, communications, CRMs, etc. It seems like they all have some connection to GPT.
Starting point is 00:14:33 All right. Number eight. So OpenAI demoed a live view mode, presumably being able to use vision in real time. I say presumably, right, because I never really want to report on things even when they look true because Google really just, you know, kind of, I won't say that they straight up lied to everyone, but they were not very truthful in their original Gemini marketing video in December. But it did look very legit and very live from Open AI, but essentially where you had some people on the stage, some developers, you know, literally just turn on a camera.
Starting point is 00:15:08 And it is a live view on the camera and to say, hey, chat, JVT, what is my reaction? You know, and the developer was smiling and chat GPT said it looks like you're happy. So the new model was able to literally recognize video in real time, which was crazy. Another one was solving a math problem, a simple equation in real time. Again, the developer was showing the camera and, you know, literally working out a math problem by hand and asking chat GPT for directions on how to solve. This is essentially, it looks like OpenAI is delivering on what Google T's, with its marketing, but never actually had working six months ago.
Starting point is 00:15:48 So pretty impressive there. Number nine, reduced latency with a real-time feel in voice-to-voice communication. This piece was huge. So normally, if you've used any voice model, even if you were using the voice mode on chat GPT's app previously or anything, right? So the Google, you know, Google Assistant, Siri from Apple, you know, Alexa from from Amazon. Most of these systems have a noticeable delay.
Starting point is 00:16:18 Even if it's maybe only a second, it's pretty noticeable. It doesn't seem like real-time conversation. At least in the demos with Open AI, didn't seem like that. The latency was very low. At times, not even noticeable, right?
Starting point is 00:16:31 Like I was listening and observing, and I'm like, wait, that's probably faster than I can respond if someone asked me a question, right? Sometimes, like, I take a second to, like, actually process something and think about it. And I'm like, this is a pretty quick response time.
Starting point is 00:16:46 The latency was super low. Another thing is you can kind of cut off chat GPT or GPD40 when it's responding to you. So if it looks like it's going in the wrong direction, you can just speak and essentially cut it off and correct course. So the latency was, it seemed pretty, pretty low. All right. Number 10, a couple of more things. 10 just was a much more human feel, including, yes, there were some mistakes, which let me to believe, all right, maybe this was actually live and not real, right? Yeah, unfortunately,
Starting point is 00:17:16 with Google Gemini, I think a lot of people, you know, have a mistrust of AI models, right? And I think the Google Gemini kind of marketing stunt, so to speak, in December with their Gemini model, only increased people's mistrust in, you know, these big tech companies who are saying, like, oh, look at our AI does this, this, this and this. And, you know, it turns out with Gemini, none of that was really the case. It was all kind of manufactured behind the scenes. But with this, there were some mistakes. There were some mistakes in OpenAI's demo, which I actually liked because that told me like,
Starting point is 00:17:51 okay, this is believable, right? So at one time, the developer asked Open AI question, and it was responding about something else. It was responding about, you know, oh, the wood in the table. So this was probably from a previous response. The developer just said, oh, no, not that. That was our last conversation. I'm asking about this and course corrected.
Starting point is 00:18:11 And then Chad GPT instantly got the question correct. So it did have a much more human feel, including that, yeah, it was getting a thing or two wrong, which obviously over time and when millions of people are using it and providing feedback in real time, like, yes, this is good. No, this is not good. You know, presumably this model is going to get smarter. So next, next piece here, 11, the 12 things that you need to know about the new GPT-40. it will start to roll out to users in the quote coming weeks.
Starting point is 00:18:42 All right. So like I said, a lot of people, myself included, already have access to this new model. If you are a paid user, go ahead and try now. Love what Liz said here. She said, Jordan is every IT manager. Try turning it off and back on again. Yeah, exactly. So if you don't have access yet, don't worry, you'll probably get access in the coming days.
Starting point is 00:19:03 But not all of these features are available yet, right? Right now, this new model, so it's two different things. You have to think of it as that. There is a new model, and then there are all of these features that work with the new model. So it looks like Open AI is probably first going to be rolling out just access to the new GPT4O model. Again, O stands for Omni model or the, you know, kind of what they're hoping is, or will be referred to as the Everything model and the Omni model. So it does, most people are going to have probably access to that first before all of these other
Starting point is 00:19:36 features, but do pay attention. So we don't know what's going to be rolling out first. As an example, will all of these updates first be coming to the app? Will the desktop app, you know, kind of the smart desktop system be rolling out, you know, in the coming weeks as well, not sure, but do check. And obviously, if you tune in to everyday AI, we do this show literally every single weekday. We go live at 7.30 a.m. Central.
Starting point is 00:20:01 So maybe this is your first time listening. So, you know, as these new updates get rolled out, we'll obviously talk about them on the show. And then we have Google. Our last thing to know is Google is likely in trouble. So, you know, we have a commenter here, CyberS from YouTube saying, is anyone excited about Google tomorrow? Yeah. So the timing on this one is interesting, right? So Open AI just officially announced this event about three days ago, whereas we've known for a couple of months that Google has their I.O. developer conference tomorrow. So Open AI just kind of swooped in here and maybe potentially stole the limelight.
Starting point is 00:20:44 I mean, we'll see what Google announces tomorrow. But man, if I'm sitting in the seat at Google, I'm not feeling super great, right? Number one, you know, a lot of these kind of features or capabilities were teased by Google, like I said, six months ago in their original Gemini marketing video. And it turned out that a lot of it was manufactured behind. the scenes, kind of this ability to interact with an AI in real time. It wasn't real. None of it existed, right? Google later shared a research paper that said, oh, here's how we actually put it all together. It was multiple steps and a human was involved. We took the video. We grabbed frames. Then we
Starting point is 00:21:20 prompted and reprompted the AI. And then, you know, we spit out this result. So it wasn't actually true. So if I'm Google, I am not feeling good heading into the IO developer conference tomorrow. I'm being honest because Open AI just came and kind of took their lunch and their dinner. All right. So tune in for that. Actually, tomorrow we're going to be coming in with hot takes on what this actually means. All right. So today is kind of extra addition, just bringing you the facts on this new GPT4 model.
Starting point is 00:21:52 So again, we're going to go over it very quickly here. Here are the things you need to know. The new model is a GPT4 variation called GPT4O, which stands for Omni. model. GPT4O will be available to free and paid users. Paid users will have 5x the capacity limit as free users. We don't know what other differences there will be. Four, even free users will soon be able to access the GPT store. Five, GPD40 combines transcription, intelligence, and text to speech all in one mode.
Starting point is 00:22:22 Six, the new desktop assistant will be coming out that can hear and see what you're working on. Seven, GPD40 will be rolling out to the API at a reduced cost. 8. Open AI demoed a live view mode, presumably being able to use vision in real time. Nine, we saw a reduced latency with a very real-time feel in voice-to-voice communication with the new model. 10, it had a much more human feel, even including mistakes and the ability to cut off the model. 11, it will start rolling out to users in the coming weeks. A lot of people have access to the model. Features will be rolling out.
Starting point is 00:22:55 And 12, I personally think Google is in trouble. So we'll be talking about this tomorrow. If this was helpful, let me know in the comments. Hit that repo, share this with your friends. If you're listening on the podcast, maybe for the second time today, thank you for your support. If you want to leave a review on Spotify or Apple, we super appreciate that. So we hope to see you back tomorrow and every day for more everyday AI. Thanks, y'all.
Starting point is 00:23:26 Meet Firefly AI Assistant. Now live in Adobe Firefly, the Allman One Creative AI Studio. Just describe what you want to create in your own words and the assistant handles the rest, orchestrating multi-step workflows across Adobe Creative Cloud apps, including Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome while the assistant accelerates execution. Stand control with the ability to step in and refine at any time. See it today at firefly.adobie.com.
Starting point is 00:23:56 And that's a wrap for today's edition of Everyday AI. Thanks for joining us. If you enjoyed this episode, please subscribe and leave us a rating. It helps keep us going. For a little more AI magic, visit Your EverydayAI.com and sign up to our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.