Everyday AI Podcast – An AI and ChatGPT Podcast - EP 441: OpenAI’s o1 Pro: What is it and is it worth $200 a month?

Episode Date: January 16, 2025

Send Everyday AI and Jordan a text messageChatGPT's new 'Task' mode is here. Is this some gimmicky feature that'll get glazed over? Or a small step toward Operator, OpenAI's u...pcoming agentic release? We dive in to find out.Newsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion: Ask Jordan questions on the o1 pro modelUpcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode: 1. Difference between transformer models and reasoners2. OpenAI's new reasoning model, o1 Pro3. Different variations of o1 4. Use-cases for o1 pro5. ChatGPT Plus Overview: What's included for $200Timestamps: 00:00 Exploring OpenAI's O One Pro05:23 Pay-Per-Use Microsoft AI Unveiled11:18 GPT Subscription Plan Comparison17:01 OpenAI's New Model Releases22:48 GPT-4.0 vs. Pro: Speed vs. Tools26:05 OpenAI's Affordable Pricing Debate36:25 Podcast Content Longevity Analysis37:12 "Evaluating AI Use Case Data"46:06 AI Milestones and Google's Misstep52:37 AI Usage Efficiency Comparison57:10 Multitasking in AI-Driven Work01:01:14 AGI Prediction and Tool Access01:03:50 "o One's Efficiency Advantage"Keywords:chatgpt, chatgpt podcast, learn chatgpt, o1, o3, o1 pro, openai podcast, openai o1 pro, agentic, chatgpt tasks, gemini, copilot, ai news, generative ai Get more out of ChatGPT by learning our PPP method in this live, interactive and free training! Sign up now: https://youreverydayai.com/ppp-registration/Send Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info) Start Here ▶️Not sure where to start when it comes to AI? Start with our Start Here Series. You can listen to the first drop -- Episode 691 -- or get free access to our Inner Cricle community and all episodes: StartHereSeries.com Also, here's a link to the entire series on a Spotify playlist. 

Transcript
Discussion (0)
Starting point is 00:00:00 This is the Everyday AI Show, the Everyday Podcast where we simplify AI and bring its power to your fingertips. Listen daily for practical advice to boost your career, business, and everyday life. Meet Firefly AI Assistant, now live in Adobe Firefly, the All In One Creative AI Studio. Just describe what you want to create and the assistant handles the rest, orchestrating multi-step workflows across Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome. The assistant accelerates execution. There's a new breed of large language models that you probably haven't used.
Starting point is 00:00:53 Even the dorky among us that follow generative AI every day, so many people haven't really used or figured out or found use cases for these new reasoning models like OpenAIs 01 and 01 Pro. So today I'm going to be specifically going into O1 Pro telling you exactly what it is, telling you who it's for, how it works, and ultimately, if it's worth the price tag. Yeah, you've seen the price tag on this new O1 Pro? $200 a month. There's other things that are included in that, but I think we've kind of been spoiled by how cheap it. available large language models have been, especially as prices continue to plummet.
Starting point is 00:01:46 And then you see something like a $200 a month subscription for a model like 01 Pro. And you're like, what is this and is it worth it? All right. We're going to be tackling that. And hopefully a lot more on today's edition of Everyday AI. What's going on, y'all? My name's Jordan Wilson. I'm the host of Everyday AI.
Starting point is 00:02:08 This thing's yours. This is your cheat code. is a daily live stream, podcast, and free daily newsletter helping us all not just keep up with AI, but how we can actually use it to grow our company and our career. This is how you become the smartest person in your company, in your department at AI. All right. And we all need it, whether you know it or not, right? I've been saying this for many years, even if you don't think that you're a dorky or a techie person, we all have to learn how to get the most out of AI. And where you start doing that is, well, here, you're listening, but also at
Starting point is 00:02:40 your everyday AI.com. That is our website. There, you need to go sign up for the free daily newsletter. Each and every day, we recap our show for the day, as well as a lot of other information in that daily newsletter. You know, the latest news, trends, fresh finds from across the internet, tutorials, everything you need to know. It is your guide every single day. Go read it. Doesn't take long. Also on our website, there's more than 430 episodes that you can go listen to from the world's leading experts all for free sorted by your category. So go click on that AI learning tracks or that episodes on our website. Go find whatever you care about.
Starting point is 00:03:19 It can be marketing, legal, tech, governance. I don't care. It's all there for you for free. So make sure you go check that out. Speaking of checking things out, Monday, January 20th, mark your calendars, y'all. We are next week going to be doing five episodes. These are not just our 20, 25 AI predictions, but they are more like a roadmap on how to deal with everything that's coming.
Starting point is 00:03:42 Yeah, I literally spent thousands of hours in 2024, talking to smart people, thinking about AI, reading about it, writing about it. This is the culmination of all that. You are not going to want to miss it. Get your board together, get your team together. You need to tune in. All right. Before we get into today's show, I'm excited about it.
Starting point is 00:04:01 Let's first go over the AI News for the day. Live stream audience, thanks for joining. Got a couple questions for you. Let me know. All right. So first, Google is partnering with. the Associated Press to enhance Gemini with real-time news updates. So Google's AI chatbot, Gemini is set to integrate real-time news from the Associated Press, marking a pretty
Starting point is 00:04:20 significant collaboration between the tech giant and a major news publisher. So the deal allows AP to deliver a continuous feed of real-time information to the Gemini app. AP's chief revenue officers emphasize the importance of this collaboration, highlighting a commitment to non-partisan reporting and accurate journalism. Financial terms of the agreement have not been disclosed raising questions about compensation and how AP's content will be credited within the Gemini app. This is another piece of news related to this. This Google and AP deal comes literally at the same time as OpenAI announcing a new partnership
Starting point is 00:05:01 with Axios to expand local newsrooms. So Open AI will fund Axios' local newsroom expansion into a couple cities, cities, Pittsburgh, Kansas City, Boulder, and Huntsville, marking the first time and it has directly funded newsrooms in a publisher deal. So this three-year partnership allows Open AI to use Axios journalism for chat, GPD responses while providing Axios access to AI tools for content creation and distribution systems. Yeah, I'm actually excited we're going to have a specific episode coming up very soon. So make sure you tune in on AI's impact specifically on journalism.
Starting point is 00:05:38 I was a former journalist. So, you know, it's going to be a conversation near and dear to my heart. Last piece of AI news, Microsoft has unveiled new usage-based AI-powered, co-pilot to chat for corporate users. So Sadia Nadella, Microsoft's chairman and CEO, introduced a new category of PCs and of copilot with built-in generative AI tools at a recent event. So the newly launched Microsoft 365 copilot chat offers an alternative to the existing copilot service, which costs organizations $30 per employee per month.
Starting point is 00:06:14 So now this is based on usage. So yes, Microsoft 365 copilot chat based on usage. So the new model allows organizations to pay based on actual usage, right? So if you have thousands of employees at your company, maybe everyone's not ready to, you know, throw down a couple million dollars. So the new model, like I said, allows organizations to be charged for actual use with charges calculated per message sent, starting at just one cent per message, which could encourage wider adoption among companies. So copilot chat, like normal copilot can summarize documents, fetch web information, and create task performing agents. So unlike the traditional Microsoft 365 copilot, which is also integrated into applications like Word and Excel, copilot chat is accessible via the Microsoft.
Starting point is 00:07:01 off 365 copilot app on various platforms. All right. So let me know. Do you guys want to see more on co-pilot? Copilot chat? Let me know. All right. I'm excited to get going.
Starting point is 00:07:14 So let's talk about it. Open AIs 01 Pro live stream audience. Thanks for joining us. Have you all used this? Same thing. Podcast peeps. I always say check the show notes. You can reach out to me on LinkedIn, email the show.
Starting point is 00:07:27 I want to hear from you all, right? I can only make this better the more information you tell. me, but I'm curious for our live stream audience. Are y'all using Open AIs, O1, O1 Pro? Let me know. But let's just dive straight into it. Enough chit-chat. So here's the gist of O1 Pro.
Starting point is 00:07:49 All right. So it is a reasoner model. Okay. It's two different classes. So the GPT family of models, those are transformer models. All right. This is just the easiest way to separate the two. And a lot of other companies, you know, Google came out with their kind of flash thinking,
Starting point is 00:08:08 which is a reasoning model. Amazon Nova has a reasoning version, DeepSeek, the Chinese AI company. So just in the last like four weeks, just about all the big tech companies have said, all right, we need a reasoning model because we see how powerful it is. So 01 is a reasoner model different than a GPT model, right? GPT40, you know, all these other models that we've been using for, you know, now two plus years, they've been these kind of transformer models. And they're completely different.
Starting point is 00:08:39 And we're going to talk about the difference between the two. But that's the biggest thing. It uses chain of thought thinking kind of under the hood. And I like to say when you're working with chat, GBT, like, do any of you have different types of colleagues, right, for those of us that may be still going in office or our hybrid? You have those colleagues, you know, you're getting work done, but you have those colleagues that work by talking, right? You're just going back and forth, back and forth.
Starting point is 00:09:02 And then you have those employees or, you know, coworkers that work by putting their headphones on. You know, you talk to them once and then you check in with them at the end of the day, right? Those are those deep work employees that put their headphones on. They just crank it. Think of that as like those two different types of large language models, right? There's some people that you work with, right, that require a lot of talking, right?
Starting point is 00:09:26 require a lot of conversation, require a lot of collaboration, you know, and that's how they work best. And then there's others, it's just like they don't want to talk, dump all the information on them, give them their instructions, let them ask questions up front if they have any. But then they go to work and you see them later. So the latter, that is what these new 01 models are. All right. So as an example for our live stream audience here, you probably see it on the screen. I took a screenshot of this.
Starting point is 00:09:52 But yeah, generally when I ask these model questions, it's going to take any of from four to 15 minutes to get a reply, at least how I use 01 or at least 01 Pro. So I do use the chat GPT Pro account, right? So that is $200 a month. So I'll share what that includes. But for the most part, when I'm using O1 Pro, I know this is my deep work colleague. I'm still using GPT40 all the time, all day, every day. I like using it in canvas mode.
Starting point is 00:10:24 I just started really using the new task mode, which if you listened to our show yesterday, we went over that new mode. By the way, for those of you that shared the episode yesterday, I took, I spent probably about three hours on a document that showed you how to use tasks and to go over this concept of task concept, task stacking. If I'm being honest, it's one of the best documents I've ever created in my life. So if you haven't gone and shared that task episode, go do that and I'll share that document with you. All right.
Starting point is 00:10:57 So that's the gist of it. This is a reasoning model. This is not a GPT model. It takes a long time to think. It uses this chain of thought process under the hood, right? So you really have to use it at the right time for the right purpose, for the right reason. All right. Adobe just introduced an entirely new way to create, bringing the power and precision of its creative suite into one conversational experience.
Starting point is 00:11:28 Meet Firefly AI Assistant, now live in the Adobe Firefly app, the all-in-one creative AI studio. Powered by Adobe's creative agent, Firefly AI Assistant lets you start with your vision, just describe what you want, and shape the outcome as it takes form with the Assistant. The Assistant orchestrates multi-step workflows, drawing on 60-plus pro-grade tools across Adobe Creative Cloud apps, including Photoshop, Illustrator, Premiere, Lightroom Express, and more to help bring your ideas to life. You can also get started with creative skills, a growing library of pre-built workflows for common creative tasks like batch editing photos, creating mood boards, portrait retouching, and creating social variations. Every step the assistant takes is visible so you can refine,
Starting point is 00:12:15 redirect, or take over at any time. You stay in the driver's seat as the creative director. Adobe Firefly AI assistant now in public beta. See it today at firefly.adopi.com. Let's just take a quick look at the tiers or subscriptions. So there is a free version of chat GPT. I've had a lot of chat GPT shows this week because, you know, it's early in the year. A lot of people are asking. And, you know, we had some popular episodes like a year ago. And I'm like, yo, these are old.
Starting point is 00:12:51 So literally this week we did an episode on chat, GBT free versus plus. All right. So you have your free chat GPT, which is actually pretty good now, right? I used to tell people don't touch it with a 10 foot pole. It's dangerous. It's not like that anymore. The free version is pretty good. You have the $20 plus plan that I think a lot of people are on.
Starting point is 00:13:08 That just gives you essentially almost all the features except the 01 Pro model. So even on the chat GPT plus, you have 01. It's much more limited in terms of messaging and you have an 01 mini. But on the pro plan, so if you want to use 01 Pro, which is technically Open AI's most powerful large language model, you do have to pay $200 a month to be on that pro plan, O1 Pro, But there's a lot of other, I guess, features and benefits. So when we talk about SORA, you have way more usage in SORA. You have unlimited usage of GPT-40, whereas, you know, normally, even on a chat GBT Plus plan, you run into limits.
Starting point is 00:13:49 So essentially, you get more limits on the pro plan. It's unlimited GPT-40. It's unlimited advanced voice mode where on the plus plan is a little limited. And then you get access to O1 Pro, which you can only access via. via that $200 a month subscription. Yes. It is confusing. Yeah, Allison's kind of talking about the naming here.
Starting point is 00:14:14 It is weird because the plus is that $20 a month where a lot of companies like Microsoft, they have their, the $20 a month is called pro. So even I get confused all the time. But yeah, so Chad TPT plus $20, pro $200, not to be confused with all these other ones that say pro for the $20 tier. Yeah. All right. So let's talk about how open AI describes their model.
Starting point is 00:14:41 So they say more thinking power for more difficult problems. So they say chat GPT Pro in this instance, they're talking about 01 Pro actually, provides access to a version of our most intelligent model that thinks longer for the most reliable responses. In evaluations from external expert testers, O1 Pro mode produces more reliably accurate and comprehensive responses, especially in areas like data science, programming, and case law analysis compared to both
Starting point is 00:15:08 01 and 01 preview. O1 Pro mode performs better on challenging machine learning benchmarks across math, science, and coding. All right. So yeah, speaking of benchmarks, essentially, 01, pro, it's PhD level, right? It's no longer where you really have to work. So that's the other thing. I think with like a model like GPT40,
Starting point is 00:15:33 You can get these, you know, quote unquote PhD level responses. You just have to have a master's degree level to get it there, right? It's different with 01, especially with O1 Pro. You don't have to have a lot of experience to get it to that kind of, we'll just say, quote unquote, PhD level. It kind of can do it on its own because it uses this kind of under the hood, you know, step by step chain of thought reasoning. Right. So it's weird. I've talked about this all the time. And if you've taken our free prime prompt polish course, you understand. The GPT40 family of models is extremely capable. But to get the most out of it, you've got to know the basics of like prompt engineering, right? Without getting too technical, there's things called shots. Right. And when working with a transformer model or a transformer family of models, regardless of if you're talking, you know, chat, GPT, Gemini, Claude, etc. Right. If you do some basics of prompt engineering,
Starting point is 00:16:33 it's going to be better. So a five shot prompt as an example is always going to do better than a zero shot prompt. So what that means, and I'm going to oversimplify it here, so sorry, machine learning PhDs. A shot is when you give a model an example. In input and output, you tell it good, bad why. That's what I like to say. Input output pairing, good bad why. So you are essentially shotting this model. So that's what the 01 and technically 03, right, like Open AI teased the O3 model. It's not out. I don't think it'll be out anytime soon. But this O family of models kind of goes through that process on its own, right?
Starting point is 00:17:11 It doesn't give itself examples, but it goes through that chain of thought thinking. And I'm going to show some examples for you live on the screen. But from a benchmark perspective, the gains are huge, right? So some of the biggest gains between like the 01 family and the GPT4, family are in math. I mean, you're automatically like Olympiad, you know, math Olympics like gold, silver medal, right? So it's smarter than 99.999-9-9-9-9-5% of humans in math. Physics, same thing. Huge jump from 0-4 to, or sorry, to GPD-40 and the 01, getting ahead of myself, skipping, skipping 0 or 03, you know, 4-0-0-3-1 alphabet soup already. Other categories.
Starting point is 00:17:56 Yeah, mathematics, physics, LSATs, right? So, you know, they have actual models take exams. So huge. So obviously, if you are in software development, if you are in research, if you work in anything that has to do with complex math, complex equations, business intelligence, right? If you are essentially working with numbers, working with research, I think 01 makes the case for itself.
Starting point is 00:18:25 But I'm going to keep going and we're going to talk about even some everyday use cases. So first, you might have got confused, right? Because I'm dropping all these different words on your head. Oh, one, this, I won that, oh, one, right? Because technically, O one has been out for a while. So we saw O1 preview in O1 Mini in September. So yeah, all those other big companies that are releasing these kind of quote unquote reasoning models. This is just the last few weeks.
Starting point is 00:18:55 AI's been here for a few months since September. They released 01 preview in a 1 mini. And then in December, they essentially knocked the preview off of it and said, okay, now this is 01. So technically, if you think of power and not all these models are here, but you have 01 mini, 01 preview, which is now no longer there, 01 and 01 Pro. So 01 Pro and the kind of full version of 01 are newish.
Starting point is 00:19:22 They've only been out for a couple of weeks. I've been using it. I didn't get it right away probably about a week or so after it was released. So I've been using it now for about three weeks pretty heavily. All right. So let's just go over quick in live stream audience. Yeah, keep getting your questions in. I'm going to try to tackle some of these at the end.
Starting point is 00:19:46 Mark says it looked like a lot of work, Jordan. Thanks for sending it. Oh, yeah, that's the task thing from yesterday. Task stacking. Everyone's slept on the chat GPT tasks. even I think Open AI missed the point. All right. So let's go over the bullet point details.
Starting point is 00:20:00 I'm going to go through this quickly because I'm going to show you here at the end of the show. For a podcast audience, I'm going to try to walk you through it, O1 in action for even non-technical reasons. So what is O1 Pro? All right. Let's go through all the bullet points here. I want to make sure I give you all the details. So O1 Pro is OpenAI's premium model available to chat, GPT users for $200 a month. Also, there's other third party platforms where you can just pay by usage, right?
Starting point is 00:20:32 You can't get access to all the other tools and all that. But here's the thing, at least right now, the 01 model, it doesn't have all the other tools anyways, right? It doesn't have internet access, right? The 01 Pro model. The 01 Pro, you can actually upload files, which is nice. And you can with that on 01. as well, whereas previously the O1 family of models, you couldn't upload files.
Starting point is 00:21:01 And still, on O1 Mini, you can't upload files anyways. But it's described, O1 Pro is described as an AI colleague for complex tasks. And it's more about reasoning than collaboration. So how does it work? Well, like we talked about, enhanced reasoning. It has this chain of thought processing to kind of enable better logical breakdowns, accuracy and reliability. So we shared about this on the show before.
Starting point is 00:21:26 Open AI kind of went over this like four of four reliability concept, right? Where, you know, there's a little more variability in the kind of GPT family of models. With the O series of models, there's much more reliability. And also, this is specialized for, you know, professionals. If you are specialized professional, O1 is for you. So it's strong in STEM, coding, legal, in data science. Advantage.
Starting point is 00:21:57 What the heck are the advantages of this? Well, like we said, enhanced reasoning. So the chain of thought processing under the hood enables better logical breakdowns. We talked about the four of four reliability. So that's essentially, you know, when Open AI did their internal benchmarks, they wouldn't just say, do it once in like, oh, okay, yeah, this passes. They would do it actually four times for more consistent results. All right.
Starting point is 00:22:23 So where does it excel? So I said, hey, here's where you can use it. There's strong use cases here. It excels in scientific research. So analyzing data sets, developing hypothesis, designing experiments, financial modeling, forecasts, complex calculations, legal workflows, right? Analyzing case law, summarizing documents. It excels at anything STEM related, right?
Starting point is 00:22:47 Anything. So it specializes in kind of. synthesizing and analyzing dense data sources. And here's the thing. And I'm going to show you some more non-technical use cases at the end and how I'm using it. We all have access to data, right? Data over the past five to 10 years, data used to be something for the geeks. Now we all have access to data.
Starting point is 00:23:09 There's more and more data being collected, which is why I think there's actually broad use cases for people to be using the 01 Pro models. All right. So what is it good for? Kind of already talked about this, but this is what people are always asking. Like, right? Where does Excel? What's it?
Starting point is 00:23:26 Like, what is it good for? Who should use it? So I want to tackle this from all areas. So who is it good for? So professionals in STEM, finance, law, and health care, great health care use cases as well. So any users with high stake tasks requiring accuracy and advanced reasoning. So developers, also great for anyone in software development coding.
Starting point is 00:23:49 So handling intricate coding and debugging requirements. professionals in fields like Madison that require precision. All right. Now, let's do the breakdown. People are always asking, well, what's the difference?
Starting point is 00:24:02 Should I just be using 4-0? Should I be using 01 Pro? The way I like to say that, think of these two chatty colleagues, right? For the most part, we've been kind of spoiled by having these transformer models that are highly capable.
Starting point is 00:24:14 If you know how to use them, I would still say 80% of the business world has no clue how to use something like chat gbt, something that has now become synonymous with AI and has name recognition like Google, people still don't know how to use it, right? And people in positions of leadership, kind of scary. But 01 Pro excels in complex reasoning. GPD40 excels in, well, being fast, speed, general tasks, right? Also, GPT40 has access to more tools, which is important.
Starting point is 00:24:46 So you can upload files in 01 Pro, but you can't use things like. canvas. You can't use things like tasks. You can't use things like chat, GPT search, connect it to the internet. That's an important thing because when you are using a non-connected model, you have to keep in mind, you better be feeding it a ton of up-to-day data or whatever you are asking it should hopefully not require a lot of up-to-the-minute real-world information because that model does not have it. All right. So how should you prompt 01 This is where it's completely different. And again, think of my comparison.
Starting point is 00:25:28 Does anyone else have that chatty coworker and then the coworker that just has the headphones on, right? I'm the latter. You never would have guessed, right? Someone that just talks about AI nonstop and sometimes talks for way too long. But speaking of like even my journalist days, right? I used to be the guy. I would go in there after an assignment. I'd go talk to my editor, get all the information I would need.
Starting point is 00:25:51 right then I would go sit in the back put my you know headphones on they weren't even headphones I don't know if this makes me weird I would have like the you know sound canceling there's no music I would just put those things on I would just go to work do the whole thing right but then there's other people you know hey they want to check in at every single every single point right but that's that's the difference so with oh one you have to begin with number one a lot of data a lot of context you need to have clear and structured prompts to define task parameters effectively. You need to provide examples or templates to guide the model's output format, and you need to use concise yet informative phrasing to maximize response relevance.
Starting point is 00:26:32 If you all have taken our free Prime PromP Polish course, yes, we're going to have new ones in 2025. Give me a minute. I'll explain why later. But we walk through something called Refine Q. So if you have taken our free course and there's been like, I don't know, 8,000 of you, use that Refine Q method for setting up your first prompt to 01. You're still going to have to answer a question or two because that's the, you know, how we set up that refined queue. But try that out. It's going to work fairly well.
Starting point is 00:27:04 All right. So let's get to the big question. Is it worth $200 a month? So let's talk about the pros and the cons. Well, the pros are high accuracy and reliability in complex domains. Unique reasoning capabilities, especially. if you are in some of those more technical professions, software development, engineering, anything with math, research, data, science, STEM, right?
Starting point is 00:27:33 If you're there, yeah, it probably is, probably a no-brainer. What about for everyone else? Because there's cons. $200, it's not cheap. Although, if I'm being honest, I think we've been spoiled by these free and $20 a month world-class state-of-the-art models that essentially now have mini-rag, right? We've been spoiled, right, because the big companies, they know a lot of them are losing money, right? Like Open AI reportedly lost, I don't know, four or five billion dollars in 2024 because they're not worried about making money.
Starting point is 00:28:03 What we are getting, if you're a power user, you're getting way more than that $20 a month, right? Open AI CEO, Sam Altman said even on this $200 pro plan, they're losing a ton of money is what he said. This is still relatively cheap, whether we're talking about the $20 a month, Or if you have a use case for it, I think even the $200 a month, pretty affordable, right, all things considered. And we're going to see an example of that. So ultimately, there's the pros and the cons. It's having a PhD-level companion that will think about things, give you better results, higher accuracy if you know how to direct it. but it is much slower, right?
Starting point is 00:28:51 So if you're used to just jabbing back and forth, and that's how you like working with large language models and you don't see any problems right now with outputs, then it's not for you. But I think it's actually for more people than you think. I think people are literally just thinking, oh, oh, one, that's for, you know, engineers, data scientists, researchers, etc. I don't think so.
Starting point is 00:29:14 So you also need to just ask yourself. yourself. Yeah. Do you need the advanced reasoning and professional tools? Are there use cases in your domain that are worth that premium price? So you really have to ask those questions. There's no blanket answer. I think if anyone asked me, you know, chat GPT free or chat GPT plus, it's easy. I don't care what you're doing. Chat Chbett plus, $20. It's a steal, right? I've always said all along. If Chad ChpT plus was $200, I would still pay for, right? Obviously, I have a chat Chepti pro plan. All right. So let's look live. All right. And please keep getting your questions in. I'm just scrolling through the comments. So thanks everyone for getting your questions in. I'm going to try
Starting point is 00:29:58 to tackle them at the end. I'm just scrolling through all the comments looking for question marks. So yeah, make sure if you do have a question, then, you know, get it in. Douglas talking about our show yesterday. Great task write up. I think yesterday was the first day. I saw, saw the link to your post on your website. Yeah. All right. Let's get after it. Let's do some stuff live here. So bear with me, y'all. All right. Here's what we're going to do. Live stream audience, as always, I never know if this works or if my audio is still coming through. Can you let me know? Can you all see my screen? And can you see what's going on here? So I'm going to explain to you what I'm doing after I get this started.
Starting point is 00:30:49 Okay. So I'm going to be copying and pasting a bunch of information in here. All right. Give me a second. There we go. All right. So this is information that I've exported from my podcast stats, right? I really want to make sure I have.
Starting point is 00:31:08 Okay. Good. Thanks, y'all. All right. Everyone says they can see. Thanks, y'all. Thanks, y'all. Okay.
Starting point is 00:31:17 So I am in a. O1 Pro mode. All right. I'm going to tell you what these, I'm going to read this to you, but I'm going to get it going first because like I said, this might take a couple of minutes. All right. So here's my first kind of tip, right? Provide a lot of context.
Starting point is 00:31:34 I'm going to walk you through the context that I provided. But also something that's changed recently, I don't know what it was, probably a year ago. You can run concurrent chats now. Generative AI, even O1 Pro, it's generative. You can run the same prompt, even when you give it a ton of information, you might get very different things. You might get similar things. So I'm going to go ahead, even though it might slow it down, I'm going to follow my best practices. If I'm waiting, I'm just going to wait.
Starting point is 00:32:03 So I'm literally running the exact same prompt in another tab. All right. So let's go ahead. Let's check on it here. Okay. And we're going to walk through. So sometimes it will give you details. Sometimes it'll tell you what it's doing under the hood.
Starting point is 00:32:19 right? And I know I have my token counter here, but the context window is much different. I probably should have mentioned the context window differences because that's important. That's important as well, right? So essentially, 01 Pro has a much, much larger context window. So let's go ahead. And I'm going to read now, I'm going to read now what I actually put in. All right. I'm going to try to go quick. But like I said, I exported some recent podcast episodes.
Starting point is 00:32:58 There's a ton of stats. This is an example. And I want you to think, what data or what large amounts of context do you have? Because this is, I found myself when 01 came out. I'm like, okay, I may not be in STEM. I may not be in data analysis, but I have access to a lot of data. And either I don't have time or when I am analyzing it, I'm really just looking for the low hanging fruit.
Starting point is 00:33:23 And there's probably so much deeper and so many different more channels I could go in if I had time. All right. So I'm going to go fast here. All right. So I'm saying these are my podcast stats. So remember when I said, when you prompt 01, think of it like that coworker that wants all the information and then they're going to go in the corner. So I'm saying these are my podcast stats.
Starting point is 00:33:46 Keep in mind, today's date is January 16, 2025. For all questions, always exclude the top 2% and bottom 2% of episodes unless otherwise noted, right? I have a bunch of episodes, there are downloads, some other stats. And sometimes there's anomalies, right? Sometimes there's just problems. And I don't want those problems to be included. All right.
Starting point is 00:34:06 So already you're seeing that could be a lot of manual work, even if you're good with business intelligence, you're good in spreadsheets, right? I'm saying also always give the episode number and name never one. Keeping that in mind, please carefully answer and tell me. Question one, give me the average downloads per episode. Question two, give me the complete list of episodes with the new performance percentage over under of the adjusted average. So I'm asking it, find the adjusted average number of downloads, take out the top 2%,
Starting point is 00:34:37 take out the bottom 2%, then go give me for each one. I want how it compares versus the average. So let's just say the average was, I don't know, 4,000 downloads, right? I want to see the percentage once you take out the top two, bottom two. I want to see each episode. Is it higher than that kind of median? I don't know. What's the math term?
Starting point is 00:35:00 Right? Is it higher or lower than that? All right. Then I'm saying question three, give me the top 10 and bottom 10 episodes and their respective percentages that they're over or under the adjusted average. All right. Question four. For the top 10 from question three above, the adjusted average, please suggest three
Starting point is 00:35:23 slightly adjusted episode titles for each if I were to rerun them. So every once in a while, I'd say maybe, you know, I don't know, depends. Anywhere from one to five times a month, I'll rerun an episode. Yeah. Sometimes I get sick. Sometimes I can't be here, y'all with y'all live, you know, at 730 every single day, although I try to. So I'm essentially saying for the 10 episodes that performed highest above the adjusted average, suggest three additional titles. But don't just like look at it and
Starting point is 00:35:55 randomly suggest them. Look at the trends, right? Look at find common themes. So here's where we're really working with structured and unstructured data. This is where it's great to work with a large language model with natural language processing, right? So I'm like, yo, here's hundreds of episodes, go find the ones that are really good. And then those top percentage, you know, try to develop, you know, some way to see what's working and what's not and then apply that to some of these top percentages. All right.
Starting point is 00:36:24 And then I'm saying, like, look, find common naming trends. Example, title length, psychological marketing angles, superlatives, word choice, etc. Be exhaustive in your pursuit of spotting common and hidden trends. Question five. What are the most common patterns? among underperforming episodes. And how can I avoid them in the future? Question six.
Starting point is 00:36:45 How does title length or structure correlate with episode performance? Break it down from every angle you can think of, be pinpoint specific. Question seven. How does release day impact episode performance? Please exclude Mondays as that is usually our AI News That Matters Day, and we don't usually run other types of shows on those days for needed contact. So even though the large language model should know, I'm saying, you know, giving it hey, this date was a Friday.
Starting point is 00:37:11 So you can make sure you have it correct. Question eight, how does they release time or hour affect episode performance? Do not group them together. Go individual by hour. Be exhaustively precise and give me a chart that shows hourly performance, right? Sometimes we get our episode out by 815. If I stop yapping, today's not going to be that. It's already 806 a.m.
Starting point is 00:37:33 Sometimes something comes up and we might now get it published until 11 a.m. So I want to know hourly, how does that impact it? Then I'm saying here's one that would take a long time to figure out. I'm saying staying power and average decay. So in this document, in, you know, when I pasted all of this in there, I didn't upload a spreadsheet, I just pasted it in there. Essentially, it was information from a CSV. But for hundreds of podcast episodes, it gave seven-day downloads, 30-day downloads,
Starting point is 00:38:03 90-day downloads, all-time downloads. So what I'm asking here is to essentially figure out staying power and average decay. So saying, hey, when do average like across hundreds of episodes, when do they normally, quote unquote, go stale, right? When do they stop really, you know, getting listened to? Because people are searching for these all the time. It's not just people like hopefully you subscribe and thank you if you do, right? But everyone else is searching for podcasts and they're discovering.
Starting point is 00:38:27 So I'm trying to see which ones have staying power, which ones are more evergreen. And then I'm asking it to show me the top ones because then I can develop. new episodes based off that. Question 10. How do episodes featuring specific brands or keywords? Example, OpenAI, ChatGBT, GBT, GBT, Google, large language model, AI, Claude, compare in performance. Question 11.
Starting point is 00:38:51 Please also categorize all of these episodes according to what you can gather from the titles. Example, marketing, chat GPT, enterprise, AI use cases, etc. Only put one episode in a category and try to create a, at least 20 different categories. In doing so, please also give me the average, the category performance versus the averages that we identified earlier, right? A lot of this, I want to see what sticks. What do y'all like? What do listeners actually care about? Right. And I think after you have, you know, tens of thousands of pieces of data points, yes, I can go figure out some of these things with some simple calculations in, you know, Microsoft Excel, Google Sheets, etc. But this is where
Starting point is 00:39:33 we're really combining a lot of data, but with also unstructured. This is bringing in unstructured data, structured data. So structure data are numbers, things that you can plot on a graph, unstructured data is words, right? You can't necessarily plot them. So we're combining structured data, unstructured data with a reasoning model, right? And I gave it a ton of information. All right. And then I'm saying essentially, you know, I'm giving it some additional encouragement, like how to format it, all that stuff. And then I'm also saying at the end, give me essentially a quick summary. And then here's all the data. So I pasted in about 13 pages of those questions and data. All right. That was a lot. Marie said, how long did it take you to come up with these amazing, with these detailed questions? Amazing. I'm a fast typeer, I think, all the time. Probably took me, I don't know, 12, 12, 13 minutes to type all these up.
Starting point is 00:40:40 So, yeah, there's no AI in helping me formulate these questions. We always talk about human in the loop, right? What role do humans have? And one of the things that AI, and I think especially these reasoning models like, like 01, O1 Pro, it allows you to really, let your expertise shine, right? One of my expertise, I think I'm background. I have a background in journalism.
Starting point is 00:41:03 I have a background in marketing and advertising. And I don't know, maybe you saw some of that in play there, right? This is how my brain works. I'm like, y'all, we got so much data. I need to be able to identify trends and to build something better to help you all, right? All right. So let's now jump back in and see how our chats are doing. So you'll see it's Ben.
Starting point is 00:41:25 Oh. Live stream audience, can you still hear me? I got, I got something that said, can't hear. But let me know if you can. I got something on my screen just said, I lost audio. So we'll see. All right. So here is our details.
Starting point is 00:41:48 All right. Thank you, Sam, Sarah, from YouTube. All right. So I can click details. So essentially, you can see kind of slash sometimes how the 01 model is actually thinking about this. All right. It's weird.
Starting point is 00:42:04 I've done similar prompts like this and I'll always do A-B testing, right? I'll run the same prompts on 01 Pro twice. I'll run, you know, the prompt on 01 Pro versus O1 normal. I'll run the same prompt on O1 pro, right? I do a lot of testing. And even on O1 Pro, sometimes it'll give you all of the details. Sometimes it won't. And then it says, oh, sometimes O1 does better when it doesn't share the details with you.
Starting point is 00:42:27 So is there ultimate transparency? not really. But I'd say more times than not, you do kind of get to click that details, and you can see kind of what's happening under the hood. So in my other one, it looks like, it looks like I timed out in my other one. Bummer. So maybe I shouldn't have been doing two at once because this one thought for about nine minutes. And then it said, oh, I'm done for. All right. But luckily, all right, luckily we finished. We finished over here. in our first chat. So I'm going to go ahead,
Starting point is 00:43:03 why not, and regenerate the other one. That one thought for about 10 minutes, ran out of steam. So this one, let's see if I can see exactly how long this one thought. Let me go up.
Starting point is 00:43:20 A lot of information here, y'all, a lot of information. My gosh. All right, this one thought for 11 minutes and 22 seconds. So I think my,
Starting point is 00:43:30 I think my record, is maybe like 15 or 18 minutes. I give it a lot. I give it a lot. All right. So I'm not going to read all of these one by one because it's going to take a while and I don't want this to go to go too long. But let's just see very quick overview how well it did. So it's saying below is a comprehensive step-by-step response that follows all of your instructions precisely.
Starting point is 00:43:53 You know, I've counted all the episodes. So it's telling me what it did. So it took out the top 2% and bottom 2%, which in this case was six total episodes. computed and listed everything after removing those six, right? So pretty good. It kind of first gave me an overview of how it did it. Then it gave me the preliminary step. So it went through, identified the total episode count,
Starting point is 00:44:17 identified the top two and bottom two percentages, right? Listed them all. That's good. And then it kind of said, hey, then there is 122. I didn't give this all 400 of our episodes because I knew I did testing. and it worked fine, but it took way too long and it timed out too much. So I only uploaded like the last probably six months of episodes. Okay.
Starting point is 00:44:41 So here we go. Question one. So now it's getting. I told it to label it. So here we go. Question one, average downloads per episode. So here you did. It did some, a little bit of math.
Starting point is 00:44:51 So thank you for that. I don't like math. All right. And then it says answer to question one. Oh, I was about right. About 4,000 downloads per episode. The downloads are weird. Download streams.
Starting point is 00:45:02 Everyone looks at it differently. So yeah, I think we're almost at like two million downloads. So thank you all for listening. All right. So question two, list of all remaining episodes versus adjusted average. This is what I wanted. All right. So here, it says below is the performance calculation for each of the 122 remaining episodes.
Starting point is 00:45:23 Is there anyone smart in math? I don't even know what this means. I don't even know how to read it. I don't know. It created some kind of formula. Sometimes I ask ChadGBT to create algorithms. I just give it a bunch of data. I'm like, create new algorithms for me and tell me things that, you know, I can't find out in a spreadsheet.
Starting point is 00:45:41 That's fun to do. I didn't do that here. So there's a calculation. So let's see if it gave me the full table. Sometimes it does. Sometimes it doesn't, right? So full table. Okay.
Starting point is 00:45:54 Here we go. There we go. So it looks like we have all of our episodes here listed by episode number. it gave me the all-time downloads. It did the performance. So I can see this one right here was about 0.6% below average, right? So I can go through here. I could ask it or I could copy and paste this and give it to as an example like
Starting point is 00:46:19 01 Mini or GPT40 and have it turn it into an actual spreadsheet. One thing I realize 01 Pro isn't great at is creating documents. I don't even know if it technically has that, you know, that capability. or functionality, but the advanced data analysis mode inside GPT40 is great. GPT40 is great at creating different types of documents. So if I wanted to, I could copy and paste this, but let's see. Yep, there we go. It's giving me, you know, 28% above adjusted average, 18% below, 3% below, 15% below, right?
Starting point is 00:46:49 So this is great. Oh, let's see. It said, it looks like it might have truncated. I had a feeling that even the 01 model was not going to complete this in its entirety. Because it says, and so on. So it didn't do all 160. It says due to the length of this list to fully comply with your request, this table would extend for well over 100 lines. I have demonstrated the exact calculation method and the format above.
Starting point is 00:47:19 The same format applies for every single remaining episode. Below, I continue the listing in concise bullet form. Each line follows the same pattern. Episode number and title all time, then the resulting percentage. Okay, so it did actually go through and do it. It just didn't show me the math for each one, which is fine. I didn't need that. All right, here we go.
Starting point is 00:47:40 Question three, top 10 and bottom 10 episodes and their percentage over. This is what I wanted to know. So here's the top 10 overs with their adjusted average over. So when will we achieve AGI that performed well? AI agents, everything you need to know, top AI tools and features of 2024, how AI agents can bridge the gap. the future of enterprise work. Google's one trillion
Starting point is 00:48:06 dollar AI mistake, right? So there we go. This is good. I mean, again, I could have sorted this by downloads. And I could have found out some of this, but I wanted to see how much higher. Because there's some anomaly episodes where I'm like, okay, were these actually, you know, is this a bug? Sometimes, you know, as an example, Apple podcast or Spotify podcast will, you know,
Starting point is 00:48:28 feature an episode, you know, if their algorithm says it, good and it'll put it on like a top episodes and technology page. So I know sometimes some of our episodes get way many more downloads, but I'm like, I don't really want those. I want to just focus on the guts there. So it did a pretty good job. Bottom 10 episodes, there we go. Slightly adjusted title names for each of the top 10.
Starting point is 00:48:51 There we go. It's giving me all of those. Yep. For each of them, it's giving me adjusted episode titles and also why, right? That's interesting. I didn't even say why, but it gave me for each of the 10, it gave me some, some other episodes. Question five, common patterns among underperforming episodes and how to avoid them.
Starting point is 00:49:13 So it says titles that are too generic or vague, overly long titles without a clear hook, insufficient mention of strong keywords. And in each of these, it's giving me very specific examples, right? It's not just giving me these general guidelines. It's telling me how to avoid it. Then question six, how does title length or structure correlate with episode performance? It's all good. Question seven.
Starting point is 00:49:34 How does release date impact episode performance? Let's see. Tuesdays show moderate to good performance. Wednesdays Thursday, engagement, because listeners have midweek energy. Fridays, it says, can be hit or miss. All right. So maybe I shouldn't schedule big shows for Friday since they can be hit or miss. Yeah, sometimes people check out.
Starting point is 00:49:56 Let's see. I did specifically ask for a table for time or hour. So for question eight, let's see here. Good, it did it. So it gave me the release hour and then the average all-time download. So I can see, yeah, apparently sometimes I've released some late. That's weird. Some of those might have been bugs.
Starting point is 00:50:14 I should have asked in this one to also give me the total number of episodes that have been published in that release hour. Because, yeah, sometimes there's bugs with our host. We use BuzzProw. Yeah, sometimes there's just weird anomalies. So I should have asked for the number of episodes, but it looks like for the most part, it looks like maybe our sweet spot is when it gets published by 9 a.m. So maybe when we do publish it super early, maybe it misses people.
Starting point is 00:50:42 Maybe people are listening on their commute to work. But it looks like for whatever reason, it looks like that sweet spot is episode, sorry, releasing the episode by 9 a.m. This is all our local time. All right. Question nine. Staying power in average decay. this is one I was really looking forward to.
Starting point is 00:51:01 So I'll go through and read this if you're interested. You know, you can let me know. But it did good. It gave me kind of the average 730, 90 in total. It identified certain episodes that extended past that. You know, some of the agent episodes. All right. So did a pretty good job.
Starting point is 00:51:23 I wish it would have given me a little more depth on this. But again, what I would do in theory, I would look at the responses and I would update that prompt that I did and I would just run it again, right? Because I can see some of these things. I'm like, ah, I forgot a little bit here. I should probably go back and add some. Question 10 episodes featuring specific brands or keywords. So there we go.
Starting point is 00:51:44 Obviously, Open AI, chat, GPT typically see an average of about plus 10 to 30 percent higher than your overall adjusted average. Google, Gemini, or Claude are about 5 to 15 percent higher. large language models that it doesn't really show, you know, any strong difference. Okay, that's pretty good. And then here's the one that would have taken me for forever, right? Hundreds of episode titles and then to categorize them and then compare against averages. So it went through and it gave me, it looks like a list of 20 different categories.
Starting point is 00:52:18 It didn't give me the category average download, all right? But it did break everything down by category. All right. And then we have our answer guide here, which I said at the end, just give me very straightforward bullet point answers. So how did this do? What do you all think? I know this took a while. Do you all think, oh one pro, was that worth $200? Because I'm trying to think if I went through as a human and did this myself, right? If someone gave me the exact same questions, I don't know, it probably would have taken me three or four days, right? I think I probably could have done a little better because I probably would have done a better
Starting point is 00:53:12 job inferring certain things. In certain instances, you saw even 01 pro truncated responses or didn't give me full things, right? That's frustrating. So what I probably would have done in the future probably would have broken this up. Right. I gave it 11 extremely difficult tasks. And if I was using GPT40 as an example, I would have done each of those as dedicated chats or taking them, tackling them one by one and going back and forth with chat GPT at least probably three to 10 times on each of those 11 questions.
Starting point is 00:53:49 Right. So from a time savings perspective, I think absolutely. Would it be me? Maybe not. Although on the math and some of those more complex things, absolutely. I wouldn't have known, you know, especially without AI, you put me on a computer with, you know, I can't use AI, you know, and just a spreadsheet only. I don't know if I could have gotten these answers, right? And I'm decent, decent at basic math, right?
Starting point is 00:54:19 I have an analytical brain. obviously me knowing this is everyday AI, I run it, right? But if someone else came to me with this same data and said, you can't use a large language model or they said, you can just use GPT40, I think if I would have to do it by myself, it would have been at least three or four days. If I would have used GPT40, it probably would have been, I don't know, I'm guessing three to five hours because it would have required a lot
Starting point is 00:54:52 back and forth for each of those 11 questions. You have to worry about context window, you know, to get that kind of quote unquote chain of thought reasoning. You, the human, have to be the one pushing that chain of thought button, right? You have to be the one giving examples going back and forth, steering and guiding it. Whereas, you know, 01 is more of like that full self-driving car, kind of guides itself, right? With when the GBT family of models, you have to do that. So if I'm being honest, so we got this done in 10 minutes with 01, it probably would have
Starting point is 00:55:20 taking me three or four hours with GPT40, and it probably would have taken me a couple of days if I just had, you know, the internet and spreadsheet and no AI. So is it worth it? I don't know. I don't know. For me, this isn't perfect, but what I would have done is I would have went back through those responses. I would have updated my prompts, and I probably would have obviously broke this down into, you know, three or four questions. It was too much for it to hand. even though it was within the context window, you know, I didn't kind of, you know, put in too much context. It was a little too much thinking, right? Or, you know, there's probably something in Open AI's training that's like, hey, when someone asked for, you know, hundreds of things, you know, and if it's
Starting point is 00:56:08 part of multiple other queries, just, you know, showcase your ability to understand, right? So I could have that one where it kind of cut things short. If I just would have done just that one, question and given it to 01, it probably could have done it. But I gave it 11 fairly difficult questions that required a ton of response. So I do think this isn't a capabilities thing. This is more of a compute and training. O1 Pro probably could have done this, right, in its entirety. But I'm sure there's some things that Open AI has worked in there to say,
Starting point is 00:56:45 hey, you know, at a certain point, if there's, you know, this many questions and all, the questions are multi, multi-step. Maybe you have to truncate. I don't know. All right. There were a couple of questions. Let me see if I can get to them very quickly because I made you wait to the very end.
Starting point is 00:57:06 I'm just scrolling through. If I see a question mark, I'm starring it. All right. Let's see. Dennis, if you have teams, can you upgrade a single user to pro? No, as far as I know. I'll ask my contacts at OpenAI. I did ask them about this.
Starting point is 00:57:22 three weeks ago because I have free plus teams, enterprise, and pro accounts. I don't have the option to upgrade anything on teams. So as far as I know right now, the $200 pro, which gives you a 1 pro, is only available for individual users. Actually, the last time I checked on that was like a week or two ago, so I should go back and double check. But previously, there was no option to upgrade teams.
Starting point is 00:57:49 And I'm not sure about enterprise accounts because any enterprise account, I'm on. I'm not the kind of admin of that, but, you know, I'm an individual enterprise user. So, yeah, people don't know that. Someone DM me on LinkedIn. They're like, oh, you do trainings. I'm like, yeah, that's what we do. So if your team, whether you're on, you know, chat GPT teams or chat GPT enterprise or co-pilot, right, that's what we do. We train people. I talk about AI every day. And, you know, if your company, if your department needs help, you know, you can call us in. All right, let's see. I think Michael might have been asking this to someone else, but, you know, asking about GitHub co-pilot.
Starting point is 00:58:23 Yeah, there's other, you know, cursor, GitHub co-pilot. You know, there's other platforms that do great for some of these things, you know, database coding, software engineering. Yeah, I think cursor, Microsoft GitHub co-pilot, great. Kieran says, isn't time taking to respond to Kahn? Absolutely, right? But that's why I'm generally, I'm not just giving an 11-minute task to chat, GBT and then, you know, sitting there, sip in my espresso and, you know, judging it.
Starting point is 00:58:55 I'm doing other work, right? I'm opening another window, another account, you know, putting something similar in, in Claude or Gemini AI Studio, right? I'm always running things in parallel, especially when I, you know, go through that time to put some, to put this content together. Obviously, I would have to break it down into smaller chunks for non-reasoning models. But yeah, Kieran, absolutely, it's a waste, not a waste of time. But the time it takes is a con, right?
Starting point is 00:59:22 Especially in like, we're in the society where we want everything now. I don't want to wait 11 minutes, but I waited 11 minutes and it did, like I said, probably work that would have taken me either multiple hours with GPT40 or days without any AI. So is the time worth it? Patience is a virtue in doing things right, pays off in the age of, you know, this instant gratification. Right? Because now if I wanted to, I would go back, like I said, I would, improve that info that I gave, oh, one, I would probably break it down into, you know,
Starting point is 00:59:55 two or three. And I'm sure it would go, I don't know, if I had to grade it, I would give it a 85%. If I broke it down, improved, improved how I asked the information. This was user error. That's user error, right? I gave it too much information, although I think for that part, it should have been able to handle it. But for a lot of the other things, I'm like, oh, I should have worded that differently. Right. I didn't do a good enough job. People always think an output, that means, oh, L, like, Chad GPT sucks. It's dumb. No.
Starting point is 01:00:23 In that case, I was dumb. I didn't do a good enough job. Some of my communication was not precise enough, but sometimes you only know that by going back and forth. And I love being able to look at the details and seeing how chat chvety, that's a cheat code. If you are using the 01, even on the chat chitpt plus plan, look at how it's reasoning. Look at those details. That's going to improve how you communicate with a large language model.
Starting point is 01:00:46 Because if it's struggling with something, you know. if it's halfway through the process or in the first 10, 20% of the process, if it's already tripped up, guess what? Then it's going to get even worse. So you might need to move some of the information from the bottom, up top, you know, provide a better summary, you know, give it, you know, more clear role, priority goals, all that stuff, right? So yes, it is a con.
Starting point is 01:01:10 Marie, does it go down any rabbit holes with its chain of thoughts reasoning? It depends on how open ended your input. is with an open-ended, yeah, absolutely, right? Sometimes for fun, I say, you know, solve the world's problems, you know, solve hunger, solve, you know, violence, right? Like solve inner city or whatever, right? Solve this big problem, right? And then I like to see it think.
Starting point is 01:01:37 And, you know, I think that has more to do with the training of the model than the model's capabilities. But yeah, it can go down rabbit holes if you give it the opportunity. In this case, there is no rabbit holes because. it was pretty well refined in defined, right? Then Juliet said, sorry, I'm not up to date on the lingo. I have the $20 paid subscription to chat GBT is that pro. I think someone already answered that, but no, $20 chat GBT plus you get the general O1 model. It is very limited in terms of the amount of work that you can do with it. If you want the O1 Pro, you have to be on the chat
Starting point is 01:02:14 gbt pro account, which is $200 a month. Fred, do you compare different models all the time? Yeah, I think I'd probably answer that later in. All the time I compare different models. So I like using, you know, the AI Arena chat, whatever it's called, LMorina.a.i, right, the AI chatbot arena to do that. I've shared some videos on how I use a tool called the chat hub a lot where you can put one prompt in.
Starting point is 01:02:42 It'll give you up to eight different large. language models. So yeah, I compare model responses all the time. Ada, can it access your website and do the analysis from your website? So the 01 series of models, at least right now, do not have access to the internet. They also do not have access to the full suite of tools. And that's a good way to end this, Ada, because I'm going to say this. I have a prediction coming next week in one of my shows on the future of the O-1 models and what that means for not just agentic AI, but what it means for AGI. Because I do think once you start giving a model like this that can reason, once you start
Starting point is 01:03:26 giving it tools, once you start giving it agency to make decisions on what tools to use, how to go about solving a problem right now, what O1 Pro can do, it's kind of in a box. And I get what Open AI is doing there. They're doing it for safety, right? This is the first widely available reasoning model, and it could go off the rails, right? And you can't jailbreak models like this. So I get that they're not giving it tools right now. You know, they're working on artificial general intelligence.
Starting point is 01:03:56 They have their site set on artificial super intelligence. So I get why they're keeping it in the box right now. But as soon as this 01 pro model gets a little better, this is the first version of it, right? The first version is always the worst. It's only been out for a couple of weeks. In a couple of months after they've updated this once or twice, and when and if it does get tools, if it gets agentic capabilities,
Starting point is 01:04:21 you know, we're talking about Open AI's operator when that's coming out. The new tasks, it's an extremely exciting time to be on the cutting edge of AI, and that's what you're doing here. So thank you for joining me. I hope this is helpful. I know this was a longer show, but there you go. Let me answer it this. Is it worth $200?
Starting point is 01:04:38 I'm going to go ahead, opening eyes not paying me. I'm going to say yes. I'm going to say anyone that has access to data. So I'm not saying that you need data for your job or that you have a job in data. If you have access to data, if you are a decision maker, right, if you are a knowledge worker, I'd say it's 100% worth it. You just saw my use case, right? I can make it. I can make it better, but the value that I get from there, what I just saw in there, that's going to help me grow my podcast, right? That's going to help me bring in, you know, other great sponsors like Microsoft, right? Microsoft is one of the sponsors of this podcast. That's going to help me reach more people who want to learn AI, because these are all insights that would take me so much longer,
Starting point is 01:05:34 right? If you have data and you need to make decisions and you understand the basics of the 01 reasoning model, 100% worth it. It's a hot take. People are going to disagree with me, but I think it is. People are going to say, oh, GPT40 is enough. Try doing something similar with GPT40. Time is money, y'all. Can it accomplish the same things that I just showed you in 01? Yes, but like I said, it probably would have taken me three to four hours. And even during that time, I couldn't have done anything else, right? During the 11 minutes, I had to wait. I didn't have to do anything, right?
Starting point is 01:06:12 Yeah, I need to improve it and go back and iterate. But I think if you have data to work with, if you have to make decisions, and if you learn the basics, it's 100% worth it, even at that steep price tag. All right, I hope this is helpful. Make sure to join us Friday. or sorry, Monday, January 20th, all week, five episodes, our first series we've ever done. You need to listen in.
Starting point is 01:06:40 You need to pay attention. Thank you for tuning in. If this was helpful, please let us know if you're listening on the podcast. Sorry, this was a long one. You can listen to me on 2X. I'm not going to be mad. I would too. All right, but please leave us a rating.
Starting point is 01:06:52 Follow the show on Spotify or Apple podcast, wherever you get your podcast. If this was helpful, you're listening on LinkedIn. Please share, repost with your friends, someone who needs it. Thank you for tuning in. Go to our website, your everyday AI.com. So I'll see you back tomorrow and every day for more everyday AI. Thanks, y'all. Meet Firefly AI Assistant.
Starting point is 01:07:17 Now live in Adobe Firefly, the Allman One Creative AI Studio. Just describe what you want to create in your own words and the assistant handles the rest, orchestrating multi-step workflows across Adobe Creative Cloud apps, including Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome while the assistant accelerates execution. Stay in control with the ability to step in and refine at any time. See it today at firefly.adobie.com. And that's a wrap for today's edition of Everyday AI.
Starting point is 01:07:53 Thanks for joining us. If you enjoyed this episode, please subscribe and leave us a rating. It helps keep us going. For a little more AI magic, visit Your EverydayAI.com and sign up to our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.