Everyday AI Podcast – An AI and ChatGPT Podcast - EP 459: OpenAI’s Best AI Agent? The correct way to use ChatGPT’s operator agent

Episode Date: February 11, 2025

OpenAI's Operator agent is a glimpse into the future of work.Even if you don't have access to the Computer-Using Agent now, you will soon. Once the whole world gets access, you'll need ...to know the best practices to get ahead. We'll be sharing those with you and doing a breakdown of OpenAI's newest (and potentially best) agent to date. Newsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion: Ask Jordan questions on OpenAIUpcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:1. OpenAI Operator explained2. Operator vs. Competition3. Use Cases for Operator4. Managing Tasks in OperatorTimestamps:00:00 "OpenAI's Smart Research Agents"08:28 "Future Rollout of $20 Plan"11:14 "Google's Mariner: Automated Browser Tasks"17:45 Frustration with AI Use Cases25:57 "Prompt Engineering Challenges"27:09 Operator Limitations with Virtual Tools35:04 "Guide to Using Google Gemini"38:30 "Steps for Google Deep Research"46:45 Optimizing Task Efficiency and Quality52:04 Skip Prepackaged Dining Experiences58:33 Automating Tasks with GPT-401:04:30 AI Revolution: Enhancing Productivity01:05:44 "LinkedIn Repost Giveaway Announcement"Keywords:Generative AI, large language models, OpenAI's operator, GPT-4, AI agents, AI predictions, livestream podcast, AI newsletter, business growth, task automation, deep research, AI tools, chat GPT tasks, AI news, screen sharing, live demos, autonomous AI, computer use agent, AI capabilities, virtual machine, AI research, AI-driven workflows, AI models, AI-enhanced productivity, AI-powered research, task scheduling, AI automation, AI ecosystem, Google Gemini, AI ethics.Send Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info) Start Here ▶️Not sure where to start when it comes to AI? Start with our Start Here Series. You can listen to the first drop -- Episode 691 -- or get free access to our Inner Cricle community and all episodes: StartHereSeries.com Also, here's a link to the entire series on a Spotify playlist. 

Transcript
Discussion (0)
Starting point is 00:00:00 This is the Everyday AI Show, the everyday podcast where we simplify AI and bring its power to your fingertips. Listen daily for practical advice to boost your career, business, and everyday life. Meet Firefly AI Assistant, now live and Adobe Firefly, the all-in-one creative AI studio. Just describe what you want to create and the assistant handles the rest, orchestrating multi-step workflows across Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome. The assistant accelerates execution. There's always been this enormous upside of generative AI and large language models, right?
Starting point is 00:00:58 From the early times of the chat GPT moment of generative AI to all these new updates in between. But I think many of us have realized that potential, right? Especially if you're a daily listener of the show. but I think a lot of the rest of the business world hasn't. There's always those doubters out there that are like, okay, cool. This AI can go to this, but when will it just go and do my work for me? Well, with Open AI's operator, that's exactly what can happen.
Starting point is 00:01:33 And that's exactly what we're going to be showing you today. All right, what's going on, y'all? My name's Jordan Wilson and welcome to Everyday AI. This is your daily live stream podcast. and free daily newsletter, helping us all not just keep up with AI, but how we can use it to get ahead to grow our companies and our careers. Because, you know, efficiencies and optimizations are one thing, but when we can actually use them to grow,
Starting point is 00:02:00 that's the whole next step that we all need to take. And you can take that next step if you haven't already on our website. So if you're new here, please go to your everyday AI.com, sign up for the free daily newsletter. So we recap, we're going to be recapping this very show, have a nice write-up with some additional resources, but we're also going to keep you in the loop with everything else going on in the world of AI so you can be the smartest person in your company when it comes to generative AI and large language models. Also on our site, you have to go check out the 2025 AI
Starting point is 00:02:37 predictions and roadmap series. It's actually been so helpful to so many people, even though it's from a couple of weeks ago. I think we're going to do some slight updates and run it again because I think it is that important that you all listen to this. So it was five very short episodes on our website at Your EverydayaI.com. Look for that 2025 AI predictions and roadmap series. All right. Normally we start off each and every day by going over what's new and noteworthy in the AI
Starting point is 00:03:07 news. Today's show is going to be a very detailed one with a lot of screen sharing. We're doing live demos. I'm going to literally show you on today's show how Operator is doing my work. All right. So if you want the AI news, make sure to go check that out in the newsletter. Also, if this show is helpful, I'm going to remind you of this again at the end, make sure to repost this on LinkedIn or Twitter.
Starting point is 00:03:31 I'm going to give you all, whoever repost this, the complete instruction set that I use inside of Operator, which takes a long time to configure and get right, as well. as for anyone that does repost this on LinkedIn. I'm going to be entering y'all into a drawing for a free 90-minute consult. So I can help you set up operator for your team, answer your generative AI questions, teach you chat, GPT, whatever it is. And we're going to be giving that away in our newsletter. All right, that's enough chit-chat.
Starting point is 00:03:59 Let's get straight into it. Is this Open AI's best AI agent? And a lot of people are using operator for the exact wrong use cases. All right. So let me just answer this. I think this is OpenAI's best AI agent. And people are using it for the exact wrong reasons, right? So operator actually came out before deep research.
Starting point is 00:04:25 So, you know, a lot of people just flooded, you know, all the, all the thread boys, I think, what they're called on Twitter and LinkedIn, right? They go gather all these use cases and they're like, oh, you know, operators amazing. It's going to, you know, change the game and blah, blah, blah, right? But because this was before Open AI's deep research, a lot of these use cases were two things. One, it was very much what Open AI demoed, which I think was wrong. So I'll get to that in a couple of minutes. And then it was just doing a lot of research.
Starting point is 00:04:55 But Open AI's deep research came out shortly thereafter. I believe deep research came out on January 31st. So you shouldn't be using operator to just go research. This is an agent, a very smart agent that can work across multiple web, websites, copy and paste things, you know, across different products. You can give it credentials to log in to whatever you're using. So I think people are using this in probably like the worst way possible, right? So you have to keep in mind OpenAI's other tools. But I do think, you know, Open AI has officially said they've released two agents. So one in operator, one in deep research,
Starting point is 00:05:34 but I'm all I'm almost going to call it like two and a half because I think tasks, chat GPUT tasks where you can essentially schedule anything in chat EBT, I think that actually has some agentic capability because when you work with it the correct way, right? And when you use your brain, we taught you all how to do that. We did a great show. I thought it was a great show on chat, you bett task.
Starting point is 00:05:54 So when you do something called task stacking and use the context of the chat, it does have agentic capabilities. It can have agency. It can make decisions. It can create new things. for you autonomously. So, you know, Open AI will say they've released two agents, one in operator, one in deep research. I'll say it's 2.5 because like I said, I think Chachvita Tas is pretty much there as well. All right. So let's get into the definitions first,
Starting point is 00:06:24 y'all. And then we're going to get to doing this live. And hey, good morning. Good morning to everyone joining us. So Pedro, joining the show from Madrid, Jason, from Florida, Douglas. We're Mondo, Atcham, Harvey, Castro, Christopher, everyone else, Michael, big bogey face. Thanks for tuning in. If you guys have questions about operator, get them in now. You know what? And if we have time, I might be able to run an operator question or two. All right.
Starting point is 00:06:54 We'll see. So what the heck is operator? All right. So this is from Open AI. So they said this is a research preview. Keep that in mind. This is the worst it's ever going to be. So this is a research preview of an agent that can use its own
Starting point is 00:07:07 browser to perform tasks for you. All right. So it is first, this was their first official agent release. And if you keep hearing the word Cua, all right, that's what this is. This is a computer use agent or Cua. All right. So it uses GPT40 to quote unquote see screenshots and then it operates a virtual computer. So it is in a slightly different interface than the normal chat. GPD, although it is essentially the same thing. So it has its own dedicated interface. You talk to operator just like you would chatGBT. It essentially takes a lot of screenshots.
Starting point is 00:07:49 It uses computer vision. And then it essentially controls a mouse and a keyboard on a virtual machine. And you can take control at any time. Keep in mind, there's obviously some limits, right? And I'm going to walk you through some of those things. The virtual machine is not very. powerful, right? So if you want to go, you know, render, you know, a video editing program or something in the, on this virtual machine that would normally require local computer power, it's not going to work
Starting point is 00:08:19 very well, right? Also, in the same way that my computer will slow down if you have 30 tabs open, so too will compute this operator from open AI. So keep that in mind. There's there's some limitations. you are using a virtual machine. However, it may slow down if you're trying to do too many things at the same time. All right. Let's talk about access and availability. Well, right now it's available to anyone with the $200 a month pro plan. So that is what I'm using.
Starting point is 00:08:52 And OpenAI CEO, Sam Altman did say that this will be rolling out to plus users. So that is the $20 a month plan in the coming months. So there is no, you know, if that means one, two months, three, four months, eight months, we don't know, right? We could see a very long, like Sora-ask rollout where it was eight months or we could see it drop in a couple of weeks. So right now it is only available for those on the $200 a month pro plan, which is what I'm using.
Starting point is 00:09:22 But like I said, it is going to be coming out at some point in the near future. So how the heck does this thing work, right? Well, according to Open AI, this is how it works. So it says operator uses a model called computer use using agent or COA built on GPT40 to interpret screenshots and interact with sites using typical browser controls like a cursor and mouse. You describe the task, example, book of flight, order groceries, and operator executes the necessary steps. If it encounters a challenge like a CAPTCHA or password field, it will pause and prompt you to take over. ensuring you stay in control. I mean, just call this out right now.
Starting point is 00:10:06 These are the absolute worst things to do with operator, right? And I'll tell you why here in a minute, but we're not going to do any of these things that Open AI suggests because it's a terrible use of your time. And I think it's a terrible use of their technology to use it how they've suggested both on their website and when they demoed it. All right, let's talk about limits.
Starting point is 00:10:28 Everyone always wants to know. All right, well, if I'm going to pay $200 a month, Can I just have like 80 instances of this thing going at once? Well, like I said, it will slow down, right? Just like a normal computer would. Each time you start a new operator chat, think of it like this. Think that you're running an old computer from 10 years ago, right? You should probably only be doing a couple of things at once.
Starting point is 00:10:54 You should probably have a couple tabs open at once. But each new operator chat that you start, it is essentially starting a new virtual machine. However, keep this in mind right now, you have to have the tab or the window active for it to keep going. So I've been trying some workarounds, right? So as an example, if I'm using Chrome or Edge, you know, launching a new profile and continuing to work. So hopefully it'll actually work here, you know, because I'm in the same instance. I'm using Chrome right now. So we might only be able to do one thing at a time for that very reason. However, you know, there are some nice workarounds, but you do have to have it active and open.
Starting point is 00:11:34 Right. If you listened, y'all to my 2025 AI roadmap series, I said virtual machines and second computers are going to be huge in 2025. And here we are a couple of weeks after that show debuted. And yeah, now you see why, right? Now I'm happy I have a stockpile of extra computers because I can just, you know, launch maybe two operators, give them extremely detailed multi-step tasks and have them literally do my work for me, right? But I have to wait. So presumably we'll see the same thing with Google's
Starting point is 00:12:04 Mariner, which is essentially their computer using agent that will be hopefully rolling out in the coming weeks and months. I did talk about that a little bit with Google's Logan Kilpatrick on Friday if you want to go back and listen to that conversation. But now's a great time if you have that extra machine to go ahead and do this because like operator with Google's Mariner, which will work as a Chrome extension. It needs an active window or an active tab because it's essentially using the instance of your browser to use a virtual machine.
Starting point is 00:12:39 That's how operator works. Mariner works. It's literally using your browser. So you can't do anything else. All right. So now, if you're like me and your slight computer quarter, this is where it pays off, right? Because you can go, you know, set it up and, you know,
Starting point is 00:12:52 essentially have one computer just always doing your work over there in the corner. But you've got to put the work into it. and you have to know how this works and how it doesn't. All right. So can operator handle multiple tasks at once? So yes, operator does allow you to run multiple tasks in parallel. However, for security reasons, operator places dynamic limits on the number of simultaneous tasks
Starting point is 00:13:13 in open conversations you can have at any given time. And these limits may change. Yeah. So there's no like hard limit. It's not like, oh, you can run two things at once or three things at once. It's dynamic, which is going to make a lie demo kind of tricky. because we might run into limits. You know, I tested everything.
Starting point is 00:13:30 Last night, everything was going well. But, I mean, we'll see how it actually works, right? All right. So let's talk about how to actually use it. So you don't, it's not in the same chat GPT interface. So you can go to operator dot chatgpt.com. Again, you have to be on that $200 pro plan. Otherwise, this isn't going to work.
Starting point is 00:13:52 Or you can log into your normal chat GPT account. And there will be an operator. in the left-hand corner where you would normally see your GBT's. And then, like I said, it has to be an active window or tab. So what the heck should you use this for? Right. Here's where I'm going to enjoy. I'm going to enjoy this slightly hot take Tuesday, right?
Starting point is 00:14:16 I actually might have a hot take Wednesday for y'all tomorrow if you want it. So number one on what types of tasks should you be giving to? operator. Probably not what you would think, okay? Because first, you have to know and understand Open AI's full tool set. So here's what I mean by that. You have to understand chat GPD tasks, all. And please, please, y'all, go listen to my chat GBT tasks show. All right. It's funny. I actually had someone from Open AI reach out after that show and they're like, you know, this was great. Like I learned so much from this. listening to this, which I was like, I was like kind of shocked on.
Starting point is 00:15:00 Right. So you need to go listen to that task show because I don't think people understand how powerful chat GPT tasks is. So that's episode 440. Go listen to it. So again, before using operator, you have to understand tasks. And you have to understand task stacking. All right. So you can literally go back and reshare that show, put together a huge guide on task stacking. All right. Then you have to understand. chat GPT, their new mode, O3 Mini plus chat GPT search. So a reasoning model that has access to the internet because that can also change what you think you might want to use operator for, right?
Starting point is 00:15:40 So a lot of the things that you're thinking, oh, I'll use operator to go do A, B, and C, it's probably already available and you just didn't know how to use it. So go listen to our O3 mini show as well, right? And I'm not just saying this, like, you know, I don't get paid 20. every time you go listen to a podcast. I get paid nothing. All right. I'm doing this to save you time, all right, and to help you get the most, you and your company get the most out of generative AI. So go listen to episode 456 on 03 mini high. And then you have to understand deep research. All right, we covered that in episode 454. All right. So open AI's deep research is outstanding. That is their other
Starting point is 00:16:22 agent. They released it. I believe on the last day of January. So just about, less than two weeks ago. So you have to understand those kind of three or four kind of tools or modes within chat chbt because a lot of the things that I see people, right, I go out and I read people's reviews or watch people's videos and I'm like, y'all are using this wrong. This is like the absolute worst thing to do because operator is slow, right? It is slow. In many instances, it is slower than a human. So you have to keep that in mind on the type of agency you are handing over to an agent. Don't hand it over something that is actually going to take the agent longer.
Starting point is 00:17:04 Okay. So what type of task should you give them? So like I said, don't give them anything OpenAI use in their demo. In their demo, Open AI, and on their blog posts, they leaned very heavily, which I don't know why. Maybe because talking about these things, I don't know, helps you imagine a future where everyone has a Jarvis, right? So they're like trying to order, you know, like tickets.
Starting point is 00:17:27 to an MBA game and trying to order groceries, right? Don't do that. Don't do that. Just because you can, right? I think they're trying to perform transactions and they're trying to show everyone, oh, you can go buy things on the internet, right? And let operator do that. Number one, it's way too time intensive.
Starting point is 00:17:47 You are not going to win back your time doing that. Because unfortunately, even when you try to over prompt it, operator is still going to ask you a lot of questions. All right. An agent is not an agent. If it has to ask you more questions, then, and if it takes more time, then it would take for you to do it on its own. So yeah, in Open AI's demos of, you know, reserving a table at a restaurant, ordering tickets, ordering groceries, those, in my opinion, those are terrible use cases because those are things that the human can probably do two to three times faster. And it's actually a quite frustrating experience. I think one of the reasons, right,
Starting point is 00:18:26 about being honest. What are the reasons why they probably demo that is open AI is using this as training data, right? And I get it. We need all of that training data in order to build the next version of operators in the next agenic system. So I get it. I get why they're probably pushing those things. And sure, maybe someone might find, you know, it's a nice party trick. But I don't know. I don't want to sit there and answer, you know, four to nine questions just to reserve a table at a restaurant, right? It doesn't make sense to me. I want to hand off operator as much of my day-to-day work as possible. Sit back, go warm up my coffee, and go do something else, right? That's the point of having an agent. So you should be used, you should not be using the pre-packaged prompt
Starting point is 00:19:14 ideas. Do not use them. All right. What you should be doing is any basic research, reading, writing, summarizing data analysis task that cannot be done in chat GPT's deep or chat GPT or deep research, right? So you should be doing these knowledge work tasks that involve you going into multiple websites, multiple software services. That's what you should be focusing on. All right. So like I said, reading and writing across different domains and services. That's number one. That's something a large language model is better at. It's faster at. It can summarize and synthesize much better, much faster than any human. All right.
Starting point is 00:19:57 So any knowledge work task connecting multiple services or any manual repetitive tasks that are time consuming and happen across multiple domains. All right. Let's look live. Are you guys ready? This could go horribly, if I'm being honest. Let's see how this works. And one of the main reasons is because I have to always have this active.
Starting point is 00:20:22 tab. So even when I'm like trying to copy and paste some stuff over, it might not work very well. All right. So live stream audience, if you could, please let me know when you can see my screen. All right. So right now I have operator open. I'm going to start on this right away. All right. And then I'm going to walk you through what's happening. Podcast audience. I always put the link to this show. So this is going to be a very visual process. I'm going to try to do my best to describe to you what's going on. But if you want to actually see it with your eyes, all right,
Starting point is 00:20:57 we always leave the link to our website. On the website, we put the YouTube video or you can go watch it on LinkedIn. All right. So I just pasted a prompt in. All right. And I'm going to, all right,
Starting point is 00:21:08 let me, thanks, thanks a live stream audience that you can see. All right. So what's happening? I'm going to go ahead and click this button here that says expand. Well, actually,
Starting point is 00:21:18 I'm not. So first I'm going to write down what, you know what? I have multiple screens here. Let's do this. Let's do this. All right. Hopefully, hopefully, of course it did this. I literally just signed in. I signed in to my Gmail account before this started. Tested it. It worked fine. So sometimes you have to enter in your credentials multiple times. So I was hoping I wouldn't have to do this. And I was hoping that we can do this, do this whole thing autonomously. All right.
Starting point is 00:21:55 So give me a second. I'm logging into my, this is my personal Gmail. So please don't spam me. I guess you can if you want. All right. All right. So now I am super, super zoomed in here.
Starting point is 00:22:14 And I can't zoom out. So give me a second. All right. There we go. So now the screen sharing should be back. Adobe just introduced an entirely new way to create, bringing the power and precision of its creative suite. into one conversational experience.
Starting point is 00:22:39 Meet Firefly AI Assistant, now live in the Adobe Firefly app, the All In One Creative AI Studio. Powered by Adobe's Creative Agent, Firefly AI Assistant lets you start with your vision, just describe what you want, and shape the outcome as it takes form with the Assistant. The Assistant orchestrates multi-step workflows, drawing on 60-plus pro-grade tools across Adobe Creative Cloud apps, including Photoshop, Illustrator Premier, Lightroom Express, and more to help bring your ideas to life. You can also get started with creative skills,
Starting point is 00:23:12 a growing library of pre-built workflows for common creative tasks, like batch editing photos, creating mood boards, portrait retouching, and creating social variations. Every step the assistant takes is visible, so you can refine, redirect, or take over at any time. You stay in the driver's seat as the creative director. Adobe Firefly AI assistant now in public beta. See it today at firefly.adobie.com.
Starting point is 00:23:43 I told you all. I am not always a fan of doing this live, even though I know y'all love doing these things live. So let's see if I can get this to work. Because of course it worked one shot when I demoed it last night. All right, here we go. So here's what I told operator to do. So I copy and pasted this in.
Starting point is 00:24:08 All right, and all I did so far is I had to log into my Gmail. I'll tell you why. So I said, step one, go to gemini.com and ask it to complete a very basic swat report for the Everyday AI podcast by Jordan Wilson. Then hit enter into our live stream audience. You see it's working on its own right now. My hands are right here. I'm not typing this in.
Starting point is 00:24:29 All right. I said step two. Then go to Google slides and copy and paste the input and outputs from that Google Gemini prompt in response. And then I'm explaining, this is after using it a little bit, I'm saying sometimes it may They ask you to install an extension for copying and pasting. If so, allow it. If not, go about your copying and pasting.
Starting point is 00:24:49 Use your best discretion on formatting. The Google slides doc should only be five pages long. Number one, title page, two, strength. So this is SWAT, right? So essentially a title page and then a page for SWOT, strength, weaknesses, opportunity threats. Step three, export the Google slides as a PDF doc. Step four, log into my Gmail and then send that PDF reports to info at your everyday AI.com, write a short subject line and a one sentence email summary.
Starting point is 00:25:22 And then I'm saying, do not in this part is important, y'all. I've been playing with operator a lot over the past like two weeks. So I'm saying, do not ask me for permission for anything. Use your best judgment. please complete this autonomously. If you run into any issues, try a second time. If your second attempt doesn't work, then try another route or get creative in accomplishing the goal.
Starting point is 00:25:46 The only important thing for you to do is to finish all four steps without human input. Please complete this task autonomously. So yeah, you'll notice that I did multiple times remind operator like, yo, don't talk to me. Right. I'm not here to be your friend, right? You have a job to do, go do this autonomously. I gave you detailed directions. Take your time.
Starting point is 00:26:11 Make sure you get this done correctly. All right. So you'll see over here for my live stream audience, I'm kind of clicking through this and a couple of things to know. So you can see a kind of summarized chain of thought on what operator is doing. So remember, this is based off of GPT4, but we almost get these, this O level. of the O series, the reasoning models, we almost get that kind of under the hood look of what it's doing. Also, no, at any time, you can go back and replay this if you want, right? And I would highly encourage you to do this, right?
Starting point is 00:26:52 So even if you don't have the $200 a month pro plan right now, you need to access this. When this does come out to maybe the plus plan, I encourage you, you have to always look at this kind of summarized chain of thought. You have to see and understand what it's doing. So you get to that by clicking this expand button. Okay, so otherwise, you can't really follow along. So I click this expand browser window button. Again, I'm in the operator interface.
Starting point is 00:27:23 And you'll see right here, it says one task in progress. I didn't want to do two simultaneous tasks, all right? So we could hopefully really walk and talk through. So you'll see also when I hover over my virtual screen here, it says take control. So at any point, if something is going wrong, I can click take control. Right now, I don't need to. I had to log in, even though literally right before I hit record, this was working fine. But there's always human in the loop, right?
Starting point is 00:27:59 But in my prompting, I really pushed and requested operating. to do this all on its own, right? There's no point in using an agent, you know, to do a task that would take a, you know, take you five minutes that, oh, working with, you know, operator takes me eight minutes. That makes no sense, right? So you are going to have to put a little bit of work into, you know, prompt engineering 101. All right. You're going to have to put in some work into learning.
Starting point is 00:28:27 All right. So now, as an example, I'm looking down and I'm seeing. what's happening here, right? I can see the actual step by step how this is thinking. So right now, I can see it was struggling to scroll down on the page. So it's about, it's about halfway done with this task. So it was struggling to find the opportunity section of the SWAT report that I asked it to generate. So again, let's even back up. So we started an operator. And then I had operator log in to Google Gemini, right? So unfortunately, operator right now can't use operator, right?
Starting point is 00:29:09 But it can use a lot of other tools that you would log into, which is great. Some websites right now, and I would assume that as computer using agents become more and more prominent, that they're going to figure out how to block these virtual machines, how to block this virtual traffic, right? at least for me, it was showing up as like a device in Iowa. I know I read back a couple of months ago that Open AI and Microsoft and others were looking at data centers in Iowa. So I'm not sure if that's what it is or if it's always just going to dynamically show up
Starting point is 00:29:42 in a new place. So you will probably have to do a lot of two-factor authentication if you are logging into sites that require your credentials. But in my opinion, that's what you should be doing. So I wouldn't be, again, I wouldn't be uploading sensitive, proprietary documents, anything like that. Right now, this is just my personal Gmail account. But I'm having it.
Starting point is 00:30:05 Go in, open Google Gemini, all right? Run a research task, right? This is something that I would normally be doing. And you'll see it's already done. So right now, it completed the presentation. It looks like it's downloading it right now. And again, I'll walk everyone through this. I want to get the second prompt started.
Starting point is 00:30:27 But it's already downloaded the file. All right. So I asked it. I said, hey, operator, go out, use Google Gemini. Then go create. So it's working between Google Gemini and Google slides. It's copying and pacing all this information. It was even resizing text, right?
Starting point is 00:30:48 Because it would enter a text box and it didn't fit. So it was resizing it all. And it's pretty impressive because it's doing this all with screenshots. All right. Let's see. So it looks like it might have stopped there. So yeah, unfortunately, it did not complete the entire task because the rest of the task, let's see.
Starting point is 00:31:13 Let's see if I can just reenter this and have it continue on. Again, y'all, like maybe I'll share the video, but it literally did this entire thing last night. But generative AI is generative. It's a roll of the dice. It's going to be a little bit different. So it looks like it didn't do step three and four, which was emailing this to myself. So now I just repasted that in there.
Starting point is 00:31:40 So it's going into my Gmail account. It's clicking on Compose. All right. So now let's see. It looks like it's finding, oh, pretty quickly there. It entered my email, the info at everyday AI. This is where it generally struggles, is attaching files.
Starting point is 00:31:59 So it essentially has this right here, a file system. And I told it over time, I found out where operator kind of shares its or keeps its files that it downloads because it's on a virtual machine. I'll probably have to fine tune those instructions a little bit because I know it's in that OAI, that open AI folder and a shared folder. So for whatever reason, I need to add a little bit more detailed instructions about where to find it because right now operator is struggling to remember.
Starting point is 00:32:30 So it's in that share folder. So we'll see if it kind of double clicks in there. So yeah, for whatever reason, it is struggling right now to find files. But that's fine. All right. So I'm going to go ahead. I'm going to stop this task. So we'll give it maybe a, I don't know, a B or a C on that one.
Starting point is 00:32:50 But let's do something even more difficult, right? That makes sense. You know, if it fails at a task that's a, you know, three out of ten, let's give it something that's extremely even harder to do, right? That makes sense. All right. So now, live stream audience, you see this. I am, this is very long.
Starting point is 00:33:10 This is very long. All right. I'm giving it a very, very difficult task. So this is something I do all the time, right? I'm not asking it to go order my pizza or, you know, go. to, you know, go find me tickets to the Warriors game, whatever. All right. So I'm telling it. Here's what I'm doing. And I'm also intentionally being a little vague. All right. So I said, for this task, you will find a trending topic in generative AI in research potential hot take
Starting point is 00:33:50 Tuesday topics for an everyday AI podcast. So I'm saying, Before I give it its steps, I'm kind of walking it through what's happening. In live stream audience, you can already see. It's on my website. It's searching, but I'm going to walk our podcast audience through how we got there. So I'm saying you will research a Google URL identifying an interesting trend or story that will be a good podcast episode. Then you will use Google's Google Gemini's deep research tool to conduct up more in-depth research on that topic. Also, you will make sure to look at the context of this chat.
Starting point is 00:34:21 That is important, y'all, right? lights, lights, gem, gem, right? Because what I'm going to do is I'm going to run this task probably a couple of times a week. And I don't want it to keep suggesting the same thing over and over. So I'm telling it, yo, look back at the context of this chat. So don't suggest something to me. You've already done. All right.
Starting point is 00:34:44 So then I'm saying step one. First, you will go to the Everyday AI podcast episodes page. So I didn't give it the URL. I wanted to see. So what it did is it went to Bing. It typed in everyday AI podcast. It went to the homepage. Then it went to the episode page.
Starting point is 00:35:01 It did this on its own. And it clicked. It clicked the search button. I wasn't looking at it closely because I was looking at my prompt here on the other screen. Let me just go through, kind of check my chain of thought a little bit. Let's see what it did. Yep.
Starting point is 00:35:15 Okay. So then it clicked the search button and it searched for Hot Take Tuesday. Right. So those are my Tuesday episodes where sometimes I bring in hot. takes. All right. So now, all right, it's working this time, y'all with no hands. This is good. So then I'm saying you need to go look at all of my hot take Tuesday episodes so you understand the type of topics. Then I gave it essentially a Boolean search on Google, right? In this Boolean search, it essentially, it's a little complex, but it essentially brings up AI news over the last 24 hours from a bunch of
Starting point is 00:35:51 big companies. So there's, it's, it's a very advanced Google search. So I copy and pasted that long URL string in there. All right. And then I said, this shows you when you paste this into Google, this shows you some of the top AI news stories for the week. Step three, you will identify one trending topic that could make a good episode idea for everyday AI. Again, play, pay close attention to the types of Hot Take Tuesday episodes that we've already covered. Step four, you will research that topic. This is what's happening on the screen now, and it's going to take a couple of minutes. You will research that topic using Google Gemini's deep research feature.
Starting point is 00:36:34 All right, you will go to jemini.com, sign in with the account that is on the screen. It did that. I said, do not skip that part. So this time, without me typing it in, it properly logged into my Google Gemini account. I have a paid account. And then I said, Google Gemini's deep research is an AI tool that performs research. You will need to click the model selector drop down in the upper left hand corner and select 1.5 pro with deep research.
Starting point is 00:37:01 You will write a prompt instructing that mode to research the hot take Tuesday topic that you selected and include any relevant information that is needed to properly research that topic for the hot take Tuesday show. And then I gave it an example. You should always be walking this through step by step. because again, this is a human process that would take me probably about 20 or 30 minutes without distraction. All right.
Starting point is 00:37:28 And you might be saying, okay, Jordan, it looks like it's already taken five to 10 minutes. Yes. Right. But I can let this run autonomously. And I do believe that there will be a way to schedule these as well in the near future. All right. So now after that, I gave an example of the type of prompt that it should put in. not going to read that because it's kind of long.
Starting point is 00:37:49 But essentially, I'm saying when you use Google deep research, you need to put in this type of prompt. So just like you would give a large language model shots, right, a five shot prompt, five shot is better than a no shot prompt. I'm giving it some examples of what's good and what's bad when it's using deep research. All right. And then I'm saying, please be, please be exhaustive in your search, making sure to tackle this from every angle.
Starting point is 00:38:18 And then I'm saying step five, Google Deep Research will give you a content plan and you will click select the blue button that says start research. Right. So there's actually multiple steps inside Google Deep Research. So it first needed to look at my example of a prompt, apply that to the essentially the Boolean research that it went off and did on its own, right? So are you looking at the number of steps here, y'all? And essentially the agency that I'm giving this agent, right?
Starting point is 00:38:47 I'm saying, yo, go look at my hot take Tuesday. Essentially think like me, see what I cover. Then go do all my research. I believe it went through about 40 to 50 search results using that Boolean, essentially search URL that I shared with it. So it's looking at all these different news stories, trying to identify trends based on things that I already cover. All right.
Starting point is 00:39:09 This is great. Then on top of that, without, you know, my hands have been in the air the whole time, more or less, right? then without any other instruction, it is going straight into Google Gemini's deep research. I gave it an example of how to use it. Otherwise, it's going to stink. It had to verify, right? That's the other thing.
Starting point is 00:39:27 Google deep research essentially, it starts and puts this plan together for you. And then it had to click to verify it. And then I told it. I think I told it or maybe I told it in a different one. Okay. So I didn't even. Okay, I did. Okay.
Starting point is 00:39:45 So I did say step seven, you will have to wait two to ten minutes for this to finish, right? And you'll see on my screen right now, it keeps, operator essentially keeps taking a screenshot. And it keeps saying awaiting completion of research analysis, right, waiting for research analysis completion. But I told it, I said, you will have to wait two to ten minutes for it to finish. There is a small icon that looks like two windows and a purple-ish statish indicator. All right, you will need to be patient for this to finish. And then I said, eventually, on the left hand side, it will say something like,
Starting point is 00:40:22 I've completed your research. Then on the upper right-hand portion of the screen, there will be a light blue button that says, open in docks. Please click that button. So you'll see right now in Google Deep Research, it's research 76 websites already, right? I hope in the future, right, that you will be able to use, which I'm sure you'll be able to, that you'll be able to use Open AI's operator with tasks with Open AI deep research, but right now you can't, right?
Starting point is 00:40:53 But this is the literal process that I always do. So you'll see right now, live stream audience, it finished. It finished completing the document. So it looks like it's trying to open the document. And for whatever, okay, there we go. it had to try it a couple of times, but it put together, it put together this document. So what it was, what it decided the hot take Tuesday to be was kind of the ethical, the impact of AI on pricing and its ethical implications, which is actually pretty, pretty fascinating,
Starting point is 00:41:26 right? Because when intelligence becomes cheaper and cheaper, what happens to humans and the ethics behind that, right? So pretty cool topic there that it decided to put together. All right. So now I told it, I said, please save this document as a PDF. So it looks like it saved it as a PDF. So that's good. Then I also said, before exiting this Google Doc, we want to copy all the text. You can do that by clicking and dragging or just by pressing Command A or Control A, then Command A or Control C. Then I told it, please go to Notebook LM, right? If it does not log you in, click on the notebook outline button. If it does log you in, click on the blue create new button in the upper left hand side of the screen, which is what it's doing now. Then I said click on ad source. It's literally doing this in real time. And then I said paste in all that information. Bam, it just did that.
Starting point is 00:42:23 Let's see if it does the next step here. This is pretty, pretty impressive. Good. It just clicked generate. So it's generating an audio overview for me at the same time. Are you guys seeing what's happening here. This is what I do. This is what I do all the time, right? I look on my website. I'm like, all right, I got to plan a show for this week. Let me see what I've covered recently, right? I might go look at stats from our podcast as well, which I could do this, right? I could do this. All right. Let's see. It looks like I was hoping it would finish it all. Let's see if it's going to. But this is what I would do. I would go look on my website. I would go do a bunch of research on, you know, Google or, you know, deep, deep research, honestly, from Open AI, but I can't do that
Starting point is 00:43:09 right now. And then I would go in, I would go into deep research. I would take that topic, have it do a bunch of research. I would copy and paste that, put it into notebook L.M, generate an audio overview. This is literally what I would do, all right? And now, hopefully, it's, wouldn't this be weird? Let's see. Oh, it said, it paused while I was away, because I wasn't clicked on there. So I'm not going to count that as anything because I was just clicked on my other window. All right.
Starting point is 00:43:37 So isn't this wild? So now it's going to, let's see if it can actually finish this task because the first time it failed a little bit. All right. So my last parts of the task of the task are to go to my Gmail, send this to info at your everyday AI.com. Put a subject line in a brief.
Starting point is 00:43:59 Love this. Oh, look at that. It actually did it. It did it correctly on the second time there. It found the attachment right away. Bam. Look at that. It did the entire thing, right?
Starting point is 00:44:13 It did the entire thing. All right. So now, just to hopefully prove to everyone, I'm going to go ahead and open my email account. All right. There's a reason. There's a reason I did this on my old camera here. I'm sure no one really noticed, but I have to have my phone, my phone available here for all the two FAs, because now my computer, because I was essentially using a browser from a probably another state here in the U.S.
Starting point is 00:44:46 It's getting a little confused, and I'm having to re-log into everything, which is a little annoying, but that's fine. So, all right, let's see. Let's go ahead and share my screen here, y'all. Look at this. Email from myself. Look at this. Here's the email, y'all. Hello, please find the attached PDF document detailing the impact of AI on pricing strategies and ethical considerations surrounding its use. This report highlights key points such as AI's potential to lower prices, ethical concerns like bias and lack of transparency, and the importance of regulatory measures. Best regards. I love that I just best regarded myself.
Starting point is 00:45:33 live here on the everyday AI show, then I can click. Here is the deep research. So look at this. There we go. And then I would probably take it one step further and have it also download the MP3 from from notebook LM and attach that as well. But I wanted to show you an example of this is what I actually do, right? this task would have probably taken me, like I said, 20 minutes.
Starting point is 00:46:07 I should have timed it. I can go back and look. And you know what? We're going to go share that screen anyways. So we can go back and look and see exactly what happened. All right. So if I go up here, all right. So it says worked for 11 minutes.
Starting point is 00:46:26 All right. There we go. Worked for 11 minutes. So this process by myself would probably, probably, like I said, probably takes me about 20 minutes. So you might be thinking, okay, Jordan, well, a two for one tradeoff. What's the big deal? Right.
Starting point is 00:46:39 Number one, this is something I can go be doing other things, right? I did get this working last night when I'm not doing a live stream where I was doing my own work just in another Chrome or Edge profile and it was working perfectly, right? So it just did my work at a very high level. Right. And this was essentially my first time doing this. And as I always tell you, anyone that's taken our, you know, free prime prompt polish course. And I know it's been like two months since we did that. I'm sorry. We're going to have new dates coming up. I'm getting a ton of emails on that. Essentially our, you know, hosting provider changed their plan. So we're moving it. We're rebuilding it
Starting point is 00:47:23 literally from scratch. It's, I think it is going to be the best basic chat GPT course on the internet. I think it's going to be better than courses that cost, you know, $1,000. It's all going to be for free. So even if you've taken our PPP course like five times, you're going to want to take this new updated one, FYI. So anyways, this is a task that I would do in getting a two to one. And anyways, what I was getting back to, I'm going to go back and I'm going to look. I'm going to look at this kind of chain of thought. I'm going to see what worked well and what didn't.
Starting point is 00:47:56 Right. All right. So doing this one time doesn't mean a whole lot, right? That's just to get the process down. So I want you to think, what are those manual time-consuming tasks that you do across different domains, across different websites that you maybe have to be logged into? I just gave you an example of a task that I do fairly often, right? I'm going back. I'm looking at my old episodes. I'm doing some research on Google. I'm using my brain. I'm thinking, right? But now I can go back and look at this kind of chain of thought on operator, see, see what I like
Starting point is 00:48:39 that it did because I can literally go back and watch the recording, which is great. And I can see step by step. So then I can kind of save my set of instructions, change them, improve them, right? So maybe that 11 minutes will get down to eight minutes. But not only that, but then I can look at increasing the quality of the output. now I can not only do it in half the time, but I could do it even better, right? I can maybe make that task, oh, this is something that would now take me 30 or 40 minutes and maybe I can still do it in 10 minutes while I'm doing something else.
Starting point is 00:49:13 And then think, think of these three, five, 10 ongoing little projects or tasks that you do all the time. And maybe right now there's no other way to automate that, right? Maybe right now you're just automating the pieces, but you can't automate the whole. This is where operator changes that, right? So, yes, some of these things were already, you know, you could already do by using something maybe like Zapier, by using some APIs or make.com or something like that, right? And speaking of, we have to talk about APIs, right?
Starting point is 00:49:48 This is how like 1% of the internet talks to each other, right? But what about for the other 99%? This is where CUA or computer use agents comes into play. You also have to tip your hat to the Anthropic team that came out with their computer using agent. I think it was back in October. It just wasn't usable, right? You had to download like Docker, which is an extremely, you know, compute intensive program on your desktop. You had to go into a GitHub repo, you know, and it timed out like every five.
Starting point is 00:50:22 seconds. There you just saw it did in 11 minute task all on its own. I didn't, you know, limit out or anything like that. Granted, I am on that $200 a month pro plan. All right. I do want to show you a couple other things on the operator interface. Okay. So like I said, this does look kind of like a chat GPT. A couple of things. I wish you could rename, rename these kind of operator tasks. So you can't write. now, you can only delete them. That's one thing to keep in mind. Another thing is you're always going to have your active tasks. So I have run up to three at the same time. I don't know if that actually slowed it down or not. But keep in mind, there's limits that are dynamic, so you don't
Starting point is 00:51:09 know what that actually means. Let's go into the settings because this is kind of important. So you can go in here to save tasks. So I'm going to go into the one that we just did. And then I'm to go click save tasks. All right, it's going to auto generate a title, the detailed instructions. So in this case, I would not use these detailed instructions. It's the same kind of piece of advice that I gave you guys for chat GPT tasks. Never let chat GPT save instructions on its own. It's not going to work. So it really just abbreviated those instructions. So I'm going to paste all of these in. So it has it. And then so it says title, research, trending, AI topic, the detailed instructions.
Starting point is 00:51:57 I copy and paste those in manually. And then it says websites, right? So it's going to use, you know, Gmail.com. It's going to use your everyday AI.com. So if it ever starts going in the wrong direction, you can put that there, right? Gemini. dot Google.com. And then we had notebook l.m.
Starting point is 00:52:19 Right. So now, if I am running into issues, I can essentially save this as a task first. Let's see. It doesn't look like it saved it. Let me just double check that there. I'm so zoomed in on my interface here. I think I just had to zoom out. There we go. All right. So then, yeah, I can go here, type in the URLs, whatever. I'm just showing you all that. Oh, here's the downside. So it looks like this is why I didn't save. The instructions cannot exceed a thousand characters, which that stinks. So let's just show you what this looks like. So this wouldn't work now. All right. Well, let's just go.
Starting point is 00:53:12 Let's just click Save Task. Sorry, y'all. All right. I'm going to save that. So now that is going to show up in my saved task right there. So then at any time, I can go in and modify that as well. All right, a couple other things. And these are things I don't even think you should pay much attention to if I'm being honest, right?
Starting point is 00:53:33 So when you do go to the homepage here, so now I have my saved task. And I can click that. I can edit it or I can click it and it will launch it right there. But don't pay attention. These are the things that OpenAI demo, don't pay attention to these, these dining and events. So these are essentially pre-packed. packaged prompts. And, you know, it does look like Open AI partnered with some of these websites and
Starting point is 00:54:01 companies to provide a more seamless experience. Like I would never use operator for any of these tasks because it requires too much human in the loop. Like when I'm using an agent, I want to save time. I don't want to sit there and just be like, oh, cool. And then like answer a question every 45 seconds. That's a waste of time. Right.
Starting point is 00:54:20 So you can go through here and, you know, use open table to reserve a table. or Stubhubhub to do tickets, you know, Uber Eats, Instacart, right? All these things, thumbtack, Uber, like, no, I'm not going to use operator to do in Uber. I'm going to use my Uber app, right? But a couple of other things to keep in mind, you can go in here into your websites. So for all of these, you can give them custom instructions. So for booking.com, I can go in and say, set instructions. I could say, you know, like I like, you know, modern, modern interiors and
Starting point is 00:55:03 outdoor spaces, right? So then if I'm using booking.com or whatever, it will take those preferences into mind. So I wish, so you can do that for all of those websites that they work with as well as news. So these are all the news organizations that OpenAI has partnered with. So I can go to, you know, the Associated Press. I can click Edit and, you know, type in custom instructions for the Associated Press as one example. So I hope and wish that in the future, you'll be able to add your own websites, that you'll be able to store your credentials for all of those, right? That would be extremely helpful.
Starting point is 00:55:44 All right, y'all, that was a lot. So I think there's a couple of questions. I know that this is already an extremely long episode. And she just said, holy sh. All right. Sandra said she was blown away. All right. That's good.
Starting point is 00:55:59 So this was helpful. All right. That's good. So, yeah, even though this was a little bit of a longer, of a longer process here, y'all. Thank you. So, all right. I see a couple of questions. I'm going to try to answer some of these as quickly as possible.
Starting point is 00:56:15 All right. Just scrolling through. Let's look at some questions. Douglas, have you checked out any open source operator solutions? Yes. So there's browser use. There's a couple of other ones that have become extremely popular. I've done a couple tests, but I'm using operator more, right?
Starting point is 00:56:35 The reason why there's, yes, there's other great kind of open source-esque and fully open source projects that do this. The reason I'm not doing them is because you have to think of the future, right? The future is operator is probably within hopefully weeks or months going to be able to work with chat GPT tasks. It's going to be able to work with open research. So in my mind, it is not worth, like, I think you have to choose your ecosystem, right? And I'm choosing, right, for at least when I'm on my Mac, right, I have my Windows computer,
Starting point is 00:57:09 my Windows copilot plus PC. I still got to get set up and using. But for the most part, I'm using in my day to day, I'm using chat chitbd, right? I have free plans, plus plans, team plans. pro plans, enterprise plans, because we train companies, obviously, right? This is my business operating system. So I'm not, even though there are, you know, some other better, or I won't say better, there's some alternatives that may be cheaper.
Starting point is 00:57:38 But I'm working for the future here, Douglas. I'm not working for today, right? Because in the coming probably weeks, months, operator is probably going to start working with everything else. So I am currently building skills and using operator that are going to pay off as number one, operator gets better. And number two, it starts to work with all the other products and tools in Open AI's ecosystem. Woozy, what's the coolest use case you've seen anyone do with it, Jordan?
Starting point is 00:58:06 What's up, Woozy? Hey, I'm sorry about your chiefs, buddy. I'm sorry. Caught a beating there. All right. So what's the coolest use case you've seen? I mean, it's limited, right? It's limited because right now the virtual machines that this use, they don't have a lot of computing power.
Starting point is 00:58:24 So I don't know. If I'm being honest, some of the coolest stuff is what I showed you guys, right? Using deep research, using other large language models, I think is great. I think it would be cool when it can consistently handle using something like a cursor or something like GitHub co-pilot. But right now it's not there because you still have to have kind of. of that quote unquote virtual machine compute and it doesn't have. So anytime you try to do anything that's a little too, you know, power intensive, you're going to get a warning.
Starting point is 00:59:00 Sandra, one of your prompt classes resuming, hopefully in March. Pedro, how could you prompt the model to be iterative with other AI models? So yeah, I kind of just showed you an example of that, right? It was using Gemini. So, and I did give it an example of a prompt to do. do the deep research. So you have to give it examples, you know, in your instructions, essentially. Another question, Pedro, would you use this to dive deep into X using GROC to search for news and hot topics and process the data as you did? Maybe. I personally think GROC stinks.
Starting point is 00:59:38 The only thing that I think GROC is decent at is searching X or Twitter. And in many instances, for what I want to use it for, it doesn't do well. So a lot of times I'll say, like, Today's, you know, February, let's say today's February 11th, right? I'll say, hey, give me the top AI news for February 11th, and it'll bring in things from two weeks ago, right? So I don't think Croc is a good model. I wouldn't recommend businesses use it. So I'm not using operator to, you know, do anything. Big Bogey says, looks like it needs to prove it.
Starting point is 01:00:07 Hot take. How do you rate it? It's an A, right? Especially after using some of these open source tools and Claude's computer use, it's an A, right? A lot of times what I find is once you go through and you improve, you run something once, you look step by step and see what it does, and then you improve your instructions. In most cases, it's going to do it extremely well. I mean, in my use case, I had it query something, click on my website, click on the search
Starting point is 01:00:38 bar, search for something, go back, use the pageination or pageination, right? Look at multiple pages of my website, understand trends. then go use Boolean search, research something, find what it thought was helpful. Go in, then in deep research, which requires multiple steps. Right? Like, you saw what it did. That is amazing. And maybe I'm just blown away because these are the, what I feel are mundane, repetitive
Starting point is 01:01:05 tasks that I do over and over. And now I can just be like, yo, operator, you go do this. And then it's going to get better at it than me. Because guess what? It is using the GBT40 model. So it will be able to summarize, synthesize, and understand information better than I can't, period. Right. So how do I rate it, A?
Starting point is 01:01:24 Right. If I look at this in six months because it's probably going to improve, I will probably look back at it and be like, yo, that was a D. But right now, it's extremely exciting. Cecilia, how are your passwords protected when you have the agent log into your accounts? So that's a good question, Cecilia. I read that last night. I thought I took a screenshot of it and put it in my account.
Starting point is 01:01:45 presentation, I didn't. So I'll make sure to put that in the newsletter. Pedro, should companies set agent accounts? Yeah. I mean, companies need to be using agents, period. Marie says, I see it can save the task. Does it also save the sidebar commentary? That is saved by default. So you don't have to click save task to save that sidebar commentary. So I can go through at any time, anything that I've run in operator. I can literally go and rewatch the entire process with the commentary. So you just have to click that expand window. And I can go, just like you can kind of see that chain of thought. I can see the entire step-by-step process in there. All right. Sandra says, can it use Can it use Canva? I don't know. Should we find out? Should we find out here? Well, actually, no, that's going to take too long. I'm going to have to 2FA it. But I believe, yes, it can from what I remember in my research, Sandra. But it's not going to work very well. Right. It's not for anything you want it to do that's extremely visual. It's not going to work very well. Because essentially, what it does, even to click and type things in, it takes a screenshot. So if you were like, oh, go, you know, update this template or, you know, update this template.
Starting point is 01:03:11 or create a design that's not really what it's for right now, at least, maybe in the future, it will do a good job at that. But you saw it put together a very, I mean, albeit plain, it put together a PDF presentation for me. It resized the font. You know, it's not going to win any design awards, but it at least went to Google slides and copy and paste it all that information over there that it did for the SWAT analysis.
Starting point is 01:03:34 All right, Doug was asking, does the refine Q principles work here? Yes, it does. your basic prompt engineering basics are always going to work. It's always going to improve it. You always need to iterate on the result. Don't run it once and say, oh, this is the best it's going to be. No, run it once, watch it, right? It's very tempting to just let it run and then go do something else.
Starting point is 01:03:56 But again, think of that task that you do every single day that takes you 30 minutes, takes you two hours. It might take you way more than that to automate this and to make it, you know, a solid operator workflow, but think if then you can get that two-hour task to you don't have to do it. That's amazing. But you're going to have to reiterate. So, yes, the refined Q approach that we teach in our free prime prompt polish PVP
Starting point is 01:04:24 course does work fairly well. And yes, basic prompt engineering, you know, works well. Give it examples. Tell it what's good and what's bad, right? Provide feedback, you know, improve your set of instructions. each time, rerun it, tweak it, right? You need to be doing these things. It's not, you know, agenic systems are not one shot.
Starting point is 01:04:47 They require human in the loop. They require constant improvement, constant refinement, because they're going to get better and better as we go. All right, looks like I tackled all the questions. So I hope this was helpful, but let me just recap it. Is Open AI's best AI agent operator? Yes, it is. Is it the one I'm going to use the most?
Starting point is 01:05:08 Probably not, right? If I'm being honest, I'm using deep research a ton. I'm using tasks a ton because they're running, they're scheduled. They're running autonomously. But I do think operator is the best because like I started the show out with, right? I think a lot of people are seeing these individual, these fragmented use cases of AI, right? But they're like, I still have to take these 20 pieces and put them together. Right. So a lot of people say, okay, it's not just doing my work yet. I thought that's what, you know, the future of AI in large language models, it was just going to do our work. Well, here we are, you know, going from the reasoners step to the agent. We're there. Right. I just showed you. That is a task that I do over and over and over and over again. I just trained live here on the show. I just trained operator to do it.
Starting point is 01:06:06 for me. And I'm going to go in and I'm going to improve it, right? I'm going to have them send me that, you know, notebook, LM, deep dive, or maybe send me a link to it, right? But now I can do better, right? I can do better. Instead of maybe looking at one of those reports, I can have a do three. And then I can sit. I can read the report. I can listen to the deep dive and I can use more of my brain, more of my creative ability, more of my kind of strategic decision making, right? I can leave, some of those mundane, repetitive, manual tasks that up until operator, I could not fully automate, but now I can't. So that's why I do think, I'm not saying, I don't say these things lightly. This is a revolutionary step. This is a giant leap for the future of AI because the future of
Starting point is 01:06:57 AI, like we've been saying for a long time, it's agentic, right? It is working in a multi-agent environments, giving agency, decision making, passwords, right, giving everything to an AI system, keeping the human in the loop, but then changing what we as humans work on. All right, I hope this was helpful, y'all. If so, if you want to put this in practice, I'm going to send you an example of exactly what I did. I will send you my instructions. So just go click repost if this was helpful.
Starting point is 01:07:26 If you're listening on LinkedIn or Twitter, just click that repost button. You can tag me in the post and, you know, to make sure I'll send this to you. you know, also for anyone that does repost this, I'm putting this out there. I don't know what we charge anymore for like a 90 minute consult. I think it's, I don't know, like $350 or $400, something like that, right? Anyone that goes and shares this on LinkedIn, I'm going to enter you into a little giveaway. I'm going to announce it in the newsletter probably next week. So then that way our podcast audience, you all have time to go click the LinkedIn.
Starting point is 01:08:02 show for this, go click repost, right? So I don't know, whether there's two people or 50 people that reshare this. I'm going to put all your names in a digital hat. I'm going to draw one and then give you all whoever does win this a 90 minute consult. All right. So whether you want me to help walk your team through operator, whether you have questions about chat GPT, whatever it may be, you get 90 minutes, all right. I'm not going to put together anything for you. You essentially just get get my time, right? Talk to me. I'll answer questions. Whatever it is you need, I'll do that. So make sure to share and repost this if this was helpful. Also, go make sure you check out that AI predictions and roadmap series. Thank you for tuning in. I know this was a long
Starting point is 01:08:46 one. I hope it was helpful. I hope I see you back tomorrow and every day for more everyday AI. Thanks, y'all. Meet Firefly AI assistant. Now live in Adobe Firefly, the Allman One Creative AI Studio. Just describe what you want to create in your own words and the assistant handles the rest. creating multi-step workflows across Adobe Creative Cloud apps, including Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome while the assistant accelerates execution. Stand control with the ability to step in and refine at any time. See it today at firefly.adobie.com.
Starting point is 01:09:29 And that's a wrap for today's edition of Everyday AI. Thanks for joining us. If you enjoyed this episode, please subscribe and leave us a rating. It helps keep us going. more AI magic. Visit your everyday AI.com and sign up to our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.