Everyday AI Podcast – An AI and ChatGPT Podcast - Ep 649: The 7 Types of AI Agents and the 10 Top Agents for Businesses to Grow

Episode Date: November 7, 2025

There's like a bajillion AI agents. 🤖But most of the REAL agents fall into these 7 categories that you need to understand.Oh.... and don't worry. We'll break down the top 10 AI Agent...s for business growth.Join us as we go over Agents 101, the 7 categories of AI agents, and the 10 you should be paying most attention to.Ep 649: The 7 Types of AI Agents and the 10 Top Agents for Businesses to GrowNewsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion: Thoughts on this? Join the convo and connect with other AI leaders on LinkedIn.Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:AI Agent Market Growth OverviewDefining AI Agents vs. ChatbotsAI Agents vs. Large Language ModelsSeven Types of AI Agent CategoriesAI Agent Adoption in Enterprise WorkflowsRisks and Pitfalls of AI Agent UsageTop 10 AI Agents for Business GrowthKey Features of Leading AI AgentsSelecting the Right AI Agent StrategyFive-Day Plan for AI Agent ImplementationTimestamps:00:00 "Agentic AI: Hype vs Reality"04:19 "AI Agents Need Human Oversight"07:26 "AI Agents Embedded in Workflows"11:23 "AI Agents vs Agentic Browsers"15:14 "Generative AI Expertise & Training"19:32 "AI Costs: Endless Loops Warning"20:49 "ChatGPT Agent Mode Overview"24:19 "Microsoft Copilot Permissions Challenges"29:20 "Agent Force: Salesforce Automation Tool"31:19 "Mastering AI Tools for Leaders"35:11 "Replit’s AI: Empowering Non-Tech Users"36:35 Zapier Agents Automate Workflows41:13 "Robust Cloud Autonomy Explained"45:51 Safe AI Implementation Guidelines46:26 Human Errors and AI Mistakes49:59 AI Agents & Enterprise FutureKeywords:AI agent, AI agents, agentic AI, agentic AI market, autonomous AI agent, enterprise AI agent, business AI agent, generative AI, agent washing, agent mode, ChatGPT agent mode, Microsoft Copilot Studio, Claude Code, Anthropic Claude, Google Project Mariner, Project Astra, AWS Bedrock agents, large language model, reasoning models, sub agents, coding models, software engineering agents, enterprise workflow automators, specialized research agents, anSend Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info) Start Here ▶️Not sure where to start when it comes to AI? Start with our Start Here Series. You can listen to the first drop -- Episode 691 -- or get free access to our Inner Cricle community and all episodes: StartHereSeries.com Also, here's a link to the entire series on a Spotify playlist. 

Transcript
Discussion (0)
Starting point is 00:00:00 This is the Everyday AI Show, the Everyday Podcast where we simplify AI and bring its power to your fingertips. Listen daily for practical advice to boost your career, business, and everyday life. Meet Firefly AI Assistant, now live in Adobe Firefly, the All In One Creative AI Studio. Just describe what you want to create and the assistant handles the rest, orchestrating multi-step workflows across Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome. The assistant accelerates execution. There's literally hundreds of options out there when it comes to AI agent options.
Starting point is 00:00:54 It's overwhelming. Someone that covers generative AI every single day. I almost like get tired of seeing new AI agents. Part of that is, well, one of the reasons is Gartner came out with some recent research going over the agent washing epidemic, where they found that nine. 95% of companies even promoting AI agents weren't actually AI agents. So what do you do? Do you need to use a bunch of them and find out what's good for you?
Starting point is 00:01:23 How do you separate the real from the fake and how do you actually use them? Well, we're going to be hopefully answering those questions and a lot more today on everyday AI as we go over AI agents. From automation to super agents, 10 AI agents you should know. I'm excited for today's conversation. Hope you are too. What's going on y'all? My name's Jordan Wilson. Welcome to Everyday AI. This is your daily live stream podcast and free daily news that are helping everyday business leaders like you and me. Cut through the fluff. Understand what's going on, not just learn what's real and what's not,
Starting point is 00:01:56 but how we can all actually leverage that to grow our companies in our careers. If that's like, yo, that's what I'm trying to do. You're in the right place. It starts here with the unedited, unscripted, live stream podcast. But if you want to take it to the next level, you got to go to our website, your everyday AI.com. There, go sign up for the free daily newsletter. We're going to be recapping all the highlights from today's show in case you missed anything. I're out walking your dog or wishing I would shut up so you could finish your walk on the treadmill, right? But we're also going to keep you up to date with everything else that's happening in the world of AI news.
Starting point is 00:02:28 There's a lot over the past 24 hours. So make sure you go check that out in the newsletter. So AI agents, what's real, what's not? What should you be paying attention to? It seems like almost an endless onslaught. Like agents are being shoved down our throats. But what's the difference between an agent in chat GPT? Should we be using agents?
Starting point is 00:02:51 How? And for what purpose is. That's what we're going to be talking about on today's show. So we're going to be going over and diving into the explosive growth of the agentic AI market. We're going to uncover the hidden challenges and potential pitfalls in agentic AI adoption. Tell you the real reasons you should be paying attention. AI agents in 2025, not just because they're shiny and new, but the real reasons. And we're going to break down the 10 most popular and widely used AI agents across different
Starting point is 00:03:18 disciplines. Y'all wanted this. All right. So, yeah, if you listen to the podcast, when I say, go sign out for the newsletter, it's for reasons like this. I work for you guys, right? In the newsletter, I said, hey, are we talking about agents too much or not enough? And you guys overwhelmingly said, feed us more agent content.
Starting point is 00:03:37 So if you're a listener of the podcast and you haven't subscribed to the newsletter, that's the only way I can ask you guys what you want to know more of, but you said you wanted more agents. And at the end, I've been doing this. I've been putting together these little kind of bonus guides because I can't fit everything into, you know, 30, 40 minute episode. This one's good. All right.
Starting point is 00:03:58 I've, I'm not going to brag. I've pulled out some banger, like extra bonus content this week. So if you want, if you want access to it, there's 20 total agents. There's 10 that were really good, but I couldn't just make the cut. So go repost this show on LinkedIn. The link is always in the podcast show notes.
Starting point is 00:04:18 We put the link in the newsletter. And so make sure you go repost that and I will send it to you. It is done. It is ready to go. It is a comprehensive guide on 20 AI agents. So let's talk. What the heck is an AI agent? Well, an AI agent can plan, act,
Starting point is 00:04:35 and also self-correct. That's the biggest thing to get work done. So an AI chatbot, you're just chatting with it. An AI agent can plan on its own. It can reiterate. And ultimately, it takes actions on your behalf. The whole human in the loop thing that I hate. I think human in the loop is bad.
Starting point is 00:04:54 I think expertise-driven loops are what we should be focusing on. Anyways, you've got to have humans overseeing AI agents because they can make decisions and, and actions on your behalf, which is both really cool and extremely useful for enterprises, but also extremely dangerous. Because if you don't have the right humans driving that loop,
Starting point is 00:05:16 right, the feedback self-correcting loop, yeah, agents can be bad, right? And I've talked about that plenty of the show. But the AI agent or sorry, the AI agent market is projected to explode past $7.5 billion this year. I would expect it goes well beyond that. And this represents,
Starting point is 00:05:34 I think a pivotal shift from AI, right? That just answers our questions to get things done, right? And we've seen whether they're advanced or simple offerings, right? Because, you know, at this time last year, there weren't really no AI agents available from the big players to the general public. Now there is Microsoft. Co-Pilot Studio. chat gbtte their agent mode claude athropics clod has multiple different agents a lot of them for coding but they have a computer using agent as well google they have their project mariner project astra hopefully they release their agent space to the masses soon right but every single big player has an agent that's available today are they all great no are some of them really really really
Starting point is 00:06:31 good? Absolutely. But whether you want to know it or not, the future of work in AI native workplaces isn't talking to a chat pot. It isn't just like, all right, well, well, now we have a chatbot plus our business, you know, info. We have rag. We have, you know, these connectors. No, it's smart humans like you and me, overseeing agents, right? And right now, studies show that over 80, percent of enterprises are adopting agents right now. So there's massive opportunities for all of us tuning in because if you're tuning in and listening to this, you're ahead of the curve. But it's all about what are you going to do with this information? And you have to be able to understand the risks. So like I said, what does, how is this getting to the point now? Because it's more than just talk.
Starting point is 00:07:25 Like I said, we've been talking about AI agents on this show for multiple years. But the good thing is, I got the receipts two years ago. I'm like, nope, we're not ready. 24, too early. Now we're ready because agents are already showing up where we work. That's the thing. Maybe 18 months ago, two years ago, you had to really go out of your way and try really hard to make anything on the AI agent side work.
Starting point is 00:07:51 You had to have a lot of duct tape, hope, and dreams, and maybe you got a little juice out of it. Now agents are showing up in the actual enterprise systems where we've worked. So that's in our documents, email, CRN, browsers, Ide's, etc. And the tooling has obviously matured because the models, right, most AI agents are powered by one or a series of underlying models, right? A lot of them have five, 10 plus models running kind of the agendasic capabilities and a main agent will break off a complex multi-step tasks to multiple sub-agents. And all of those sub-agents are running, you know,
Starting point is 00:08:27 maybe one of them is running a coding model, from Anthropic. Maybe one is running a reasoning model from Google. Maybe another one's running, you know, like a, you know, GPT4.1 from open AI or something like that. So these models themselves are much more capable, much more robust than they were six to 12 months ago, especially now that we have these reasoning models and, you know, all of the scaffolding that comes along with these models. And that's another reason why they are not just explosively growing right now, but are actually explosively useful if you know what you're doing, right? The tooling has matured and big vendors have shipped platform agents as well.
Starting point is 00:09:07 Focused startups have shipped specialized agents. Everyone is shipping them. And they are now more than just co-pilots. They are pilots. They can go out and do work on your own, right? Very simple use case. I have chat GPT, their agent mode, probably one of the easiest to use, not the most useful unless you are decent.
Starting point is 00:09:29 enough at prompt engineering, but once you get it, I have my agent. It's on a scheduled run. I don't do anything. It just goes out, execute tasks for me every single day at a certain time. So again, what's the difference? What's the difference between an AI agent versus a model? All right. And hey, live stream audience, love to see you.
Starting point is 00:09:51 Tune it in. Sorry, she'll give you a shout out a little more. But let me know if you have any questions. Someone said I would love to see an episode on Perplexity's agentic browser comment. Already did it. LinkedIn user. So go go check that out on our website. It's free. Go go go check it out. But yeah, live stream audience, if you have any questions, please, please let me know. I'll try to answer them. Just put a question in your in your comment. So I can make sure I can see
Starting point is 00:10:17 it just because we have comments rolling in from all the different platforms. But what's the difference between an AI agent in a large language model? I've done an entire episode on this, but to put it simply, the lines are blurring because now the base model, are agentic in nature, whereas nine months ago, they weren't. Right? So essentially, we had our, let's say, rewind the clock a year ago. The models, the most powerful models in the world were transformer models. They didn't really have a bunch of useful tools, right? All the scaffolding that make models really good. And they couldn't, they weren't reasoning models. So they couldn't plan and think and retrace their steps and come to a fork on the road and try a couple
Starting point is 00:11:02 flip things on the left side of the road and figure out, oh, that's wrong. I need to go back to the middle and use some different tools, right? That's an agentic model with tool use. We didn't have that, right? So the lines between what is an AI agent and what is an agentic model, they are kind of blurring, but for the most part, if you think of traditional old school AI models, large language models, they're chatbots, right? They're obviously becoming much more robust and they're becoming entire operating systems now, but you know, you are still having as the human to take that information and do something with it. Whereas now AI agents, you give them a goal. You make sure they have access to the data that they need and then they go figure it out. Right. Whereas I'd say
Starting point is 00:11:45 more chat box kind of self-service, right? AI agents, autonomous cars, they do it for you. An agent holds a goal, chooses its own tools, takes its own action, and checks its own action and checks its own work. And a lot of times, it's not just taking the best available road. It is really in real time building its own role. Right. So let's talk about categories of AI agents. Because there's a lot. And this is also where it starts to get a little confusing. Because like someone already said, you also have your agentic browsers now, right? Which I actually think, at least for, hey, what has more utility today? I would say it's almost in some use case. is agentic browsers, right?
Starting point is 00:12:34 If you look at top to bottom, if you pick a middle of the road, AI agent versus an agentic browser, in the short run, I think agentic browsers have much more utility for the average business leader right now. But we're not talking about agentic browsers. We're not talking about agentic models.
Starting point is 00:12:53 We are actually talking about AI agents, complete systems. And I would break them into these seven categories. So number one would be autonomous software developers. developers such as Devon, right? So these are agents designed to handle end-to-end software engineering tasks from writing and debugging code to testing and in deployment. Then you have your general purpose task agents, which is like chat GPT agent mode. So these are platforms that act as a versatile digital assistant that can plan and execute a wide variety of complex, multi-step tasks for professionals and consumers. You have your enterprise workflow automators. That would be kind of the
Starting point is 00:13:29 third category. These are like Microsoft co-pilot studio, right? They can automate company-wide workflows and they're all connected to your dynamic enterprise data. Then we have specialized research and analysis agents such as Gen Spark, all right, in a kind of an AI agent startup. So these are agents that focus more on gathering, synthesizing, and analyzing information from various sources to generate comprehensive reports, summaries, or insights. All right. Then you have your foundational platforms in frameworks. This is more like Amazon Web Services, Bedrock agents.
Starting point is 00:14:06 So these are the underlying toolkits, cloud services, and open source libraries that developers use to build, deploy, and manage their own custom AI agents. Then you have kind of your UI and web automation agents. There's a lot of more of these recently than there were, you know, maybe six months ago. So these are agents like, you know, the UIPAP. agentic offering. So these are kind of like GUI. They work in a GUI way.
Starting point is 00:14:32 So they work on the graphical user interface, but they're agents that specialize in interacting with software through its graphical user interface enabling automation of applications that lack modern APIs. So technically, right, this is kind of where agentic browsers would fit in, but I do think they're a different category. They're not agents. They're agentic browsers.
Starting point is 00:14:52 Then you have your conversational, last but not least, you have your conversational companion agent. This is more for agents who kind of like personal agents. So it's almost like a blend between a standard AI chatbot, but can also go out and do things for you. Right. So this category is for agents whose primary interface is empathetic dialogue, but they still perform simple, goal-oriented actions like setting reminders or finding information to assist you personally. An example of that would be inflection AI's pipe. All right.
Starting point is 00:15:24 Adobe just introduced an entirely new way to create, bringing the power and precision of its creative suite into one conversational experience. Meet Firefly AI Assistant, now live in the Adobe Firefly app, the all-in-one creative AI studio. Powered by Adobe's Creative Agent, Firefly AI Assistant lets you start with your vision, just describe what you want, and shape the outcome as it takes form with the Assistant. The assistant orchestrates multi-step workflows, drawing on 60-plus pro-grade tools across Adobe Creative Cloud apps, including Photoshop, Illustrator, Premiere, Lightroom Express, and more to help bring your ideas to life. You can also get started with creative skills, a growing library of pre-built workflows for common creative tasks, like batch editing photos, creating mood boards, portrait retouching, and creating social variations. Every step the assistant takes is visible, so you can refine,
Starting point is 00:16:25 redirect or take over at any time. You stay in the driver's seat as the creative director. Adobe Firefly AI assistant now in public beta. See it today at firefly.adobie.com. What do all these AI agents have in common? Right. I broke down kind of seven different categories because they're all very different. A lot of people think an agent, all agents are just general use case agents.
Starting point is 00:16:56 And that's not the case. Right. And when you're choosing, hey, what should our company be doing or what should I personally be doing when it comes to AI agents? You have to think of, well, what's your goal? But one, some core functionality that most AI agents share. They break big goals into smaller steps. They call on and use different tools. And then they click through or use some type of user interface and then fix mistakes as they go along, right?
Starting point is 00:17:27 that's the other thing. AI agents are iterative. A lot of them will test and deploy things on their own. Go back, which I know it sounds weird, go back and fix the own errors that they created, right? Just like a human would or a team of humans. They pull live info from your data sources, from connectors, from search, from your company knowledge, sometimes from rag databases, right? But they also keep humans in the loop for risky steps and leave a clean paper trail. Right.
Starting point is 00:17:55 So almost every single AI agent, we're going to be done. talking about there is a level of traceability, right? And that's so important when we talk about AI agents, right? Because let's be honest. When we think of, oh, human in the loop, I think we think of, you know, kicking our feet up on the beach with a margarita and our AI agents are doing everything for us. That's not good, right? But you always have to keep observability and traceability in mind. That's number one, can you observe the agent as it works? But traceability is after the fact, can you go back and see step by step what the agent did? So as an example in chat chbtee agents mode, you can.
Starting point is 00:18:31 You can click the three little dots. You can either watch it in real time, observability or traceability. After the fact, you can always go, click on kind of the reasoning data. It'll extend it. And then you can literally see step by step, a screenshot and a description of what the agent did. That is a very simple example of traceability. So most AI agents do have those things. But it's not all fun in games.
Starting point is 00:18:55 The pitfalls are enormous, right? I think especially, and this is maybe another conversation for another day, I think AI in general, because so many people have become over-reliant on it, it's taken our normal human guards way down, which actually makes these hidden challenges and pitfalls way more consequential than they, in theory, should be. Sometimes when you're using an AI agent, what you start to do is, right, kind of like,
Starting point is 00:19:24 what I'm doing. You're just sipping the coffee and you're like, oh yeah, good job. Good job. This would take me five hours. You did in five minutes. I'm going to spend five seconds looking over it. Perfect. You can't do that. You have to be vigilant. All right, but some of the most common pitfalls is something just looks done, but it isn't actually done until you go through and go through the step level logs and, you know, past failed checks. Permissions and sometimes, in some cases, are too wide. AIA agents sometimes have too much power and humans aren't doing enough job to rein that in or control it on the front end. Sometimes there's no per agent identity, there's no accountability, but you need to be able to add approvals and change control. Also, costs, right? Cost
Starting point is 00:20:11 can creep up when agents keep looping forever, right? So a very simple example of this, if you've ever gone into chat GPT and you have the wrong mode selected, right? And you give it a very simple question And, you know, now there, you know, as of last night, there's even new, you know, thinking levers inside chat, GPT, right? There's the light standard heavy and, you know, light standard extended and heavy. Right. So in the same way, a normal large language model that can reason, you might choose the wrong one. And it's thinking for 12 minutes on like, you know, when was the last time the Chicago
Starting point is 00:20:42 Bulls won an NBA championship, right? The same thing, the same kind of mistakes in terms of these endless loops, uh, can happen and for AI agents, the difference is, depending on how you set them up, you might be paying for usage, right? So you might accidentally be like, oh my gosh, it's just this, this never-ending loop that you're not paying attention to. And then you go look at your, your API bill. And you're like, oh, my gosh, this, it would have been cheaper to have a human do this, right? You see that all the time. So you need to have clear SOPs and ownership and fast rollback as well. All right. Enough to chat, top 10 agents that you need to know. Here we go.
Starting point is 00:21:22 Open AIs chat GBT agent mode. This is more of a generalist virtual computer, right? Chat ChitpT agent mode has access to a virtual computer, a sandbox. It can run code like it would on a computer, a browser, etc. So what it does, it runs multi-step tasks end to end. It can research, draft, analyze, and build files inside of chat. How it works, it uses a secure cloud computer with a browser, code, and adds files, narrates each step and ask before high-impact actions.
Starting point is 00:21:51 You can also log in to different websites on a browser. You can open, you know, you can temporarily take control, open up different browsers, log in to different services if you like, and then hand it back over. So at any point, you can take over, which a lot of these AI agents, you can do something similar. The pros and the cons of Chad GPD agent, well, it's one of the easiest to use and it's extremely versatile. It's in one place, right? 700 million weekly active users to chat, GPT have access. Well, not all of them because you have to be on a paid plan.
Starting point is 00:22:18 But, you know, tens of millions of people. right now have access to an agent that is the learning curve is essentially nothing. Is it the best? Absolutely not. If I'm being honest, I'd say out of these 10, it's probably only better than maybe two of them. Right. But you have to pay attention to Chad GPT's agent mode because it's going to get way better in the future. Obviously, they have their agents SDK.
Starting point is 00:22:46 So companies are building on top of it. And you should be using it anyways. I think it's one of the easiest ways, and that's why I put it first on the list. It's one of the easiest ways to understand how an agent works. You have to go back, look at the observability, traceability that I talked about. You're going to learn so much just how an agent calls its tools, how you should be talking to an AI agent versus an AI chat bot. So much can be learned when you go back on the observability, traceability side and, you know, track it.
Starting point is 00:23:15 So some of the unique features, it chains multiple tools in one persistent session, So context and outputs stay together. So yeah, you can keep the context going inside of one agent chat, just like you would a normal conversation inside chat chad chivity. This is best for busy professionals who want one agent to go from question to finish deliverable. It's not great right now. Let me just say that first and foremost. It's not great, but it is the easiest to use.
Starting point is 00:23:44 It is the most widely available. And I think out of most of these, it probably has one of the higher ceilings. Next, probably one of the most used here would be Microsoft co-pilot studio agents. And these are more, the category would be kind of a governed enterprise orchestration. So what does, and this is different than Microsoft co-pilot. People get confused. They're like, oh, co-pilot studio.
Starting point is 00:24:08 I use that. And I'm like, okay, show me how. I'm like, no, that's co-pilot. You're just chatting with co-pilot. Or you're using co-pilot in one of the 62 places that it exists within, you know, Windows 365 copilot. Microsoft copilot studio is a dedicated autonomous AI agent builder that you can build something with no code. So it lets non-technical people as well build agents in natural language for Microsoft 365. So it taps into the Microsoft Graft, intra-identity and DLP. Admins can add
Starting point is 00:24:39 approval in guardrails if they want, they should. And agents live where office users already work. That is the big key. Pros and cons. It has enterprise grade governance and deep app hookups. And it's tied to Microsoft's ecosystem. And it needs setup time. That's the thing. Everything, like one of the biggest downsides to Microsoft 365 co-pilot in general is it's extremely difficult to handle permissions, access, things like that.
Starting point is 00:25:09 The learning curve is actually, I think, way steeper than it needs to be. I hope Microsoft improves on that in the future. What's unique about this? well, each agent gets an Azure Entra identity with full audit. So IT orgs or IT departments can treat agents like real users. They can go track anything an AI agent does across their entire organization as if it was an actual employee because different people within your organization can use the same agent, right? Sometimes people are collaborating with the same agent at the same time.
Starting point is 00:25:38 You know, one agent might be used by just one person. It might be used by an entire team. It might be used by thousands of employees across the organization. So that's a good, unique feature for the co-pilot studio agents. And it's best for organizations, obviously, who are already deeply ingrained inside the Microsoft or co-pilot ecosystem. Next, Claude code. So this is kind of safety first coding execution agent. So what it does, it plans at its runs and test code changes, turning clear tickets into reviewable patches.
Starting point is 00:26:10 So if you're in software engineering, anything on the dev side, you probably use Claudecode. It has been one of the more popular agentic coders and one of the first now. I think there's maybe some that might end up being more useful in the long run. But Claude Code is one of the most popular and most widely used coding agents in the world right now. And it's really, really good. As much, yeah, I know people like realize and I get messages all the time. People are like, oh, Jordan, you always, you know, poo on Claude. Claude's great for coding, software engineering.
Starting point is 00:26:40 I say it all the time. That's not the majority of our audience, right? the majority of our audience is non-technical. So if you're using clod. A.I in the front-end chatbot, not very useful, not very helpful. If you're software engineer, developer, Claude, absolutely amazing, right? And Claude code also has sub-agents where it can, you know, break these tasks up. It can work for, I think they said, up to 90 minutes on its own.
Starting point is 00:27:02 So how it works, it modifies files and runs tests and loops, logging each step for easy review. Some of the pros and cons, it's safer, test guided changes and explainability. But on the downside, it needs a lot of testing. QA, a lot of times you'll ask for one small change, right? If you're working across an entire repo, you know, an entire app, you ask for one change and it might change absolutely everything. In at least over the last couple of weeks,
Starting point is 00:27:30 Claude has had some problems. The service has not been as good. Everyone's kind of jumping ship over to OpenAI's Codex, which did not even make the list. But regardless, Claude Code is one of the best. So some of unique things, it's a purpose-built coding loop for small and safe edits and clean diffs and it's separate from the chat interface. Obviously, Claudecode is not going into clod.AI and coding in there.
Starting point is 00:27:55 It is a separate product, you know, like I said, with the agent, sub-agent architecture. And this is best for teams automating refactors, upgrades, and fixes where automatic build and test pipelines are already in place. All right. Next AI agent, you need to know, AWS bedrock agents and agent court. This is more on the governed enterprise integration side. So what is this? Well, it's a cloud runtime.
Starting point is 00:28:17 So this is an agent that runs in the cloud for building and scaling custom enterprise AI agents. How it works, agent court adds session isolation, memory, identity, identity, and observability. It's kind of like Lego blocks for agents. So obviously, if you're a large enterprise using AWS, AWS bedrock agents, right, where do you start? You always start where your data lives. So for, I don't know how big of a market share, AWS has in the cloud, right?
Starting point is 00:28:46 They're one of the three big. You could say Oracle now is in the big four of cloud AI. But you know, you're always like, where do I start? Well, so many, I think so many organizations are on AWS, at least maybe on the smaller side and aren't fully investing in bedrock agents, which you probably should. So pros and cons, it's highly modular, highly modular and framework agnostic. Another good thing about AWS is they have so many. platforms. Yes, they have a strong partnership, obviously, with Anthropic and their investment there.
Starting point is 00:29:15 So using the Claude models, but you can use just about any different models, proprietary, open source, closed source, et cetera. So what's unique? Well, it's plug and play to run many different agent patterns securely on AWS. And this is best for builders who want company-wide standard agents on top of their current existing AWS infrastructure. Next, another company that sometimes I poo-poo on. Salesful. But if you are a dedicated Salesforce organization, if you're using Salesforce for your CRM, which so many people in the world are, I'll say this. Asian force, in some cases, if you don't care about costs, it's better than not using it, right? It's better than probably manually having to go through all of your Salesforce information.
Starting point is 00:30:03 So what is agent force? Well, it automates sales and service steps inside of the Salesforce CRM. teams already live in. So it uses the CRM context to plan actions. It has a command center that provides approvals, oversights, and outcome tracking. It can send emails to, you know, prospects that have gone cold based on your data, based on your conversational history. So some of the pros and cons, well, it's trusted, data grounded actions and it shines most if you're already deep in Salesforce. So like I said, if Salesforce is not, you know, obviously for sales teams that are very dedicated and use Salesforce all day, Salesforce, Agent Force is a no-brainer.
Starting point is 00:30:41 If you're a company that, you know, let's say you use Microsoft 365 co-pilot, you're just in Salesforce every so often. Agent Force might not be the one that gives you, might not be the agent that gives you the best ROI. So who is this best for? Well, revenue and support teams that want measurable impact inside their system of record. Next, Google Project Mariner. This on the list is probably the least used AI,
Starting point is 00:31:06 agent. All right, but Google is with Open AI. They're running ahead of everyone else. So even though their Project Mariner, not that great, right, it's a Chrome extension. You have to be on a higher tiered paid plan to use it. You still have to pay attention because eventually Google will release agent space to the match masses. This is their kind of premier AI agent platform. However, it's been hardly released to anyone, right? So I was at Google Cloud Next when it first started rolling out back in May. Here we are five or six months later. And still very few organizations have access to agent space. I know we have a lot of people, you know, listening at Google. So Google, please start rolling this out to SMBs and, you know, enterprise companies in general. I think
Starting point is 00:31:56 it's a fantastic product, but hardly no one has used it. So I can't really talk about agent space because I haven't even personally used it. But I personally use Project Mariners. So I can talk about that. So this is more of a lives in the browser, right? But you like, if you have access to Project Mariner or if you can get access and you are a leader on your team when it comes to AI, this is one of those things. You got to get the reps in, right? Because Google and Open AI are going to continue to run away with everything. So you need to get used to their agent tools with where it's available now, even though most people, right, Google's Project Mariner might get rolled up with Project Astra and agent space. Who knows if it's going to be its own
Starting point is 00:32:35 dedicated offering in the future, but you should get use of the technology and how it works. Like I said, one of the easiest ways to learn AI agents right now is to go use chat GPT agent mode and go observe its steps. You're going to get better at prompting AI agents. It's completely different than prompting a large language model. The same thing can be said for using Project Mariner. So here's what it does. It clicks through real websites for you at scale without manual scripts, right?
Starting point is 00:32:59 So it's essentially a browser using agent. It's not a computer using agent. where chatyp t's agent mode has use of essentially a virtual computer including like things that you can do outside of the web so google's project mariner is more of just web-based so it runs isolated cloud browser sessions plan steps fills forms and can work in parallel for speed so pros and cons it has great obviously real web execution but the packaging and scope are still evolving so what's unique here. Well, it can run up to 10 concurrent browser sessions to finish web tasks faster, which you don't have that in like chat, GPT agent mode. You're kind of restricted to one run at a time.
Starting point is 00:33:42 So you have this kind of parallel advantage here with Google's Project Mariner. And this is best for teams that need repeatable web workflows done in handoffs reliably. One thing I like about Project Mariner is it has this teach and repeat thing. So if you have a process that you manually do over and over. I should have used this for a project that I was, uh, I was working on yesterday. I was trying to do it inside, uh, both perplexities comment and chat GPT agent mode. They weren't doing the best. I probably should have done it in Project Meritor because it has this teach and repeat pattern, right? If you have, you know, five different websites that you all use concurrently, you're grabbing something or some information from one, carrying it over to two,
Starting point is 00:34:22 making changes, bringing it over to three. That's a great use case for Google's project Mariner. But come on, Google, give us agent space. Next, and this is a more recent one, and one, at least on the list that I've probably been the most impressed by recently, and that's Replit Agent 3. And this is an autonomous software creating agent. So here's what it does. It turns plain English into working apps, talking full scaffolding, coding, running tests, fixing everything. Front end, back end. It is a absolute workhorse. So you'll notice that some platforms didn't make the list. I didn't put cursor on here. I didn't put lovable. I didn't put both all great all great tools. But I think Replit agent
Starting point is 00:35:08 three outshines them. All right. So it also uses a reflection loop to execute tasks, tests in a browser, and auto repair issues inside of replet's IDE. So some of the pros and cons, super fast idea to app, but complex code still benefits. from human review and controls. This is one of those. Can a non-technical person use Replit Agent 3? Sure. But your outcomes are going to be much better if you know what the heck is going on, right?
Starting point is 00:35:37 It's kind of like watching a movie in a different language. You don't speak without subtitles, right? You're like, okay, I think I know what's going on because I can see everything. And I think I'm following the storyline of turning plain English into a working app with a database and a backend authentication and payment processing, right? what Replit Asian 3 can do is extremely impressive, but it probably makes more sense if you speak the language, right? So it's almost like some of its most,
Starting point is 00:36:07 I would say, appealing use cases are for non-technical people because technical people have plenty of AI IDs they can go work with. Like I said, cursor, lovable, bolt, you know, Claude Code. I think Replit is one of those that's probably going to separate itself over the next year and become wildly even more popular than it already is. So what's unique about this? Y'all three hour runtime. An AI agent can go run for three hours.
Starting point is 00:36:37 And I've seen some videos on this, right? Unedited before and after of what it can do in two to three hours. Wildly impress it. So yes, some of the other platforms, the lovable, the bolts, right? It can accomplish the same things. But with a lot of back and forth iterative prompting, right? So if you get it right on the front end and if you know what you're talking about, again, think of the movie analogy.
Starting point is 00:36:58 If you can direct the movie, Replit Agent 3 can go do it. Three hours. So this is best for founders, educators, and developers who want prototypes live and fast. Next, Zapier agents. I've been very impressed. Zapier agents are really good. So what it does, this is no code agents that automate work across the more than 7,000 enterprise apps that Zapier connects with.
Starting point is 00:37:21 So I'm not even going to get into like the difference between, well, I'm, I will shortly, actually, right? I think a lot of people when they think AI agents, they're thinking AI powered workflows. And yes, Zapier has that as well, but their agents offering is completely different. So yes, you can still have your, you know, kind of your visual builder, building different, you know, marketing automation workflows and enter in AI. Because a lot of people think of, you know, like, oh, N8N, that's an agent. I'm like, no, that's an AI powered workflow. So Zapier does have AI powered workflows, but they have their actual agent as well. So completely different product.
Starting point is 00:37:58 And it's extremely impressive. So all you do is you describe the outcome you want. The agent selects the app actions. It runs them and that it can hand certain things off to other agents. So some of the pros and cons, well, there's massive app coverage more than $7,000. So you don't even have to worry. There's support for MCP, but you don't even have to worry about creating your own custom servers because Zapier connects to the internet.
Starting point is 00:38:20 It is the glue that holds the internet together. So that is a huge advantage of Zapier agents. So some of the downsides, the cost and complexity can grow, right, especially with more open-ended flows. What's unique? Well, there's agent-to-agent calling, live knowledge sources with drive, box, drop-box, dashboards, and ready to use templates. And this is best for ops teams, marketing sales, and support teams,
Starting point is 00:38:42 wanting practical cross-app automation without code, right? So I think Zapier agents have a place. It's not like you choose one AI agent. I think Zapier is ultimately going to continue to rise in relevance outside of those in the marketing world. Right. I've been using Zapier for more than 10 years. I love it. It's traditionally been for marketers.
Starting point is 00:39:09 I think non-marketers are eventually going to catch on to Zapier because of Zapier agents. Miguel said, would you say Zapier? is better than any and yes absolutely n a n is free it's open source zap here's better right don't be wrong n n's great a lot of people right stop stop seeing right a lot of people see like what's on social media and that you know it's all these n8n workflows you know comment follow me repost this and do this 30 things and i'll send you this agent that prints money it prints a million dollars right it's no it's just it's an it's an i power workflow you know part of the reason i think at a gets a bad name because people are abusing it and selling it as snake oil.
Starting point is 00:39:51 So, but no, Zapier's better. All right. Next, we have Jen Spark, Super Agent. And this is an orchestrated research and creation. So here's what it does. It produces research pages, summary, slides, and media fast. The research and creation process in Jen Spark, extremely impressive. And both Jen Spark and Manus, to their credit, keep churning out updates a lot faster than as an
Starting point is 00:40:15 example. co-pilot studio or chat GPT agent mode. They're really shipping fast. So here's how GenSpark's super agent works. It routes subtasks across multiple models and tools. I think the last time I checked, it uses nine different large language models. And then it stitches together a clean result. So essentially, you know, you have a routing agent.
Starting point is 00:40:35 And then there's specialized sub-agents. And they all kind of work together. They all go off and do their own thing, report back to the main agent and then give it back to the user. So the pros and cons, well, is broad and quick. You can verify key outputs before publishing in a professional setting. So what's unique about GenSpark? Well, it has these thing called Spark pages.
Starting point is 00:40:54 Think of them is essentially like almost a cross between like a website and a PowerPoint, right? But they are shareable pages that you can go in and update. And this is where all of this information can live. So instead of just having, you know, a bunch of text or some random, you know, oh, it spits out a PDF. it's like a way for your content that you create with the Gen Spark super agent to live. And this is best for analysts and marketers who need synthesized deliverables without heavy lit. All right. Manus AI.
Starting point is 00:41:26 And this is more of a hands-off autonomous executor. So what does Manus do? So GenSpark and Manus are both out of China, which is why maybe you've heard of them and maybe you haven't, right? But they are obviously, I think, starting to creep their way into the U.S. ecosystem. them as well. So what does Manus AI do? Well, it handles long multi-step projects and end then shows you every step it took. So how it works is it works in a cloud desktop and browser environment similar to chat GBT's agent mode, but it plans action and it logs everything and keeps
Starting point is 00:41:59 going even if you disconnect. So I will say it is much more robust right now than chat Chb-T's agent mode. So, but they work very much in a same way in terms of the user interface and user experience. So some of the pros and cons, well, there's deep autonomy with transparent trace, but also there's mind privacy and access policies. So what's unique? Well, persistent cloud sessions work continue even when you go do something else, right? So that's nice. Chad GPT's agent mode is hit or miss with that. So essentially, you can log in or keep these instances kind of live and running across multiple days, multiple weeks inside of Manus AI. So this is best for I'd say smaller teams, creative teams or consultants and researchers who want an agent that just
Starting point is 00:42:46 kind of does the work. Right. So I'd say Manus is a great kind of starter agent for a lot of people. All right. That's our 10. We're going to keep this thing going. A couple things to wrap up here. So what's the difference?
Starting point is 00:43:01 Well, I gave you the different categories, but let me start breaking these now because each platform offers different runtime primitives, including identity, memory, isolation, observability, and parallelism capabilities. I told you. Some of these agents don't run in parallel. Some of them do. Some of them have persistent cloud instances. Some of them don't. So, you know, every single thing like a car, right? Some have Ford war. Some have two. Some are hybrid. Some are electric. Some are four-seaters. Some are vans, right? So it depends on what you need and what you want. But the platforms vary in domain grounding depth. You know, some are CRM native. Some are IDE native. Some are browser native, etc.
Starting point is 00:43:35 And the delivery approaches range from no code studios like Microsoft copilot studio to framework run times to embedded application agents, right? Some are great at certain tasks right now. And I do think, you know, as we go on, there's like marketing specific agents, there's sales specific agents, there's HR specific agents. There's legal AI agents, right? Harvey, right? There's so many different agents for different categories as well. But then there's also different genres, even with. within those categories, which is why it is sometimes hard to keep up.
Starting point is 00:44:09 But cost structures also significantly differ. Some of them are subscription models and have pretty generous usage. Some of them you're just paying for the actual API costs in addition to your usage. So you also have to keep cost in mind. Also, the integration philosophies span from ecosystem lock-in, more like AWS bedrocks, to universal connectivity, right? Like Zapier. So it's also like what?
Starting point is 00:44:35 What systems do you ultimately need these agents to pull data from and talk to? That's the other thing. You have to think of agents as like a mind map, right? It has to be able to pull in dynamic data from multiple places. And it has to be able to execute tasks in multiple ways. All right. So what's your agent use case? Here's what I don't want you to do.
Starting point is 00:45:02 I don't want you to look back at these 10 AI agents that I told you say, oh, this one's my favorite. This one's sound. it's good and go get started. Think of it the exact opposite. You need to think about your automatable problems. I don't know. Is that a word automatable?
Starting point is 00:45:19 That's what you need to do. Think about your automatable problems. Because again, if you have a hammer, everything looks like a nail and you're going to get screwed. Think like so much of getting the most out of large language models, out of AI powered workflows, out of AI powered workflows, out of AI agents is really doing an honest audit of your day-to-day responsibilities or your team's day-to-day responsibilities and how you actually fulfill those.
Starting point is 00:45:49 I know it sounds boring. I tell people, use time trackers. What are you doing? What is your team doing? What are you actually doing? What is the work you're actually performing hands-on keyboard, hands-on mouse inside of large language models today? And then you need to find the automatable problem first, not the shiny tool.
Starting point is 00:46:07 Here's what I would do. Start small, pick one workflow people hate, and then define a what done looks like. A lot of people don't define what counts as agentic success first. And they're just like, oh, this is good enough. You need to lock it down. Right. So you need the least privilege access approvals, logs, and fast rollback. You need to make sure that you're using it in a safe way.
Starting point is 00:46:30 If you're giving an AI agent access to your company's dynamic data, you need to make sure it's auditable, traceable, guardrails, etc. You also need to be able to measure weekly, not just on a per task completion, but you need to be able to measure completion rates across the board, fixes needed, time and cost, say, need to understand how much does it take humans to do this? Guess what? Humans make errors. Humans go through revisions. I think people think of large language models and AI agents as something they're not.
Starting point is 00:47:02 Nothing will ever be perfect, just like there's never been a perfect, you know, human worker. No worker has gone through their entire career and never made a mistake in that spreadsheet, never made a mistake in that presentation. You know, they've no, no one's ever killed absolutely, you know, 10,000 out of 10,000 sales meetings. Human workers make mistakes. So do AI agents, right?
Starting point is 00:47:23 So you have to be able to calculate the cost from start to finish. And then once you have and you see the positive R.O.I. Right? Maybe you think, oh, Microsoft 365 Copilot Studio, this is where we're going to go. Maybe you're going to get great ROI out of it. Maybe that's for six months from now. Maybe you're going to get something better with using, I don't know,
Starting point is 00:47:46 chatypte agent mode or Manus or, you know, Zapier agents. So you need to pick one that solves your automatable problem, be able to measure it, and then what can scale with you. Five days. This is what I want you to do. Ready? Write this down. Get out your pencils.
Starting point is 00:48:04 Take a screenshot. Whatever. Day one, you have to define the win, right, the automatable problem, and then pick a lane. One lane outcome. So whether that's Agent Force, co-pilot studio, et cetera, don't test one thing across five different platforms. It's not going to work. Okay. Day two, you need a short list and smoke test.
Starting point is 00:48:28 So try two platforms. All right. So not five, two. All right. eventually get in one lane. We need to test. Try two platforms on one real world example. You need to note the setup, the accuracy, the workflow.
Starting point is 00:48:43 Go back, look at the observability, traceability. Day three, build the MVP. So connect only the needed data and apps. Okay. And then confirm usable by owner. Make sure has the right access to the data. Make sure the inputs are correct. Make sure it can output or create whatever deliverable you actually need.
Starting point is 00:49:03 All right, day four, run five cases. Time the runs, note the fixes and the limits, and then compare it to the baseline of what your humans do. That's how you calculate the ROI. Then day five, you decide and expand. You confirm the platform. You write a one page, how it works, and then you start sharing with the rest of the team,
Starting point is 00:49:22 the rest of the organization, and then you can add adjacent use cases across different teams or across your organization based on what already works. Very small scopes. There it is, your five-day plan. we covered a lot. I went over 10 AI agents.
Starting point is 00:49:41 I talked about some of the problems and opportunities as this space continues to blow up. All right. But I will tell you this. Even though this is a longer episode and I apologize for that, but hopefully you all found some value. There's more. All right. So if you found value in this episode as we wrap up here, please share this with your community. Go click the repost button on the LinkedIn post.
Starting point is 00:50:04 So if you're listening on the podcast, always put the link to the LinkedIn post, go repost this, and I will share, I put together a crazy useful guide on 20 different, there are some other AI agents that I really like that didn't make the top 10, right? So make sure you go repost this. I will share that with you. Also, we've had a lot of great agent content and some good interviews over the last year or so. So go check out episode 590, which is agents, LLMs, or algorithms,
Starting point is 00:50:34 a playbook for choosing AI. Go listen to episode 452, which is AI agents, the future of enterprise work, and then go listen to episode 422, licensing AI agents. I like this episode. What is it and do we need it?
Starting point is 00:50:49 All right, so learn from other people, not just me, some great experts that I've had on the show. All right, I hope this was helpful. If so, please go to your everyday AI.com. Sign up for the free daily newsletter.
Starting point is 00:51:01 Thank you for tuning in. Hope see you back tomorrow. And every day for more. everyday AI. Thanks y'all. Meet Firefly AI assistant. Now live in Adobe Firefly, the Allman One Creative AI Studio. Just describe what you want to create in your own words and the assistant handles the rest, orchestrating multi-step workflows across Adobe Creative Cloud apps, including Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome while the assistant accelerates execution. Stand control with the ability to step in and refine at any
Starting point is 00:51:36 time. See it today at firefly.adobie.com. And that's a wrap for today's edition of Everyday AI. Thanks for joining us. If you enjoyed this episode, please subscribe and leave us a rating. It helps keep us going. For a little more AI magic, visit Your EverydayAI.com and sign up to our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.