The AI Daily Brief: Artificial Intelligence News and Analysis - The Latest on AutoGPT and BabyAGI: Semi-Autonomous Specialized Agents (SASAs)

Starting point is 00:00:00 The AI breakdown you're about to hear originally came out as a YouTube video on Thursday, April 27th. In it, we look at the latest in Auto GPT and Baby AGI, which is semi-autonomous specialized agents, a more discreet and useful implementation. One month in, Auto-GPT and Baby AGI have been through a full hype cycle from peak excitement to, hey, are these things even really useful, to now people finally understanding where their utility lies, at least in the short term. And it appears it's semi-autonomous specialized agents. What's going on, guys?

Starting point is 00:00:39 Welcome back to another AI breakdown. Today we are talking about seemingly everyone's favorite topic, auto-GPTs. Now, auto-GPTs and their related technology, baby AGI, are about a month old. You actually have Yohei here who created baby AGI saying, happy one-month birthday. This is just from a couple days ago. Now, there has been a lot of ink spilled on what these are and why they're, so interesting. There's a million articles like this one from TechCrunch right here. What is AutoGPT? And why does it matter? So there are a few key things. One, unlike ChatGPT, although that's changing

Starting point is 00:01:14 right now, auto GPTs can search the internet. A second difference is that they have more memory, so they can engage in complex tasks. And then that's really the third piece is when you assign an Auto GPD a task, theoretically what it does is it figures out all the steps that are needed to complete that task and then can even spin up other AIs that can help them complete that task. At least that's the goal. Now, this is a topic that has captured tons of attention. I did this video 13 days ago, five ways auto GPs are already being used, and people were just so excited about these use cases.

Starting point is 00:01:48 It was things like the do-everything machine and the self-executing task list. And it really was a group of people who very quickly saw the implication of this technology and raced out to see how much they could do. However, over the last week or so, there has been a shift in narrative, let's call it. And I think Rachel Woods captures it here. She says the example is auto-GPT. Yes, impressive proof of concept. No, not actually useful yet.

Starting point is 00:02:15 That nuance is very difficult to communicate. And I've seen lots and lots of pieces like this. But of course, that doesn't mean that people aren't excited about it. Omar Para, an AI product lead at Meta, says auto-GPT hype is a lot. unreal, 85K plus stars on GitHub, AI autonomously completing tasks based on a overhyped, maybe, more wow than useful, yes, a peek into the future, 100%, a catalyst for innovation, no doubt. So this is roughly the state of the conversation coming into today. Well, when I signed onto Twitter this morning, I saw this thread from Nate Chan, who's been playing

Starting point is 00:02:51 around a lot with auto-GPTs and baby AGI. And I think he did a great job of pointing to who where the tinkerers who have been experimenting are starting to actually see where the short-term utility and usefulness of auto-GPTs might come from. Nate writes, today auto-GPTs are thought of as general autonomous agents, a tool that can do anything and even thought of as a precursor to AGI. When you build something that can do everything, it turns out it can't do anything well or at all. But I think we're starting to notice a shift in the auto-GPT space.

Starting point is 00:03:25 Specialized agents are being. being built, and we're now seeing more useful and narrowly focused autonomous agents hit the market. P.S. this is not a criticism towards the OG Auto-GPT creators and projects. Now, Nate from there gives a few different examples that he's seen of these specialized agents. One he suggests is autonomous agents for research. Nate writes, research can't rely on AI-generated content, so this agent browses the internet for what you're researching and only extracts relevant info from trustworthy sources. He points to a project called AOMI, which launched on April 17th and is an agent specifically designed for research.

Starting point is 00:04:01 Another specialized auto-GPT he points to is one for medical research that can call medical APIs and site sources. Another was a specialized agent for podcast preparation that could research recent events, prepare a podcast outline, layout topics and discussion points, or even write a cold open. Another is a specialized agent to control your web browser. And Nate says there's some nuance here, writing, Though some see this as a do-everything agent, its abilities are limited to certain web actions. I think we'll move to seeing this as a specialized agent simply to automate tasks in the browser. Order a pizza, book a flight, open tabs for 10 Craigslist posts of couches near me under $100, that sort of thing.

Starting point is 00:04:41 Nate concludes his thread, There's a ton of greenfield opportunity right now in building for specific types of tasks and industries by using a robust chain of GPT calls, optionally to fine-tune models, with narrowly focused, all to get one category of things done and done very well. Yohei, the creator of Baby IGI, actually responded to the thread, saying, I agree, I think something like this is what we'll see. First, specialized agents that work for one person, then specialized agents as a service, then slightly more generalized agents, then increasingly generalized agents. So this is a theory of how this might evolve from something more general like we saw people try initially,

Starting point is 00:05:20 but starting from the standpoint of these highly specialized agents which seem to be coming to the four now. Now in this thread, I asked Yohei basically what types of specialized tasks he was seeing there be a lot of value in, especially as compared to a more human-mediated chat GPT style experience. His answer was, for me, I have a couple of what I'd call semi-autonomous specialized agents that run in the background that would take me maybe 20 prompts to do in chat GPT. Having it automatically kick off from CRM activity saves me the time. time it would have taken to do that in chat GPT. So effectively, I'm kind of imagining a string of tasks which an auto-GPT can put all together without him having to mediate each one of those tasks

Starting point is 00:06:02 along the way. I followed up and said, how routinize and task-based are they versus creative and production-based? His answer was some of both. The easy stuff is pinging APIs and scraping websites and summarizing, but you can do some creative stuff. For example, if you have a podcast about AI and you see layoff news from big tech, you could ask, and then he tweeted a picture of him doing this. His prompt was, you're generating content for a blog about the future of AI, generate a discussion specifically referencing this news in a way that is relevant to the audience. News, Dropbox lays off 500 employees, 16% of staff. The discussion idea that came back from the auto-GPT started, the news that Dropbox has laid off 500 employees has sparked debates about the future of AI. On the one hand,

Starting point is 00:06:50 AI has the potential to automate many processes, making them more efficient. On the other, the automation of certain processes could lead to job losses in the future. This news from Dropbox serves as a reminder of the potential effects of AI on the job market. Nate Chan responded to all of this and said, that is awesome, semi-autonomous specialized agents. I think abbreviating to Sasa has legs. I've got my Sossas running. Give me five minutes. So as you can see, the cool thing here is that people are taking this technology, which again, was so hyped and so exciting when it first came a month ago, and then maybe had the patino wear off it just a little bit. But now people are figuring out how to hone in on where its actual utility lies, and they keep building these

Starting point is 00:07:30 really interesting tools. I mentioned AOMNI before, which is an agent specifically designed for research, and I took it for a test spin earlier today. I did a couple of tests. The first, I asked it to find the most relevant and discuss news stories in AI. It brought back a bunch of interesting things, certainly not about today, but you could see how this would be a really helpful, useful tool, and then I also asked it to write a crypto podcast outline. For that task, it lays out its subtasks, including search, find popular topics in the crypto world to include in the podcast. Browse extract information from the top search results. Browse extract information from the second search result. You see where this is going. Process, combine the information from popular

Starting point is 00:08:09 topics in recent news. Process, create a podcast outline based on the combined information. It's a very expansive, suggested podcast that it came back with, but of course I didn't define its strictly at all. I could definitely see integrating something like AOMNI into my research flows. Another tool that a lot of people have been excited about is this new implementation of AutoGPT for mobile. It's effectively a Telegram bot that allows you to use your own version of AutoGPT right inside Telegram. Anias Callio put it together on GitHub and of course I had to try it out. So here we are in Telegram and I ask, I'd like some help with some research. NLW agents says, sure, what topic are you researching? I say AI.

Starting point is 00:08:49 NLW agent says, great, artificial intelligence is a vast field. Could you please narrow it down? And I said, yep, I'm doing a podcast about AutoGPT. NLW agents started to write about the basic steps for implementing AutoGPT. Then I tried to define the task a little bit closer. I said I'd like to find three potential topics for a podcast on AI. It gave me AI on the future of work, AI in healthcare, AI and society. Now, you'll notice this is much more like a chat GPT style experience in Telegram versus an AutoGPT type experience.

Starting point is 00:09:18 And so eventually I asked, are you an auto GPT? An NLW agent said, no, I'm not an auto GPT. I'm an AI language model that uses machine learning and natural language processing algorithms to generate responses to user input. I'm more similar to chat GPT than auto GPT. Now, I just started to try this right before I filmed the video. I'm sure there's a lot more to do. And frankly, having chat GPT sitting in my telegram, that's my own little NLW agent, is still

Starting point is 00:09:41 very cool. So I think to sum up, what's cool to see is that people are starting to hone in on the use cases that make auto GPDs actually useful for them, that actually fit into their workflows. And this is kind of what you'd expect. Things start like a toy and with people understanding the broadest possible implications, and then they get narrowed to where they can actually drive and provide value, whatever those use cases might be. However, to the extent that they are competing in some ways with a more human-mediated experience of just using an LLM like chat GPT, they now even have some extra competition. As I was having these conversations earlier this

Starting point is 00:10:16 morning, I started to notice that people were talking about the fact that chat GPT default mode now includes browsing mode enabled for many plus users. It's still an alpha and hasn't been rolled out to everyone, but this is chat GPT allowing you to also access the internet. Now we're ignoring all of the AI safety implications of all of these AI tools being hooked right into the entire data of the internet. But if we do hold that aside, it's an extraordinarily exciting time to see how these agents can change our workflows, make us more efficient. and allow us to do more of whatever it is we want to do. I like this direction for AutoGPT of these semi-autonomous specialized agents,

Starting point is 00:10:54 and it's certainly where I'm going to be experimenting in the near future. All right, guys, that's it for another AI breakdown. Please subscribe to this channel. If you haven't yet, go check out the podcast. And until next time, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - The Latest on AutoGPT and BabyAGI: Semi-Autonomous Specialized Agents (SASAs)

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.