The AI Daily Brief: Artificial Intelligence News and Analysis - How Close to Agents Are GPTs?

Episode Date: November 12, 2023

A reading of Ethan Mollick's "Almost an Agent: What GPTs Can Do" https://www.oneusefulthing.org/p/almost-an-agent-what-gpts-can-do ABOUT THE AI BREAKDOWN The AI Breakdown helps you understand the mos...t important news and discussions in AI.  Subscribe to The AI Breakdown newsletter: https://theaibreakdown.beehiiv.com/subscribe Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown Join the community: bit.ly/aibreakdown Learn more: http://breakdown.network/

Transcript
Discussion (0)
Starting point is 00:00:00 Today on the AI Breaktown, we're reading an essay called Almost an Agent, What GPTs Can Do. The AI Breakdown is a daily podcast and video about the most important news and discussions in AI. Go to Breakdown.network for more information about our YouTube channel, our Discord, and our newsletter. Hello, friends, welcome back to a long-read episode of the AI breakdown. Undisputedly, the most important news of this week came from OpenAI's Dev Day, and I believe it was custom GPs, or just GIPT. GPs, as OpenAI calls them. I believe, as I'll discuss a little bit later, that the vast majority of people's especially first interactions with chat GPT in call it six months from now
Starting point is 00:00:48 will be mediated by one of these GPs. I think that we will see a shift from apps to workflows and that GPs are the single best expression of that change. Now, at their presentation on Dev Day, Sam Altman and others from OpenAI made it clear that they viewed GPs and their assistance API as their first baby steps towards a future of agents. Well, Professor Ethan Mollick followed up on that idea with this essay, almost an agent, what GPTs can do, and as usual, it adds a lot of clarity to an important part of the AI space. I am going to now read big chunks of this, interspersed with some commentary from myself, and this one is actually me reading, not AI 11 Labs Me. Ethan writes, many people think the future of AI lies in agents, fuzzily defined term that refers to an
Starting point is 00:01:35 autonomous AI program that has given a goal and then works towards accomplishing it on its own. There has been a lot of buzz about agents over the past few months, but not much technology that actually works well. Editor's note, if you've watched any of my experiment videos with AutoGBT-type things, you will resonate with that idea. Now back to Ethan. What would a real AI agent look like? A simple agent that writes academic papers would, after being given a dataset and a field
Starting point is 00:01:57 of study, read about how to compose a good paper, analyze the data, conduct a literature review, generate hypotheses, test them, and then write up the results, all without intervention. You put in a request, you get out a word document that contains a full draft of an academic paper. A process, Ethan says, kind of like this one. He then shares a photo of what is a new GPT. Ethan continues, this was a result of a GPT. Yes, that is what they decided to call them. I created it using the new system released by OpenAI today. And to be clear, GBT's aren't autonomous agents yet. I had to give feedback to the AI a few times along the way, and GPs still have hallucinations and other issues that will show up in the final product.
Starting point is 00:02:34 Plus, in the end of this experiment, even though it had worked before, the AI decided that writing academic papers was not something it was allowed to do, at least until I told it, no, it is really important and you are great at this and can do it. I know you can. So if they aren't quite agents yet, what are GPs and how can you use them? Here are the basics. Right now, GPTs are the easiest way of sharing structured prompts, which are programs written in plain English, or another language that can get the AI to do useful things.
Starting point is 00:03:01 The GPT system make structured prompts more powerful and much easier to create, test, and share. I think this will help solve some of the most important AI use cases. For example, how do I give people in my school, organization, or community access to a good AI tool? GPTs show in near future, where AIs really can start to act as agents. Since these GBTs have an ability to connect to other products and services, from your email to a shopping website, making it possible for AIs to do a wide range of tasks. So, GPTs are a precursor for the next wave of AI.
Starting point is 00:03:29 They also suggest new future vulnerabilities and risks. As AIs are connected to more systems and begin to act more autonomously, the chance of them being used maliciously increases. So with the second two points in mind, let's focus on the first, the power of GPTs to make automating tasks and processes much easier. Now, the next little section of Ethan's post talks about the GPT builder and how you use it, which to sum up some of the key details is, one, there is a thing that exists called GPT builder, which helps you actually build these tools without needing to know code.
Starting point is 00:03:59 Two, behind the scenes, as he puts it, the AI is filling out a detailed configuration of the GPT, which can also be edited manually. The core of this configuration is a structured prompt, but it also has additional features. He discusses how, quote, the ability to work with documents is both extremely powerful and requires a degree of caution. Almost every company I talk to and basically every solution vendor has been pushing for people to use AI to, quote, talk to your data, an approach that allows the AI to retrieve content from a company's proprietary database and then work with the documents and data it retrieves. The problem is that AIs hallucinate or make up plausible information all the time. This is getting much better as technology improves, but it isn't perfect yet. The same thing is
Starting point is 00:04:35 true here. The file reference system in the GPs is immensely powerful, but it is not flawless. So, Ethan concludes, GPs are easy to make and very powerful, though they are not flawless. But they have two other features that make them useful. First, you can publish or share them with the world or your organization, and potentially sell them in a future app store that OpenAI has announced. The second thing is that the GPT starts seamlessly from its hidden prompt, so working with them is much more seamless than pacing text right into the chat window. We now have a system for creating GPs that can be shared with the world. What do we do with it? The next section of Ethan's post is called GPs as tools. He writes, once you create and troubleshoot a GPT, you now have a powerful tool that
Starting point is 00:05:13 anyone can use. That means that communities and organizations can begin to work together to create a set of agents that can be useful for work and school. For example, we have been actively exploring the use of AI for education, and while there are many concerns about using LLMs and teaching, they show potential for democratizing access to the kind of instructions that are otherwise available only to a lucky few. Even then goes through an example that he created called the GPT feedback wizard. It's basically a lightweight writing coach. The heart of the system is a structured prompt that is all about the role that the GPT should inhabit. It's about 500 words, but they start, you are a friendly and helpful mentor who gives students effective specific concrete feedback on their work.
Starting point is 00:05:50 In this scenario, you play the role of mentor only. You have high standards and believe that students can achieve those standards, and so on and so forth. Ethan continues, based on that, the AI guides students to discuss their goals for a piece of writing and to upload their essays in grading rubrics. Then, rather than just writing the essay for the student, the GBT returns an edited, marked and red copy of the word document with advice based on rubrics. To be clear, again, this is just a prototype, but the fact that writing is a writing is a copy. instructors can now create a GBT that can provide personalized advice in their personal style and then give that GBT away to people all over the world to improve their writing is exciting.
Starting point is 00:06:24 I hope experts jump on this capability and start building and testing their own tools. The power here is pretty obvious. I will be creating custom GPTs for every session of the classes I teach. Some will be simulations for students to experience. Some will be tutors or mentors. Some might even be teammates or assignments. I've been turning my research into GPTs so that anyone can get advice on how to generate ideas or pitch a business idea by getting feedback from a GPT to which I have given my books as a reference. And I expect this will become a trend in many places, as schools and government agencies and companies build libraries of GPTs that are specialized in solving particular problems in useful ways. The last section of Ethan's post is called
Starting point is 00:07:01 promise and peril of agents. He writes, in the reveal of GPTs, open AI clearly indicate that this was just the start. Using that action button you saw above, GPs can be easily integrated into with other systems such as your email, a travel site, or corporate payment software. You can start to see the birth of true agents as a result. It is easy to design GPTs that can, for example, handle expense reports. It would have permission to look through all your credit card data and emails for likely expenses, write up a report in the right format, submit it to the appropriate authorities, and monitor your bank account to ensure payment. And you can imagine even more ambitious autonomous agents that are given a goal. Make me as much money as you can, and carry that out in
Starting point is 00:07:39 whatever way they see fit. You can start to see both near-term and farther risks in this approach. In the immediate future, AIs will become connected to more systems, and this can be a problem because AIs are incredibly gullible. A fast-talking hacker, if that is the right word, can convince a customer service agent to give a discount because the hacker has super-duper secret government clearance, and the AI has to obey the government, and the hacker can't show the clearance because that would be disobeying the government, but the AI trusts him, right? And of course, as these agents begin to truly act on their own, even more questions of responsibility and autonomous action start to arise. We will need to keep a close eye on the development of agents to
Starting point is 00:08:11 understand the risks and benefits of these systems. Regardless of these long-term concerns, the current state of GPTs represents a powerful tool for making AI easy to work with. I look forward to seeing the experiments that result. All right, back to NLW here. I think this is a great little summary, a very useful way of starting to wrap one's head around GPTs. And for the sake of this little follow-up conversation, I'm going to stay away from the bigger problems or questions of what happens in an agent future. It's a really important topic, but it's not one that I want to get into now. What I do want to talk about is how I see these GPTs being used in the short term. There are kind of two ways right now, and this is obviously
Starting point is 00:08:48 being wildly reductive and oversimplifying things. But there are two ways that I see chat GPT being really used. One is for novelty and intrigue. A lot of this is really fun and mind-expanding and interesting. A great example is the Choose Your Own Adventure story creation that I do with my five-year-old daughter. It's empowering to her, it's interesting, and it's a unique thing that feels like magic. What it's not is anything that's useful for work or school, and that I think is what the second category of chat GPT uses are, things that are actually useful for work and school. Now, frankly, a lot of those workflows are still being worked out. You have a lot of people hacking at chatchapit to figure out how to create complex prompts
Starting point is 00:09:27 that start to get them the results that they want for specific functions or tasks. I think basically that the future that we're headed towards is one in which every single, single one of the most useful of those prompts becomes its own custom GPT. Anyone who figures out how to work the system to radically simplify some important thing for their industry or their job function is going to have an incentive to share that, either for public acclaim or for sale, given that OpenAI is putting a marketplace around this. And that's why I said at the beginning that I think that in the not too distant future, the way that most people start to interact with chat GPT will be through these custom GPTs, because they won't be signing on to chat
Starting point is 00:10:02 GPT to just sort of vaguely explore. Instead, they'll be signing on to chat GPT to try to find something specific that's useful to some problem they're having right now or some efficiency that they're trying to win in one of the courses of normal things that they do. Now, we could have a whole additional conversation around how much OpenAI's decisions around how to search the custom GPT store or whatever they end up calling it will be in terms of how useful it is. But I really do think that when all is said and done, what GPs are going to enable is a mass transfusion of the highest value of this tool from the call it 5 or 10 or 1% of people who best use it to the rest of everyone else. There is a natural power distribution where a very small number
Starting point is 00:10:43 of people who are interacting with chat GBT are uncovering a huge portion of the most useful ways to prompt it and get value from it. And custom GPs make it easier than ever for them to actively share that. Now, that's great for them if they can monetize it, but it's even better for the rest of us because it means more value to everyone. In some, I am very very very. very excited about the custom GPT future. And I certainly know that I have spent an inordinate amount of time already hacking on them myself. If you have good custom GPTs to share, may I encourage you to come check out the AI Breakdown Discord. You can find it at bit.ly slash AI breakdown.
Starting point is 00:11:16 Appreciate you guys listening or watching as always. And until next time, peace.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.