The AI Daily Brief: Artificial Intelligence News and Analysis - AI Agents Coming to OpenAI Dev Event?

Starting point is 00:00:00 Today on the AI breakdown, we're looking at reports that OpenAI might be bringing AI agents to their November developer event. Before that on the brief, a set of seemingly AI-related layoffs and a million-dollar contest to use AI to read ancient scrolls thought lost a time in history. The AI breakdown is a daily podcast and video about the most important news and discussions in AI. Go to Breakdown.network for more information about our YouTube channel, our Discord, and our newsletter. Welcome back to the AI breakdown brief. All the AI headline news you need in around five minutes. We start today with something that is not useful, not going to change the way you work or learn, but is just a mind-blowing example of things that we never would have thought possible,

Starting point is 00:00:45 revealing themselves to be with ingenuity, creativity, and artificial intelligence possible. So basically, the Vesuvius challenge is a million-dollar competition to try to use, use AI to read papyrus scrolls that were burned in the eruption of Mount Vesuvius and buried in Pompeii. When Vesuvius erupted, in addition to the human toll, there was a villa with a vast library of papyrus scrolls that was also buried and lost the time. Now, these scrolls were carbonized by the heat of the volcanic debris, which was kind of a mixed bag. On the one hand, they are technically preserved, but on the other hand, almost as soon as they are exposed to air, they immediately decay and disappear. The villa was discovered in 1730.

Starting point is 00:01:26 and scholars immediately realized just how significant the find was. Indeed, some people estimate that if we were able to open and read all these scrolls, it would more than double the amount of literature we had from that time. Of course, upon those first attempts, most of the scrolls that were attempted to be opened were destroyed. A few were unrolled by an Italian monk over a period of literally decades, but soon the project was abandoned with more than 600 unopened and, to all accounts, unreadable. However, zoomed for over 265 years,

Starting point is 00:01:55 And technology has advanced such that we now have the tools to virtually unwrap scrolls that would otherwise have been lost to antiquity. A version of this was used on a scroll found at the Dead Sea region of Israel that was found to contain text from the Bible without ever having to unroll it. And since then, people have been working on machine learning models that can detect ink that would otherwise be invisible from x-rays. This involves using a particle accelerator to scan these scrolls in order to pick up these extremely subtle surface patterns that theoretically AI models could interpret and decipher.

Starting point is 00:02:25 Now, once this was shown as possible, a group of funders from Silicon Valley, including former GitHub CEO Nat Friedman, put together this challenge to get people working on actually reading these scrolls. Just last week, the competition announced that the first world in the scroll had been discovered. This was actually a multi-part process. In August, a contestant named Casey Hanmer, found and wrote about what he called a crackle pattern that looked like ink. And from there, another contestant named Luke Ferriter, who was a 21-year-old college student

Starting point is 00:02:52 and SpaceX summer intern, started training an AI model on that crackle pattern. This led ultimately to Luke's model being able to identify a first word, meaning purple. Another contestant use of Nader, an Egyptian biorobotics grad student in Berlin, used a totally different model to identify another set of letters in a similar area. I'll include a link for you guys to learn more about this, but since we're already going so long in what is supposed to be the brief, I'll move on. But I think it's just a really cool example of how AI, when combined with other technologies, in this case particle accelerators can do things that literally seemed impossible until we actually figured out how to do them. Speaking of the ramifications of artificial intelligence, unfortunately a couple

Starting point is 00:03:30 sets of layoff announcements today. Stack Overflow has laid off 100 people, which means that over the course of the last year, it's laid off about 28% of its staff. Now, Stack Overflow has been one of the best examples of a company whose model was challenged by AI. Stack Overflow is a personal coding help forum where developers can post their questions and get responses or go search through other people's old problems to try to see if anyone else has encountered similar issues, but now the developers are relying more on AI-assisted coding, the use case has dropped pretty precipitously. LinkedIn has also announced layoffs of nearly 700 employees, that represents about 3% of the company's staff and is part of a larger organizational restructuring effort.

Starting point is 00:04:10 From what information has been shared, this will impact roles across all sorts of different departments from engineering to talent to product. Now, while LinkedIn has not confirmed that these layoffs have to do with AI specifically. That has been the working assumption of press who have reported on it, with a LinkedIn spokesperson saying to Gizmodo, this restructuring is to support LinkedIn's future broadly. Over in the world of new features, one of the big themes of this fall has been seeing AI actually integrated into the products that people are already using, and we got a couple of announcements like that over the last day or so. The first comes from Capcut, which is a video editor that is used most primarily by power users of TikTok. Now, just to get a sense of

Starting point is 00:04:46 scale, this video editing platform was BytDance's second product to hit 100 million consumer revenue after, of course, TikTok, and now they are introducing a set of new AI tools. One is an AI powered script generator that helps advertisers come up with new ideas based on however they describe their product or their business, and their new CapCup for Business Suite also has AI generated presenters who can be used for demos or explainer videos, and even a virtual try-on featuring AI models. Now, some of these features may remind you of what YouTube has announced over the past few months, but YouTube has also announced another set of AI tools, specifically they announced a new advertising package that they're calling Spotlight moments. Basically,

Starting point is 00:05:24 the product uses AI to automatically flag the most popular YouTube videos related to specific cultural moments, such as Halloween, award shows like the Oscars, or sporting events, and then give the advertiser the ability to serve ads across videos referencing that topic or event. Now, as someone who has a fair bit of experience in the advertising field, this is an absolute no-brainer and feels like a guaranteed hit of a product. So much of what you do as an advertiser is trying to connect your brand to these cultural moments. And so having the ability to automatically target content that relates to those moments is a huge upgrade over just the traditional keyword style advertising. Between these announcements this week from Capcut and YouTube and the absolute slew of tools

Starting point is 00:06:04 announced by Adobe at Adobe Max last week, it is very clear that the theme for this fall continues to be the integration of AI into real business workflows. But my friends, we will end there for the day. Next up, the main AI breakdown. Welcome back to the AI breakdown. Today, we are discussing a set of information about OpenAI, some things that are actually known, some things that are just rumors. And I think the context for this is that obviously this fall, we have seen a lot of announcements from many of the big players in the space about their next big moves in AI. Now, of course, makes OpenAI different is that ever since the launch of ChatGBTGT, they have been in the pole position in the industry. GPD4 really reinforced that, and as of today, there just aren't

Starting point is 00:06:52 any other models that can compete with GPT4. Now, many anticipate that Google Gemini might be the first to actually do so or even exceed GPT4, but as of yet, all the information we have about Gemini is mostly in the realm of rumor. Last month, OpenAI got people excited when they announced that on November 6 they would be holding their first ever developer conference. OpenAI CEO Sam Altman in his announcement post made sure to make it clear that there was not going to be any sort of GPT4.5 or GPT5 announcement, but that they would still have, quote, great stuff to show developers. The announcement that they posted on their website didn't give any more information either. They just said it would be a one-day event that would bring together hundreds of developers

Starting point is 00:07:28 from around the world and that they would be previewing new tools. Now, towards the end of last week, Reuters posted some sources suggesting a few things about the update. The way that they framed it was that OpenAI plans major updates to lower developers with lower costs. So again, these were, quote, sources briefed on the plans. And the updates that they discussed with Reuters included one, the addition of memory storage to developer tools, which Reuters said could theoretically cut costs for application makers by up to 20x. Another development would be the addition of vision capabilities to the API. Now, obviously, GPT4 vision has been one of the biggest feature updates of the year, and really in many ways a fundamental change to the chat GPT

Starting point is 00:08:06 experience. If you're interested, I just recently did a video about the seven categories of use cases for GPT Vision, so go check that out. But it would only make sense for those capabilities to be brought to the developer tools as well. Another update was around Stateful and Vision API releases, as summed up by Linus Ekinsdam. The Stateful API will make applications more cost-effective by recalling conversation history. But the big one was this little line. The new features are designed to encourage companies to use OpenAI's technology to build AI powered chatbots and autonomous agents that can perform tasks without human intervention. This is definitely the part of the announcement that has gotten the most attention, specifically the idea of OpenAI Native Autonomous

Starting point is 00:08:44 Agents. Now, of course, agents have been one of the biggest focuses for this show, a theme that we've been watching ever since April when AutoGPT and Baby AGI started to jump into people's attention. As of right now, of course, there isn't any real breakout winner in the AI agent space. Indeed, for the average observer, the promise and potential still far away the actual applications in practice right now. Now, of course, that doesn't change the fact that there are an incredible number of developers exploring this exact space, and so the introduction of developer tools specifically focused around building autonomous agents in the open AI ecosystem would be a just massive development in that space. Now, the way that a lot of folks are framing this is in terms of

Starting point is 00:09:21 an evolution for open AI, from a company that is just chatchip-T to a real platform for developers. Reuter sums it up, the new features mark the company's ambition to expand beyond a consumer sensation into one also offering a hit developer platform as its chief executive Sam Altman has envisioned. Now, of course, just how new this is, I think is a reasonable question. Altman, for example, has clearly been focused on developers for some time, and indeed, one of the big questions for developers has been trying to understand where OpenAI is itself likely to compete with them. Basically, people don't want to build tools on OpenAI's APIs that OpenAI is going to turn around and offer a native version of. There is a lot of competition for OpenAI, especially

Starting point is 00:10:01 when you get beyond just a consumer chatbot and chat GPT itself. On the enterprise side, you have companies like Microsoft, Amazon, who are all leveraging their existing relationships and the very high burden of trust that comes with providing an AI platform with access to enterprise data to secure customers that might otherwise have flowed to a new startup like OpenAI. And so in many ways, on the one hand, open AI is dealing with that sort of competitive pressure from enterprise software companies, while they're also dealing with skepticism from developers around whether the OpenAI ecosystem is a good one to build in, given that it's not exactly clear what OpenAI will want to build themselves in the future, and given that there just isn't that much of a history

Starting point is 00:10:39 that developers can content themselves with in terms of where those competitive dynamics might play out. Now, adding to that pressure is, of course, the fact that meta has taken a totally open approach and is racing to catch up to the state of the art and give developers the ability to build things with AI tools that are close to as good as what they can get in the open open AI ecosystem with potentially fewer of those concerns. The point I'm trying to make is that in many ways this is more than a developer event that's coming up next month. It is sort of a referendum and reflection of, if not a shifting strategy for OpenAI, a clear rededication, at least, to appealing to that developer audience. It kind of makes sense then that it's happening

Starting point is 00:11:15 almost a year exactly after ChatGPT was released, as OpenAI figures out what its next big act is going to be. There are other indications that OpenAI is consolidating around its focuses heading into next year. The company recently updated its core values on its website. They used to be audacious, We make bold bets and aren't afraid to go against established norms. Thoughtful, we thoroughly consider the consequences of our work and welcome diversity of thought. Unpretentious. We're not deterred by the boring work and not motivated to prove we have the best ideas. Impact-driven. We're a company of builders who care deeply about real-world implications and applications. Collaborative, our biggest advances grow from work done across multiple teams and growth-oriented.

Starting point is 00:11:52 We believe in the power of feedback and encourage a mindset of continuous learning and growth. Now, at the end of last week, the company very publicly updated those core values. Logan, who does developer relations over at OpenAI, posted the new values, which are led with AGI focus. We are committed to building safe, beneficial AGI that will have a massive positive impact on humanity's future. Anything that doesn't help with that is out of scope. Next value is intense and scrappy. Building something exceptional requires hard work, often on unglamorous stuff and urgency. Everything that we choose to do is important. Be unpretentious and do what works, find the best ideas wherever they come from. Third is scale. We believe that scale in our models,

Starting point is 00:12:28 our systems, ourselves, our processes, and our ambitions is magic. When in doubt, scale it up. Make something people love. This one borrowed, of course, from Y Combinator. Our technology and product should have a transformatively positive effect on people's lives. Finally, team spirit. Our biggest advances in differentiation come from effective collaboration in and across teams. Although our teams have increasingly different identities and priorities, our overall purpose and goals have to remain perfectly aligned. Nothing is someone else's problem. Now, interestingly, I've seen some people suggest that this is a commercialization of OpenAI's core goals. I don't exactly see it like that. Of course, you hear echoes of that first set of goals and

Starting point is 00:13:02 these newer updated goals, but frankly, the first ones were much more wishy-washy and kind of just generic about how Open AI imagines its teams working together. The new core values, particularly with the anchor of AGI focus, is a much more tangible and clear articulation of what this company is trying to do. To the extent that these core values are about attracting a certain type of person, which I believe that is their whole purpose, anchoring around the actual objective. of the company, that AGI focus, the fact that anything that doesn't help with that is out of scope feels to me a lot clearer and a lot more transparent than what that first set of goals represented. Now, it's worth here as we round the corner actually checking in on what OpenAI's timelines for AGI really are.

Starting point is 00:13:43 Sam Altman was recently on the Joe Rogan experience, and when Rogan asked him what sort of timeline he imagined for AGI and how it had changed since the beginning of OpenAI, Altman said, AGI isn't the endpoint. To accomplish ASI artificial superintelligence, that'll take us until 2030 or 2031. That has felt to me all the way through kind of a reasonable estimate with huge error bars. I think we're on the trajectory I would have assumed. Now, as the AI Safety Memes account points out, however, it seems likely that OpenAI actually thinks that superintelligence could arrive even sooner than that,

Starting point is 00:14:14 given that their super alignment team that was announced earlier this year, or as this account calls it their AI not kill-Everyoneism team, set a four-year deadline for solving alignment, which would of course bring us to 2027. The AI Safety Memes account also points out that in the wake of GBT4 Vision, prediction markets have increased how fast they think AGI is coming. Metaculous markets now put a 25% chance at AGI in the next 1.5 years. For me, I am looking forward to this November 6th Developer Day

Starting point is 00:14:41 as a way to better understand, one, how Open AI views its relationship with developers and how it sees its most important competitive challenges in the year to come, and two, for anything it suggests about change timelines or updated senses of the with which things are developing. I'm sure as November 6 approaches, we will get even more leaks and hints and eventually schedules, and I will, of course, keep you posted on all of those as they come up. For now, I want to say a big thank you to you guys for hanging out and watching or listening, however you are consuming this content. If you are enjoying it, I would so appreciate it if you

Starting point is 00:15:12 would take a time to leave a rating or review. Silly as it may seem, it actually makes a huge difference for people discovering the show. And until next time, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - AI Agents Coming to OpenAI Dev Event?

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.