The AI Daily Brief: Artificial Intelligence News and Analysis - What's Next in Phase Two of Generative AI?

Starting point is 00:00:01 Today on the AI breakdown, we're talking about what's next in phase two of generative AI. The AI breakdown is a daily podcast and video about the most important news and discussions in AI. Go to Breakdown.network for more information about our Discord, our YouTube, and our newsletter. Welcome back to the AI breakdown. Today, we are doing another trend-type story and exploration that I think aligns with a lot of the things that we've been talking about recently, and that in many ways is trying to sum up some of where we actually are. Now, if you listened to yesterday's show, you heard me talk about Professor Ethan Mollick's essay, The Shape of the Shadow of the Thing. The way that he started was that he saw us reaching,

Starting point is 00:00:40 quote, the culmination of the first phase of the AI era that started with the launch of Chad GBT. We talked a lot about what that meant, about my belief that we had been sort of converging around unleashing and exploring the full range of capacities of GPT4, and that a lot of the big questions were what comes next. Now, the other big thing, though, that I've been talking about a lot over the past couple weeks based on the news of autumn AI product announcements represented not the race to bigger, better foundation models, but instead the integration of AI into our workflows, into our spaces. So just to quickly go through some of the examples of that, there was, of course,

Starting point is 00:01:16 Meta's announcement, which they framed literally as introducing new AI experiences across our family of apps and devices. This included AI stickers in messaging. It included an advanced conversational assistant that would live inside WhatsApp, Messenger, and Instagram, and of course, it included their 28 characters that were played by a variety of cultural icons that were supposed to bring fun AI personalities to the experience of different users. Then, of course, there was YouTube. At its made-on- YouTube event now about three weeks ago, the company once again released a set of AI-powered tools that would live inside its software

Starting point is 00:01:48 and would make it trivial for video creators to use AI, both to be more productive, but also to do new things. Dreams Create is a feature which can include AI-generated images and video as the background of YouTube shorts. The company, is integrating an AI-powered music recommendation engine. Really powerfully, there will be a new AI dubbing feature that will give creators the ability to put their content into other languages, which if it works well, could totally break down how we think about linguistic barriers.

Starting point is 00:02:12 And there's even a new assistant style feature, which will help people generate ideas and even outlines for new videos based on what's working with their audience, as well as the other types of videos outside of their content that their audience is watching. Again, the consistent theme here is this AI integration era. These are really powerful features, but what makes them extra useful is that they are so fully enmeshed in this creator suite that people are already using. LinkedIn has also announced a set of new AI integrations.

Starting point is 00:02:39 There's a personal assistant for learning, new AI-powered recommendations around recruitment, including suggesting candidates that might not necessarily fit with what a recruiter thinks that they're looking for, as well as AI integrated writing tools for marketing and sales roles. Just this past week, Google Assistant got a borrowed upgrade, bringing the power of generative AI into that popular mobile assistant experience, Canva had an absolute slew of new AI tools integrated into their software. Again, in a video earlier this week, I showed off a number of what they call their magic editing tools, their magic media tools. One really exciting thing is their partnership with runway, which brings text to video generation directly into this suite. And again, just to really beat this dead horse. Integration era, integration era, integration era.

Starting point is 00:03:20 Canva has 150 million users. And now at the very center of their creative process is this suite of AI tools, which while now kind of presented as shiny new things, things almost certainly are just going to start to blend into the background as the way that you interface with Canva. Finally, there is ChatGBTGPT vision. The new upgrade to ChatGPT makes it multimodal in a meaningful way. Images have become inputs, which opens up a variety of new use cases, and users can also speak to ChatGPT and have it respond via voice. This isn't an upgrade to the underlying GBT4 model, but the modalities that it opens up are so significantly different that it feels like an entirely different experience. And that

Starting point is 00:03:58 brings us back to Ethan's piece, this idea of reaching the culmination of a first phase of the post-Chatschapit AI era, and it gets me to this recent post by Investor Sequoia called Generative AI's Act 2. This is just a couple weeks old, and the framing of the piece is this. One year ago, we published a hypothesis that generative AI would become a profound platform shift in technology. Then came the firestorm. So in some ways, this is a reflection on that first post-chatchapit phase that Professor Malik was also talking about. Sequoia writes, ChatGBTGPT's rise was a spark that lit the fuse, unleashing a density and fervor of innovation that we have not seen in years, perhaps since the early days of the internet. And yet they said, quickly, AI excitement turned to borderline hysteria. Suddenly, every company was an AI co-pilot.

Starting point is 00:04:41 Our inboxes got filled up with undifferentiated pitches for AI Salesforce, AI Adobe, and AI Instagram. The $100 million pre-product seed round returned. We found ourselves in an unsustainable, feeding frenzy of fundraising, talent wars, and GPU procurement. And sure enough, the crack started to show. Artisan writers and singers challenged the legitimacy of machine-generated IP. Debates over ethics, regulation, and looming superintelligence consumed Washington. And perhaps most worryingly, a whisper began to spread within Silicon Valley that generative AI was not actually useful.

Starting point is 00:05:10 Now, Sequoia goes on to compare the crooning from critics that followed to the very early days of the internet, referencing Paul Krugman's famous declaration in 1998, that by 2005, quote, it will become clear that the Internet's impact on the economy has been no greater than the fax machines. So then how does Sequoia? see where we are now. Well, they say, Genitive AI's first year out of the gate, Act 1, came from the technology out. We discovered a new Hammer Foundation models

Starting point is 00:05:34 and unleashed a wave of novelty apps that were lightweight demonstrations of cool new technology. They go on? We now believe that the market is entering Act 2, which will be from the customer back. Act 2 will solve human problems end to end. These applications are different in nature than the first apps out of the gate. They tend to use foundation

Starting point is 00:05:50 models as a piece of a more comprehensive solution rather than the entire solution. They introduce new editing interfaces, making the workflow stickier and the outputs better. They are often multimodal. Now I'll pause here to quickly go through again the announcements that we have recently seen. YouTube Dreamscreen, AI suggested content ideas, and AI power dubbing, Canvas Magic Studio with editing and media creation, LinkedIn's AI tools for writing and marketing, meta's in-app conversational assistant that lives inside WhatsApp Messenger and Instagram. Pretty good evidence of this thesis of customer back, right? Now, this piece is really

Starting point is 00:06:20 comprehensive from Sequoia and is well worth a read on your own, but I'm going to skip over some parts, including the generative AI market map, the generative AI infrastructure stack, and get to some of their reflections on what they got wrong and what they got right. Under what they got wrong, they list that things happened much faster than they thought. Last year they write, we anticipated it would be nearly a decade before we had intern-level code generation, human quality videos or human-quality speech that didn't sound mechanical. Obviously, we have all of these things, and by the way, although maybe a little bit out of scope of this video, this I think is one of the most compelling arguments from the AI safety folks

Starting point is 00:06:51 that the fact that no one seems to have a sense of how fast these things are going to move and that we have to keep revising our goalposts because things keep moving faster than anyone thinks is possible is a compelling reason to try to take a moment, to take a breath even, and ask if they're going too fast. But back to Sequoia's reflections, they were surprised that access to GPU was a bigger problem than end user demand. They note that it reintroduced consumers paying as an actual business model, which could be a net positive in the long run. There's a couple more, but the other one that I wanted to mention is number five. the moats are in the customers, not the data. They wrote, We predicted that the best generative AI models, companies could generate a sustainable

Starting point is 00:07:26 competitive advantage through a data flywheel. More usage, more data, better model, more usage. While this is still somewhat true, especially in domains of very specialized and hard to get data, the data moats are on shaky ground. The data that application companies generate does not create an insurmountable moat. And the next generation of foundation models may very well obliterate any data moats that startups generate. Rather, workflows and user networks seem to be creating more durable sources of competitive advantage. This again is something we've talked a lot about on this show. That if the choice is Canva's integrated version of these tools versus a totally new design suite that requires you to go change all of your workflows, even if it's 10 or

Starting point is 00:07:59 15% better in terms of outputs, might not be sufficient to get users to change their behavior. That's even more true on the enterprise level, where we see a lot of companies wanting to use providers that they already trust with their data rather than trusting a whole new set of startups. But what about where things are now? One thing that Sequoia notes is that although these tools have grown faster than basically any other type of tools in internet history, a lot of it seems to be driven by novelty. They have a chart which you might have seen floating around Twitter or X, around the retention of social media giants versus the new AI apps. Retention of companies in the old school, like YouTube, Instagram, TikTok, Snapchat, WhatsApp, etc. has a 63% 30-day median. In other

Starting point is 00:08:35 words, one month after they first started using those tools, 63% of them stick around. That goes all the way up to 85% for YouTube, by the way. Now for AI-first companies, that average retention is only at 42%. ChatGPT is the highest of that cohort at 56%. But at 56% that's lower than Roblox, WhatsApp, Snapchat, TikTok, Instagram, and YouTube. Now, I think it's fair to ask if that has to do something with not just the novelty of these new tools, but also the fact that they are tools. They're largely for productivity versus these social experiences like YouTube and Instagram, which are much more about content consumption.

Starting point is 00:09:05 But it's still an interesting thing to see written out this way. Sequoia also notes that user engagement is lower. Consumer companies tend to have 60 to 65% daily-active and monthly active users, whereas the median for AI first companies is only 14%. I could probably spend an entire show speculating about why this is and how much of a problem I think it is, which TLDR I'm less convinced than Sequoia seems to be, but I still think they're really interesting statistics. And finally, where the piece leaves off is with an estimation of how entrepreneurs and developers

Starting point is 00:09:31 are converging around a shared sense of how to actually build these new applications for maximum utility. They reference, for example, a number of what they call emerging product blueprints. They discuss new generative interfaces and point out that new form factors. are entering the mainstream. They talk about an entirely new set of editing experiences that a variety of different AI companies are getting people used to, and one that will be of interest to many people on this channel. They discuss increasingly sophisticated agentic systems. Generative AI applications they write are increasingly not just auto-complete or first drafts

Starting point is 00:10:00 for human review. They now have the autonomy to problem-solve, access external tools, and solve problems and end on our behalf. We are steadily progressing from level zero to level five autonomy. Based solely on download figures and viewer numbers, I think there's a a strong argument, the thing that people are most interested in whatever this Act 2 is, is the move from just tools to actual AI agents. Anyway, trying to sum up a bit, what's interesting to me about this is this growing sense among lots of different parties that some first phase of this new world that shifted the moment that ChatGPT was released is coming to its culmination. And a next thing that comes after that is in the early stages of being born. It is inherently a liminal

Starting point is 00:10:39 period. And part of the chaos right now is that so much does feel up in the air. I think that that's reflected positively in entrepreneurial energy and negatively in public anxiety. But in any case, generative AI at its first birthday is screaming forward. And what comes next, whatever it is, seems likely to be even more interwoven into our lives than what we've seen over the past 12 months. Anyways, guys, interesting food for thought for your weekend. I hope wherever you are, you are having a great one. Until next time, peace.

The AI Daily Brief: Artificial Intelligence News and Analysis - What's Next in Phase Two of Generative AI?

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.