The AI Daily Brief: Artificial Intelligence News and Analysis - Who Won the Week: Google or OpenAI?

Episode Date: May 15, 2024

Explore the recent AI announcements from OpenAI and Google, comparing their new products and innovations. OpenAI’s GPT-4o and Google’s Project Astra aim to redefine AI interactions, but who came o...ut on top? This analysis dives into the details of each announcement, evaluating their impact and potential. ** Check out the hit podcast from HBS Managing the Future of Work https://www.hbs.edu/managing-the-future-of-work/podcast/Pages/default.aspx Join Superintelligent at https://besuper.ai/ -- Practical, useful, hands on AI education through tutorials and step-by-step how-tos. Use code podcast for 50% off your first month! ** ABOUT THE AI BREAKDOWN The AI Breakdown helps you understand the most important news and discussions in AI.  Subscribe to The AI Breakdown newsletter: https://aidailybrief.beehiiv.com/ Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@AIDailyBrief Join the community: bit.ly/aibreakdown

Transcript
Discussion (0)
Starting point is 00:00:00 Today on the AI Daily Brief, everything announced at Google I.O. And a look at the battle between Google and OpenAI. The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI. To join the conversation, hop on the Discord with a link in our show notes. Hello, friends. One more day of non-traditional formats, we're just going to be talking about Google I.O. And the comparison to OpenAI, as that's where a ton of conversation is right now. The other big, obvious story that we will cover is Ilya formally leaving, Open AI, so keep your ears peeled for that in the next day or so. But for now, let's talk about Google I.O.
Starting point is 00:00:44 Welcome back to the AI Daily Brief. Yesterday was the main keynote day of Google I.O. This is Google's big annual conference where they announce all the stuff that they're working on, and at the surprise of no one, AI, was even more center stage than it was last year. Now, the timing of this event was no coincidence. Or more specifically, the timing of OpenAI's event just before this was no coincidence. It was pretty clear then and is definitely clear now that OpenAI was trying to front-run Google and capture excitement and attention before I.O. happened. What we're going to do today is talk about the difference between these two announcements, who the AI Twitter sphere thinks won, but we have to start with just everything that Google
Starting point is 00:01:18 actually announced. This is in no particular order, other than broadly speaking, the biggest, most discussed things being up top. First of all, we got a new version of Gemini. This is called Gemini 1.5 Flash. The team at Google said that this multimodal model was as powerful as Gemini 1.5 Pro, but that it had been optimized for, quote, narrow high-frequency low-latency tasks, making it better at generating fast responses. They also announced that the context window was moving from the already industry leading 1 million up to 2 million in the coming months. Second and perhaps most relevant for the demos that we saw a couple days ago from OpenAI, we got Project Astra. This is their version of a personal assistant. On screen is the demo video they
Starting point is 00:01:57 shared both live and on socials. Which makes sound. What is that part of the speaker called? That is the tweeter. It produces high frequency sounds. The Verge writes, Project Astra is powering many of the most impressive demos from I.O. this year, and the company's aim for it is to be an honest-to-goodness AI agent that can't just talk to you, but also actually does things on your behalf. We'll come back to that in a moment,
Starting point is 00:02:21 given that it is such a strong and clear through line across both the OpenAI presentation and this Google presentation, that these companies are betting on a totally new mode of interaction between people and AI that they're literally betting their whole enterprise on. We also got Google's answer to SORA. A couple months ago there had been a question of whether OpenAI was starting to lose the lead that it had held for so long, but then they dropped a set of demos for SORA, and honestly everyone was just slack-jod.
Starting point is 00:02:46 It was so far ahead of what was available currently from other text to video generators that it honestly shifted many people's opinions on whether specialized models can ever beat a big generalist model. Then again, uncommonly for OpenAI, SORA hasn't been available for people to test. Part of that seems to be economics, with generations from SORA just cost. an absolute boatload to create right now. Instead, OpenAI has been pursuing relationships with Hollywood and professional filmmakers, but with VO, Google has produced an answer. Like OpenAI's SORA, VEO is not available to the public, and instead, Google has been working with a set of high-profile
Starting point is 00:03:17 YouTube creators as well as Hollywood filmmakers to try to start integrating the technology. I.O. also saw the announcement of a competitor to OpenAI's custom GPs, something that Google is calling gems. They are basically pitching it in exactly the same way that those custom GPs were created. Their announcement tweet reads, whether you need a yoga bestie or a calculus tutor, in the coming months, you'll be able to customize Gemini saving time when you have specific ways you interact with Gemini again and again. Gemini, like GPD4, got a conversational upgrade for voice, adding personality, the ability to interrupt it, and lower latency, to make conversations hopefully feel more natural. And then from there, there were a ton of announcements of where
Starting point is 00:03:54 Gemini was being integrated far more deeply into Google products than it had been before. For example, Gemini is getting much more deeply integrated into workspace. The Verge again writes, Google is rolling Gemini 1.5 Pro into the sidebar for docksheets, slides, drive, and Gmail. When it rolls out to paid subscribers next month, it will turn into more of a general purpose assistant within workspace that can fetch info from any and all of the content from your drive no matter where you are. It will also be able to do things for you like write emails
Starting point is 00:04:18 and incorporate info from a document you're currently looking at or remind you later to respond to an email you're perusing. Interestingly, this integration is starting to get so deep that it sort of appears to me that Google is making a bet on a future where our default interaction with computers is talking to our AI assistant that then goes and does the stuff that we do now for us. So instead of writing an email, you talk to your assistant, which writes the email. Instead of analyzing a document, you ask your AI assistant to analyze a document, etc., etc. This one is coming pretty soon, so we'll see just how ubiquitous it actually becomes
Starting point is 00:04:48 and whether people really start to shift those behaviors. Another big important integration is around search. Thank goodness we're losing the language of search. generative experiences, which is an incredible mouthful, and instead they're being replaced with what they're calling AI overviews. Basically, every search now is going to have a perplexity-style summarization first, rather than just the classic blue links. It is not hyperbolic to call this the most significant shift to Google search since the product was launched 20-plus years ago. Relatedly, Google Chrome is getting an AI assistant, which will basically place text generation
Starting point is 00:05:20 natively in the browser. As a listener of this show, I have a strong feeling you like to stay up to date on all things artificial intelligence, including its impact on the workforce, which is why I highly recommend checking out managing the future of work, the chart-topping business podcast from Harvard Business School. HBS professors Bill Kerr and Joe Fuller talked to business leaders, technologists, and policymakers grappling with the forces like AI, globalization, and demographic shifts that are reshaping the nature of work. Recent guests include IBM's CHRO, Nicol-Lamuro on how Big Blue is adopting AI, Morningstar
Starting point is 00:05:52 CEO, Kunal Kapoor, on how AI can raise the investment IQ. Microsoft Corporate Vice President Jared Spatero on how the tech giant is experimenting its way from AI assistance to autonomous agents and many other prominent movers in business and the workforce ecosystem. So don't miss out. Follow managing the future of work on Apple Podcasts, Spotify, or wherever you're listening now. Hello, friends, quick note before we get back to the show, I'm so excited to share that Super Intelligent is now live. Superintelligent is a platform for fast, fun, and super practical, useful AI learning. We have something like 300 video tutorials adding 30 to 50 each week, covering every topic in AI you can imagine from LLMs to image generators to case studies, use cases, basically everything that tells you
Starting point is 00:06:37 how to use AI and what to use it on in short, fast four to seven minute tutorial videos, which are paired with step-by-step instructions that help you actually use these tools as well. It's $20 a month for unlimited access, and I would love to see you there. Check it out at B-super.aI. that's B-Super.AI. So let's talk about impressions. First of all, I think Jan Peleck nailed what a lot of people were feeling when he wrote, Google is out for blood.
Starting point is 00:07:05 What a bombardment. In many ways, that's what it felt like to me, too. As opposed to OpenAI's tight 30-minute announcement, Google's I-O keynote went on for a couple of hours. In fact, in its hugeness, it sort of gave the impression that Google wasn't trying to compete one-to-one between Project Astra, their AI assistant and OpenAI's GPT4O assistant, but instead to show just how many different vectors of AI Google is competing on.
Starting point is 00:07:28 Still, there was a lot of focus on comparing Project Astra and GPT-40. Professor Ethan Malik writes, This video does a great job highlighting the practical side of ubiquitous multimodal AI agents. It's fascinating to see the neck-and-neck technology race between Google and OpenAI around the same set of problems and approaches. Robert Lukosko writes, guys, that's really insane. I rewatch this IO agent's demo three times already. Tell me Google will not win agent space.
Starting point is 00:07:51 It can schedule UPS pickup from a same. single photo. However, the biggest critique was that we're still comparing what was a slightly janky live demo in the case of OpenAI from extremely high production demo videos prepared for Google. Amir from Dewist writes, there's a big difference between OpenAI and Google. Open AI ships useful stuff. For example, the GPT4O model is incredibly fast and 50% cheaper, and the voice improvement is incredible. It's widely available. Google announces stuff that looks great in I.O. presentations, but isn't really usable or even released in many cases. It's mostly vaporware. It's fantastic to see a company with real potential to disrupt Google. DC investor writes,
Starting point is 00:08:26 Google simply lacks credibility for any kind of AI demo at this point, like Project Astra or similar. Promo Reel's even worse. Meanwhile, OpenAI does a semi-scripted event for GBT4-0, but enough flaws come through to show it's real. Google needs a full culture reset at this point. What he's referring to in part is that people got extremely hyped back last December when Google announced Gemini Ultra, only to realize that the videos that had been most impressive were really pretty doctored to show exactly what they wanted to show versus where the actual capacity was. Some people also pointed out that even these overly prepared demos weren't as impressive as what OpenAI showed. Bindu Reddy, who was a little bit harsh on OpenAI, frankly, wrote,
Starting point is 00:09:03 after watching Google I.O., it's safe to say what OpenAI showed yesterday was mind-blowing. Astra is a prototype voice assistant and seemed like a two-year-old baby compared to OAI's Scarlett Johansson. Danny Rodriguez said, well, that's quite a turnaround from yesterday. To which Bin Du replied, it definitely is, lull. Google I.O. gave me a comparison and comparatively, the OpenAI version was very, very good. Unfortunately, the people who were at I.O. and got their hands on Project Astra also weren't all that complimentary. Santiago on Twitter said, I tried Project Astra, and the demo wasn't great. One of the things that was totally unignorable was the difference in approach.
Starting point is 00:09:37 Google's I.O. presentation was, like I said, extremely high production value, and it also had celebrity integrations woven throughout. OpenAI's presentation was intimate and quaint by comparison. However, I'm not so sure that does exactly what Google wanted. it. I asked my Twitter followers, did Google's high production values and celebrity integrations make you more or less stoked on their AI announcements? Notably, 42.6% of people said it didn't impact their opinion, but of those who did have it affect their opinion, about 90% were less stoked rather than more stoked. Broadly speaking, other polls I saw confirmed a pretty similar sensibility about the two events. Dentalist asked, did Google Cook open AI? 75.9% of respondents
Starting point is 00:10:14 said nah. Stanford's Andrew Gow wrote, Who wins AI today? Among nearly 1600 voters, 60.4% said OpenAI compared to just 16.1% who said Google. Jenny AI, who does research at Microsoft, wrote so who won the week. And of her 400 respondents, 87.5% said OpenAI. It is worth noting, however, that there may be an audience bias issue here. Open AI is very much competing for the state of the art. They are, as they have been, leading the pack from a technology perspective. A significant portion of the people who are discussing this on AI Twitter are themselves technologists, developers, and entrepreneurs. They are the type of people, in other words, who value state-of-the-art more than anything else.
Starting point is 00:10:54 That's different, I think, than the audience that Google is going for and the strategy that Google is taking. Google is taking an approach of putting AI everywhere with their incredibly large and diverse install base. OpenAI does not have users that they are bringing to their party. Google has everyone using Gmail, everyone using Maps, everyone using search, everyone using docs, everyone using slides, etc., etc., etc. While they are clearly not interested in seeding state of the art in any way, the strategy that they're actually deploying doesn't technically rely on being state at the art. It requires being good enough for these products to actually be useful for the people who are
Starting point is 00:11:29 already their users. It's frankly a much lower burden. And one of the reasons that even for an open AI, it's so incredibly hard to compete in this space. So to the extent that you are trying to find signal in these results, I don't think it's about who is necessarily leading AI in the real. world, but it does seem to be clear that people think that what OpenAI is producing is still a little bit at least ahead of what Google's labs are producing. For some, though, they're not ready to sign off on any of
Starting point is 00:11:53 these yet. Robert Scoble writes, I'm in a judgey mood, so here's some things I'm going to do with my AI personal assistant, since that is what Open AI and Google are trying to bring to us. Scobel then puts together a list of 30 questions by which he will judge an AI assistant. For example, he writes, does it do the basics? Like when you ask Siri to take you back to your car, it doesn't work. But if you basket to take you back to your parked car, it does. If you ask it a thousand questions, how many does it answer accurately? In my perfect world, it would have no mistakes, etc, etc, etc., and I think, hold aside Scobal specifics, what he's getting at is that ultimately, AI assistance will only be a thing if they're good enough for them to be a thing.
Starting point is 00:12:28 The general mass market of consumers isn't going to use something because it's AI and cool. They're going to use something strictly on the basis of whether it's useful. We are still at the very early stages of these products coming to market, and it's not at all clear to me yet that either Project Astra or the new GPT-40 is actually going to clear that threshold. Still, after all of this, the competitive landscape is getting a little bit clearer. DC investor again pointed out OpenAI's stakes and why the news about them potentially getting a deal with Apple is even more interesting today than it was a couple of days ago. He writes,
Starting point is 00:12:58 The big thing I see with Google's AI announcement yesterday is they are driving for much deeper integration with your data and your surroundings. OpenAI better hope Apple can do the same for them. I could see an argument that OpenAI needs Apple as much as Apple needs Open AI. At the same time, there are clearly some forces that have been unleashed by AI that all of these companies now feel subject to. Daryl Bostenjo writes, the real problem OpenAI has created for Google is that it has damaged the virtuous cycle of websites making free content for Google to index so Google sends them traffic in return. Now Google's AI results will provide answers instead of sending sites traffic and the web withers. P. P. P. P. Pashel put it more simply. It's official. Generative search is now
Starting point is 00:13:35 the norm. Before the Open AI event, we had so many rumors that they were going to announce, a search product, which they of course didn't, but I would be very surprised if something like it wasn't still in the works. Still, maybe the tweet of the day went to Andre Carpathy formerly of OpenAI, who, in reflecting on the new OpenAI voice assistant, which reminded so many people of Samantha from her, wrote the killer app of LLMs is Scarlett Johansson. You all thought it was math or something. AI is more competitive than ever, and next week we're getting a Microsoft event as well. So stay tuned for all of this, as there is going to be a lot to discuss. If you made it this far and want to dig deeper into how to actually use these tools, please go check out Super Intelligent.
Starting point is 00:14:13 It's our platform for fast, fun, and useful AI learning. You can find it at Besuper.a. For now, though, that's going to do it for the AI Daily Brief. Until next time, peace.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.