The AI Daily Brief: Artificial Intelligence News and Analysis - The AI Assistant and Agent Battle Heats Up

Episode Date: January 13, 2024

Note: This episode was produced on Wednesday January 10th, but due to an ice storm never came out! Apologies for the delay. Today's Sponsors: Listen to the chart-topping podcast 'web3 with a16z cryp...to' wherever you get your podcasts or here: https://link.chtbl.com/xz5kFVEK?sid=AIBreakdown  ABOUT THE AI BREAKDOWN The AI Breakdown helps you understand the most important news and discussions in AI.  Subscribe to The AI Breakdown newsletter: https://theaibreakdown.beehiiv.com/subscribe Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown Join the community: bit.ly/aibreakdown Learn more: http://breakdown.network/

Transcript
Discussion (0)
Starting point is 00:00:00 Today on the AI Breakdown, we're looking at a big day in AI agents and assistance and how the competition is shaking out for the future. The AI Breakdown is a daily podcast and video about the most important news and discussions in AI. Go to Breakdown.net network for more information about our YouTube, our newsletter, and our Discord. Hello, friends, one quick note. Over here on the East Coast, we have been dealing with an ice storm for the last couple days, and my internet has been barely working, so much so that I've spent the last six hours or so offline, and was only able to get a main part of the episode done. So apologies that this episode does not have a brief, we'll be back.
Starting point is 00:00:41 Hopefully, fingers crossed to our normal format tomorrow. Welcome back to the AI breakdown. Today, we are talking about a theme that has promised to be a big one for 2024 and is coming out of the gate in an even more significant way than we thought. That is the convergence of AI assistance, in other words, incredibly powerful personal assistant applications that can actually go out and do tasks on one's behalf and AI hardware. Yesterday saw the launch of the Rabbit R1. The company had been hyping the launch for the past several weeks and unveiled yesterday what people immediately recognized as a very
Starting point is 00:01:16 job-se-in-style keynote presentation about 20 minutes long, explaining exactly what the Rabbit R-1 was all about. Now, there are actually two parts of Rabbit's presentation. The first was what lies underneath, which they call Rabbit OS, which is underpinned by something they're calling a large action model. They write, the large action model is a new type of foundation model that understands human intentions on computers. With LAM, Rabbit OS understands what you say and gets things done. And so basically a big part of their presentation was trying to draw a distinction between large language models and large action models, pointing out, for example, that while chat Chb-T is very good at finding information, it's not set up currently, really, to take actions on a user's behalf. The second thing
Starting point is 00:02:00 that Rabbit revealed, however, was the R1. A total. new type of device that does away with many of the conventions of personal computing and is another contender for what a next generation AI native mobile device is going to look like. So the R1 has a few different features. The main way that you interact with the R1 is by talking. There's a single push to talk button, which as soon as you press it, you're off to the races. There's a 360 degree rotational eye, which gives the device the ability to see and interact with the world visually. There's an analog scroll wheel and a USBC port and a SIM card slot. Now, a lot of the way that the R1 will work is that you'll use a web-based portal to connect various services that you use, think,
Starting point is 00:02:38 for example, Spotify, to the R1 so that this device itself becomes kind of like a voice-activated control center for all of the applications that you might otherwise use. Now, inherent in the presentation was a critique of the existing form factor, the way that applications are organized currently and have been for the last 15 years of smartphones. Here's how the CEO described it. Like iPhone and Android phones. These guys being here, for years and we've grown tired of them. The problem with these devices, however, is not the hardware phone factor, is what's inside.
Starting point is 00:03:13 The app-based operating system. Each time you want to do something, you fumble through multiple pages and folders to find the app you want to use, and there are always endless buttons that you need to click. The smartphone was supposed to be intuitive, but with hundreds of apps on your phone today, that don't work together, it no longer is. If you look at the top of ranking apps on App Stores today, you'll find that most of them focus on entertainment.
Starting point is 00:03:43 Our smartphones has become the best device to kill time instead of saving them. Recent achievements in large language models, however, or LLMs, a type of AI technology, have made it much easier for machines to understand you. The popularity of LLM's, LM chatbos over the past years has shown that the natural language-based experience is the path forward. Quickly a brief word from today's sponsor.
Starting point is 00:04:14 As a listener of this show, I suspect you like to stay up to date on all things AI and tech, which is why you have to check out the chart-topping podcast Web3 with A16Z crypto. Produced by venture firm Andresen Horowitz, Web3 with A16Z is the perfect companion podcast to the AI breakdown. Web3 with A16Z crypto is your definitive. resource for the future of the internet, whether you're interested in the convergence of AI and crypto or simply curious about what's next. If you need a place to start, they recently released an excellent episode with Stanford Cryptography Professor Dan Bonae and former Google X engineer
Starting point is 00:04:47 Aliyaa in conversation with host Sonal Choxi about the intersection of AI and crypto. From fighting deepfakes and proving humanity to large language models like chat GPT, they cover it all. I highly recommend checking it out, especially if you'd like to learn more about how AI and crypto will impact our everyday lives. Beyond Crypto and AI, the show is for creators seeking more ways to truly own their work, for business leaders trying to prepare for the future today, and for innovators exploring trending tech topics. Don't miss out. Follow Web3 with A16Z Crypto on Apple Podcasts, Spotify, or your favorite listening app. Now, one of the things that I was most impressed with when it came to their approach was that they sort of put their money where their mouth is when it
Starting point is 00:05:29 came to the critique of how existing mobile services work. So, where are you? So, where are you? As, for example, the humane pin is going to be something like $699 plus a cell phone plan on top of that, the Rabbit is $199 for the device with no subscription required. Now, when it came to the community's reaction, there were a couple different categories. Some thought their presentation was great. Jason Levin said, I've done marketing for 30-plus startups. Rabbit's launch of their AI hardware was genius.
Starting point is 00:05:56 To be honest, the founder might be the next Steve Jobs. Here's the hidden marketing tricks you definitely didn't notice. He goes on to point out the critique of the endless buttons on smartphones. as well as the critique of LLM's understanding language but not being able to take actions. He also pointed out that it was a super savvy move to collaborate with what he called Silicon Valley nerds favorite product designers teenage engineering. Teenage engineering are better known for their pocket and portable synthesizers. They're totally reimagining the music creation process, and they really do have kind of a cult following. Now, the biggest category of critique that I
Starting point is 00:06:25 saw is why couldn't this just be an iPhone app? Sergey Galgan put a little rabbit icon at the bottom of an iPhone and said, is there a universe in which I have this and don't have to spend $199 to carry around another device in my pocket? Gabe Schnitzel created a little app icon and said, I just saved you guys a bunch of time, plastic, and spinny wheels. And this is just the tip of the iceberg of that particular critique. Matthew Berman, however, offered a defense, which again echoed a lot of themes that I saw. He wrote why I'm excited about Rabbit R1. New device have a techie, so I love it. Natural language-based interface removes complexity in the form of apps. Rabbit R1 offers a preview of the future, one where apps.
Starting point is 00:07:01 are gone and operating systems are abstracted away by LLMs. Could this have just been an app today? Maybe. If so, it would have to navigate current App Store restrictions. Apple will likely launch something like this built into iPhone. It'll be good. But I enjoy trying new form factors and the price point is fantastic, and I love the design by teenage engineering. Ravo to a company willing to try something completely new. I think Matthew's point that this offers a preview of the future in which apps are gone and operating systems are abstracted away by LLMs is an extremely salient one. It may be that there isn't room in the mainstream for yet another category of consumer device that sits alongside a phone. But it also might be that the design experiments that show up in things
Starting point is 00:07:39 like the Rabbit ends up shaping what the iPhone 20 looks like a few years down the line. Now, like I said, what made yesterday interesting was that the news around AI assistance and AI hardware wasn't just from Rabbit. First of all, Amazon Alexa got a set of new generative AI powered experiences. Frankly, the updates were kind of minor. They were debuted at CES. And they just showed off a set of three examples of how different AI companies like Character AI are collaborating with Amazon Alexa for a new generation of skills and experiences. But to me, this felt very much like prelude to much bigger changes that I think are probably coming down the pipeline for Amazon in the future. Meanwhile, Humane, which had been super buzzy and which unveiled itself last November,
Starting point is 00:08:16 announced that it was laying off 4% of its employees before the release of its AI pin. A lot of the commentary was fairly incredulous. Pucco Capital Guy writes, Humane already beginning layoffs before the product is even shipped. Now, the founders of the company argued that this was a restructuring based on shifting phases of the company's life. But I would say that broadly the sense on Twitter was that it might be positioning for a world in which the pin hadn't been quite as well received as they would have hoped. I think, of course, ultimately that 4% is a fairly small part of the workforce and that smart startups today are constantly in a re-evaluate and revisit tash flow management kind of perspective. But the signal it sends is at least a little bit rough. Now, one of the David versus
Starting point is 00:08:54 Goliath upstarts in the AI assistant slash hardware space is of course Tab. Yesterday, the company announced that it had raised a $1.9 million seed round. So what does this superpower Tomogachi actually do? Basically, the Tab's approach to AI is to listen to everything that you say and everything that people say around you and to try to give you access to basically a second brain. It's closer to a pensive from Harry Potter than a personal assistant. In fact, in an interview with Fast Company, Schiffman said, tab is not an assistant period.
Starting point is 00:09:22 I'm not building something that's going to connect to Notion or your emails anytime soon. I'm solely building a friend that morphs into your creative partner, life coach, or therapist as needed. What I'm trying to do is create a new relationship in your life, radical transparency without concern of judgment. I think this is a relationship people used to have with God, but is lacking in the modern world. You gotta love the ambition there. And yet still, there was more news in this broadly defined agent or assistant space.
Starting point is 00:09:46 Multion founder Div Garg tweets, We came out of stealth today, very excited to officially announce ourselves Multion, vision for AI and agents and thank all of our many backers and investors. In their first official blog post, they write, Maltion is an AI agent that takes actions and interacts with the digital world to tackle mundane tasks that people would rather delegate to an assistant. Maltion frees up the human race from the mundane, boring, but necessary tasks that would take hours or days to accomplish so we can more fully enjoy and live our lives. Maltion will book your dream, travel itinerary in minutes, or reserve a table with six of your best friends at your favorite
Starting point is 00:10:16 restaurant and confirm the best date and time for everyone in just minutes. We used to take hours and even not be possible to do, is now handled by Multion in a matter of minutes. In many ways, I think it's reasonable to look at Multion as a software-driven approach to some of the same problems that Rabbit is trying to solve, although the very presence of hardware obviously makes it a different type of thing. Still, if 2023 was the year that language models became a mainstream force, if entrepreneurs have their say, 2024 is going to be all about AI agents and assistance. Exciting stuff to watch as we kick off this January. But for now, that is going to do it for today's AI breakdown. Until next time, peace.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.