Bankless - LIMITLESS: Aravind Srinivas: Perplexity CEO’s All-In Gamble to Take Down Google

Episode Date: August 19, 2025

Perplexity CEO Aravind Srinivas joins the show to explore how AI agents, personalization, and new browsing models could transform the internet experience.  From replacing mundane workflows with p...roactive assistance to envisioning a fully AI-native operating system, Aravind shares his long-term vision, the thinking behind Perplexity’s new Comet browser, and why curiosity will remain humanity’s most valuable skill in the AI era. We also dive into competing with Big Tech, protecting users from AI-generated “slop,” and the architectural choices that make AI-assisted browsing faster, more reliable, and more personal. ------ 💫 LIMITLESS | SUBSCRIBE & FOLLOW https://limitless.bankless.com/ https://x.com/LimitlessFT ------ BANKLESS SPONSOR TOOLS: 🪙FRAX | SELF SUFFICIENT DeFi https://bankless.cc/Frax 🦄UNISWAP | SWAP ON UNICHAIN https://bankless.cc/unichain 🛞MANTLE | MODULAR LAYER 2 NETWORK https://bankless.cc/Mantle ------ TIMESTAMPS 0:00 Intro 9:58 Curiosity as the Key Human Advantage 15:26 From Keywords to Questions 21:35 Personalization and Agency in Browsing 27:13 Keeping Humans in the Loop 32:30 Why People Love Perplexity 38:35 Comet - The AI Browser 47:24 The Browser vs. Agent Debate 57:16 Competing with Big Tech 1:01:26 The OS Endgame 1:08:48 Closing Thoughts ------ RESOURCES Aravind Srinivas https://x.com/AravSrinivas  Perplexity https://www.perplexity.ai/  ------ Not financial or tax advice. See our investment disclosures here: https://www.bankless.com/disclosures⁠

Transcript
Discussion (0)
Starting point is 00:00:00 Comet is, we basically call it browser the speed of thought. So we all have a lot of thoughts while we are on the browser. And we don't get to actually finish all of them because every task that we have in our head takes a lot of time. So Comet is meant to unify perplexity on the browser in a very native way, where perplexity evolves from just giving you answers to performing actions for you. And perplexity evolves from just pulling context from the web, pooling all context. If you were listening to this, that means you were a subscriber to Bankless,
Starting point is 00:00:38 the greatest crypto podcast in the world. But there is another show you should know about named Limitless. Limitless, you could think of kind of like Bankless, except instead of crypto, we are exploring the frontiers of artificial intelligence. It's everything you wish you knew about AI condensed into these really short and neat 25-minute episodes. Think how to optimally extract value out of chat GPT. What model to use when if you want to offload a lot of the productivity and the work that
Starting point is 00:00:59 you do on a day-to-day basis? And then we're also talking to the people who are making that reality. Today's episode is no different. We are joined by the CEO of Perplexity, one of the most popular AI consumer applications in the world who is telling us that basically Google Chrome and Safari are dead. You can throw them out. You don't need them anymore. He is building the future of browsers. In fact, Perplexity actually just made an offer to buy out Google Chrome. So it's this really high stakes game on how we engage with the internet, thinking through the lens of how to optimize for an AI first experience. I found this totally fascinating and I hope that you will too. So if you enjoy this episode, do me a favor. It really means a lot. a lot. If you can go into the description, click the link, go subscribe to Limitless, check us out on socials, share it with your friends. It really goes a long way in supporting the mission. Today, I'm joined by David, who you know on love, myself Josh, as well as our co-host EJaz, where we drop some alpha on how to get ahead of these trends, on how to get ahead of the AI browser trend and offload a lot of your productivity to AI. So you have more time to
Starting point is 00:01:53 do the stuff that you want to do. So without further ado, let's get right into the episode with the CEO of Perplexity. In the wild west of Defi, stability and innovation are everything, which is why you should check out Frax Finance. The protocol revolutionizing stable coins, D-Fi, and Rolex. The core of Frax Finance is FraxUSD, which is backed by BlackRock's institutional biddle fund. FRAX designed FRAXUSD for besting class yields
Starting point is 00:02:14 across D-Fi, T-Bills, and carry trade returns all in one. Just head to FRAX.com, then stake it to earn some of the best yields in D-Fi. Want even more? Bridge your FRAXUSD over to the FRAXTL layer 2 for the same yield plus FRAXTL points and explore FRACTL's diverse layer-2 ecosystem with protocols like curve,
Starting point is 00:02:31 of convex and more, all rewarding early adopters. Frax isn't just a protocol. It's a digital nation, powered by the FXS token and governed by its global community. Acquire FXS through Frax.com or your go-to decks, stake it and help shape Frax Nation's future. Ready to join the forefront of Defi, visit Frax.com now to start earning with FRAXUSD and staked FRAXUSD. And for bankless listeners, you can use FRAX.com slash R-slash-Bankless when bridging to FRAXL for exclusive FRAXL perks and boosted rewards.
Starting point is 00:03:00 Imagine a world where traditional finance meets the power of blockchain seamlessly. That's what Mantle is pioneering with blockchain for banking, a revolutionary new category at the intersection of TradFi and Web3. At the heart is U.R, the world's first money app built fully on chain. It gives you a Swiss I-Ban account, blending fiat currencies like the Euro, the Swiss franc, the United States dollar or the Rimbi with crypto, all in one place. Enjoy real world usability and blockchain's trust and programmability. Transactions post directly to the blockchain, compatible with Tradfai Rails, and packed with integrated DFI future. UR transforms Mantle Network into the ultimate platform for on-chain financial services, unifying payments, trading, and assets like the MI4, the M-Eath protocol, and functions FBTC,
Starting point is 00:03:41 backed by developer grants, ecosystem incentives, and top distribution through the UR app, reward stations, and by-bit launch pool. For M&T holders, every economic activity in UR drives value back to you, embodying the entire stack and future growth of this super app ecosystem. Follow Mantle on X at Mantle underscore official. for the latest updates on blockchain for banking. That's X.com slash mantle underscore official. Ethereum's layer two universe is exploding with choices.
Starting point is 00:04:06 But if you're looking for the best place to park and move your tokens, make your next stop Unichain. First, liquidity. Unichane hosts the most liquid Uniswap V4 deployment on any layer two, giving you deeper pools for flagship pairs like ETHUSDC. More liquidity means better prices, less slippage, and smoother swaps, exactly what traders crave. The numbers back it up.
Starting point is 00:04:24 Unichain leads all layer twos in total value locked for Uniswap V4. And it's not just deep. It's fast and fully transparent. Purpose built to be the home base for defy and cross-chain liquidity. When it comes to costs, Unichain is a no-brainer. Transaction fees come in about 95% cheaper than Ethereum main net, slashing the price of creating or accessing liquidity. Want to stay in the loop on Unichain?
Starting point is 00:04:44 Visit unichane.org or follow at Unichane on X for all the updates. We are here with Arvind, Serenov, from Perplexity. Arvind, welcome to Limitless. Thank you for having me. Arvin, we want to learn something from you. We generally understand on Limitless that the future is going to be very different from what it looks like today, most likely starting with the internet. So we want to hear how you think the internet will be different in, you know, five,
Starting point is 00:05:08 five years and how that thesis of yours has informed your leadership of your company perplexity. It's never easy to imagine the world like five years from now, even in normal circumstances. And we live in like a pretty different world where AI is advancing at a pace that we just not used to. So I would be lying if I said I really know how the world's going to look like Firesmo, and I actually don't think anybody really does. Because it's so hard to work around scenarios where the capabilities of AI are constantly evolving every few months. Three months before, all the agenda capabilities where chain of actions and chain of tool calls were not reliable.
Starting point is 00:05:58 We're not, like, you could, you could see, like, the contours of it, but it was not as well-defined as it is today. And even today, it's not as reliable as it needs to be for really having, like, like, one employee per person, this is an AI. So, definitely, I'm only making, like, you know, bets here that there'll be a lot of agents that serve the web, serve the internet, and go and do tasks for people. we are no longer going to like browse the internet for things that don't feel fun. In the sense, like no one likes paying their credit cards.
Starting point is 00:06:38 No one likes like, you know, moving money from one bank account to another. No one likes like using legacy websites to book hotels. No one likes like using these really archaic UI for like finding, you know, a rental, a car rental or like a last minute dollar. appointment somewhere in a new city that you are in. These are all like stuff where like, you know, the different websites don't actually have good websites to deal with. They're a buggy. There's no customer support.
Starting point is 00:07:09 You've got to go find their numbers somewhere. You've got to find details from four or five different places. Finding a lawyer for something. Like these are all like very difficult ways in which the web is designed today. And AI hasn't solved that so far. AIS managed to assimilate information from. many different sources and summarize it. That's what perplexity essentially started.
Starting point is 00:07:31 But next step for AI is honestly to take away the mundane, boring aspects of having to do the real work for you. And so then what happens is you just go and like browse the internet for fun on websites that feel delightful. And different website owners can actually make their websites look delightful because AI can write a lot of code. And so my bet on the future is it's going to be glorious. It's going to be like, I have a very optimistic view that people are going to like have a lot more fun.
Starting point is 00:08:04 And entertainment is going to be a way in which like monetization happens even more in the coming years. Creators like podcasts like you like many, many other formats of like communicating and like sharing information are going to like spread. and so people are also going to have more time on their hands. Like a lot of the ways in the AI companies are motivating their tools is like, hey, people hardly have time, so let me just have the AI do the work for you. That's only partially true. People actually have a lot of time. It's just that they find the work boring.
Starting point is 00:08:43 That's why time spreads to fill the gap. Like if you allocate two hours to do something that only takes like 15 minutes, if the work for 15 minutes is so boring and you have to deal with so many boring workflows, you still spend two hours doing it. And that's kind of like why people hate doing work and they get tired and then they don't want to do anything anymore. So I think that part would go away hopefully
Starting point is 00:09:08 and that means that we're all going to have a lot more fun with each other and consume the web in our own personalized fashion. We're going to have AI's proactively tell us what to go and consume. We can also consume ourselves. You can tune on and off how much AI dependence you want in your life.
Starting point is 00:09:29 So it's going to be a very high agency, high curiosity-driven world. That's one of the reasons why we have built a brand around curiosity because we think that's the one human emotion that, sorry, human characteristic,
Starting point is 00:09:42 I wouldn't say emotion, but human characteristic that is even more important in the age of AI, especially knowing how to use AI as well as knowing what to do in a world where AIs are able to do a lot of things we used to do. Go into that a little bit more, curiosity as a leading human drive
Starting point is 00:10:03 for how we will navigate the internet. Why is curiosity so important? And maybe what will human curiosity look different when we have all these amazing tools at our disposal? So I think the world has more AI than the amount of scale required to make use of it today, right? Most AIs are pretty good.
Starting point is 00:10:24 Like, yes, there are mistakes being made by all JIs, hallucinations, sometimes like chatbots are too psychophantic. You know, like they're not able to reliably complete tasks, yes. But keep even the current state at which it can write code for you, build websites for you, do your research for you, answer whatever questions you. have is already like incredible but compared to the amount of people using it on a daily basis, it's not as high. Most people are still doing work in the traditional way.
Starting point is 00:11:04 We have a browser called Comet. It's basically able to let you watch a YouTube video with help of an AI. I don't even have to watch the full videos anymore. It's able to help me draft emails, LinkedIn posts, recruiting emails, recruiting messages, pull candidates to like reach out to, pull up all these old emails. I don't want to read.
Starting point is 00:11:26 Unsubscribe me from spam. It's already able to help me do all these things. But I do need to sometimes be curious in terms of how to actually make new users out of it that I haven't tried yet. It's on my creativity. It's on myself to actually like channelize that extra agency that it gives me.
Starting point is 00:11:45 So I would say that's the current, aspect in which curiosity is short-term useful, long-term, when these are all like, you just take it for granted that, you know, by default, you have to have the AI with you for everything you do. Let's assume that's the state. You still have to be curious in terms of what do you even work on. What is the next project you start? What are the kind of like seed prompts you can provide to AI, even if it helps you do the
Starting point is 00:12:12 task? It's you channelizing like what questions to ask. what original project to like initially start working on, that imagination, like, okay, let's assume Einstein had all the scientific tools, right? It is still on him to question, like, what happens if we travel the speed of light? Do all the Newtonian mechanics break at that regime? Should we build a completely new understanding of the world? Okay, why is this even a useful question to ask?
Starting point is 00:12:44 No, even if that, even if you didn't know the answer to that, like, asking these questions and going deeper, or for that matter, like questioning Einstein's wisdom and going at the particle level and drawing a distinction between the particle and wave nature, all these things are like stuff that physicists used to do just out of pure curiosity. And Jeffrey Hinton had a lot of curiosity on like what would happen if we build computers that simulated the brain, even though computer science was all about deterministic programs. fundamentally AI is basically stochastic programs. Like you cannot guarantee an output in AI even today.
Starting point is 00:13:25 So no LLM decoding is always the same. So people had their own sign of curiosity to go and explore things. But that was purely an academic exercise until now. With the access to all these AI tools, it's no longer going to be restricted to the elites. It's no longer going to be restricted to the professors, the scientists. So anyone with the curiosity that's childlike, when you have a child very early on,
Starting point is 00:13:52 or you have someone in your family as a child and you go hang out with them, they ask you all the most basic questions that stumble you, right? Like it feels great to be able to answer them, but also it feels like, oh, down, like I never actually thought deeply about it. I just took it for granted what people told me
Starting point is 00:14:09 or what I read on the internet about. So I think that sort of world is what we are heading to. and as you go deeper and deeper and as you ask, ask more questions, and if AI actually helps you ask more questions, like it's not just answering your question, but it's actually suggesting you more questions to ask and taking you into rabbit holes.
Starting point is 00:14:30 You get that kind of joy that the early web adopters got with hyperlinks and Wikipedia and like embedded web pages. The internet used to actually attract the librarians and like historians and like the intellectuals and the academics. That was how it started. That's kind of why also Amazon started selling books because it wanted to cater to the early audience of the internet, which happened to be people interested in books.
Starting point is 00:14:57 So I think like that's kind of how AI feels like today to me. It's really like massively used by the early adopters who are thinkers and programmers and like intellectuals and academics. But as the tools get easier and easier to use, and as the tools get a lot of agency, it's really going to be way more accessible. And so the normal person who's curious is going to be actually having a lot of superpowers.
Starting point is 00:15:23 And that's hopefully going to change the world in a very positive way. I love the way you frame the internet and AI as a vehicle to pursue curiosity and creativity. I think back to when I bootloaded my first CD-ROM or when I went on RuneScape for the first time and I could just kind of explore this new, internet native world.
Starting point is 00:15:45 I'm curious, AI is typically branded as something that will automate the world and your kind of like pitch of like curiosity and creativity. Do you think there's like a fine line between kind of AI constraining what an individual can look at, an explorer and search for,
Starting point is 00:16:05 versus something that they can use as a tool to create? I'm kind of thinking how you tread that line when you build the products that you build. Our product is built for helping you navigate the web and search more effectively than consuming through organized 10 blue links, right? And in my opinion, like, the skill of asking a question was not even there. We all, like, I would say using the machine learning terminology, we all overfitted to the skill of typing in a couple of keywords and opening the links. and reading those links and then
Starting point is 00:16:44 synthesizing and summarizing the relevant information to our original question in our minds and arriving at whatever conclusions we have. That's kind of what
Starting point is 00:16:54 we did in the last two decades because we just couldn't have a tool where we just directly go and drop our questions. Now that's there. So what are we doing more of is like asking more questions. The first question,
Starting point is 00:17:08 but also like a lot of follow-up questions. And so that's leading to a very different way in which you start. Like, for example, let me give you an example of how my own life changed. Earlier, when I want to read and understand a topic, I would like go and read the papers, the blog posts in a very linear fashion. I would, I would, like, go do the literature review. I would collect a bunch of, like, resources to read that particular source of material.
Starting point is 00:17:37 And then I would, after reading all of that, I would come to my own conclusion. I would still have a lot of questions, but this is how I would do it. This is how everybody did it. Now it's very different. Let's say I'm the CEO of the company. I don't have time to go deep into anything anymore. But I do want to learn,
Starting point is 00:17:57 what is this new thing everyone's talking about? MCP was a buzzword, and I wanted to know what it is. I don't have to go and read Anthropics documentation and blog posts to understand. I can actually ignore the writer's perspective and just go directly to perplexity and ask like what the hell
Starting point is 00:18:16 is this MCP thing? Right? Like, you know, just, is this a buzzword? Is this like a pro? All it does is a different way of moving jons around between like, servers and like the models or is there something more to it? Like, why is everyone calling it the USBC for the AI or internet?
Starting point is 00:18:38 Like, what's the big deal about it? Like, I can ask things in a way. where questions guide my learning rather than like the material and the blog post guiding my learning. Because anyway, after reading five different pages, I would still have like tons of questions. So why not I start with the question?
Starting point is 00:18:58 And then after like 20 questions, I got a lot of it. And now I can go and read the material in full. Like it's a very, it's kind of flipping the order. This is a personal taste. I don't recommend everybody to do this, but because I don't have, have a lot of time. This is how I learn things at this point. Same thing happens. Like,
Starting point is 00:19:16 whatever happens to my body, whatever, like, I'm doing my workouts, the foods they consume. I don't actually go and, like, watch YouTube videos of, like, diets recommended by bodybuilders and, like, like, how to, like, lose fat without, like, losing muscle. Like, like, I don't have to watch, like, 20 videos. I can just actually ask critical questions about the videos using the comment browser, tell me what is actually like contrarian on you here. Like I can ask it to like reference check a bunch of papers. So it's leading to a very different way for me to consume knowledge out there. And I only think this is going to be even more awesome, like going forward.
Starting point is 00:19:56 Like kids don't have to like consume the web or the internet like the way we did when we were kids. And then voice more interactions are going to make it feel even more natural. And then having, like, ability to pull context from whatever you're seeing and asking questions is going to make it feel like even more device-free and, like, more natural. So I'm actually more excited for the next generation because they're very lucky. Like, like, I feel like I don't know your ages, but I'm guessing you're all in my age group. So us, we were fine. Like, we at least enjoy the early web. I feel like the next
Starting point is 00:20:38 the generation right after us really got affected by this whole social media push and they got a lot of knowledge just by watching reels and like charts and you know all these things and I don't think that's like
Starting point is 00:20:53 very good because fundamentally it's net negative I think the generation after that is going to be like you know go watch the reels for me and tell me like what's likely interesting to me I trust my relationship with my agent who truly understands me and knows my preferences, knows my goals and objectives to go, like consume the internet for me and give it to me the way I want.
Starting point is 00:21:19 So that agency and that trust you get with your AI to do things for you and to filter out the noise for you and help you seek the truth and help you stay curious, that's the kind of what we want to help create through our products in this company. I love that. You're describing essentially a new kind of online or browsing experience, right? When I showed a comment to Mark and Reason,
Starting point is 00:21:46 the prompt he wanted to know was like go to X.com, scroll through my like 100 tweets on my feed, based on my browsing history, filter out of the noise and just show me like the 20 relevant ones. And it did an amazing job. Like what if this is like all? almost real time where I go to a website, I click a button, you know, I don't even have to write all this prompt, and then it just renders the website and the way I want to consume it.
Starting point is 00:22:14 No website owner or like algorithm builder has a time to like customize everything to every person at a fine grain level, right? There are like Elon Musk will make a change to the X algorithm and then you start seeing a lot of political posts or you start seeing a lot of memes. You start seeing a lot of random videos or anime content. You have no idea why you're suddenly seeing all this. And then it's not his fault either. He's just trying to maximize a few metrics for the company or maybe for himself. Like it's his property. And that's the, well, like, I don't want us to continue living in. We need to have the agency to do like things in the way we want.
Starting point is 00:23:01 It kind of sounds like you're describing the current world that we browse and look online is kind of overfitting for skills that we don't need right now. So what do you think are the important skills that we should focus on now then, as like that new younger audience that is entering the internet today? I would say critical thinking. So here's one skill I have learned to acquire over time, which is anytime you read a book or about, biography of somebody, unless it's written by someone pretty neutral, usually it's a
Starting point is 00:23:37 buff piece on them, right? Like, usually it's like something that they collaborated with and, you know, meant to show the company or themselves in a very positive light. So you don't, you end up like getting a very biased perspective. So I would love to like, you know, have an AI, when I'm reading a book together and and critically review any chapter and tell me, like, perspectives that are contrary to what the author has as well. So that's what I do when I read books right now.
Starting point is 00:24:12 Like, I just have a sidecar assistant on my browser, and I just ask it for like, hey, just based on what you read in this chapter along with me, like, just tell me, like, what are some things I should look out for that the author could be wrong here about. and that's just me being looking for like more perspectives like slightly the peter tealish like contrarian ideology again not to be contrarian for the heck of it like I just want to know everything possible and so that's that's that critical thinking will definitely be essential and then like the reason that you can consume everything in the way you want definitely helps
Starting point is 00:24:53 you to not get into echo chambers. So hopefully that opens up the mind for most people to like just question a lot of the things they see on the web. And I think the web is going to be filled with a lot of AI slop too. I don't want to just like give you this impression that AI is like all so amazing technology. Just sit back and enjoy the ride. Like there's going to be a lot of slop. There's going to be a lot of AI generated misinformation.
Starting point is 00:25:18 AI generated videos that are like photorealistic that you can't even say if it is real or not. and then like content will be written a lot more by AIs and humans on the internet. So the only way to fight this is actually with help of an AI like ours or like other people are building where that are helping you seek the truth and guide you towards it
Starting point is 00:25:41 even without much effort from you through the right kind of prompts that are like already cached and help you consume things the way you want and like honestly work for you like truly like keep your interests. Imagine a world where agents are doing shopping and like travel booking for you, right? And there might be a world where like
Starting point is 00:26:04 whichever companies choose to do this, try to have advertisers like trying to get the attention of the agent instead of yourself. Like ads at the agent level. Where do you like protect the user then? Who doesn't want this to happen? The way potentially it could work is the user and the agent have their own contract, a handshake,
Starting point is 00:26:27 it's all in the form of a system prompt, and that prompt is protected. You cannot inject it. This doesn't exist today. You can do prompt injection to anything. So this would not work today. But imagine a future where we can reliably do this. Then, like, no matter what the advertiser tries to do
Starting point is 00:26:45 to get the agent to prefer them over some other merchant, the user's prompt to the agent protects them. from the sort of like advertising mechanism. So we need all the sort of like develop versions of like the current systems. It's very nascent today. It feels like the early days of the internet. But that's what I would like to like ensure
Starting point is 00:27:08 to make sure like people are protected even against AI slop and advertising and all that going forward. There's this notion in UX design that better UX involves fewer clicks on behalf of the humans. Like if we can just get them to tap fewer times to get what they want,
Starting point is 00:27:22 that's generally considered to be good U.S. I see some of this, that same kind of sentiment being applied towards AI agents where, oh, we can actually just get AI agents to do things on behalf of their users. Yeah. And that really puts humans into a very passive role. And I see pros and cons of that, right? Like, you know, sometimes I just don't want to think that hard and I just want to be entertained. And that makes me feel good.
Starting point is 00:27:46 I'm also worried about the costs of that as my brain turns off more frequently as my default mode. How do you think about, when you're designing perplexity, how do you think about like this active versus passive human in the driver's seat when we can automate things? But then also like maybe we also want to encourage a more active driver when it comes to like managing these tools.
Starting point is 00:28:11 How do you think about that tradeoff? Yeah, it's a good question. So we think about it in the sense of keeping the user active in the process. in the process. So at least for agentic queries, where you go and ask perplexity to go do a deep research for you about like
Starting point is 00:28:30 GLP ones. We do have the agent coming back and asking clarifying questions to the user where the user can provide more input. I think chat GPT is also doing this. ChatGPT explicitly forces you to reply. Perplexity does not force a user to reply.
Starting point is 00:28:48 I think ours is a better design because sometimes when you don't know anything about a topic, like, you don't know enough to even reply anything. So your reply doesn't matter. So you don't need to be blocked on the user to reply. But this is at least one way in which you can stay along and guide the agent towards like doing something that you want, right? And then certain other ways in which we do it on the comment browser, for example,
Starting point is 00:29:09 if you go and ask it to buy something, it'll still ask you for confirmation before like proceeding. It'll give you like warnings. Oh, like, this is going to be $100. Like, are you sure you want to spend? So it's still going to keep your brain active as it does to work. But I'm taking a more, I'm zooming out and looking at your question at a more philosophical level. Like if agents are, at some point you do trust them, they're smarter than you.
Starting point is 00:29:36 It's kind of like you hired some person smarter than you and like why do you even need to micromanage them anymore? So where do you apply your brain power? It's not very dissimilar to like, you know, like you're running a company and you hired like two people and they're like, world class and they're just doing everything. And like, even if you don't turn up for work, like, your company runs fine. Like, what do you do then? Either you do another company or you start another business adjacent to your current business that helps you grow the current business even more.
Starting point is 00:30:04 Or you start like doing, taking more bets within the same company by hiring a different set of people and trying to like, like amplify what you can do within the company. Right. So I guess like that's how I see it. If you stay idle and do nothing, yes, like I think there's going to be cognitive. of decline for sure. And I think that applies even more in the age of AI, where if AIs are able to do a lot of things for us,
Starting point is 00:30:33 and we therefore take it for granted. And Bill Gates has this thing of, like, people have, like, three-day work weeks or two-day work weeks in a world where AIs work really well. I think that's okay. Like, by the way, I'm not against such a future where people only work two, three days a week and chill for the remaining four days.
Starting point is 00:30:51 you know, like this whole five days a week thing was, the Industrial Revolution did it for us. Like Henry Ford was one of the main reasons that happened. Because at that time, like the only way to maximize production efficiency is people turned up to the factories and did the work. Then machines started doing a lot more things and like people started finding different kinds of jobs and software and internet.
Starting point is 00:31:14 All these things are like, like, you know, how we started evolving to deal with all these changes. So I'm sure we're, find more ways to keep ourselves occupied. At the same time, I'm sure, like, there'll be some people who just retire and pursue other passions, like, like, hiking and, like, photography and, like, you know, content creation, podcasting.
Starting point is 00:31:34 There's a lot of, like, so many different ways in which you can, you can just have your own life. And that's kind of like making the world more multidimensional. Like, some people tell me, you know, San Francisco is two, one-dimensional. You come here, you just meet tech pros. Like, they all talk about AI and cafes and, No one has, like, it feels like, you know, it feels great.
Starting point is 00:31:56 The energy is amazing. But people who like living in New York or London and, you know, you go into a bar and they're like people, someone, you meet someone who's like playing an instrument. You meet someone who's an artist. You meet someone who does like stage shows or like stand-up comedy and then engineers. There's like a lot of different types of people in New York and that attracts, you know, a lot of people to that city. I do think that like AI is getting better and better
Starting point is 00:32:24 might make the society feel more that way globally across the world, not just restricted to few cities. So, Arvin, people love perplexity. When I told some of my friends you were coming on, they were stoked. They use it all the time for like sports scores, for weather, for gambling suggestions. And I find people have this affinity with perplexity.
Starting point is 00:32:42 And I'd like for you to help me unpack why. So if I'm a user, if I'm someone who's listening to this podcast, who uses Chad Shoebt, who uses Gemini, and isn't quite sure what the advantage is to perplexity. Can you describe the unique advantage you have why people would want to use the service and what you're doing in the background to actually deliver on that promise?
Starting point is 00:32:59 Number one, we established the brand around accuracy and knowledge. And so it's not meant to be an assistant that is broadly like, you know, or rather it's not meant to be an AI chatbot that's meant to be chatting with you on and everything. So you can go to chat chip and like just have a bad day can he motivate me
Starting point is 00:33:21 that perplexity is not meant for that and so we're not trying to build a product that's good for search and research and knowledge and facts and good for being your chat buddy or like a companion
Starting point is 00:33:37 all in one Gemini and Chattepe are trying to do that so as a result of really optimizing for one thing which is knowledge and facts and research and giving the answer to the user in the most consumable fashion, like the highest density, you know, per pixel,
Starting point is 00:33:56 like in terms of information bandwidth, we try to, we do a better job, and then we also are faster to just give you the same answer. So we really care about, like, you know, what the user needs to like, like, say. Even if the user doesn't have to be very precise in their prompts, we kind of understand their intent and give the answer faster and better.
Starting point is 00:34:17 So the sports scores, you're saying that we did a lot of work on that because when you're asking the score of a game, you don't always return the answer in a wall of text. It's not fun. You do want these widgets. Your brain is used to consuming those pixels. You don't want live updates, stock graphs. Like you do want sometimes do deep dives on a company's revenue or the financials. We build a lot of dashboards for that. You don't want to be able to compare two different stocks.
Starting point is 00:34:45 you don't want to be able to like go deeper into like the past scores or like you know like the different like Formula One for example like you want to be able to like track live updates in the game so we did a lot of work towards like just giving the information in the highest possible information bandwidth like consumable pixels and we still haven't completed like we haven't done good job at tennis yet like I think we're still lagging behind on like soccer So there's still a lot of work to do, but we at least care enough about this. And we want people to be able to come to us for asking questions about anything and everything in the world. And that's the way we think about building the ultimate answer engine.
Starting point is 00:35:30 And how are you doing this? What's happening behind the scenes? So when I place a query with perplexity, what is the magic that's happening behind? Are you routing? We recently had the CEO of OpenRodera on who kind of described how you can route queries to different models. Are you just kind of aggregating the data yourself? Are you scraping the web and serving it along your own model? what's going on when I hit enter on that search box?
Starting point is 00:35:48 Yeah, so every query gets classified. So sometimes it's a sports query, sometimes the weather finance or a regular query that doesn't need widgets. So every query gets classified, and then depending on the classifier, different UIs, like we call it generative UIs, different UIs are generated per query.
Starting point is 00:36:08 And then for certain queries that require really accurate facts, you don't want to just use web links, You actually want to use a data provider that gives you a real-time data dump. That's what you need for finance. That's what you need for sports. It's what you need for weather. So we do that.
Starting point is 00:36:27 For some queries, you actually need like merchants or hotel inventory or those kind of things. So we do that for travel and like commerce. For some queries, you need like data providers for local restaurants. We do that with Yelp, for example. And for other queries, you just need the regular web
Starting point is 00:36:46 where you pull a bunch of links and you summarize the content in them. So that's what we do for most of the queries. That's a long tail. And you want to like decide if you want to format it and markdown or tables or like just one paragraph or two paragraphs. And if the query came on the phone
Starting point is 00:37:03 or if it came on the web, if it came on the phone, try to be a little more concise because people don't want to read like a lot of text on the phone. And then you also want to decide of like you want to reason and think longer for certain queries that are a little more ambiguous. Example, let's say you want to ask like,
Starting point is 00:37:21 what's the age gap between the top five billionaires and their wives or something like that, right? You want to like, who are the top five billionaires, so and so, okay, like who are their wives, so and so, like what is it ages like these 10 people, their birth dates, and then you want to calculate the differences. So you actually have to do some reasoning and then give the answer in the form of a table.
Starting point is 00:37:41 So the model has to automatically, adapt based on the query, how much reasoning and how many steps of reasoning to apply. So that's all based on classifier decisions too. So we have essentially think of us as building this gigantic, complicated information router for humanities like curiosity and knowledge needs. That's basically what we're doing. And if we can do this at scale for all languages, all types of queries, all types of articles, all types of like basic day-to-day tasks, there's like tremendous
Starting point is 00:38:13 value in that. It doesn't even matter if we own the model or not. Like, just the value of the router in terms of knowing which models to use for what queries and like what kind of UIs to use and how much of compute to apply per query and getting like majority of the answers right and doing it with delightful latency and like UI is basically our goal. Okay. So you have, now we have this tool set. You're taking the complexities. You're merging it into one dataset. And it seems like you're really exceptional at a few things. We mentioned sports. I know a lot of people also use it. A lot of people have been begging on Twitter actually for a perplexity to replace Bloomberg in terms of financials because it can do a lot of
Starting point is 00:38:52 charts. And it seems like you're really strong at some of these categories. But where you're putting a lot of your time and effort is actually into the browser itself with Comet. And I want to introduce Comet for the people who don't understand. It is your new AI browser. I'd love for you to share it with us because it seems like perplexity for a while. You've kind of been living on rented land in order to use perplexity. Normally I'd have to go to Chrome or I'd go to Safari or I'd use like a different browser that is not native to you. But now what you're doing is you're actually creating the full stack, right? You're creating the browser from your desktop.
Starting point is 00:39:19 You run the application. You control the entire stack. Can you just introduce what Comet is and kind of how it works? Yeah. So Comet is, we basically call it browser the speed of thought. So we all have a lot of thoughts while we're on the browser. And we don't get to actually finish all of them because every, every task that we have in our head takes a lot of time.
Starting point is 00:39:41 So Comet is meant to unify perplexity on the browser in a very native way, where perplexity evolves from just giving you answers to performing actions for you. And perplexity evolves from just pulling context from the web to pulling all context. Your browser history, your Google Calendar, your Gmail, other tabs that you might have had open once upon a time, your Slack, your other workspace tools. And so it can pull all relevant personal context and the web context and have the agency to go take actions for you. And available with you everywhere on the search bar, on the sidecar, on the new tab page. So whatever webpage you are on, contextually helping you, right?
Starting point is 00:40:28 That's the most important thing. Your work starts with some context. You're on a Google doc and you're asking for help to edit the doc. You're on a Google sheet and you're asking for help to source information from the web to help you fill the sheet. you are doing some work and you're trying to pull relevant context from the past that you might have exchanged on emails
Starting point is 00:40:47 with your colleagues to help you like draft something. You're just about to interview someone and like you want to pull all the background materials for them. You just want to do it like, hey, like prepare me for my day tomorrow and it's going to do it for you as part of the prompt. So we just wanted to be a lot more intuitive,
Starting point is 00:41:06 a lot more personal searches, a lot more like personal context. And actually like just, taking away the mundane aspects of dealing with boring websites. So that's what Comet was meant to do. And it got up to a really great start. And it's right at the sweet spot where it's almost there, but not quite there yet. And I think that's where you want to be so that you want to write the wave of like the models
Starting point is 00:41:30 getting better and then closing the loop on like full reliability. Yeah, I want to talk about the form factor and kind of design choices for Comet because I think a lot about intelligence. It's going to improve over time and more importantly, how we're going to engage with it as we kind of ascend this growth curve. And when I think about the conclusion that I reached, it seems to be a little bit different than the browser. So when I think about a browser, and you mentioned this earlier, there's kind of two uses for it, right?
Starting point is 00:41:56 There's two buckets. There's productivity, and then there's leisure. And productivity is kind of the work you do. It's me doing the agenda prep for the episodes. It's me shopping for laundry detergent or booking trips. And then for the leisure, it's like, I'm watching YouTube videos, I'm watching Netflix, I'm scrolling my X timeline. I love that.
Starting point is 00:42:11 That feels very uniquely human to me. And I kind of want to keep that. That feels special to me. So what I imagine is that productivity bucket kind of gets abstracted away through agents. And it seemed a little far-fetched a few weeks ago. But then I tried Open AI's agent and I was like, wait, this is kind of cool. It kind of obvious gets away all the interfaces, the complexities of the browser, and it just gives me the answer. It understands my preference stack.
Starting point is 00:42:32 It kind of knows everything. And I'm curious the design decision you made to actually preserve the form factor of the browser versus just going direct to the agent workflow that kind of, kind of takes away all the interfaces, the advertising, and then just serves you the answer that you're looking for. So the work begins from where you are, not like from an empty chatbot. Like for example, you're actually like in the middle of
Starting point is 00:42:54 drafting a note for a memo and you want to pull context from what you already discussed with your colleagues on Slack. You don't even want to like copy paste the memo and ask, hey, like, can you pull all the context from the past that I might have discussed with David and yes or something like that, right? Like, you literally want to just have the assistant right next to you and just say, can you pull relevant context that I might be missing here? And you don't even have to say pull it from Slack notion. It'll just automatically know what to pull and edit it for you right in place.
Starting point is 00:43:32 The other other advantages are that like this constant like switching tabs and copy pasting context here. and there and there, and then taking the outputs from one place, putting it back in another place, all that stuff saved for you when you just have it natively embedded in wherever you are. And in terms of architectural decisions, like the Chachity Agent is way slower than the comment browser. People have done comparisons and whatever takes you like 11 minutes to do on Chachickey agent will probably take you like less than a minute on comet. because there's a lot of advantages
Starting point is 00:44:07 in just parsing information on the client side and just using the server side for the frontier model reasoning but not having to create an entire server side session of your client browser and then doing all the compute there. There's again another round trip between that and where the models are actually hosted
Starting point is 00:44:27 and then like actually like sending the result back to you to the client. It's just like very slow and unreliable. and sometimes get stuck and retries, and you don't even know what's going on. Compared to having full control on the client, which is a lot more secure. Pass passwords don't need to be communicated. Everything is stored locally.
Starting point is 00:44:46 All your content is local. You don't ever have to worry about like a server-side session of whatever you're doing. And like everything's much faster because there's only like one way, like two-way communication between like whatever information on the client and whatever models are running on the server. But that's all about it.
Starting point is 00:45:02 So, and like, For example, you might want to take help even on your ex or Netflix or YouTube, right? Like, I'm on YouTube. I just want to say, hey, like, there's a podcast these guys did with Arvin, and I just want to get that exactly what Arvin said about a chat GPT agent. Can you pull it up from, can you exactly like edit the clip out where he only talks about this and upload it as a separate video to YouTube and help me watch it? We're not able to do all these things yet,
Starting point is 00:45:34 very much on the horizons of like, you know, happening. You can exactly pull it from the right timestamp and like you don't have to go show transcript, command F, chat, GPT agent, and then again, like, move around those, the playback slider in terms of where I exactly start speaking. All that's not needed. It's just much better.
Starting point is 00:45:53 So it can also help you with personal tasks. Like a lot of like work, like sometimes you're just watching YouTube and you can, you just may want to pull the whole transcript and use it for your, next thing later or send it to somebody pretty quickly. Or while you're watching YouTube, you might want to book a dinner reservation on the side. Everything can just be,
Starting point is 00:46:15 and you might just want to see if the agent's making progress and you can just consume your content. Everything is just much more seamless and integrated in one environment. It's the stickiest product that humanity has built so far since the last 20 years or like almost 30 years, like we've been using browsers. Yes, like it's changed a little.
Starting point is 00:46:36 Firefox innovated on the concept of tabs. Google innovated on the concept of like tabs as separate processes. But other than that, there's not been much changes. So for the first time, we're able to like give it to you in a familiar front end, familiar UI, but give you a lot more agency. That's basically what we decided to do. And it's okay. Like eventually the agency is so reliable that like you don't need to actually like, you know,
Starting point is 00:47:02 open your browser at all, you just have to type into the new tab page and it does everything for you. It's completely fine. But we think of a future where people will still be doing work, but they'll just be doing it with a lot more AI help, but they'll still retain all the agency. I think that's the kind of future we believe in. And I think embedding the AI directly in the browser is a better approach. Yeah, I think the browser is 35 years old, 1990. So we've been using it for a long time. It's clearly very sicky. And when you mentioned the perks of using comment browser now. I agree. We actually graciously got access to it. We were able to test it out and try it. And it is so
Starting point is 00:47:37 much faster than using the agent feature because it has all the integrations built in. It had my Google accounts. It had all of my login integrations. But my question to you is, what happens when eventually they do get faster, when the agents kind of collapsed that time and latency where you don't have to spin up a virtual machine. It doesn't take that long. And it really truly is just like a browserless experience. I know people are working on hardware devices to kind of just bring that into reality and remove a lot of the interface. So do you see the browser being the continued form factor as we move forward? Or do you eventually see Comet evolving into something a little more abstract than just a box with a little tab on top?
Starting point is 00:48:14 Look, I'm not particular on the browser remaining the front end for information consumption. Like I don't think that's necessary for the browser to be relevant. That's the whole point. the time it takes for the agent to actually, like the abstracted out agent to actually, like, do the work for you is not a bottleneck by the models getting more intelligent. It's purely an architectural choice of like spinning up server side sessions for each of your like browsing like tabs or third-party services. And models will get more intelligent and reliable in terms of controlling these websites,
Starting point is 00:48:53 but fundamentally what's happening is you're just, spinning up a browser session on the server side. That's all that's happening. And you still need the infrastructure of doing a browser, whether it's on the client or the server, whether it's headless or like with the front end, you still need all the infrastructure to do this, right? Like when you're asking on Comet,
Starting point is 00:49:19 go and buy this on DoorDash for me, we're not actually opening DoorDash and have the agent, like rendering it on pixels and have the agent like click on things. It's actually done a much more efficient way, but just directly consuming the JavaScript and taking actions there. We give you the front end in terms of the progress bar to see what's going on. That's just more meant for transparency and like user reliability.
Starting point is 00:49:45 But the agent doesn't need to consume it in the way you consume it. So that's not really like a server or a client decision. It's more that where do you have? actually start. Where are you already on most of the time? Are you going to be mostly on the chatbot? Like, is that where you're going to be spending most of your time on? In that case, it makes sense to move the browser back to the cloud and, and keep you on the chat. But that's not how we are, right? We are actually most of the time on the browser. We're opening the chatbot as another tab, or Google as another tab, or perplexity is another tab, but we are mostly on the browser.
Starting point is 00:50:19 We're like, you and I are on Riverside now. But the Riverside, I'm recording. I'm recording you don't comment right now. It earlier used to only work on Chrome. But now it works. Okay, yeah, we fixed the bug. So look, here's the thing. I'm on Riverside we're talking. I might want to have Comet listen to us
Starting point is 00:50:40 while we're talking and just loop it into our conversation. And, you know, it can also, like, come and do a podcast together with us or, like, answer our questions. That you'll miss out on all these experiences when you're just, like, stuck in this single chatbot window all the time. And it feels empty and there's no like new context all the time. Whereas on the browser, you just open like Twitter or like link,
Starting point is 00:51:03 you know, you go to Twitter and like scroll through a few feeds and then your world is already like chaotic and interesting. And that you miss out all that by just staying in the chatbot and always have to think about what prompt to add to the chatbot. I think that's why we think the browser is more interesting because context keeps coming. So it's like no limit to your curiosity in terms of what you can do with it.
Starting point is 00:51:31 What I'm thinking, I'm seeing happening here is there's this notion that AI is just going to come and improve all of our lives in all these different ways. And it's going to come via these assistants. And what I'm seeing with the browser, with this browser model, the common model, is what you are doing with perplexity is you are making a bet that the browser form factor is the most useful assistant form factor to take the unbridled intelligence of these LLM models from Open AI
Starting point is 00:52:00 and all these things. And you're just making a bet that like, okay, we'll make the assistant actually just the browser. And there are other maybe competitors out there that maybe you... The assistant needs to have browsing, whether it retains the front of the browser or not.
Starting point is 00:52:17 Right. And I think on mobile, you're not going to actually use it as a user on the web. On mobile, you're not actually going to be opening tabs
Starting point is 00:52:25 on the mobile browser. You're actually going to just go to the individual apps. You're not going to go to X.com on my mobile browser. You're going to open X as an app. So on mobile, the way the assistant
Starting point is 00:52:40 takes advantage of the browser functionality is calling the third-party apps, which you cannot, the OS restricts you from calling the third-party apps. I cannot open DoorDash. I cannot open,
Starting point is 00:52:52 Uber, Amazon, or like Twitter or LinkedIn, and go do stuff for you there. The OS does not let me do that as another app. Siri can potentially do that, but that's because it's not even an app. It's native to the OS. So that's where having the browser as an explicit standalone app
Starting point is 00:53:13 and helping me either run a cloud server side session of that or like doing it on your client as a background process, has a lot of flexibility in terms of what all I can let the assistant do beyond just answering questions. Right. And I suppose there's going to be a handful of products that are like this where they are trying to make just a useful form factor for AI around your person. And one of them is a browser because like you said,
Starting point is 00:53:39 we spend so much for our time in a browser. Another one is, you know, maybe people aren't really intuitively thinking of this as like a competitor, but I see it in the same category as like those pendant things, those physical devices where it's just another form factor and it's supposed to assist you using AI. And this one is a, it's not a browser, but it's this thing that's with you in real life. You're away from your desktop.
Starting point is 00:54:02 You're away from your phone. Your phone's in your pocket. But it's another form factor of something that's supposed to assist you and make your life better. And is that how you see the category of what you're building in? You're just trying to make the best form factor possible to create a useful AI assistant tool. Yeah, definitely.
Starting point is 00:54:18 the memory and the context you can pull from the browser is second to none in my opinion I think people believe in the pendant and like whatever the thing that you can put on the chain necklace largely unproven
Starting point is 00:54:35 yeah glass like record everything you're talking to and fundamentally it's actually like a less efficient way to like store things battery draining compared to like taking advantage of the battery of your phone or your MacBook, which is what the browser does. And also a lot more engineering resources
Starting point is 00:54:57 that have been put into making browsers like consumeless battery and memory. And like well understood code to optimize, the chips are way more powerful. So that's what the pendant lacks. It has to constantly drain your Bluetooth on the phone and keep uploading things to the server, keep using the internet connection of your phone.
Starting point is 00:55:19 So it's not meant for, and maybe you don't even need to record that much. Like it feels an overkill. Whereas on the other hand, every website you've gone to, having access to your email and calendar, what are all meetings you've attended, your flights, your dinner plans.
Starting point is 00:55:37 I already know so much to be able to help you just through the browser context. And it also feels less creepy to me in terms of like constantly having to go around this device and recording people without their permission, whereas the browser only gets your own personal context and only with your own permission, by the way, and like you can choose to do things on incognito.
Starting point is 00:55:57 So that's another like advantage I feel the browser has. And if you want your phone to record a particular meeting, you can always choose to do that. It's pretty easy. They're recorder apps. You can have like existing apps, have a record button and then, you know, log all the context, dump it into your local drive there on the app itself.
Starting point is 00:56:19 He can be stored locally on the client, doesn't have to be pushed the server, context can be pulled from there. The browser can do all these things. Like, it's pretty easy to do all these things. So that's why I'm not a big believer in the hardware. Like, I think hardware is very interesting when you go to the AirPods level. Like, when I have an AirPods and I can just talk to it while walking, and it has cameras.
Starting point is 00:56:44 And I can actually ask questions about restaurants, menus. It gives me a completely new way to go shop online. There are a lot of these advantages you have with the glass or the AirPods, where glass can help you render things. The airport can help you just see and talk. So I believe in those. But I'm not a big believer in devices that need to record every single thing you're talking to or speaking about.
Starting point is 00:57:10 and then taking all that as context and pushing it into a chat on the server. I don't think that's needed. That's funny. Josh, we've spoken about different form factors back and forth before, and he kind of guessed the AirPods with a camera that was able to kind of see and sense everything.
Starting point is 00:57:29 I mean, like, Poblexity is like the first major AI company to come out with the AI browser, right? And it's no secret now that the likes of OpenAI and Google are going to be releasing new browsers or enhanced browsers. And you mentioned on a previous podcast, I think it was with Y Combinator,
Starting point is 00:57:50 that the reason why Google kind of like didn't like created a separate kind of search engine and didn't integrate AI directly into their search engine was because like it didn't function or work the same way. And my question to you is, if OpenAI releases a browser tomorrow, what do you think is like the main moat that Poplexy Comet has over everyone else?
Starting point is 00:58:14 Is it these kind of natural, intuitive, human flows that you describe? Is that where you're going to play the best in? Or is it these agentic flows? Like, can you help us understand what that looks like? I mean, like, I think they are going to work on a browser. It's been communicated by the press already. So what would the mode? I think mode is obviously going to be around like having a better
Starting point is 00:58:39 product, moving faster, shipping new things that are not just whatever we shipped already, but things that have to do with like long running processes, like kind of like the clod code equivalent for day-to-day browsing tasks. Some people like to think about the browser as the IDE for your life. And then the coding agent could be the thing that's fundamentally missing. Like right now you have synchronous agents that do things for you in place in real time. But asynchronous agents are doing things on. the background or like taking for much longer but can can take on harder tasks that needs to be stitched together much more long context management stateful memory all that stuff is still missing
Starting point is 00:59:21 so we need to build that they they'll want to work on all that too so i think like the modes really just going to come from whoever executes better um and and unlike a chatbot where users ship features a browser is a massive commitment to be multi-platform and constant upgrades and tons of bug fixes having to deal with so many different versions of the operating systems and both on mobile and desktop and a lot of like architectural decisions between what stays on the client and the server, security, privacy guarantees, enterprise versions for people to use it safely at work, lots of like context handling bugs and errors, constantly having to deal with new models, having the ability to use like,
Starting point is 01:00:09 multiple models, not just one, so that agendic capabilities on different models are going to always, like, you know, never be the same. So we have a lot of advantages by just being ultra product-focused company versus being a model company that's doing a compute cluster buildouts and Stargate and like SORA, video generation,
Starting point is 01:00:28 like chatbot, companionship, like image generation, search. Like, let's like 20, 30 different projects, they do and browsers is just one of them. For us, it's everything. So we're waiting the house on it. And if we are a very tiny startup that has very little funding, we're obviously
Starting point is 01:00:48 still going to lose. But fortunately, we're not that. We have a reasonable distribution. And we have a lot of funding. So a lot of great talent here. So I think like it's a very natural bet to take, even if an established company like Open Eye wants to work on the same thing.
Starting point is 01:01:07 It only validates our thesis even more. And we are also betting on the fact that open source models are going to catch up to the capabilities of the frontier ones. And we'll be able to migrate off the close models for whatever we do today. And we'll still be using the closed models for things we won't be able to do today for like new cutting edge things. Like I said earlier, like you guys were the first to launch an AI or major AI browser. How like if you would have think about form factors going into the future, you mentioned that, you know, you're not really a fan of hardware devices. If you were to augment your browser in the future,
Starting point is 01:01:44 what would you build next? Yeah, I've said this before. I think the only next step after the browser is the OS. That's the final frontier. Because the only one who has, like the only reason you even build a browser for doing a lot of the agents is you cannot control iOS or Android.
Starting point is 01:02:05 It's interesting, you might think Android's open source so you can control it. No, you can fork it and you can, make Android whatever you want, but you really cannot get a phone maker to ship your version of Android without getting approval from Google for that. And if they're not the default search,
Starting point is 01:02:21 they're not going to let you ship a version of Android without the Play Store and with the core Google apps, so like Google Maps and YouTube and Gmail and calendar and so on. And if they don't ship their apps and let other people ship their apps on your version of Android, no phone makers even incentivized to sell those phones on like in any market. So you basically have to build a super app that can call every other app.
Starting point is 01:02:50 So you don't even need the app store. But that's kind of why you need the browser because the browser essentially once it becomes everything app and you can call Uber's and buy stuff on Amazon and generative UI's are all so fast and nimble and like doesn't feel like you're missing out on the app. apps, you would still need things like X and Instagram and WhatsApp to message people. Like, like, it's very hard to get around the lack of having apps. So I think this is a much bigger vision than, like, you know, even shipping the browser, where you have to actually convince social media companies and, like, other people to actually, like, you know, ignore the Play Store and ship apps along with you to do a new version of Android.
Starting point is 01:03:34 and then convince like a massive phone maker like Samsung or Motorola or someone like the largest OEMs to actually ship this phone in the market. That's the ultimate end game. And I don't think we have graduated to work on that. The best step to get there, to deserve a right to try that is to ship a really amazing mobile browser and get a lot of distribution on this and really improve the reliability and latency of the product to the extent that people feel like the browser. and get a lot of distribution on this and really improve the reliability and latency of the product to the extent that people feel like the browser, it's the everything app, and it feels like an OS by itself,
Starting point is 01:04:07 and they're willing to actually try out a new phone that can have like a new version of Android. And I think that's the, in my opinion, once you complete that last step in the trajectory, that will be the true end of like the Google monopoly, in my opinion. Because that's when, like, they cannot control anything here.
Starting point is 01:04:30 Like, on Android, they control was a default search. And 68% of their revenues, mobile advertising. And so if I remove Google search to the default and let you just use an assistant for all your search needs and you can navigate the web
Starting point is 01:04:47 and information, everything, all done in a seamless way, most of the revenue on search ads, tanks as a result of that. So you actually need to get market share like through distribution on the phones and that needs like massive phone maker like Samsung to back you. So this is like this is basically the end game and you also have to build a good business model around the agents and you know a subscription revenue for people who want to like experience the internet and like services through this new form factor. So the world has to change quite a lot for these things
Starting point is 01:05:23 happen, but we're not like working on perplexity as a as a short term like project. Like it's a, it's going to take a decade to realize all this. And a baby steps along the way and comment is the first step towards that. I think if there's one big takeaway I learned from this episode, it's the notion, the reasoning why a AI native iOS software must be AI first. Like that's ultimately where we ultimately end up. You could also consider building like Windows or as not exactly Windows, but, you know, arrival to Windows or Mac OS.
Starting point is 01:06:01 But again, you're going to end up the same problems. Like Microsoft might not want to ship their apps to your OS because they just don't want to encourage arrival. And that's why like all the Microsoft apps, like the Office 365 apps suck on Linux. And that's one of the reasons why Linux has failed to get distribution. It does kind of beg the question. If the end game is an AI-native operating system, what's more likely?
Starting point is 01:06:30 Apple finally figures their shit out and converts iOS to being AI-native. Microsoft has Windows, and they figure out how to make Windows AI-Native or maybe ChatGBTGBT and OpenEI try and get into this game or like a younger startup-like perplexity. Are these the players of the game? Or how might- Google is still relevant to it?
Starting point is 01:06:52 Google's Google relevant. If we do see an AI native operating system, it's going to come from one of these players. Apple, Microsoft, chat GPT, Google, and perplexity. I think so. Our meta, you never know. But I think so. These are the main players.
Starting point is 01:07:12 And I mean, I'm even fortunate to be considered in this list. Everybody else has 10 to 100 X more capital, maybe 1,000X also. So, and so it's definitely, but I would say the main advantage in terms of structural restrictions exists with Apple. They basically, yes, like they are going to lose the Google ad revenue share if they change the way search and Safari works. But that might be a thing they might lose any way if the judge rules that to that effect in the DOJ case. So if they are going to lose it anyway, they might go all in on this vision and change the iPhone to be more AI native. Google, on the other hand, will not be able to do it as fast on Android phones.
Starting point is 01:08:05 They might try it on the pixel phones where the distribution is much smaller, so they don't actually lose much ads. And they might sense the market and then try to go deeper on the other OEMs. But they have more constraints and restrictions here. and Open AI doesn't have the ability to go build out its own device. It has the same problems we have in terms of convincing Samsung to go do this with them. And similarly, like, Meta has the same problems.
Starting point is 01:08:34 It doesn't have search. It doesn't have a browser. It doesn't have, like, great models. And Microsoft, you know, Windows doesn't have the phone, like abstraction. So it's not going to be multi-platform like Google or Apple can do. Well, Arvin, I want to say thank you and congratulations for having a seat at the table.
Starting point is 01:08:52 That is no easy feat. I mean, you've gone from what half of a billion dollars to 18 billion in 18 months or some outrageous growth like that. So congratulations and all the success. For the people who are listening who are curious about what we're talking about today, how would you say is the best way to reach perplexity? How would you want to intro people to use your product? Where should they go? Perplexity.
Starting point is 01:09:09 That's the web landing on mobile apps, iPhone, and I'm App Store and PlayStore. type perplexity on the play stores or app store. Ignore the ads at the top, like Gemini and Claude advertising and stuff. Got a love it. Go directly to our app. Yeah. Amazing.
Starting point is 01:09:28 Well, Arvin, thank you so much for taking the time to joining us today. We really appreciate it.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.