This Week in Startups - Creating the future of search and competing vs Google with Perplexity AI’s Aravind Srinivas | E1770

Starting point is 00:00:00 Your interviews with Brian Chesky, I learned a lot from those episodes, actually. Me too. Also, I liked his idea that, like, they were going to only ship the number of features he could keep in his brain. And that his brain would be the maximum, you know, size of the canvas. So if one person can't keep all these changes in their brain, let's put those changes into the next six-month cycle. I thought that was pretty awesome as well. I actually borrowed a heuristic from there, adapted it for our company, which was, if the person, in building the feature doesn't know how to write the code for it.

Starting point is 00:00:33 They're very good programmers, but if they're finding it a hard time to break it down and actually implement it, then it's not worth shipping. This week in startups is brought to you by CrowdBotics. Great ideas can change the world. And CrowdBotics is the fastest way to turn those ideas into code. Get a free scoping session for your next big app idea at CrowdBotics.com slash twist, Vanta. Compliance and security shouldn't be a deal breaker for social. startups to win new business. Vanta makes it easy for companies to get a SOC2 report fast.

Starting point is 00:01:06 Twist listeners can get $1,000 off for limited time at Vanta.com slash twist. And Open Phone brings your team's business calls, texts, and contacts into one delightful app that works anywhere. Get 20% off your first six months at openphone.com slash twist. All right, everybody. Welcome back to the program. We have been having an amazing array of founders who are taking on the challenges of implementing AI in the real world. And today will be no different. We have R. Unvind Shrin Ivas on the program.

Starting point is 00:01:43 He's the founder and CEO of Perplexity AI. Welcome to the program. Arnvind. Do you have a nickname or you go by Arravind? Arvind's good. Thank you for having me, or Jason. Great to have you. And listen, you, this is a name.

Starting point is 00:02:00 There's a big battle going on between chat, GPT, and Bard. And you're right in the middle of that. You are doing Perplexity AI. And you are trying to compete with these two giant software developers. Tell us a little bit about how that's going and how you see Perplexity. That AI, which you can go check out right now. The interface will look familiar. How do you plan on competing with them and how is that going?

Starting point is 00:02:26 Yeah. So firstly, we started off. A week after Chat ChaptiPT came out, we put it out. And there was a lot of difference at that time, which is we were just a search bar and we gave direct answers with citations. Whereas Chat Chapti was this entertaining hallucinatory bot that was not just, you know, like correct many times, but it was also equally wrong many times and its mistakes were also entertaining, right? So we focused a lot more on revamping search, realizing that 10 years from now, no one's going to be asking for 10 blue links.

Starting point is 00:03:04 You're going to ask for answers. So we might as well start it today. And the technology for that was ready. Both Chad GPT and us were basically being powered by GPD 3.5. That was the fundamental breakthrough. And then after that, GPT4, even better than that. So that's kind of how it started And we were seen pretty different

Starting point is 00:03:26 It's like oh, you know If Chachapiti lies or makes up Things There's this other side called perplexity You go there and like It's going to be this boring Educated Uncle kind of product But useful

Starting point is 00:03:39 And you can trust it And that's kind of how we grew And then BART came out I believe BART still hasn't solved The fake news problem completely it does hallucinate and it doesn't actually have like real citations sometimes. So we are better than barred in that in the context of search.

Starting point is 00:04:02 But Google's also rolling out this thing called Magi or they call it search generative experience to the public, but the Wall Street Journal called it Magi. So they're trying to do something pretty similar to us. And so far, the experience, at least from what I've seen and myself used that and other people who have used that is, it's not very different from the way they used to extract text from the top link and put it in space at the top. It's not very different from what they've already done before.

Starting point is 00:04:33 And they cannot afford to use really powerful models. So like the search traffic that they have, and if they actually want to really get it right, in the actual search bar itself, outside of bar, they're going to lose a ton of revenue. Yeah. have two challenges there. And so have you done a

Starting point is 00:04:54 crawl of the web because you are giving citations and you do have a language model behind this? So tell us what is underneath the hood here because I have been saying, hey, if you're going to get a bunch of information and present it to me in a beautiful

Starting point is 00:05:09 answer with bullet points and numbers like perplexity just did for me. I asked it, hey, what are activities I should do with my seven-year-olds and they like cities. and the outdoors, and it gave me four popular destinations for cities and for popular destinations outside. Really good suggestions, really tightly summarized.

Starting point is 00:05:29 And then at the bottom, it said, hey, and then here are three citations, trip advisor, U.S. News, family vacationist, and today, the Today Show. So what's underneath the hood here? Yeah. How is it generating the answer? Yeah. So L-LMs are these great reasoning engines. You throw a lot of texts at them and tell them what to do with it,

Starting point is 00:05:50 and they'll do it for you. And then there's the other part that's great, which is having a good index and a ranked version of the index, which is a traditional search engine. And what we do, where we come in is we combine the two together. We say, hey, like, LMs are great.

Starting point is 00:06:06 We'll figure out what content to throw at them for a given query, and we'll instruct them on how to actually take all the text that's thrown at them in the context of the query and get the needles from the haystack and present it in the right format of the user. So they're doing more of the reasoning job, they're not actually doing, pulling up actual facts that's been stored in the LLM itself,

Starting point is 00:06:25 because some of them could be right, some of them could be wrong, the real, actual facts are in your web pages. So that's the content that we want to take. And we have, like, our own index and also, like, we rely on other index providers. And we collate from multiple different indexes, multiple different crawls of the web, and pull up the relevant links. And then we asked the LLM to do all the reasoning on top. and then we give you the answer.

Starting point is 00:06:51 Now, the magic is that all this happens so fast. We've put out the product in December, and back then the latency used to be like five to six seconds per query. In fact, one of our investors, Daniel Gross, he used to joke to me saying, you should call it submit a job and not a submitted query. It's that slow. And now it's like almost as fast as Google.

Starting point is 00:07:10 Like you're hardly waiting. The summary is like really generated really quickly. And we still have like, you know, so much more room to improve there. And I think at some point, you're just going to take answers as the de facto search experience. That's kind of what we want to bring together.

Starting point is 00:07:28 And our primary like superiority over the existing products is the speed at which we deliver the really accurate, well-collated answers from so many different sources. And so, but you are built off of today, chat GPD4, correct? Yes, we heavily use

Starting point is 00:07:46 chat GPD 3.5 and 4. And we also use a little bit of our own LLMs for many other things. Every question you ask on our site, you see a few related questions that are being popped up, right? That's actually one of the favorite parts of the product from many of our users because they like asking more. And that is sort of generated with our own LLM, for example. So there are some parts of the product that we use our LLMs, but I would say like most of the heavy lifting is being done with opening as LLM's right now. And so you added right now, does that mean your plan on building your own? Because it does seem like you're directly in competition with Bing.

Starting point is 00:08:27 Bing has the partnership with ChatGP4. So it's almost like you're both using the same underlying technology. They already have some scale. So that would be a difficult race there. So how do you look at ChatGPT's 4's relationship or OpenAI's relationship and Microsoft's access to it? I think we just need to win by building a superior product. There's just no other way. And I believe so far we have done that.

Starting point is 00:08:53 We have not won against them, but they still have a lot of distribution through Windows devices. So a lot of people just go to Edge and they can start using BingChat. But people have, despite that, won against Microsoft in the past. They Google, everyone went in search for Chrome as the first search query on, like, Internet Explorer, to install it. We all did that despite the friction they added. So there's only one way to win against the person who has much more distribution than you, which is a superior product. Now, about using the same underlying technology, it is the case today.

Starting point is 00:09:31 The reason is they have the best models and there's still a lot of differentiation you can have and how you harness the power of these models. These are so general purpose machines. It's almost like you buy the engine from somewhere, but you're building a whole car with a lot of different parts, and you can still build a better car. And if it is the case that Open AI is just going to be the number one place by far, and you want to give the best product to your users,

Starting point is 00:10:01 you don't need to use their model. Like, there is no... Like, you can say, yeah, I'm going to use my own model because I don't want to use someone else's. But then if the search experience is pretty shit there compared to what you have with Open AI, you're not going to get users. And then you build your own modes of differentiation in other ways that just the person

Starting point is 00:10:21 owning the LLM cannot build as good competitive product as yours. So if just the LLM is the only reason this is working, we don't have a chance. But that's not the case here. There's so many other things needed to be done to give you this experience where there's real-time facts being pulled up and presented in the right manner, super fast, reliable, and make the product engaging. So all that also matters. So for example, people have done comparisons between us and Bing.

Starting point is 00:10:47 And, you know, we have like much better accuracy in terms of how correctly we cite things. A lot of academic research has been done there. People spend on an average like two minutes more on our site than Bing. So that engagement is much better there. So our bounce rates are much lower than Bing. So basically we only lack in one thing, which is number of views on the site. But that can only be addressed if we're given sufficient. time to grow and make people aware of us.

Starting point is 00:11:17 All right, we all know the one thing that separates great startups from the good ones is product velocity. What does it mean? Product velocity. Fancy term, right? You've got your product and you have velocity. Speed. The speed in which your product improves.

Starting point is 00:11:31 So can you ship updates? Can you release new features? Can you do bug fixes? Can you iterate on the interface? Can you solve problems for your customers? And can you do it quickly? because you're not alone. You have competitors and your customers have choices.

Starting point is 00:11:46 They may solve their problems by writing their own custom code or they might use your solution. This is what startups are about. How fast can you get that product velocity going? And so, you know, how do you supercharge it? Everybody says, okay, yeah, we want to go faster, but you got to go faster intelligently. And crowdbotics is going to help you do that. They're your CTO as a service. Basically, they provide you with the most optimal architecture to get your product to market as fast

Starting point is 00:12:10 as possible. You'll have access to an on-demand product manager and developer talent, and they will help get your app into production 10 times faster than conventional development. CrowdBotics can work with your in-house dev team, or you can just have them work independently. And you own all the IP, you own all the source code. Let the folks at CrowdBotics supercharge your product velocity today. No more waiting. Get a free build plan at CrowdBotics.com slash Twist. That's a $499 value just for the Twist listeners. You get that for free. That's C-R-O-W-D, B-O-T-I-C-E. dot com slash twist for a free build plan. How do you get the citations?

Starting point is 00:12:48 If you were asking this query I just did about like, hey, what cities, should I take my seven years old to and then what outdoor locations? How do you actually get the citations? Because chat GPT4, they don't provide citations? Yeah. Or do they? They have this thing called a browser plugin,

Starting point is 00:13:06 which is basically powered by Bing. But people hate that experience in the sense it's really slow and clunky. Yeah, it is slow and funky, yeah. And so how do we do the citations? We basically pull up the relevant links to your query from a search index. And then we combine that and tell the LLMs to write the answer. We basically ask the LLMs to go read all those links.

Starting point is 00:13:29 And then pull up the relevant paragraphs from each of those links and then make an answer out of whatever you thought was relevant. But write down the answer as if an academic or a journalist would write it, where each part of the answer has the corresponding citation like Wikipedia. You basically say, hey, like, I want you to do the job of what a human does on Wikipedia

Starting point is 00:13:50 where when they're writing something about a new person or a new phenomenon or a new city, this is basically going and like picking up a lot of web links about that, sifting through them and reading them and then coming back and writing an essay on it, right? So that whole human labor, intelligence needed to do that,

Starting point is 00:14:09 is being automated now. All of this happening in like seconds, right? That's worth like powers of human labor. And that's the value we are actually adding to everybody. Got it. And so you collect all those links, give all those articles, and then give the summary of them.

Starting point is 00:14:27 They basically instruct the LLM to like, hey, behave like a Wikipedia person. Just write it like this. So the core of this is prompt engineering and knowing how to prompt engineer for different types of query. because different queries might require a Wikipedia editor, other ones might need a more of a sensibility of a journalist.

Starting point is 00:14:46 And the LLM knows the difference between those things? You need to make it know. That's the skill there. And you're right, prompt engineering is a big part of it. But prompt, just because somebody might have your prom doesn't change much, actually. Like, prompts can leak. So it's all about orchestrating the back end, making it work with the right sources to.

Starting point is 00:15:08 So there's a Steve Jobs movie with Kate Winslet in it where there's a scene between Wozniak and Jobs where Wozniak's like, I'm the guy writing all the code and I'm the code, you don't write code, you don't do design. Why do you, why does everybody know you and not me? And he says, I play the orchestra. So that's basically where anyone who aims to build a long-lasting company on top of LLMs,

Starting point is 00:15:36 the thing you need to be really good at is, playing the orchestra. Like having so many things work together reliably and efficiently and correctly and super fast. So one of the pieces is searching the webbing, finding the right articles, the next piece is knowing how to write the answer. Right. What are the other pieces here?

Starting point is 00:15:58 Thinking the relevant parts from each article too. Like article has a ton of content in it. You only need a few for the query you ask. making sure that you write the answer in the most accessible way. Initially, we just started off with just putting text with citations. Then people were like, hey, I want neatly formatted answers. I want markdown in it. I want, like, code to be rendered in a specific way.

Starting point is 00:16:21 I want images in it. I want, like, videos in it. I might want to customize it for kind of the domain I'm searching in. And then people keep asking for more, and you learn more about the second part of Google's mission, right, making it universally accessible and useful. So the first part is organized those information. The second part is basically

Starting point is 00:16:42 where LLMs are adding tremendous value. And how do you deal with specific verticals of data that are more siloed? I see one of your co-founders or one of your founding team members was from Quora. You of course have the Reddit dataset. Great for conversations.

Starting point is 00:17:01 You have Twitter, great for debates and funny one-liners and breaking news. You have Yelp, you have Google local. You've got all these silos of data. I asked, hey, what are some great Greek restaurants? Did a pretty good job of telling me Greek restaurants in the Bay Area. And so how do you think about those silos of data and are you intercepting searches and saying,

Starting point is 00:17:23 hey, this search is about local businesses and restaurants. This search is about something that the Reddit data set would do better with. How do you think about that? Yeah. So the part of our data, like, you know, the access and things like that, it's, it's, It's an ongoing debate and I don't have like, you know, very strong opinions on what each person should do. Ideally, if there's a need for us to pay any party for their data access, we'll do it. As for how we do it, like, what links we know to use for which query, we do, like, take your query and figure out, like, which category it is and, like, try to use that information to give you the right sources.

Starting point is 00:18:00 It's pretty hard, actually. Google does a tremendous job at this. And we are also doing some things. called focus searches where in the search bar, instead of using all of the internet, you can go and pick academic, or you can pick YouTube, you can pick Reddit, Wikipedia,

Starting point is 00:18:17 and you can just, yeah. Yeah, so you can, there is a drop down called all. Yeah. And I could just pick YouTube. And then YouTube, you have access to the corpus of all the transcripts or just the metadata, I guess, and titles. For now, we use metadata

Starting point is 00:18:30 and titles, but that's already amazing. Sometimes I can't find some videos on YouTube directly, but these LLMs are so good at, like, doing the relevance ranking, that's much better than the YouTube search algorithm. The language models do better than Google's native search algorithm. Wow. Sometimes. Not always. Got it. Most of the times it's equal, but sometimes it's just really good at, like, these fine grain. I was trying to find a video of, like, oh, so there's a scene in this movie I want to find, for watching for

Starting point is 00:19:00 inspiration or something. And then I couldn't find out YouTube, and I come here and I get it. It's very useful for Reddit. Like I want to like learn about like, you know, the nothing phone. Like, you know, who's even using it? Who are those million people? And then I don't have time to go to the subreddit nothing phone and like score over all these like links. That's very useful there. People use it a lot for Wikipedia.

Starting point is 00:19:22 Like if they just want to focus on one thing. Like I was talking to the founder of Wikipedia, Jimmy Wales. And he literally just asked for this feature. Like, hey, I just want to do search over Wikipedia with an LLM. I was a great idea Yeah I think they're building it now within Wikipedia itself

Starting point is 00:19:39 Hmm Interestingly I did a search For interviews With the CEO of Airbnb Mine didn't come up But other ones did

Starting point is 00:19:50 But then it came up with I did once from the past year And man That was kind of a bingo It kind of nailed it Which is a kind of a nice feeling I really think that's a creative idea And I can see how

Starting point is 00:20:02 what you're talking about is got some there is some point to this which is if you narrow the scope or you build some interesting prompt engineering or narrowing and thoughtfulness

Starting point is 00:20:18 you can get to a better answer so what's going to be your business model here you talked before about how Google is not going to be able to make it work with advertising there's a group of people who believe that the chat interface

Starting point is 00:20:33 will cannibalize their existing business. So do you agree that this chat GPT style interface or just the chat interface, let's leave the GPD out of it? Nobody owns a chat interface. But is the chat interface anti-advertising,

Starting point is 00:20:50 or could advertising be integrated into it? Because on All In, a lot of, I think three out of four besties thought, hey, advertising's not going to work. And I thought, I think advertising is going to work great inside of this. You have your citations, but you could put right in, embedded in the discussion, you know, all kinds of

Starting point is 00:21:07 interesting things. So if you were asking about places to travel with your kids and I'm Disneyland and you didn't make it, I could put in there, hey, and if you're thinking about outdoor stuff, Disneyland also has this adventure park and they do the safari and I could have like a really AI generated answer at the bottom. So it gives me the correct answer or what it thinks is the correct answer, but then it also gives an ad engine's answer to it. So am I right or are my other three besties right?

Starting point is 00:21:37 You decide. I'm more of a two here. Oh, you are? Okay. So firstly, I think relevance can be even more targeted now than ever before. What is the purpose of Google? It's just bringing two parties together, the advertiser and the consumer. And they help you connect these two parties together,

Starting point is 00:21:59 where their query and link matching, right? At the end of the day, the advertiser wants to get their content to the consumer or the content. And LLM can give you that needles in the haystack even better. Like, it's even more targeted, honestly. That if I were an advertiser,

Starting point is 00:22:21 I would just kind of focus on selling myself really well, writing even better marketing copies with LLMs, cater to the person. I'm trying to sell to. And we introduced this thing called AI profile on perplexity, where you can just write about yourself. Ah, yes, I saw that. And that way you, the results are even more catered to you.

Starting point is 00:22:44 And then if you're an advertiser, you can say, I want to target people who are of, like, having all these attributes in their profiles. And then the ranking will automatically take care of that. So in some sense, you're creating, way more relevant and targeted ads than ever before. I don't know if you use Instagram, but my experience in Instagram

Starting point is 00:23:05 is that the ads on Instagram are even more relevant than than Google. Often is that is the case. Here, take a look at this. Can you see my screen? Here's the query I did based on our little back and forth here. Will LLMs, will the chat

Starting point is 00:23:21 interface be accommodating to advertising. Well, I put in here, you're the CEO of Disney Parks, pitch me on why I should take my seven-year-olds to one of your parks. And so imagine this got appended to my previous search, which, hey, what should I do with my seven-year-olds? In a city or outdoors. And I says, oh, thank you for considering one of our parks

Starting point is 00:23:41 for seven-year-olds. Here are reasons we believe you'll have an unforgettable experience. Number one, a place where everyone is welcome. Two, more value and flexibility. Three, disability access service. That's kind of weird. Number four, new attractions and experiences. That's really good. Memorable music, that's good too, actually. Park reservation system, that's great.

Starting point is 00:23:59 We hope you'll consider this. And then here it could have bookings and would you like to talk to an agent? Do you have further questions? And you could just hijack somebody's chat stream for your own purposes. They could be thinking they want to go to Europe for the summer, and then you could sell them on going on a European Disney cruise or something. And I think that this kind of style of advertising where a company CEO

Starting point is 00:24:25 starts a discussion with you in chat GPT and in a chat interface is going to be magical. Yeah. And like you said, you know, I can give you an answer that's sort of neutral and unbiased and it's not targeted at you. And I can also say, by the way, in case you were actually looking for something

Starting point is 00:24:48 very much to you. And if you already share that information with us, fully transparent and you're in control, we're not going to do it in a creepy way like Facebook. Then we should be able to give you the answer. We should be able to help the advertiser sell to you even better, right? So I think basically I'm going even more abstract first principle in thinking that it's not clear how you do it in the product

Starting point is 00:25:15 and how you build a business model, but at an abstract level, the point of advertising is to reach the right person to sell to, and this can help you do that even better than the current system. So therefore, you should be able to figure out something at a level below this. If you're a SaaS or services company that stores customer data in the cloud, then you need to be, uh, SAC2 compliant. You knew that from a third party,

Starting point is 00:25:41 and you need that third party to close big deals. And if you want to get compliant, easier and faster you need to use. V-A-N-T-A. V-A-V-A-V-A-V-A-V-A-V-A-V-A-V-A-V-A-V-A-V-A-V-A-V-T-A-V-T-C-A-V-T-C-T-C-T-Compin in just two to four weeks. Prepare that to three to five months without Vanta. And Vanta can save you hundreds of hours of manual work and up to 85% of compliance costs. This is a total no-brainer. And Vanta does more than just SOC2 compliance. They also automate up to 90% compliance for GDPR, PIPA, and more. You can't afford to lose out our major customers.

Starting point is 00:26:17 We all know that. Listen, it's a hard year. Last year was hard. You can't lose those major customers because you don't have your compliance dialed in. Just work with Vanta. Get your compliance automated and tight and tight is right. Lock down those big deals. Here's the best part.

Starting point is 00:26:29 Vanta's going to give you $1,000 off. That's $1,000 off at vanta.com slash twist. That's vanta.com. Slice for $1,000 off your sock, too. So you've raised some money and you're currently trying to grow the company. Tell me a little bit about what it's like to try to compete in this area for talent. People are raising, you've raised a lot of money, but people have raised even more. And there's a massive talent battle going on right now.

Starting point is 00:27:01 Is it better to just hire great developers and have them learn? Because you're not building the fundamental model. You're building something on top of it. What's your strategy for talent here? Yeah. So we don't waste time trying to hire people that, say, on when they'll be hiring anyway. It's very,

Starting point is 00:27:20 it's, you cannot compete. They are way more cash, way more, like, and they can give way less percent of the company because they have a way bigger valuation. So what we do is go for these people who are still trying to get into AI, very talented engineers who haven't done AI before.

Starting point is 00:27:38 And want to be part of an amazing product that's growing and they want to feel the dopamine from shipping every week and want to see their stuff actually being. put up. And there is quite a lot of people, there are quite a lot of people who are like that, like who haven't done AI before, very talented generalist programmers. That's another thing that I look for, which is, are they generalists?

Starting point is 00:27:58 Can they do back and can they do front and can they strategize for the product? Can they do prompt engineering? Because all these are new skills. Like prompt engineering is not like a, you know, you don't need to, there's, but you cannot ask ask for years of experience there. It's like a few month old skill.

Starting point is 00:28:14 So you just need to be somebody who's like pretty logical and like pretty good at like getting things done. You were at Open AI for a while. I was at Open AI, yeah. Yeah, how long were you there and what did you work on? So I yeah, I worked on like diffusion models and like conversational models for like chat bot. Like not exactly like chat tributy, but more like trying to get another modality into like conversations. So that's kind of what I was focusing on.

Starting point is 00:28:42 But the reason I started this company was because ever since I came to U.S. for grad school in Berkeley. I was always interested in starting a company. And I was trying to look for people who were like me before, who were like PhD students who started a company. And I could only find one example from the past that I really resonated with was Larry and Sergey. So Larry is my entrepreneurial hero.

Starting point is 00:29:07 Like he's the only reason I kind of wanted to do a company. And in fact, in a book he's written like he'd either do a professor or he would do a company. he would never work for anybody else. I had more constraints in my, like, you know, immigration and other stuff like that to have to, like, sort of work for a bit, get some money and, like, learn more skills. But that was sort of always there. And it's not planned, but it's just more like a coincidence, happy coincidence that I'm working on search too. But, yeah, being at OpenEI, it was really helpful. Back then, there was no chat GPT, so I didn't foresee the future where Open AI is so successful.

Starting point is 00:29:44 but there was GPD 3.5 and was pretty good and like, you know, we knew like a lot of things were happening. Nobody knew that, like, if you put out these models in the chat UI, the world would go crazy. That was very unknown. So the fact that people are so used to the modality of chat, because they live in it all day long, this was the breakout moment for AI.

Starting point is 00:30:10 Because AI stuff had existed. People were using it in the backends to serve you up your for you page on TikTok or filling your search query or giving you a couple of words ahead in Gmail and finish your sentence. All that stuff, it was happening. Yeah. But it needed the interface to make it work. Yeah.

Starting point is 00:30:28 Fascinating. The generality of the models was also amazing, but it was all, if you remember, opening, I had a playground where you could go and enter a prompt. And in green text, you would see the completions. But nobody cared about the average person in the world did not care about it. And then you put it. into a chat UI and then the world goes crazy, right? Makes you wonder if there's another thing that you could do that make the world go even

Starting point is 00:30:51 crazier. And I got to think, and I'm interested in your thoughts on this, were Siri and Alexa, just far too early. They had the ability to understand what you were saying. Yeah. They just didn't have the ability to give you the right answer or any answer. Exactly. I mean, you could barely call, you know, you'd be like, oh, okay, call my mom.

Starting point is 00:31:11 And it would be like, calling Mother Teresa. saying you're like, no, no, no, no, no. It's not what I want. And just even getting it to play the right song took three tries. Now with chat GPT and all these language models and Bard and Poe and what you're doing at perplexity, it feels like talking to the computer would work. And I don't know why this doesn't exist yet. It's going to happen. It's going to happen.

Starting point is 00:31:35 Yeah, like, if I had perplexity as running in the background on my phone in my ear pieces, and I could just whisper to it and say, hey, hey, perplexity. what are some Greek restaurants near me that have a lamb and that are over four stars and it just gave me the answer back and started talking to me and I could take out my phone that would be so magical

Starting point is 00:31:56 and just using the language models as your interface but using voice and having to talk back to you would be incredible. Yeah. Still doesn't exist. In five years I think

Starting point is 00:32:07 what's going to happen is we'll talk. We'll all wear glasses, we'll talk and then we'll see the answer render in our glasses and then or it can speak back to us and we we can listen via the glasses or whatever okay why doesn't it exist today like as we speak uh i think you you can stitch together a demo with a speech recognition model and lLM and then a text speech model right yeah um the latency wouldn't be enjoyable like um that it's mostly on the lLM side not not even on the speed side uh you can make these

Starting point is 00:32:42 ASR and DTS work pretty fast. But if you had to wait for two to three seconds, it's a bit like talking to a socially awkward person. Like they would be like staring at you for like two seconds and then giving you back the answer, right? Yeah, yeah. So that's the experience you would get. It might not be very enjoyable, like how you and I are talking right now.

Starting point is 00:33:02 I think for that you need even smaller or even faster LLMs. And... Ah, so it's not... It wouldn't have the response time that people would find not annoying. It would quickly become annoying to have it giving those pauses.

Starting point is 00:33:17 Yeah, I find it quite charming now when my chat GPT interface takes a second or Bard is kind of skipping around and it stutters and then it plays and I'm like, wait a second. And then Bard now just gives you the answer straight away. Boom. It doesn't do the typing.

Starting point is 00:33:32 But I've got, I think the OpenAI Apple app, iOS app, has like kind of haptics in it where it's like typing. I think it's kind of a gimmick, right? It's not. We chose not to do it. But there is this thing where you stream the output tokens, token by token.

Starting point is 00:33:49 The reason we did that is because you perceive the latency as lower. Like, if I waited in my backend to generate the full answer and then displayed it like in the bard style, you might just be like, oh, what the hell? Like, I don't want to wait, you know? And then you just, I just bombard you with a huge paragraph. It may not be as fun as like anticipating, like, you're, like, you're. reading along with the model generating the tokens, that's a different kind of UX. I like opening eyes choice here,

Starting point is 00:34:17 but we didn't do the haptic thing because I found it pretty annoying to use when we were beta testing it and so did the others in our company. So that said, you know, like, here's the thing with TTS. Like, you have to generate the full answer before feeding it into the Texas speed system. If it's just going to read it word by word

Starting point is 00:34:36 as the LLMD goes the answer, it's not going to get the tone of the sentence completely about the same thing. right. So if there's an exclamation mark at the end you started reading the sentence. Yeah, that's a fair point.

Starting point is 00:34:46 It's not going to know that. I'm curious, you also were a researcher at Google in Deep Mind. Before going to Open AI and before launching your own. A lot of these language models were based on

Starting point is 00:34:57 seminal papers on tensors and whatever. And a lot of the code base was open source or open source is. I guess in Facebook's it was leaked in the case of opening eye the original models were open source.

Starting point is 00:35:13 How much overlap is there in the fundamental technology at this point and how much is different? If we were to take, you know, the top five language models, how much shared DNA do they actually have? How different

Starting point is 00:35:28 are they at their cores? So, everything is a transformer, which is the architecture built by Google in 2017. and everything is generatively pre-trained with language models. So all of that is the same. The difference comes to what data is being trained on,

Starting point is 00:35:48 where Open AI puts in a lot of effort there compared to other organizations. The reason methods, Lama models were actually really good, despite not being as-because Open AI's models, is because the research is there put a lot of effort into curating the right data. Ah. Well, explain what? what that means to lay people here who are wondering what you're talking about. Yeah.

Starting point is 00:36:13 So how these intelligent language models are built is you have this giant neural network and you download a lot of data from the internet, terabytes of data. And you make these neural networks predict the next word given the previous words. You basically train them to be great auto-complete machines. And by virtue of doing that, they become really good at reasoning and things like that. Now, that doesn't mean that if you just keep scrolling the web and scraping every page and then creating the dataset, you're going to keep getting smarter and smarter. In fact, you get smarter by not training on junk and actually training on good quality data. And now, like, it also turns out that if you train a lot on coding, like GitHub and other datasets, you develop these reasoning capabilities to an even higher level.

Starting point is 00:37:07 than not training on coding. It's kind of like thinking about, like let's see you have a kid, you send the kid to coding or math competitions, even if they may not become the, you know, the I am a medalist, they might end up being great analytical

Starting point is 00:37:22 and logical thinkers in their life and that might help them in their life. So that's sort of what happens with these LMs. And so if you pay a lot of attention to what data they are trained on, that helps you a lot in terms of what you can achieve with them later. So the base core IQ of these models will be much higher if you put a lot more effort into like curating the training data more carefully. And Open AI was ahead of everybody else there.

Starting point is 00:37:48 Google has all the data in the world, but they didn't pay enough attention to this. And now like people have caught up, they've understood, you know, this is where they need to pay attention on. As for like who's really ahead right now, I think it's Open AI like with GPD4. Yeah, much far ahead. Who can likely catch up? There's one more organization called Anthropic. Sure. And they are the closest number two.

Starting point is 00:38:16 And both these organizations were more or less the same people. Like the people who trained GPD3 were the guys who had then started Anthropic later. Are you still using your personal phone number at work at your startup in 2023? Stop! Such a common mistake founders make. But open phone has totally rethought every detail of what a business phone should look like in 2023. Open phone makes it so easy to do this and so affordable that you have no excuse. And you really don't want your team using their personal phones for business. Why?

Starting point is 00:38:48 Well, it could get creepy. People start texting people on your team. It could be that they leave your company and the salesperson has all of these text threads going with all your clients and they bring them to your competitor. Do you want to deal with this nonsense? You don't. I can tell you open phone is amazing because we use it. Our sales team, our ops teams. We use it daily. We also started using open phone for Angel Summit communications. It's rated number one on G2 for customer satisfaction. And let me tell you, those G2 rankings, those are dogged battles. If you win that, you really have to be the best.

Starting point is 00:39:19 Twist listeners love open phone. My sales team uses it. Ops team uses it. Customer support uses it. And you know what's great about it? You can create a shared phone number like we did for the Angel Summit. With multiple employees being able to field those calls and text and keep it all sorted. It's affordable at just $13 per user per month.

Starting point is 00:39:37 But Twist users are going to get 20% off that already ridiculously affordable price for six months at openphone.com slash twist. And if you got an existing number, Openphone will port it over at no extra cost. Head to openphone.com slash twist to start your free trial and get 20% off. So when you look at the open source community, they seem to be really moving fast now. Correct. Meta's Lama models were leaked, maybe, or maybe leaked on purpose. Yeah. You think that you think that story is true, that it was leaked on purpose to jumpstart the open source community?

Starting point is 00:40:14 I wouldn't be surprised, but, you know. Yeah. It was accidentally leaked. Accidentally on purpose. There's some parallels with the COVID leaks there. I don't know. Yeah. It was, yeah, it was an accidental leak, but they might have leaked it because, yeah.

Starting point is 00:40:31 But this was actually good. Like, it was good for the world. That this anthropic got leaked. I'm sorry. that Lama got leaked? Yeah, Lama leaking was actually really good for the world. You know, I think, I think,

Starting point is 00:40:41 I think it gave more power to the rest of the world in terms of what they can do with LLM's outside of Open AI or Google or Anthropic. So that's my question. These open source models, you've got a lot of people working on them.

Starting point is 00:40:55 Yeah. And a lot of people are not happy with how closed open AI has become. Even I've started referring to it as closed AI. So if they're super closed, and Open tends to win, if we're sitting here in five years, who do you think wins?

Starting point is 00:41:14 Open source or, you know, Google and OpenAI with closed models? Who do you think is going to win? Yeah, it's, if you pattern match, Open tends to win, that's kind of correct. But there's like a catch here, which is the next big wins

Starting point is 00:41:34 are not necessarily going to come from whoever is going to continue. continue to train more. You need some algorithmic efficiencies to make use of compute even better. And you need really good researchers for that. And the best researchers are sort of like NBA players and like they're taken by these organizations who pay them millions of dollars a year. And then if these guys who are building the tricks for making these models even better

Starting point is 00:42:00 are in the closed organizations, then they'll always stay ahead of the open. Right. So then, and if these organizations stop publishing these techniques, and these guys to stay in these organizations are paid to stay there forever, it's kind of like closing the walls. So the only way in which the open source world can catch up is, like, there are amazing researchers who kind of work in organizations that are actively open source models.

Starting point is 00:42:29 And I think right now there's only one big org that wants to do that, which is meta. And so as long as meta is in the game, I think there's a chance for open source to sort of stay there and like, you know, win in the long run. Every other organization doesn't want to publish anymore. That's a problem.

Starting point is 00:42:51 Nobody publishing. Except for meta. Except for meta. And I guess that Google feels like they made a mistake publishing all this stuff and giving a to some moment. I'm sure they do. Like, they missed out on the whole revolution.

Starting point is 00:43:07 It's fascinating. And I didn't ask you about the paid version. If I, if I choose to pay, what do I get? So there is this thing called co-pilot. That's more like an interactive search companion. Ah. That it does the equivalent of hundreds of search queries for you, not just one. So you can ask it really complex queries, like,

Starting point is 00:43:27 go pull me all of Jason's investments and all startups such is done. And like, you know, at what valuation he's done. like prepare a table for me and get it back to me. If the information is there, put in public, for example, I could only find the valuation you invested in Uber,

Starting point is 00:43:41 but not on Robin Hood. Ah. So then it'll come back to me and give me that information. Or you can say, like, give me the year-by-year revenue of AWS ever since its inception.

Starting point is 00:43:51 I want to track it and growth percentage year over year, and it's going to come back to you with information. So it's almost like you're having a researcher at your disposal. Oh, wow. That's wild.

Starting point is 00:44:00 And when you say it's a co-pilot, is it something that lives in my system? try and Mac or Windows or at Chrome? No, it's on the browser. The co-pilot is just meant to be like a companion, like the word, the user of the word is just a companion for search. And it's going to help you plan, travel, buy products, prepare meal plans according to your preferences.

Starting point is 00:44:21 And if you integrate your AI profile with it, it's going to give you, like, much more detail, recommendations, travel itineries, web research. Like, I wanted to know a lot about, like, when did read off and start making money in LinkedIn? like you know, they took a while to like start making revenue. What was the hypothesis in lit scaling? All these kind of things that you're like not coding as well.

Starting point is 00:44:42 Like write me a piece of code for pulling up all Elon Musk tweets to where he tagged Jeff Vezos in it. And like you get the Twitter API, we do code. You can copy paste that and like go and execute it. So it's, it can read documentation pages. So that way it's more factual than what code you get from chat chagip. So all these kind of things, it's very powerful. So what we offer in this.

Starting point is 00:45:03 the paid version is unlimited usage of that. Not fully, like technically unlimited. It's more like 300 queries a day, which is practically unlimited for most people. And then everything else is free. So the way we're thinking about it is the free version grows enough that we can do advertising there.

Starting point is 00:45:20 And the paid version is for power users who want to like use it for work or very complex queries that they seek. But the free users get like 25 queries a day even on the co-pilot version. So you don't have to pay if you don't want to. We just want regular daily users

Starting point is 00:45:35 to stop using Google and use our product. I will be one of them. I'm just signing up for the paid version as we wrap up the episode here. You're hiring, so where can people learn more

Starting point is 00:45:46 about what you're hiring for? We're hiring for iOS and Android, mainly right now. So iOS engineers, if you want to come and help build our mobile experiences, please join us. That's the most important time here.

Starting point is 00:45:59 And I think you can go to perplexity. slash about and you'll learn more. All right, we'll see you all next time on this week and start. Bye bye. On behalf of the producers and the partnership team, thank you for listening to episode 1770. We'd like to take one more time to thank our partners,

Starting point is 00:46:18 Crowbotics. Get a free scoping session for your next big app idea at crowbotics.com slash twist. Vanta. Get a thousand dollars off your sock too at vanta.com slash twist. and Openphone. Get 20% off your first six months at openphone.com slash twist.

Starting point is 00:46:41 If you're looking to become a partner of this week in startups, you can email Hanna at Hannaatlaunch.co. That's Hanna atlaunch.com. Thanks for listening.

This Week in Startups - Creating the future of search and competing vs Google with Perplexity AI’s Aravind Srinivas | E1770

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.