a16z Podcast - Building Search for AI Agents with Exa CEO Will Bryk

Starting point is 00:00:00 Search is the gateway to the world's information. If you can make it perfect, then that has so many downstream positive implications for the world. You can kind of think of Google as being synonymous with search, right? It's one of the greatest technoplies of the last few decades. If you want to go really deep into some topic, Google fails. Most people want to understand the world. But they're getting fed information that's just like, you know, misleading in some way or straight up wrong. And if everyone had like information that was accurate, most reasonable people would be reasonable.

Starting point is 00:00:25 We have a family open club, Michael Claudeberg, and we wanted to give it web access. he was like, I recommend EXA. The world of agents' searching is just completely different from human searching. An agent doesn't just want 10 pieces of information. It wants everything. The XA, like, you could search something and then get not just like 10 results or 100 results, but 1,000 results or 10,000. How have we a team that, you know, has always been below 100 people,

Starting point is 00:00:45 have been able to build a search engine that's better than Google in all sorts of ways? Well, it's because for most of the Internet era, search was built for humans. But AI agents search differently. They need deeper context, more complete information, and the ability to navigate far more complex questions than a traditional search box was designed to answer. That shift is creating an entirely new set of challenges

Starting point is 00:01:08 around retrieval, knowledge discovery, and how information is organized online. Sarah Wang speaks with Exa co-founder and CEO Will Brick about search, AI agents, and the future of information retrieval. Welcome, Will, thank you for being here. Well, excited to be here. So I want to start with the origin. story, you've been interested in search for a long time. In fact, you and your co-founder, Jeff,

Starting point is 00:01:36 actually started building a mini search engine in college, which is not what I was doing in college. Can you say more about when you started getting interested in search and why you wanted to solve this problem? Yeah, yeah, sure. So I would say it's a life mission. So since I was a kid, I've cared about finding the highest quality knowledge, right? I was obsessed. And then in high school, I wanted to start a new type of news organization because I thought we're a civilization that got to the moon. We split the atom, and yet we can't understand what's going on at the border or in science news. Like, why can't we fully understand any topic? And then in college, I was roommates with Jeff,

Starting point is 00:02:06 and we were like, we could just build a better search using crowdsourcing the highest quality links. And we did build a pretty solid search. But then five years ago, so in 2021, that's when Transformer started to get really good. And it suddenly became possible to build a better search than Google. And that was a really important opportunity because search is the gateway to the world's information.

Starting point is 00:02:24 If you could improve search, if you can make it perfect, then that has so many downstream positive implications for the world across every industry, across every part of human life. So it just felt like this huge opportunity and no one was pursuing. And I was like, I'm going to devote my life to this because everything I care about is about information. And so, yeah, start XA and now it's gone. We actually made a lot of progress.

Starting point is 00:02:45 And we're a lot closer to that mission. There's still a huge amount of things, a huge amount to go. But yeah, it's been crazy to see how far we've come to achieving that mission that I've been thinking about for years. Maybe just to probe a little bit deeper, you can kind of think of Google as being synonymous with search, right? It's one of the greatest technoplies of the last few decades. And the idea that a startup could be better than Google at search is quite amazing. But how do you define perfect search?

Starting point is 00:03:11 And what do you see as the limitations of Google? And I'll throw in more recent events because obviously, you know, I.O. just took place last week. And they're very focused on AI mode and how they talk about the idea of information. agents and things like that. How do you think about beating old Google, if you will? And then there's, of course, new Google that's evolving. Yeah, I mean, Google was amazing and is amazing for what it's meant to do, which is like get quick answers to consumers. And now it's like increasingly longer answers. But really it's focusing on like, what do most of the billions of people in the world search for care about and like making sure they're really happy. And they do a great job of that. That's

Starting point is 00:03:47 what they're optimized for. That's why they're optimized for human clicks. It's like, you're really tired. You type in a few keywords that make no sense. And Google just magically understands what you're saying, that's magical because it has like billions of other people searching similar things. I'm excited by Google too, but like there are certain times when you want something deeper. And I was actually before starting XA writing a history book, I just got obsessed with history and I wanted to get to the bottom. What did it feel like to live in every period in history going back 5,000 years? I don't know if you ever talked about it.

Starting point is 00:04:11 I want to read this book. It's probably on hold right now. I guess it'd be really good. I would have finished it around now. But at some point I was like, okay, maybe I can build a search engine and then I could automate the building of writing of books. And I feel like that turned out to be true. But anyway, in writing that book, it became extremely obvious that if you want to go really deep into some topic, Google fails.

Starting point is 00:04:29 Like Google is great at service level information, which is great for most of the billions of consumers. But if you want to really understand what it was like to live in the Roman Empire, you know, in like 100 AD, it's actually quite hard. And like that information is scattered. It's everywhere. But it's like you need really, really good search, like really deep search to understand it. And so that was like the first realization that, or one of the first realizations that way, like, what if you could have like true perfect understanding of any topic? And so, yeah, okay, so Google has been changing their search engine a little bit. I would say I've seen many Google IOs now at this point at Exa.

Starting point is 00:04:58 Every Google I'm like, okay, they say they're changing search. And they do, but they change it more to be valuable for the consumer-type use cases, which for me, it's like there are so many different use cases that go beyond that. There's like really deeply understanding the Roman Empire, but there's also like finding every competitor to your company. And right now, Google is just not going to do that. No matter how many changes they make, like you don't trust Google to find you literally every competitor of your company, whether it's in Europe or Asia. you don't use Google for recruiting, and this announcement doesn't change that. Like, you're not going to go, you say, hey, Google, I'm looking for machine learning engineers in San Francisco who have a background at startups because it's not built for that kind of thing.

Starting point is 00:05:31 So there is an opportunity to build like a new type of search engine that's meant for extremely like deep, complex queries that businesses really care about and agents really care about. Yeah, absolutely. And I mean, to the extent you talked about starting to build EGSA five years ago and then it feels like in the last five years, the world has completely changed, which is, actually, in my opinion, a great thing to happen to you and to Exa while you're building, because you can bring in this new technology versus trying to either fight it or have some sort of innovator's dilemma. Maybe share a little bit more on why you decided to build Exa from the ground up and what parts were the hardest or have been the hardest? And then how has it changed as the world of LLMs have changed? Building it from the ground up, basically we were like in 2021,

Starting point is 00:06:15 we could build a better searcher than Google. I don't care how long it takes. I guess we were young, high energy just ready to do anything, devote our lives to this. There was a thought experiment that really excited me, which is that, wait, I could totally build a better search than Google right now, and here's how I would do it. For every query, I would take all the trillion documents

Starting point is 00:06:29 on the web, and I would run GB3 over it. I would say, does this document match the query? Is this document match query? And then it would filter it down to the top 10 documents, and that would be better than Google. The problem is that would cost, like, you know, $10 billion per query. And so then it became an optimization problem.

Starting point is 00:06:43 But at least there was like an existence proof that's possible to build a better search on Google, and that was very inspiring for me. So then it was like, okay, like how do we optimize the hell out of that? And Transformers had gotten really good at the time, and Google wasn't really leading into it. And we just had this deep belief in the bitter lesson,

Starting point is 00:06:55 maybe more than Google, for search, which is that, like, if we could develop systems, like neural systems where you pour more data into it, it just gets better and better for the thing we're optimizing for, then you could actually just totally be better than Google. When we release this to the world in November 2022, like it was actually shocking. And Andre Carbethy retweeted it.

Starting point is 00:07:12 That's pretty popular on Twitter. It was like this new way to find information. It was the first time people were like, holy cow, it's possible. to find things beyond Google. And then, by the way, two weeks later, Chatsby Tea came out. So then people realized,

Starting point is 00:07:24 okay, there's another new way of finding information outside Google with LLMs. And that was very critical to us. So, like, we released our first search engine into the world in a year and a half after starting XA, November 22. Two weeks later, Chatsbyte came out.

Starting point is 00:07:35 Actually, to me, I was at NERPS and I saw the announcement. I played with it. I was like, this feels like J.B3, but a little bit better UI, and I went back looking at research favors. But to the world, I think it was the first time

Starting point is 00:07:44 they met this new creature, and it was just easy to use, which was a very big learning, which was like, okay, you make something easy to use. It's very important, obviously. Anyway, so then AI started really taking off, and then early 2023, people started asking us for API access to our search engine

Starting point is 00:07:57 that they had used based on that Twitter announcement in November 222. And that's when we realized, oh, wait, we could start serving this search engine to these, not quite agents because that wasn't the term of the time, just these AI products, these AI workflows. And they're going to want comprehensiveness. They're going to want to search in these more complex ways.

Starting point is 00:08:13 All the ways that we as nerds in 2025 one wanted to search, agents were very similar. So that was another interesting realization was like, I'm not an normal consumer. I want to get really deep to any topic, and so do agents. And so it's cool that we were building a search engine for ourselves.

Starting point is 00:08:26 It ended up being the exact same search engine for agents. They were very similar. This is a big paradigm shift. Even as investors, we're thinking, like, hey, it's not humans who are deciding the DevTil that wins. It's actually agents. Personal anecdote, I think I told you this one,

Starting point is 00:08:38 but we have a family open claw, Michael Claudeberg, and we wanted to give it web access, and he was like, I recommend EGSA. Before we invested, I was like, Sure. I'll go with whatever you recommend, right? And so it sounds like there's this nice dovetailing of how you were intending to build eggs in the first place to what agents want. But how do you think about what agents want, right? That's sort of the holy grail right now of, hey, I don't care

Starting point is 00:09:02 about database I'm using. My agent's going to select convex or super base. These are entire tailwinds that are making some of these companies. How do you optimize for that? Think about that. I've been thinking about this for a long time since there were the first agents, right? Like, we were the first search engine. Like, we were an early search engine, and the first AI products came to us because they were like, okay, they could be a search API.

Starting point is 00:09:22 And so I've been thinking about this for a long time. And yeah, I think the world of agents searching is just completely different from human searching. I guess you make the analogy of like agents to humans like humans to sloths. Like imagine we had a search engine. I'm picturing that Zootopia, D.C. It's like, imagine we had a great searcher for sloths

Starting point is 00:09:41 and then humans came around. They're not going to want to use that same searcher. And so you should think of agents has these, like, crazy creatures that have, like, infinite, like, time is meaningless for them. They just want to, like, make complex queries very fast and, like, analyze it really fast, and they want perfect output for their human users, right? So you want to build a searching for that. So how do you, what matters for that type of creature?

Starting point is 00:10:03 Well, lots of everything. So first of all, you need a search engine, like, handle complex queries, right? Like, you do not want that feature to have to simplify its complex need for its user into simple keyword phrases because you're just losing information. So you want somebody that could actually semantically handle complex queries, but also handle keywords because sometimes you just literally want, hey, like I have this complex chemical formula. Like I want that to be part of the document, right? So you want a tool that can handle both semantic queries,

Starting point is 00:10:28 keyword queries, really just like expose all the fundamental toggles to the agent. Because the agent, by the way, it has the patience to like, you know, make a domain filter here and a keyword filter there or like, searching this way, search in that way. So you want to like have a very controllable search engine. You know, like, with Google, like, you search something, and then you're like, no, way, that's not what I want. And then you try to change some keywords. And it's like, it's just missing it.

Starting point is 00:10:50 It's not like, doesn't feel like very controllable, toggleable. You want the opposite for an agent because the agent is just going to keep searching until it gets to its outcome. And you don't want it to have to make like a thousand, like 10,000 keyword queries and still never get to its comprehensive information. You want it to like make a few queries and get comprehensive information. Anyway, so complex queries, toggleable. Also, like, comprehensive results. So this is a thing where it's like, you don't, an agent doesn't just want 10 results. or 10 pieces of information, it wants everything.

Starting point is 00:11:15 Because imagine you're an investor. You don't have to imagine that. If you're an investor and you're looking at biotech companies, you want complete information because you're making very important monetary decisions and you don't want to miss anything. You don't want to have a phomo. Like you're missing some critical startup that exists that actually reflects well or badly on the current one you're thinking about.

Starting point is 00:11:39 And so you want your agent to have complete information about every topic. So like with XA, like you could search something and then get not just like 10 results or 100 results, but a thousand results or 10,000 and increasingly agents are wanting this. You also want like lower latency because like agents search faster than humans. But at the same time you want higher latency

Starting point is 00:11:56 because certain applications don't care about latency at all. So I think another big thing with serving agents is like extreme customizability because like we're serving businesses, we're serving agents that are very different. Some want super low latency, somewhat super high latency or latency doesn't matter to them. And so it's just a whole,

Starting point is 00:12:12 It's hard to express how different. I have like a list of like 20 different ways, like humans and agents are different. And when you just build it from scratch for agents, you just make fundamentally different architectural decisions. So maybe just to go back to this point you made on model intelligence improving and how that's kind of changed the game in search as well. And I want to pose this thought to you that I'm sure you've heard before.

Starting point is 00:12:35 But given the fact that model intelligence is, you know, getting better, it sort of can almost make up for or do some of the heavy lifting in this user signal, right, that Google has collected over 20 years for page rank, et cetera. It can actually help get over that hump and do a pretty good job. And so my question to you, I guess, in that, is how do you think about this tradeoff of compute, latency, cost, right? There's all these tradeoffs that have to happen in terms of what you're actually using to, And to your point, you said it's like a big optimization exercise.

Starting point is 00:13:11 Like, how do you think about what to optimize? And I know you have different products, right? And so maybe your answer is like, well, depends on the product offering we're doing. But bring that into it as well. It's easier and harder to build a search engine for AI agents. Oh, interesting. Yeah, please, yes. I know that spark your interest.

Starting point is 00:13:27 Okay, so why is it easier? Well, first of all, like, this whole click, you know, Google has an insane amount of human click data. It just doesn't matter, like, that much for serving agents. I remember saying this, like, years ago, people thought that. was crazy. But it turns out to be right. Like human click data is great for humans, when you want to find results that humans click on, which is obvious, right? Like, so if you get a huge amount of human click data, you could train on that and now, you know, Google can understand what you mean even when you

Starting point is 00:13:53 don't even know what you mean. However, agents like just don't, they don't benefit that much from click. I mean, maybe a little bit in terms of like the ranking signal. It's also, it's valuable for agents to know what humans think is valuable. That's a very minor thing. So it's interesting that like all that click data that Google is accumulated, it just doesn't really match. matter for agents. And so it's a whole new ballgame. So there's not much of an advantage there. There's also other things like, yeah, like Google probably had, you know, hundreds of people working on re-ranking. Whereas, you know, because that was a complex thing before LLMs. But now with LMs, you could have a Reranker that you just call an LLM and like, you have one engineer work on it.

Starting point is 00:14:26 Now, obviously, we have more than one engineer working on rerankers at this point because you don't want to just call an LM. You want to train your own models and make them, you know, faster, higher quality. But you don't need a team of hundreds. You could do it with like a couple people. So like, like, how are, how have we a team that, you know, has always been below 100 people have been able to build a search engine that's better than Google in all sorts of ways? Well, it's because like LLMs unlock, the technology unlocks like new types of techniques. And then also like, because serving agents, like you don't need all the click data and like, yeah, data that Google has been collecting. So those are some ways why it's easier to build a

Starting point is 00:14:57 search engine and why we've been able to build a fantastic search engine with a small team. I think a small crack team. But it's also harder. Why is it harder? Well, it's because like the, the, requirements for a search engine now are getting more and more intense. And so, like, you know, I wake up at the morning and we have like, you know, customers love us, but we still have customers being like, wait, why can't you be perfect at this search? Or what about this search? And like, they're constantly pushing us towards the edge because we're starting business use cases

Starting point is 00:15:23 that have like deeper and deeper needs, like, you know, billion dollar investments around the line. Like, this has to be perfect. And that's great because it's pushing us towards perfect search. And so, like, basically, like, if traditional surgeons had like 99.9.9% quality or a liability. Like these these new search ends for agents need 99.99.99.99.99.99.99.99. And this is a great thing because it's just pushing us towards that dream of perfect search I've always been dreaming about. It's very similar to like LLMs. Like, you know, Opus 4.6 comes out.

Starting point is 00:15:50 Everyone's really happy. This is working out 99.9% of use cases. And then when the next one version comes out, everyone wants that thing, right? Because the extra nines of quality are so important in this new agentic economy. And so you have a similar thing for search. So it's both easier to build a search engine fast, but really hard to build a perfect search engine. Yeah. You know, it's interesting. When we were doing our diligence in the space, which, you know, has taken place over the last few years now, some people came to us with the opinion that search is just getting commoditized. And I think we look at that and, you know, if you go out there and search for information we know is out there public, et cetera,

Starting point is 00:16:24 you can't find the answers to everything still. So the fact that it's commoditized, you know, you'd have to kind of divide up, oh, what type of search is commoditized? So maybe I'll ask you that question. And then what is the type of search that's still really hard? and what is the key to unlocking that? Is it data partnerships? Is it a technical breakthrough? How do you think about the edges, as you mentioned,

Starting point is 00:16:47 in terms of perfecting or pushing forward the frontier of search? Yeah, I guess what does commoditize mean? It means like over time will the thing, will it just not matter which tool you use? Like, they're all kind of the same. I would argue that the LLMs are going to get commoditized or are getting commoditized faster than searches.

Starting point is 00:17:04 And the reason is because you don't need to run like mythos over every cell on your Excel sheet when you're trying to find competitors or something. Like there's a lot, like most of knowledge work does not require the smartest model. You act like, you know, like just an open source model that's big enough. You know, and now

Starting point is 00:17:20 the infrastructure for running them is very good. Like, it's pretty cheap. Like you can just reuse open source models for most of knowledge work. Not that the crazy smart LMs don't, they do have a super amount of value in terms of like inventing new science and math and and certain

Starting point is 00:17:36 cases you want to find any bugs in your code or something like that. But like increasingly, like I would say like if you think of knowledge work, like all the different tasks you might do as like concentric circles of difficulty, a huge amount of that service area is covered by like off-the-shelf models you get right now. Yeah, very fair. Yeah, right. So, but like on the other hand, like search, like when you are trying to, you know, enrich every cell in your Excel sheet with competitors or people you're trying to recruit, then like every extra nine of quality and search really matters. Basically, I'd argue that a lot of knowledge work is actually search problem, not only an intelligent problem. And so, yeah, I mean, what are some examples where search is not

Starting point is 00:18:16 good? I do think company and people search is the most, like, easy to see and just most value to people. Like, every company in the world has to search over companies to sell to, or almost every company in the world has to search for companies to sell to and people to hire. You could ask yourself if, like, finding companies to sell to or finding people to hire is a solved problem. I think every company would say, no, it's not. That's why people are constantly switching tools, like trying out new tools, is because we just don't have comprehensive information over all the people or companies we want. So that's a really good example.

Starting point is 00:18:49 That's something X is leaning very deeply into, like go to market intelligence. Because we care a lot about it. It's very exciting. It's also very useful to use internally. We have companies to sell to and we have people to hire. It's been great to dog to their own thing. It gives us some advantage. But yeah, that's an example.

Starting point is 00:19:02 I think you just want comprehensive. like you just want all the people that could be connected to you that are relevant. And by the way, it's a very beautiful thing. Yeah. You didn't ask this, but I think one beautiful thing about search is that a lot of important problems in the world are actually search problems, like, dressed up in a different way. Say more. What's the example? Okay, political polarization.

Starting point is 00:19:25 I would argue that's a search problem because there are people out there who want to, everyone wants to understand the world, or most people want to understand the world, but they're getting fed information that's just, like, you know, misleading in some way or straight up wrong. And if everyone had like information that was accurate and like controllable and like comprehensive, I think most reasonable people would be reasonable. And I think because our information environment is so chaotic, it's so polarized, it's causing reasonable people to be unreasonable. By way, I'm part of it's like I'm sure I have incorrect beliefs on all sorts of political things because my information is not perfect. It's something I really want to solve. Loneliness is a search problem. We,

Starting point is 00:20:04 No one thinks it's lonely. Same more, yeah. A lot of people are feeling lonely in modern society. Well, it's because they're not finding, not finding people to hang out with or to be in a relationship with, right? And so, like, yeah, lonely is a search problem, and especially in a city life,

Starting point is 00:20:18 like it's hard to find other people with similar interests or you might bond with. And this is like, you know, with a perfect search engine, with whatever information people are willing to share, it would help to find other people. So, for example, I have a lot of crazy ideas about flying cars.

Starting point is 00:20:31 I really want flying cars. I hate cars on the road. I would love to go to a group of people talking about flying cars. I'm sure I'm great friends of those people. I can't just be like, find me all the flying car enthusiasts in San Francisco. I would love to at some people to do that. Okay. And this gets in your earlier part of your question is like, what allows for this differentiation?

Starting point is 00:20:51 So it's a long answer to how to search not a commodity. You start to see that search is like a bigger thing that people think of search as like in 1990s. You see a text box. You type in a few keywords. You get a few things. Like that was search. No. Like search is way broader.

Starting point is 00:21:05 Search is coordinating the human species around anything we're trying to do. How does search become less and less commodity? Well, it's always about really good retrieval and really good data. So if you do perfectly on both those things, you have all the data and you have all the best possible retrieval, that is perfect search. And so it is like both accumulating all the web's data, you know, accumulating data that's not on the web, and then training extremely powerful models to search over it. Those have always been the two, like index and retrieval have always been the two. to pushes at Exa on the engineering side since five years ago. Can you say more about the data element?

Starting point is 00:21:40 And, you know, obviously, no need to share any secret sauce there. But it does feel like the web is getting increasingly closed, to some extent, parts of the web, right? But there's a lot of fear out there from data providers on, oh, we don't want to be stack overflowed, right? Especially if your business model is around impressions you serve to humans visiting your site. I think the fear is the greatest among those businesses. business models. How do you think about just sort of that interplay with data providers and making sure you can get to the path of perfect search, but work with these data providers that are maybe becoming more closed? Yeah. So I want to get to the ideal world where you have perfect

Starting point is 00:22:18 search and then everyone's intensivized to create amazing content, even more so than before. I actually think there's an opportunity here and I'll talk about how to create a system where like content providers are making more revenue because they are participating in this massive agented economy. Yeah. Right? So, like, I think from first principles, you know,

Starting point is 00:22:35 the first principle's idea here is that the agendasic economy is going to be huge. Like, basically, like, we're all going to have agents. So those agents are going to be participating. You can imagine it's like agented economy and cyberspace. We're basically they're doing like commerce. They're like reading information. It's like everything we do on the internet,

Starting point is 00:22:49 they're doing, but like a thousand times bigger, right? So there's a massive amount of value in this agentic economy, meaning money. And so if there's so much value, like, Instead of, like, you know, hundreds of billions of dollars going into one company, what if, you know, $50 billion a year went to one company and the other $150 billion went to all the content providers, right?

Starting point is 00:23:09 Like, there's ways to distribute the value in this new agenda economy that are more favorable towards providers of content. And, like, this is kind of how, and by the way, it won't be $200, but it'll be a trillion dollars a year of value. So there's, if we could figure this out well, like, there's an opportunity for everyone to just, like, do amazingly. I loved all the use cases you talked about. There's go to market, right?

Starting point is 00:23:32 There's also the, you know, I'll put it in the loneliness bucket, but finding people you can connect with. One that you didn't talk about that I just want to pause on really quickly is coding. Can you share more about why WebSearch makes coding agents more powerful for this use case in particular and why EGSA is such a perfect fit for the coding use case? Yeah, for sure. So every agent at some point, agents are like humans. Like at any point when you're coding,

Starting point is 00:23:59 or when humans used to code, you would have to look up information, right? Because you want, like, the most recent technical documentation or you might look at a blog for inspiration. And so agents are very similar. In particular, they really want the freshest information so that every line of code they write does not have some critical error.

Starting point is 00:24:15 And, you know, especially with coding agents, like the stakes are so high that, again, every extra nine of quality matters. So these coding agents are very intelligent right now. But in terms of retrieval quality, they've been in the dark ages or the dark ages of like the early 2000s in terms of search quality,

Starting point is 00:24:29 but like search can be way better over coding it. You know, we're talking technical documentation, we're talking SDKs, just like perfect, perfect retrieval over the sinks. That was our goal. And so we're not yet perfect, but we're extremely good at search over any sort of coding material. And so, yeah, so, you know, when, you know,

Starting point is 00:24:47 a bunch of coding agents try us, like cognition, for example, we've talked about, tried us, like we powered Devin now, and they've just found when they tested it that it just makes Devin way better, way more accurate, make way fewer mistakes, which really matters. And this will just continue. By the way, you can also think about coding agents as just like, like every agent is going to want to do everything.

Starting point is 00:25:05 And so it's not just searching over code. It's also at some point searching over the world's information, just being up to date with the news because like if the coding agent will become your agent. It's like I think coding agents and just general agents are going to like merge. Yeah. And it's an interesting trend. But yeah. Right.

Starting point is 00:25:19 No, I mean, it's clearly codex is not for developers, right? It's sort of like the Everything app to that point. I want to bring in mostly because it's in the, it's all over Twitter right now. And I think it might be an interesting tied eggs. And that's sort of this topic of token maxing, tokenomics, just the fact that, you know, Uber is talking about how they're spending too much. Service now hit their budget for the year already. You know, I think Microsoft talked about pulling their cloud code licenses. Tokenpocalypse.

Starting point is 00:25:52 Tokenpocalypse. Yeah, okay, there we go. Yeah, exactly. A better word for it. But we've talked about this before, but in terms of how you think about search actually making token consumption more effective and efficient, can you share more perspective on that and to the extent you can, like, what are results that you're seeing on that front? Yeah, sure. So, like, retrieval can help solve the token apocalypse because, like, we should not be using gigantic models for every test. We should be using, like, and people are starting to, realize this. Like you should use a family of models of different sizes. The big model decides what to do and it dishes out of commands. So the small models. And those small models can be way more accurate and reliable if they're using retrieval. So retrieval helps small models act like big models. It's kind of in a cheap way. And so we do save our customers a huge amount of tokens because they can use smaller models and use retrieval. They could also, we have all, we care a lot about this. So we have like, we've put a lot of research effort into how to, you know, extract only the most

Starting point is 00:26:52 relevant information from documents so that these models can just, like, not have to consume too much tokens because, like, a lot of, you know, any sort of input tokens can dramatically increase spend. So we could, like, we could save, like, 20x on cost for customers compared to other providers by, like, being very efficient in, like, what information does the, from the web, does the agent actually see? But yeah, in general, smaller models using retrieval is much more efficient. And, like, Andre Carpathy had a tweet about, I keep mentioning Andre Carverthley on Twitter. I know, we all do. Yeah, he's great.

Starting point is 00:27:21 he had a tweet about this a couple, I think a couple years ago where it was like the trend is towards like smaller raw intelligence modules using tools. And that trend will, like that's an important trend. Because like you have,

Starting point is 00:27:34 you know, you have a limited, like the cost of the model is determined by the number of weights. That, you know, determines the cost of inference. And, and if those weights are,

Starting point is 00:27:43 if you're wasting those weights on like all sorts of information about the world, like the capital of France or, you know, this random blog that you read, like you're just wasting tokens. Sorry,

Starting point is 00:27:51 you're wasting a weights. Those weights should be focused only on, like, intelligent processing. And you could probably get to models that are like one billion, even less than a billion parameters that are extremely hyper-intelligent and completely unknowledgeable. It's like Einstein, like, who never saw the world. That's kind of like the way to think about it. And then it uses tools that are very cheap and efficient.

Starting point is 00:28:12 And that's a much more efficient world that will help solve this, like, this compute shortage that is affecting everybody. Yeah. Do you think this is more of like a hot take moment, but do you think that reality will start to take place second half of 2026? Because right now we're in this phase where everyone's playing with the biggest best model that just arrived. And to your point, probably overspending. So when do you think this reality kind of sets in? I mean, the reality is definitely starting.

Starting point is 00:28:37 I don't have the exact. It's like trends are everywhere. Yeah, yeah. When does it become noticeable? Yeah, I would say by end of 2026, it's very noticeable. Wow. Okay. Kind of a hot take, actually.

Starting point is 00:28:47 So you talked about doing just the research that you're doing. And I think EGSA, you sort of famously structured as more of a research lab, honestly, than what people are calling application layer companies, infrastructure companies, right? You know, this is sort of a research lab focused on search. And coming out of that, one of the things that we were most excited about, frankly, is just the exciting cutting-edge work that you guys are doing. One of those things was actually search as it pertains to RL, and there's a lot of efficiencies there, et cetera.

Starting point is 00:29:21 I wanted to just flag that because it was an interesting finding. I think you used Tinker, so shout out to thinking machines. But say more about what you guys are finding there and also just sort of what research directions you guys think about as being important. Yeah, at a high level, like a lot of the big ideas in training LLMs apply equally well to trading the search models. So, for example, like, we do pre-trading of embedding models. We do post-training of embedding models.

Starting point is 00:29:46 We do RL on, like, search tools, right? So, like, a lot of these things that are working in. LMs work in retrieval, too, which is kind of interesting. And you don't hear a lot of people talking about. So, yeah, in that RL blog post, we just basically try. Like, a lot of people RL on a search tool, but we haven't seen as much many studies, like, testing different search tools that you are on. Right.

Starting point is 00:30:04 And so we simply RLed on SER, so like, Google Rapping versus XA, and found that, you know, RLing on XA does way better. better. Like it both like uses fewer calls, so it's more efficient, and then it's like higher performance. And this makes sense because again, like Exa was designed for agents to use. And so like it just it just allows agents to make more complex queries. Like it's really like capture what they actually want as opposed to having like to compress what they want into like shorter phrases that are more for traditional search engines. So that that was a cool blockpost to explore and think it was really helpful for that. In general, our research is like the bitter lesson. So just like

Starting point is 00:30:40 scaling laws in lots of different directions, some of the ones I mentioned. Post-training, pre-training, RL. We've been pretty under the radar. I don't think people don't realize how much research we're doing. We don't publish all of it, obviously. We don't publish much of it.

Starting point is 00:30:54 But there is a lot to go in search, and I don't think people realize that. And I think we realize it because we've just been obsessed with it, one, crazy enough. Like, that's just what required. But then also, I think the biggest thing is actually because of our business model and who we serve.

Starting point is 00:31:10 We've just been pulled. pushed in all these crazy directions. Because we're not serving two billion consumers who are kind of all the same, like two billion humans. We're serving, you know,

Starting point is 00:31:17 now, you know, over 5,000 businesses that are pushing us in all these crazy directions. Like, just like every day, it's like, why can't this be higher quality over companies or people?

Starting point is 00:31:27 Like, why can't this be faster? Like, why can't, why can't the information extraction be even higher quality? And so, like, just we're being pushed in all these crazy ways. And that's why we're exploring

Starting point is 00:31:38 all these research directions. Like, research always follows need. So we have insane amounts of needs at Exa to do better. And so that's why we do all this for crazy research. Yeah. No, that makes sense. So I guess kind of tied to this is I wanted to ask you about how you think about benchmarks and hill climbing. And I'll say this sort of tongue and cheek, but we've noticed that, especially maybe among folks in your space, but also in other spaces, right, there seems to be like benchmark maxing or whatever you want to call it. And, you know, of course, to no surprise, everyone is always at the top of their own benchmarks.

Starting point is 00:32:16 Yeah, yeah. That's how you know. It's not something's wrong. That's how you know. Exactly. So, and obviously you have this relentless pursuit of ground truth. And also, you know, self-improvement, right? I think, like, you're the first to admit, hey, here are the areas we could be better on.

Starting point is 00:32:32 We're trying to improve on a, like, continuous basis. But how do you internally think about what benchmarks matter? What is ground truth for you guys in terms of like, hey, we're actually better on this front, but worse on this front? Yeah, yeah. No, 100% the evals have been benchmaxed in retrieval. There aren't too many evals in retrievals. So that's one problem. There's aren't that many standard third-party retrieval evals, and they've been like benchmarks.

Starting point is 00:32:59 And they're not really actually good representations of agentic search, like what agents actually need. And so it is a problem in the industry where, like, customers can't really know what is true, which is sad. It also demonstrates the need for a really good search engine to distinguish what is true. It's just another example. But, yeah, I mean, the ground truth for customers is their own AB tests. And so when we do a, like, right, like literally they are testing us versus other providers on their use case. And if they have enough data to do AP test, that's the best. If they have, often they make their own e-vow.

Starting point is 00:33:31 So sophisticated customers will make their own evils. super-sophisticated customers will run A-B tests, and then, like, customers who just want something might just, like, look at evals that are published online. But certainly for the sophisticated customers, when they test us, it becomes a lot clearer who's, like, on the top. But, yeah, we want to improve this ecosystem. We want to, like, we want to be, like, the research lab

Starting point is 00:33:52 that, like, helps improve the ecosystem and, like, publishes things. And even if we're not at the top, we want to show. And so you'll see more coming out there. So you predicted that agenic search will be a bigger business than Google search by the 2030s. Say more about that. What trends are you seeing that we do believe this?

Starting point is 00:34:12 Yeah, I mean, just you could get this from basically, like, estimating the number of searches. So the number of LLM calls and the percentage of those LM calls that require search, then the cost of the search, and then you just like play it out. And the trend has actually been pretty clear. And we've have, you know, pitch decks from like years ago where we kind of predict where things we go, you know,

Starting point is 00:34:30 maybe we're off by a quarter here or there, but it's like pretty... you know, it follows a trend. And so, yeah, if you follow that trend, even conservatively, you get to a massive TAM for agentic search in the 20s. I mean, even before, like, late 2020s and then early 2030s. Basically, like, the number, it's hard to express, like, how many searches will come from agents, right?

Starting point is 00:34:52 Like, humans on average make, you know, a couple searches a day. But agents, when everyone has a personal assistant and every single software tool you use is going to be checking its work with retrieval. Totally. Like, the number of searches is going to, to be, we say thousands because that is like understandable and grokbold people, but really it's going to be millions.

Starting point is 00:35:08 Yeah. It's just going to be like the world to be filled with search in a way that the world is filled with electricity. It's like it's a fundamental infrastructure that powers everything. Like information, I think last stuff, search is like perfect information. It's a world be filled with like the highest quality information. And so yeah, I mean, there is a lot of, when the world is filled with something, it's usually a large tam.

Starting point is 00:35:28 And yeah, if you just play out the numbers, we think it will be bigger than Google ads in 2030. Not that ads won't be a huge part of the world, too. Like, ads are important for commerce, and that might be a percentage of the agenic search economy, too. But, yeah, it's just the numbers here are insane. And it really, it comes down to a belief, like, do you think LLMs will eat the world? We'll eat all software.

Starting point is 00:35:48 And, like, we have always believed that. Yeah. It seems like very true. It seems even more true every month. So, yeah. What do you think, and there's some debate on this, on the LLM side, on the training side, too, but what do you think is the bottleneck?

Starting point is 00:36:02 today versus, let's say, three years from now. I feel like five years is too far to predict. Is it, I think you've said in the past, it's no longer intelligence in terms of bottlenecking search. Like, is it data, accessible data? How do you think about how that evolves? Yeah, I mean, well, initially the bottleneck is going to be actually the infrastructure,

Starting point is 00:36:22 which is kind of interesting. No one realizes, but like if you actually get, for example, 10x, 100x, X, 1,000 X, more surges than Google, the infrastructure to handle that is insanely large. It just hasn't been built yet. So we're really excited to explore all sorts of cool new vector databases that have like super high throughput, for example, things like that. Yeah.

Starting point is 00:36:39 So that's like an interesting bottle. Like in the same way there's compute bottlenecks, there'll be like infrastructure bottlenecks. I mean, it'll be solved at some point. Yes. And then like other bottlenecks are like data bottlenecks. Like so agents are increasingly going to want to ask questions about the world and that data might not be on the web. It might not even be recorded anywhere. And so like, you know, there will be this trend towards how can we like accumulate all the world's data.

Starting point is 00:37:02 Like literally unearthed the world's data. I'm really excited by this because the world is filled with information and it's not all recorded. You know, the history of humanity, the history of humanity is the world's been, you know, when we're hunters and gatherers, there's also information. Then they started writing things down. And like the amount of information the world has been like skyrocketing. There's still so much information that's not recorded. You know, you go from all the way from the first clay tablet.

Starting point is 00:37:23 Oh, really the first like paintings on on caves would be arguably the first time things were written down. And then like, and then, you know, clay tablets and, you know, now I've in newspapers. and obviously now we have like the digital age, but there's so much information in your head, like satellite images, like that are not just like in the world's soup of information that we could search over. Yeah.

Starting point is 00:37:41 And like to fully understand the world, fully understand how props, crop yields are going to, you know, affect, you know, some company, like, or like, what are people thinking about the world or how do we unite the world? Like that requires like understanding the world at a deeper level.

Starting point is 00:37:54 So I think the bottleneck will be data in a lot of ways. And then once you have, the problem is once you have all this data, now the bottleneck will be retrieval. Right. Right, right, right. Imagine, it's just a crazy idea. Like, imagine, you know, we're, as we expand as a species,

Starting point is 00:38:07 like, we're thinking, like, very far in the future. Like, think about, like, or in the solar system, like, there's so many things going on in the world. There's so much data being accumulated. The retrieval over that is very expensive. And so, like, they're actually really fundamental, they're interesting fundamental questions here. Like, right now the web is, like,

Starting point is 00:38:23 what's called a trillion pages that matter? What happens if the web were a thousand times bigger, like a quadrillion pages, meaning, like, everyone started uploading data. Well, then, like, any search algorithm that works over a trillion pages no longer, like it might work over a quadrillion pages, but it might be a thousand times more expensive.

Starting point is 00:38:35 And that's not practical, because if anything, we want search to get cheaper, not more expensive. So what kind of search algorithms to work over a quadrillion pages? These are fascinating questions that, like, I don't know if I've ever heard anyone else talk about. Well, actually, I was going to say, at least the,

Starting point is 00:38:52 thinking about when we're living in, you know, on Mars or whatnot, there's probably one other founder that has thought about that extensively, Elon. And, you know, I listened to your, when you went on the latent space podcast last year, one of my favorite podcasts. And you talked about actually working at SpaceX. How was that experience? And like, are there elements of Elon's leadership style that you've taken as, you know, CEO of your own company? Yeah. Well, first of all, the internship was magical. So, like, for example, I saw the first landing on the barge. Wow. And like, like, just outside mission control. Like, I'm getting, uh, tingles just thinking about it. But yeah, like, just, just, just, seeing that, like, people coming together to do something magical. Like, that was very inspiring and made a big impact on me. And, yeah, in terms of my leadership style, yeah, I like to think that I've incorporated some of what I think are the best aspects of Elon. So, you know, for example, like, he's very detail-oriented.

Starting point is 00:39:46 Like, he gets into details of everything. And, like, for better or worse, like, I do that too at XA, like, everything from, like, the algorithms, like, the vector DB algorithm to, like, the office space and making sure, like, every part of the office is, like, really just, like, inspires and excites. Yeah. And like, like,

Starting point is 00:40:02 shows our passion. And that goes across everything, from our marketing to the engineering to, uh, to, to go to market. And, um,

Starting point is 00:40:10 so that, that's exciting. Obviously it doesn't scale. So one thing I've been learning as we scale the company is like, what details can I choose to go into? And so it's, I like the ego metaphor of like, uh,

Starting point is 00:40:19 you know, I'm an eagle flying above the company. And then when I see some detail that I think it should be important to, to, to fix, like I go dive down and go into it and then come back up. Um, so I think Elon has,

Starting point is 00:40:29 some of those properties. Obviously, Elon is like all in every day for, what is it, like, decades. Yeah. I've only been doing for five years, but I intend to do it for decades. One last thing that he does really well, which is like he's very good at memetic, like names and like inspiring through like memetic things. Like, you know, like you realize that SpaceX's mission is not like improve rockets to get to or, yeah, like, like, oh, make rocket travel really fast, like good. It's like, it's like make humanity interplanetary. Like that's a really good memetic thing. So I think a lot the names of like projects and like when I when I you know I do a team stand up every Monday in front of the whole company now we have like this like double floor office where people now are

Starting point is 00:41:08 surrounding on the top floor it's really cool it's like a stadium and I give like a speech and I and I like to like simplify what we're doing into like what is the core what is the memetic core and like come up with a cool name and that inspires by the way a name is really important because it's when you have a company of a certain size people are constantly communicating in ways you're not you're not part of those conversations and like the name like really like uh grounds the mission of that project. Yeah, absolutely. This is really important.

Starting point is 00:41:33 Like, I could think for like a day of just about a name. So I just totally divergent, but on the topic of names, one, I love goat. Talk a little bit about the symbolism of that. And then, I mean, besides, you know, greatest of all time, obviously. But, and then also EGSA. You shared before what EGSA means, but talk more about why you name the company EGSA. Yeah, sure. Okay, so the goat thing is basically.

Starting point is 00:41:58 there was a coffee shop that opened up next door, and so many people, so like the mayor of SF kept posting about it, and so a lot of people go, it's really cool coffee shop, hedge coffee. And so many people were walking by, we're like, we got to, like, have them stop and look at what XA is.

Starting point is 00:42:12 So we have some posters about XA on the entrance to our office, but we were like, how do we get them to stop? And so I was like, just put a goat. I don't know why I thought this, but just put a goat there. And I actually, some part of me wanted a real goat. I don't know how we'd maintain the goat.

Starting point is 00:42:24 I think the ROI would still be valuable. But instead, we bought, like, a nice, like, fake goat. Yeah, I love that goat. And we put like the swag on him. So it's like it's a beautiful. Yeah. And and people stop by. So like we're actually like tons of people stop by. Like wow. Even on Saturday and Sunday, which is like when the coffee shop is very popular, like people come by and I go and I say hi to them. I talk to them. It's actually very beautiful. We're actually like an important stop for for kids on their walk to school. Like you know when you're a kid you had like to stop. So you like move the ice cream store. Like the goat is a stop and they're always taking pictures. Very cute. Starting the recruiting funnel early.

Starting point is 00:42:57 but it's interesting how much those things matter. Yeah, absolutely. We've hired people because of the goat. Anyway, XA, I think is a great name. I love XA. One value of XA is like, it's a great prefix. So like XA anything, XA data, XA this, XA that. But XA means 10 to 18th,

Starting point is 00:43:16 which is in contrast to Google, which is 10 to the 100th. And the idea there was like, you know, we're kind of overwhelmed with information. Even though you could technically find lots of things on the web on Google, It doesn't mean it's like organized and you get the highest quality knowledge. So like in here it's like 10 to 18th is like you're extracting the most important information.

Starting point is 00:43:34 It's still very large amount of information, but it's not like overwhelming. That was one of the ideas. Great name. We love the name as well. And the goats. My kids love the goats. So I wanted to, this topic feels a little bit over talked about, but it's been in the news more recently. So I'm going to bring it up, which is I don't know if you saw Will Minetus wrote a

Starting point is 00:43:57 an article recently called Grind Slop. And it's almost this backlash of, and in fact, I pulled a quote from it. He said something about, we're witnessing a phenomenon that masquerades is disciplined, but represents perhaps the most extravagant squandering of economic surplus in our civilization's history. So extreme words, right?

Starting point is 00:44:18 And he talked about sort of all of these articles that are glorifying the grind over what they're doing, you know, all of these things that, you know, maybe are more relevant to work-life balance, et cetera. So I guess I'll start with saying one of the things that I was actually most impressed and excited by when I visited Exa at, God, I think it was like 8 p.m. was that there were a ton of people in the office. And it wasn't just that they were there for FaceTime,

Starting point is 00:44:50 because obviously that's not what you and Jeff care about. But they were excited about what they were working on. And I think really importantly, they were excited by who they were doing it with. And so just say more about, like, how you think about that culture of going all in and going after what you guys are, you know, sort of the mission that you just talked about. And, like, what do you think the important things to balance are in culture? Yeah, yeah. I mean, this is critical to me.

Starting point is 00:45:16 And it should be critical to any company that wants to do great things. Like, people need to be super excited about what they're doing. And I have a ton of fun, right? So, like, you might have also heard at APM, like, people are just laughing. Yeah. Oh, yeah. We laugh a lot at Exa. And it's almost too loud sometimes

Starting point is 00:45:27 and it's like, distracting. So we have to have like headphones for everybody. But, but yeah, like people are, it's very important people are having fun and that's necessary for doing the best work of your life. And I would say like having fun just like, just good vibes. But also it's very important that people work on the projects

Starting point is 00:45:43 that are most exciting to them at any point. And so like people, especially now with like, you know, all these AI tools, like anyone, especially on the engineering side or even on the market side or off side, everyone can work on whatever they want. So for example, like we had someone who built the vector database. who was like, hey, I want to go train models. I was like, okay, just go do it.

Starting point is 00:45:59 And like, he didn't really have experience in training models, but I was like, you're really smart. You'll learn it in like two weeks. And then you'll just do amazing work. And he's done amazing work. So, right? So I make sure everyone's working on exactly what they want to work on. And luckily in search, it's like, at least in our space,

Starting point is 00:46:12 like in any direction we improve things, it will be good for the business. So it's like, okay. And it actually happens to turn out somehow that like what people want to work on and what we need, like perfectly aligns. It's like a magical thing. I don't know how that's possible. But yeah, so everyone's working on exactly what they want to work on.

Starting point is 00:46:26 also working on multiple projects because like AI tools will enable you be like productive in parallel. And so it's just and then what we're doing is just super cool. Like, you know, there aren't, like it's hard to find maybe these days like really exciting like humongous projects and missions that other people aren't doing. And this one happens to be like organizing world's information, making perfect search. That's very exciting and important for the world, but also like really hard problem. And engineers are really excited about that. And on the go to market side, it's like, we're then selling this search to the whole world, and we're selling to some of the coolest companies,

Starting point is 00:46:57 and it's very exciting to them. They get to talk to all these hot companies and give them search that really helps their products. It's just an exciting space, but it's very important to me. So I hope we maintain that forever. Yeah. No, absolutely. Well, maybe just to end on one last question,

Starting point is 00:47:15 and I'll say, first of all, Exa is hiring. And you told me once that you actually still interview every single person at it. Exa or who comes to, you know, interview for Exa. And so I'm curious what, and you've, you've attracted some of the most incredible talent, junior, senior, everything in between, very high slope, very experienced. And I'm curious what it is that you look for when you do that last final interview and, you know, how you've been able to get these incredible people to come to Exa. Yeah. I mean, people might not like this word, but I look for passion. It's like someone

Starting point is 00:47:53 just like a fire in their eye. Like, do they really care about what we're doing? Or some aspect of doing. I really want to scale this thing or I really want to sell this to everyone. Because why is that important? It was important five years ago, but I think it's especially important now because like the world goes to who is most passionate and who's most agentic because like you can now do anything. Like you could be whatever you want. Like, you know, like with agentic tools, you could literally like do anything. And what matters most is like how much do you care about the end result? And then like your judgment and everything. And so like I I look for the fire on the eye because if you have that fire, you could literally do anything.

Starting point is 00:48:26 It's actually crazy how meritocratic the world is becoming because of these agentic tools filling in the gaps. Yeah, absolutely. Yeah. Well, thank you so much, Will. This was great. Such a fun conversation. Appreciate you and very excited to be partners. Awesome.

Starting point is 00:48:40 Thanks for listening to this episode of the A16Z podcast. If you like this episode, be sure to like, comment, subscribe, leave us a rating or review and share it with your friends and family. For more episodes, go to YouTube, Apple Podcast. podcast and Spotify. Follow us on X, A16Z, and subscribe to our substack at A16Z.com. Thanks again for listening, and I'll see you in the next episode. As a reminder, the content here is for informational purposes only. Should not be taken as legal business, tax, or investment advice, or be used to evaluate any investment or security and is not directed at any investors or potential investors in any A16Z fund.

Starting point is 00:49:20 Please note that A16Z and its affiliates may also maintain investments in the companies discussed in this podcast. For more details, including a link to our investments, please see a16Z.com forward slash disclosures.

a16z Podcast - Building Search for AI Agents with Exa CEO Will Bryk

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.