The a16z Show - Sovereign AI: Why Nations Are Building Their Own Models

Episode Date: May 24, 2025

What happens when AI stops being just infrastructure—and becomes a matter of national identity and global power?In this episode, a16z’s Anjney Midha and Guido Appenzeller explore the rise of sover...eign AI—the idea that countries must own their own AI models, data centers, and value systems.From Saudi Arabia’s $100B+ AI ambitions to the cultural stakes of model alignment, we examine:Why nations are building local “AI factories” instead of relying on U.S. cloud providersHow foundation models are becoming instruments of soft powerWhat the DeepSeek release tells us about China’s AI strategyWhether the world needs a “Marshall Plan for AI”And how open-source models could reshape the balance of powerAI isn’t just a technology anymore - it’s geopolitical infrastructure. This conversation maps the new battleground.Resources:Find Anj on X: https://x.com/AnjneyMidhaFind Guido on X: https://x.com/appenzStay Updated: Let us know what you think: https://ratethispodcast.com/a16zFind a16z on Twitter: https://twitter.com/a16zFind a16z on LinkedIn: https://www.linkedin.com/company/a16zSubscribe on your favorite podcast app: https://a16z.simplecast.com/Follow our host: Follow our host: https://x.com/eriktorenbergPlease note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.  Stay Updated:Find a16z on YouTube: YouTubeFind a16z on XFind a16z on LinkedInListen to the a16z Show on SpotifyListen to the a16z Show on Apple PodcastsFollow our host: https://twitter.com/eriktorenberg Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Transcript
Discussion (0)
Starting point is 00:00:00 They're not being called AI data centers. They're being called AI factories. The Industrial Revolution, having oil was important. And now having data centers is important. These models aren't just compute infrastructure. They're cultural infrastructure. It's not just self-defining the culture, but self-controlling the information space.
Starting point is 00:00:20 So if a model is trained by a country that's adversarial to you, that's actually very hard to eval or your benchmark when the models are released. This is a massive vulnerability. Is that the new age of LLM diplomacy that we're entering here? Do we build? Do we partner? What do we do? Today we're diving into a conversation that's just as much about geopolitics as it is about technology. This week, the Kingdom of Saudi Arabia announced plans to build its own AI hyperscaler called Humane.
Starting point is 00:00:51 But they're not calling it a cloud provider. They're calling it an AI factory. And that language alone suggests a shift. For decades, cloud infrastructure has been concentrated in two places. the U.S. and China. But with the rise of AI, that model is breaking down. Nations no longer want to outsource their most strategic compute. They are building sovereign AI infrastructure, factories for cultural and computational independence. To unpack what this means for the global AI stack, national sovereignty, and the new digital power dynamics,
Starting point is 00:01:23 I'm joined by Angine Mehta in Guido Appenzeller. We talk about what it takes to become an AI hypercenter, why governments are spending billions to control inference pipelines, and whether we're entering a new Marshall Plan moment for AI. Let's get into it. As a reminder, the content here is for informational purposes only. Should not be taken as legal business, tax, or investment advice, or be used to evaluate any investment or security, and is not directed at any investors or potential investors in any A16Z fund. Please note that A16Z and its affiliates may also maintain investments in the companies discussed in this podcast. For more detail, including a link to our investments.
Starting point is 00:02:04 Please see A16Z.com forward slash disclosures. Andge Guido, want to talk about sovereign AI, AI in geopolitics, and let's start with the news. Our partner, Ben, is in the Middle East right now to participate in his own way. What happened, and why is it his own point? What happened is the kingdom announced
Starting point is 00:02:26 that they're going to build their own local hyperscalor or AI platform called Humane. And I think why it's notable is that, as opposed to the status quo of the cloud era, they're viewing the AI era as one where they'd like the vast majority of AI workloads to run locally. If you think about the last 20 years,
Starting point is 00:02:50 the way the cloud evolved was that the vast majority of cloud infrastructure basically existed in two places, right? China and the US. And the US ended up being the home for the vast majority of cloud providers to the rest of the world. That doesn't seem to be the way
Starting point is 00:03:04 AI is playing out. Because we have a number of frontier nations who are basically raising their hands and saying, we'd like infrastructure independence. The idea being that we'd like our own infrastructure that runs our own models that can decide where we have the autonomy
Starting point is 00:03:21 to build the future of AI independent of any other nation, which is quite a big shift. And I think the headline numbers are somewhere in the range of 100 to 250 billion, worth of cluster buildout that they've announced, of which about 500 megawatt
Starting point is 00:03:40 seems to be the atomic unit of these clusters that they're building. So a number of countries, with the kingdom being the one that's most recent, have been announcing what we could think of as sovereign AI clusters, and that's a pretty dramatic shift from the pre-AI era.
Starting point is 00:03:56 I don't know if you'd agree with that. I think it's spot on. I think many sort of geopolitical regions are reflecting back what happened in previous big tech cycle. And wherever the technology is built, and whoever controls the underlying assets, has a tremendous amount of power of shaping, regulation,
Starting point is 00:04:11 shaping how this technology is being used, and also puts themselves in a position then for the next wave that comes out of that. And, you know, was the Industrial Revolution, having oil was important, and now having data centers. Right. It's important.
Starting point is 00:04:24 And so I think it's a very exciting development. Yeah, in fact, you can often tell why something is important to somebody by the semantics that folks used communicate a new infrastructure project. In this case, if you look at how the cluster buildouts are being referenced, they're being called AI factories. They're not being called AI data centers.
Starting point is 00:04:47 They're being called AI factories. And I think there's two ways to respond to that. One train of thought would be, hey, that's just branding. That's just marketing people doing their thing. And under the hood, this is really just data centers with slightly different components. but everybody in the world, in every industry, is looking for a way to be relevant in the age of AI,
Starting point is 00:05:08 and this is the compute infrastructure world's way of doing that. An opposing view would be, actually, no, this is not just marketing. If you look under the hood, and if you x-ray the data center itself, very little of it is the same as was the case 20 years ago. The big difference in active components being GPUs, right? About 20 years ago, what would you say the average number of GPUs were in a... What percentage of...
Starting point is 00:05:30 Right. Pretty much. Yeah, it's a very recent phenomenon. And today, I think if you look at the average 500 megawatt data center and you looked at what percentage of the CAPEX that was required to build that data center or operated rather went to GPUs. Massive, yeah. That's a huge shift.
Starting point is 00:05:46 I think we're also seeing a specialization, right? The kind of data center you built for a classic CPU-centric workload and what you built for high-density AI data center look very different. You need liquid cooling to the rack. You need very different energy supply. Right. Close to a power plant. You want to lock in that energy supply early on.
Starting point is 00:06:03 And then we also see that change, I think, in the consumer behavior. Where classically, you want a very full stack that has lots of services that helps enterprise build all these things. We're seeing more of my enterprise that are actually comfortable with just building on top of a simple Kubernetes abstraction or something. And basically, you know, cherry pick a couple of snowflake or database-type services on the side that help them complement that. So I think there's a new world.
Starting point is 00:06:27 And so that's certainly true that you could kind of look at the technical components in an AI factory are completely different from a traditional data center. And then there's, what does it do? And historically, a lot of the workloads that traditional data centers were doing
Starting point is 00:06:42 were running one cloud-hosted workloads for enterprises or developers, whoever it might be, where most of that, the data sets and the workloads were actually not particularly opinionated. And when I say opinionated, I mean, they're not necessarily subject
Starting point is 00:07:00 to a ton of cultural oversight. You could argue that was not the case with China, where China wanted full sort of oversight over those workloads. But for the better part of the 2000s, until the rise of GDPR, CCP, and so on, we lived in an era of centralization where having most of your cloud infrastructure in northern Virginia
Starting point is 00:07:22 was preferable for most of the world's developers' enterprises because it gave them economies of scale. That started to change, of course, with GDPR, CCPA, the rise of data privacy laws because then you had region by region compliance. And that made the rise of something like Cloudflare critical, right? Where Cloudflare has this idea of distributed infrastructure where you can tie the workload policies to wherever the user is.
Starting point is 00:07:46 But by and large, that was critical for, especially for the rise of social media workloads, but the vast majority of enterprise workloads didn't need decentralized serving. What's different about AI seems to be that these models, models aren't just compute infrastructure. They're cultural infrastructure. They're trained on data
Starting point is 00:08:04 that has a ton of embedded values and cultural norms in them. And then more importantly, that's the training step. And then when you have inference, which is when the models are running, you have all these post-training steps you add that steer the models to say something or not,
Starting point is 00:08:19 to refuse the user or not. And that last mile is where things over the last, I would say, year. They have made it more and more clear that countries want the ability to control what the factories produce or not within their jurisdiction. Whereas that urgency didn't quite exist as much.
Starting point is 00:08:36 Because of the cultural factors or because of certain independence or resilience? It's a good question. My sense is there's two things going on, but you should chime in if you think I'm being incomplete. One, I think, would be the rise of the capabilities in these models being now well beyond what we'd consider sort of early to a stage of a technology.
Starting point is 00:08:56 I think our partner, Chris Dixon, has a great line, which is that many of the world's most important technologies start out looking like toys, right? And four years ago, when the scaling loss paper was published, and the GPT3 was published, most people looked at it and said, okay, it's cool, sure, it can produce the next word. It's a nice party trick. It's a nice party trick, right?
Starting point is 00:09:14 It's a stochastic parrot. And now you have foundation models literally running in defense, in health care, in financial services industries. Chad GPD has about 500 million monthly active users, making real decisions in their daily lives. I think using these AI models, there was a paper that was recently published by Google
Starting point is 00:09:34 that showed the efficacy of Gemini, their foundation model, at solving medical questions. And one of the most interesting things you can see when you look at the usage, the types of prompts that people are using models for relative to two years ago, three years ago,
Starting point is 00:09:52 where it was a lot of helping write my essay. It's turned into coding, and help me solve a whole host of medical problems or personal life-related questions and so on, where it's clear now that these capabilities can be used, want to drive mission-critical industries like defense, health care, and so on, and also then influence a number of your citizens' lives. And so I think that makes a lot of governments go,
Starting point is 00:10:17 wait a minute, if we are dependent on some other country for the underlying technology that our military, our defense, our health care, our financial services, and our daily citizens' lives are driven on, that seems like a critical point of failure in our sovereignty. So that's one. It's just that models have gotten good,
Starting point is 00:10:35 and they seem to be good at a bunch of important things. The second is, I think, an increasing belief that if you don't have control over the model's production pipeline, then you're doomed or destined to use models that reflect other people's cultural values. We had a pretty in-depth debate about this with Deep Seek, where the question was, is Deepseek fundamentally more biased or not
Starting point is 00:11:00 than open source models trained in the U.S.? And I think there's early evidence to say that you can actually see, certainly in the post-trained Deep Seek, that there's just a number of topics and types of tasks that it's been told to avoid and answer differently from a model like Lama. So that's the cultural piece. I think there's a critical sort of national capability piece
Starting point is 00:11:21 and then there's the cultural piece. And I think both are combining to create this sort of huge rise in the demand for, you could call it sovereign AI, which is the idea that you want control over what the models can, can't do, or you could call it infrastructure, independence. I think everyone's got a different word for it.
Starting point is 00:11:35 You could call it our local AI factory ecosystem, but I think that all these terms are trying to get at the same thing, which is we've got to control our own stack. Yeah. I think I would make it even stronger. I think it's not just self-defining the culture, but self-controlling the information space. Right.
Starting point is 00:11:50 I mean, today we're starting to see how, in many cases, models are about, placing search. I don't longer go to Google, I'd go to chat GPT, and that comes back with an answer. If there's a historical fact and say, in the Chinese model
Starting point is 00:12:03 does not show up, and a US model it does show up, right, that is the reality that people grow up with. And if you write an essay in school, in the future, many of us essays will be graded by an LLM. Right.
Starting point is 00:12:15 So in fact, in school, something that may be truthful, right, maybe graded us wrong because whoever controlled the model decided that should not part the trading policy. So it has a very profound effect on public opinion and stuff, you know, on values. Right. The downstream use is an interesting one because it's very hard to measure for. And certainly relative to two years ago when the vast majority of products and applications,
Starting point is 00:12:40 like the ones Gito's talking about, were basically pretty simple models, right? Well, at the time they were considered pretty complex, but the frontier change is so fast. Today we'd look back at a model like GPD4 that was largely just a next word prediction model and say, That's pretty rudimentary, right? Because if you x-rayed an app like chat GPT, sure, on the surface, it looks like nothing much has changed, right? It's still a chat box you type in what you need
Starting point is 00:13:01 and it outspits an answer relative two years ago. But under the hood, there's been this insane evolution where there's four or five different systems interacting with each other. Right, you've got a reasoning model that can produce a chain of thought to think through what it should do next, including then doing what we call tool usage, right? Calling out to third-party tools.
Starting point is 00:13:21 And then you have the idea that, these models can start to self-learn, to go through a loop of taking your input and reasoning about what it needs to do, calling an action, and then evaluating its output, and then updating that loop. That starts to look more. People use the word agent, right, to call it that.
Starting point is 00:13:39 But the idea is that it's going from being a pretty simple model to being a system. And it's very hard to measure where the adversarial cracks are in this system. So if a model is trained by a country that's adversarial to you, to when you're writing code, open up a port or what we'd call a call home attack, right, where it's transmitting telemetry.
Starting point is 00:14:00 That's actually very hard to eval or benchmark when the models are released. Right? Because these models are often tested in very academic or static settings. And so when Deep Sea came out, it was just such a great model. It was a phenomenal piece of engineering that suddenly everybody was using it everywhere.
Starting point is 00:14:19 And a number of CIOs and CTOs got pretty nice, nervous because they were like, wait a minute, if the model is being used in this agentic fashion, and I don't have visibility on what it's doing adversarily until it's too late, this is a massive vulnerability. And so I think the adversarial threat, as the systems go from being models to agents, is causing a lot of governments to go, well, we'd rather have the whole thing running locally in a way that we can lock down. Again, it comes back to a sort of independence and a supply chain question. And is your expectation that this is going to play out and to what extent is it going to play out?
Starting point is 00:14:55 On the cloud, as we mentioned, there's a Chinese internet and the sort of Western rest of the world internet. How widespread is this sovereign AI thing? Yeah. I'm going to borrow an analogy that Gito used, which is like in the Industrial Revolution, you could look at where resources flowed, right?
Starting point is 00:15:09 I think you should talk about how viewing it from the lens of oil reserves can kind of dictate which countries can and can participate in the Industrial Revolution. Go ahead. So if you look at the Industrial Revolution, of oil was the foundation of a lot of the technologies, right? You needed all reserves in order to participate.
Starting point is 00:15:26 And I think it'll be a little bit the same thing, right? If you want to build industry in a particular country, if you want to be able to export things, if you want to be able to drive development, and if you want to harness the power that comes with that, you need the corresponding reserves. And I mean, I think AI data centers are a little bit like these oil reserves, but the big difference being you can actually construct them themselves
Starting point is 00:15:44 if you have the necessary investment dollars and the willpower to do it. But I think they will be the foundations for building all the layers on top that ultimately, I think, determine who wins this race. And in my mind, the countries that invest in building out the AI factories or in this sense the oil reserves to borrow Gito's analogy, I think of them as one body of countries. Let's call them hypercenters, right? The idea is they're centers that have enough compute capabilities
Starting point is 00:16:10 to compete at the frontier and run their own sovereign model, sovereign infrastructure. And then there's everybody else who just doesn't have the resources to do that. And if you look at after the Industrial Revolution, you could argue the next major technology revolution was the advent of modern finance, the Bretton Woods and IMF regime, where modern finance said,
Starting point is 00:16:30 we're going to all use this one measure of value called the dollar. And you were either in a country that produced the dollars, like America, or you were in a country that produced a lot of goods that acquired dollars like China. And then if you weren't in one of those two, you really had to figure out whether you aligned with one of these trade blocks or not. And what happened is you had countries like Singapore, Luxembourg, Ireland, and Switzerland
Starting point is 00:16:53 who'd realize, well, we just don't have the resources to build out our own reserve system. And there's not that much by way of local production that we can do to acquire dollars. We can't really trade. So we've got to find a way to insert ourselves in the flow. And so Singapore, of course, famously became the entry point from dollar flows into Asia because they invested a ton in rule of law and a great, tax regime and sort of stable government and low corruption and all of that. Switzerland did something similar for European investments and in European capital flows.
Starting point is 00:17:25 So I think what we're watching right now is that buildout where there's U.S. and China, which clearly have enough compute to be hypercenters. And then you've got folks like the Kingdom of Saudi Arabia saying, we want to be a hypercenter. And if that means we've got to trade our oil to acquire large numbers of NVIDIA chips, we'll do that right now. And I think in that bucket there's probably the kingdom of Saudi Arabia. Arabia, there's Qatar or there's Kuwait, there's Japan, Europe, clearly. And then I think the question is everybody else, what do they do? And it's not clear to me what you have to do to become the
Starting point is 00:17:57 Singapore-V-I. And maybe the Singapore-V-I ends up being Singapore, because actually now they have an enormous sovereign wealth fund as a result of participating in modern capital flows. But I think a bunch of other countries are sitting around wondering, is this the time where we actually buy? Do we build? Do we partner? What do we do? Yeah. And talk more about the implications behind what this means. Is this something that the U.S. should be excited about? What does this mean as we think about foreign policies? Are there now winners across the board and all these local environments? Why don't you talk about some of the big implications here? Go on take a step? I think every big structural revolution is both a threat and no opportunity. I think the United States and AI right now has the world leadership. Yeah. That's an opportunity. Hanging on to it won't be easy. It isn't every tech revolution? Don't we want people to be dependent on us in the same way that they were in the cloud revolution,
Starting point is 00:18:45 or do we benefit somehow from it being more decentralized? The world is not one place. So I think complete centralization won't happen. I think the leader is good. Having strong allies that also have, that technology is also very valuable. So it's probably a balance of those that we're looking for. Yeah. To put a finer point on your last note there is that you could think about a balance.
Starting point is 00:19:05 Like we're clearly in an unstable equilibrium right now. Yeah. And so Gito is right that the arc of, of humanities and history such that things will shake out until there's a stable equilibrium. And so what is the stable equilibrium? And I think one way to reason about it is you could look at historical analogy,
Starting point is 00:19:21 so post-World War II when Europe was completely decimated, there was a group of really enterprising folks in the private sector and the public sector who got together and said, hey, we can either choose to turn our backs on Europe and adopt a posture of isolationism where we mostly focus on a post-war American-only agenda. or we can try to adopt a policy where we know that if we don't help out our allies, somebody else will. And so they came up with this idea called the Marshall Plan, right,
Starting point is 00:19:53 where a number of leading enterprises in the U.S. got together like GE and General Motors and literally subsidized the massive reconstruction of Europe that helped a lot of European economies quickly get back on their feet. And at the time, there was a ton of criticism of the Marshall Plan because it was viewed almost as a net export of capital and resources. But what it did end up doing is then solidified this unbelievable trade corridor between the U.S. and Europe for the next 50 years, which really kept China out of that equation.
Starting point is 00:20:24 70 years, yeah. 70 years, really. And so I think we have a choice, either approach it the way we would, the Marshall Plan for AI, right, and say, well, a stable equilibrium is certainly not one where we just turn our back on a bunch of allies because China definitely has enough of the compute resources
Starting point is 00:20:39 to try to export great models like Deep Seek to the rest of the world. So what do we want our allies on? Deep Seek or Lama? That's what it comes down to, at the model level of the stack, right? And I think that the realities that a number of countries are not waiting around to find out. That's why you have efforts like Mistral in the EU,
Starting point is 00:21:00 where they are being approached by a ton of, not just European nations, but a ton of other allies of Europe to say, hey, can you help us figure out how to build out our own sovereign AI? And so I think we're about to see basically the single biggest buildout of AI infrastructure ever because most of the purchase orders and the capital is being provided by governments. Yeah. Because they've realized this is a critical national need.
Starting point is 00:21:28 And their stable equilibrium is certainly not to depend on somebody else or depend on an uncertain ally. And so the ones that certainly have the ability to fund their own sovereign infrastructure are rushing to do it right now. And what does that mean for the sort of nationalization debate or how you see that playing out? Leopold, Dash and Brenner, formerly of Open AI, and his famous sort of report talked about how, hey, if this thing becomes as critical to national security as we think it will be, at some point, the governments aren't just going to let private companies run it. They're going to want to have a much more integrated approach with it. Where do you stand with the likelihood of that? And what does that mean just in terms of the feasibility of regulation in a world where it's much more decentralized?
Starting point is 00:22:05 And we already had this with Deep Seek. and that everything change the game in terms of we're in an arms race and you can't control everything. We're in the open source
Starting point is 00:22:10 conversation as well. We're backing some of these players. Where are your thoughts on where there's all the nets out? I think I have probably a strong opinion on that. I mean, I grew up in Germany, right, so benefiting from the Marshall Plan
Starting point is 00:22:20 and also seeing how that pulled away Western Germany towards the United States and eventually East and Germany also towards the United States when everybody realized the impact of that. One lesson I took away from that
Starting point is 00:22:31 is that I think any kind of centralized planned approach does not work. Eastern Germany is Western Germany, a nice AB test, you know, central planning versus a free market economy, what works better, right? And I think the results speak for themselves. So I think, basically, having the government drive all of AI strategy, you know Manhattan style project or Apollo project, pick your favorite successful project there. I can't see that working. You probably need a highly dynamic ecosystem of a large number of companies competing. There's some areas I think where the government
Starting point is 00:22:59 can have a hugely positive effect, right? On the research side, we've seen that again and again, funding fundamental research, which is not quite applied enough yet for enterprises to pick up, right, is very valuable. I think it can help in terms of setting good regulation, bad regulation, can easily torpedo AI, as we've seen. And so I think there's a strong role for government to lead this and to direct this.
Starting point is 00:23:22 There's no master plan at the end of the day that you can make that basically has all the details that has to come from the market. I don't agree with the Aschenbrenner point of view. I agree strongly with Gito that the history of central centralized planning at the frontier of technology is not great, barring a few situations that were essentially brief sprints of war, right? And arguably even the Manhattan Project, which is the analogy I think he uses in his piece, we now know that there were leaks. It was literally a cordoned off facility in Los Alamos or whatever, and they were still spies.
Starting point is 00:23:53 And so if you're approaching this from the lens of the models are what are the equivalent of nukes. And we've got to regulate the development of these by locking up our smartest researchers in some facility in Los Alamos, and that's what's going to prevent the best models from getting exported. I think that's great fiction, a very interesting novel. But for anyone who has ever
Starting point is 00:24:14 had both pleasure and displeasure working in any large government system, it's a pipe dream. The good and the bad news is that, in a sense, it doesn't really matter where the model weights are. It matters where the infrastructure that runs the models are.
Starting point is 00:24:27 In a sense, inference is almost more important. And I think a year ago, we were in a pretty rough spot, I would say with the arc of regulation where there were a number of proposals in the United States to try to regulate the research and development of models versus the misuse of the models. I think that, luckily, we have moved on from that.
Starting point is 00:24:44 Where we are now is, unfortunately, still a state level like batch work on whack-a-mole of regulation. It's not consistent. Hopefully, I think we've got a number of positive signals from early administration executive orders that they put out that hopefully means there will be unified regulation around AI. But I don't think that the answer is,
Starting point is 00:25:03 going to be one single lab that has one God model that then the country protects as if it's a nuclear bomb. I think we are now in a state where partially because of the build out of AI factories that we've discussed, a number of countries have the capabilities to train frontier models. And a number of them are quite willing to export them openly. China being a leading one. Deepseek has forced people to update their priors where just a year before Deepseek's, came out, you have a number of tech leaders in Washington testifying that China was like five to six years behind the U.S. with confidence on the record.
Starting point is 00:25:42 And then Deep Sea comes out 26 days after opening I puts out the frontier. I mean, just shattered all of those arguments. So the calculus has changed. I think it means that the only way to win is build the best technology and out-export anybody else. Then if the question is, whose math is the world using?
Starting point is 00:25:58 We'd love for it to be American math. Right. My view is that we are much better off embracing the ability for other countries to serve their own models and ideally the best product wins, which is the best models just come from the US and our allies. Is that the new age of LLM diplomacy that we're entering here? Actually, Ben had a great talking point to this
Starting point is 00:26:19 at FII Riyadh last year and he said something to the effect of because these models like we discussed earlier our cultural infrastructure, you don't want to be colonized in the digital era in cyberspace. and I think that's pretty spot on. Yeah.
Starting point is 00:26:34 Instead of colonization, what we have is now, I think, foundation model diplomacy. That's a good way to put it. I think it's right. It suits the U.S.'s relative skill sets, which is innovation and working with our allies. To China, which has been a bit more closed off as a country. I want to talk about the bull case for open source companies like Mistrawl
Starting point is 00:26:51 in a world where some of these bigger players are open sourcing more, becoming more interested in that. So there's a couple, and we've talked about this increasingly in a world where two years, ago, I think when we led the investment of Mistral, we had a fairly clear hypothesis for how open source wins in an arc where foundation models end up looking more and more like traditional compute infrastructure, storage, networking, etc. Because close source usually, if you look at databases or operating systems, Windows, close source usually leads the way in terms of opening up new use cases, often captures a ton of value, certainly from consumers. But
Starting point is 00:27:31 But when the enterprise starts really adopting that technology, they usually want cheaper, faster, and more control. And in the world of AI, you can't get the kind of control most enterprises want without having access to the weights. And at the time, the only real comparable model to the frontier close source was Lama. And then the creators of Lama left to start Mistral.
Starting point is 00:27:53 So it was a pretty natural decision. I think since then there's a different thing that's turned up, which is the idea of sovereign AI infrastructure that's not just models, it's everything else down and up. And I think something we've been debating as well, does that mean the ideal provider of cloud infrastructure is also the provider of the best open source models? Traditionally, cloud infrastructure is pretty well-dominated category
Starting point is 00:28:14 owned by incumbents whose core business was either in telecom or in commerce, like Amazon. And it seems like now that's changing. I think you put it more eloquently than I did, which is if you ask the wrong guys to design the data center, They're going to design the wrong data center. But I'm paraphrasing here. I think it's exactly right.
Starting point is 00:28:35 I mean, each of the last big technological ways, if you look at the PC revolution or the internet boom, we developed essentially a new building block for systems, right, the CPU or the database or the network. I think now with the process of building yet another building block, which is the model or AI, whatever may be called in the end. Right. So it's a fourth pillar in a sense.
Starting point is 00:28:54 Computer network storage has become a compute network storage model. And in that kind of world, a cloud needs to provide all four. Right. And so I think you're exactly right. This is just part of the infrastructure layer that in the future you build all the software systems. Right. I think one way to think about that is
Starting point is 00:29:09 there's two frontiers. There's the capabilities frontier. And then there's the Paretoefficiency frontier. The capability frontiers usually dominated by close source. And then the Paretoefficiency frontier, because of all the goodness of open source ecosystem flywheel effects, right? Where in this case you put out your model
Starting point is 00:29:26 and the entire ecosystem of developers can distill it, fine-tune it, ship better runtime improvements to the model, quantize it and so on, that makes that family of technology much more efficient to run than the close source version. The second is more secure because you have the whole world red teaming your model versus just this limited group of people inside your company if you're a closed source provider. So the business case is basically cheaper, faster, more efficient, more controllable.
Starting point is 00:29:54 It's pretty strong for the raw model, distraction. Then if you ask, okay, well, does the model provider have the right to win? Is there a business case below the model stack at the data center, at the chip level, at the cluster level? And is there a right to win above? Let's start with the topmost part of the stack, which increasingly people would call agents, a less sexy version would be to call it a fully end-to-end automated workflow. Right? Where today you have, if you take the world's largest shipping company, the Merks of the world or the CMACGMs, right? These are massive logistics and transportation companies that have fairly complex workflows. And if you think about the power of these models being turned into an AI agent, the work required to customize that agent for one of these mission-critical industries is quite hard today. An area where we're seeing a ton of progress is reinforcement learning, where if you craft the right reward model, the agent gets much better at accomplishing that task. Well, it turns out crafting the right reward model is really hard.
Starting point is 00:30:57 Even for sophisticated teams like Open AI, I mean, they literally rolled back an update to chat GPT, I think three days ago. They called it the sick-offancy update where they crafted the wrong reward model. And so a traditional legacy industry company has no clue how to do this. And the question is, would they rather invest that energy
Starting point is 00:31:15 to customize a closed source model or an open source model where if the close source provider, for whatever reason, goes down, shuts shop, which happens, raises prices and so on. Seuse their customers. Yeah, see their customers. We're essentially host.
Starting point is 00:31:30 And the natural arc of that as well for the agent layer seems to be to go to a deployment partner who has an underlying open source base. I think the cloud infrastructure of the sovereign AI layer is a bit up for grabs. And that might be a good topic
Starting point is 00:31:44 for our next pod. Yeah, absolutely. Well, let's wrap on that. Anj Guido, thank you so much. It's been great. Thank you. Thanks for listening to the A16Z podcast. If you enjoy the episode,
Starting point is 00:31:55 episode, let us know by leaving a review at rate thispodcast.com slash a 16Z. We've got more great conversations coming your way. See you next time.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.