Tech Brew Ride Home - (Bonus) Nat Friedman Interview

Episode Date: April 1, 2024

What prolific AI investor Nat Friedman expects from GPT-5, Microsoft's general strategy in AI, how he invests in startups, and his background an philosophy when it comes to investing. Learn mo...re about your ad choices. Visit megaphone.fm/adchoices

Transcript
Discussion (0)
Starting point is 00:00:00 On April 4th, 2023, around 2 in the morning, a man was found stabbed multiple times on a sidewalk in downtown San Francisco. Hey, who did this to you? What happened next turned the story into a political firestorm. Reports have identified the victim as Bob Lee, the founder of Cash App. From Bloomberg Podcasts, this is Foundering, the Killing of Bob Lee, beginning April 16. Welcome to another bonus episode of the TechMeme Ride Home. I'm Brian McCullough. And I'm Chris Messina.
Starting point is 00:00:41 And our guest today is long overdue. The great Nat Friedman. Hi, Nat, how are you? Brian, I don't know if I'm great, but I'm fine. Thank you for having me. Well, you're great, especially now in this AI era, you're, you know, the investor that I hear about in almost every deal that we see. Let's start off with some high-level stuff because the last week has been, I mean, it's always crazy in the AI universe right now. But let's do some high-level stuff on like, you know,
Starting point is 00:01:11 Suleiman going to Microsoft, everyone fleeing stability AI, even the CEO. Both companies cited difficulties in finding a business model for why they are doing what they're doing. Does this suggest that there could be trouble for, you know, especially the consumer-facing AI model companies that maybe they're largely undifferentiated, maybe they're, peddling commodities at this point? What do you think? Well, there's a lot of variance in how well these companies are doing. I mean, there are consumer-facing AI companies that are some of the fastest growing companies I've ever seen and that are just doing spectacularly well. I mean, Mid-Journey is a great example of a bootstrapped consumer AI company that has just absolutely
Starting point is 00:01:58 spectacular usage and revenue and product and so forth. So I think it's very uneven. I think in the case of inflection, look, there's two sides to every transaction. And I think if I'm Mustafa, and I'm not obviously, but if I'm Mustafa, I think there's a couple things I'm looking at. One is, you know, I don't have a hit product. I have a decent product, but it has relatively low usage. And I believe in scaling laws. And therefore, you know, what I really want to do is build AGI. and compete with Demis and Sam and everyone else. And no matter how good my product gets, I don't actually see a path to the compute scale that I need to do that. So I think Mustafa is probably quite excited about going to a place where he can do that.
Starting point is 00:02:50 And then on the other side of the transaction, you have Satya. And so you have to say, okay, well, what's Satya thinking? And obviously, I don't know exactly what Satya is thinking, But you can sort of imagine looking at what's happening that, you know, he's had this very important early prescient bet on AI, put a billion dollars into open AI before GPT3, 13 billion into it now. His stock is way up. He's at the sort of forefront of this revolution. And he's got all his eggs in one basket. And, you know, November's board event highlighted potentially the fragility of that structure.
Starting point is 00:03:30 So I think he wants to have a backup plan. And my read is that he acquired inflection without acquiring it in order to have that backup plan. And now what does he have? I mean, I think the steelman interpretation is that he's got a team. He's got a model that claims to be GPT4-ish, roughly. He's got a cluster that's working and a place to collect talent and data and compute. And so he's got an internal open AI. It's, you know, there's a lot of gap there, too.
Starting point is 00:04:06 Opening eye is still the best. But I think he's got the makings of a hedge. And so that's how I read that one. You mentioned, obviously, you know, it's like an acquisition and name, all but name. Like, I heard you and Daniel Gross talking about this idea of like a priesthood of, like, training these models is such a sort of bespoke thing. like it's an artisanal thing, like almost like you have to have taste, I think is the term that you all use. Are we in a moment where it's almost like the IP doesn't matter so much as getting the artisans is the most important thing.
Starting point is 00:04:44 So like you almost don't need to acquire the company if you can get the talent. Yeah, I think that's true. The set of people who, the amount of tacit knowledge that's involved in successfully training a high quality large model is still quite high. So you can read the papers, you can look at the open source, but getting these things to train and converge and doing it over these large clusters and managing all of that. There's still quite a lot of that knowledge that's sort of not published, not written down. Many of the individual items are probably small, but they really add up. And the field is new. And so the set of people who really know how to do that is small.
Starting point is 00:05:23 And then if you believe in pushing the research frontier and kind of pushing out, you know, the capabilities, then, you know, the set of people who are kind of at that frontier is it's grown, but it takes time for that to grow. And so that's also still relatively small. In fact, it's roughly the same size it was before the entire sort of AI, you know, industry revolution that's attracted so much attention and capital and so forth. And so when I look at like what stands in the way of people training good foundation models, you know, the three key inputs are compute and data and talent. And I think if you're a mega cap tech company like Microsoft or a Google or a meta, you can get the compute. Actually, in compute, you're not limited by capital.
Starting point is 00:06:14 You're limited by logistics. I think this is actually true for everyone right now. you can have $50 billion to spend on compute, but actually converting that into useful compute is a logistics problem that many people, like, I don't know anyone who's super happy with their cluster that it's like, by logistics, do you mean like physical space or like cooling or like, what is the limiting resource there, everything? It's everything. It's the, there's a lot of pieces that go into it.
Starting point is 00:06:43 And so you have to have your supply chains lined up. If you're short, you know, 10% of your infiniband transceivers, it doesn't matter that you have everything else in the other 90% of the transceivers. Your cluster's not done and it doesn't work. And, yeah, sometimes it's space and power cooling, as you said. Sometimes it's just infrastructure. Microsoft has historically virtualized GPUs for OpenAI training on HyperV. And I don't know if they're doing that anymore, but they used to. to. And so does Hyper V virtualize all the GPU features that you want to use that,
Starting point is 00:07:20 you know, that kind of thing. So there's just, it's a big stack. Lots of stuff can go wrong. We've seen clusters not come up because of missing transceiver cables or transceivers or bands for a while or an issue. You know, that, you know, there's a, we have our clusters in Nevada and it's actually at the end of outside of Reno. And it's at the end of a long road. And that road, I think the locals call it Fury Road because there are nice like Furiosa like bad it's just like so many accidents on the road and when there's an accident the road can be blocked for like a couple hours so like your smart hands can't get to the data center for a few you know it's like all these sort of things can go wrong so I say real world problems yeah and
Starting point is 00:08:04 then yeah and then it's also getting into the supply chain is Jensen sending you the chips in wave zero and wave one you know that you like that sort of thing plus invidia stuff when it ships isn't quite done. So you know, you've got a... Yeah, what does that mean? Like just there's a lot of config to do or... Get the... There's firmware bugs. I see. There's software, there's Kuda kernels that haven't been implemented yet to take advantage of the instructions that the chip exposes. Sometimes there's hardware bugs that give you NANs and you've got a RMA, a bunch of GPUs that... So anyway, there's a logistics side, the data side, and the talent side. And yeah, the talent,
Starting point is 00:08:43 so scarce and critical resource right now. And I think that's reflected in like rapidly escalating ML engineer salaries very quickly. I kind of want to lead into asking you what you're expecting from GPT5. But a way to do this to come back to Mustafa going to Microsoft is, you know, one of the more out there theories that I heard was that, you know, if, if OpenAI achieves AGII. you know, by their bylaws, Microsoft can no longer access the technology. So is Microsoft making bets like this to sort of, you know, get out ahead of the fact that maybe AGI is right around the corner? What do you think of that as a theory? And then that can lead into what are you
Starting point is 00:09:31 expecting GPT-5 to be? Yeah. Yeah, I don't, that's probably not, that's probably not the dominant force driving SOTIA to want to have Microsoft AI. My guess is that people at Microsoft mostly don't believe AI is right around the corner. I think it's just like a single, absolutely critical vendor relationship that they have with OpenAI, which has strange governance, and that was demonstrated conspicuously a few months ago.
Starting point is 00:10:04 And yeah, I mean, it's just tough if you have this critical input to the future of every product in your company to have it have a wall there. But you know, I built GitHub co-pilot with Open AI and it was a great success and one of the proudest moments in my career honestly and incredibly grateful to them. They were so obviously so important to that. But it was also tough to have to like work across these organizational boundaries. Inside big companies, it could be tough to work across just internal boundaries and external
Starting point is 00:10:33 ones. Yeah, like throw that in and it gets even harder. So I think that's more the driving force. Microsoft still gets as, well, I don't know what they have in the agreements now, but what I've heard is that they still get the pre-AGI technology as part of their deal. So, and then I think like AGI is in the eye of the beholder. And ultimately it's something that, you know, it's just someone's opinion. Someone told me recently that Alex Tabrock, I think he tweeted that AI is when it takes
Starting point is 00:11:04 someone else's job, AGI is when it takes your job. I think I thought that was pretty good. Yeah, what do I expect from GPD 5? Well, you know, I was really personally impressed by the leap from GPD 35 to GPT4. I thought we saw kind of emergent reasoning abilities that were like noticeably much better. And less hallucination and to me, to me it was a noticeable leap. Obviously depends on what you're doing. But in the realm of coding for sure, I, I appreciate.
Starting point is 00:11:36 prefer to reach for the smarter model. And so I guess I expect more of that. And the weird thing about emergence is it's actually like this is not something even the labs can do is to predict which new capabilities has the threshold of your reliability. And I think that's how you should think about it. It's like all these capabilities are there. They're just not reliable.
Starting point is 00:11:57 And then as you get the model bigger and bigger, they kind of go from 20% reliable to 70%, 80%, 90%, and then whatever your threshold is where it becomes useful, once it crosses that, it's almost a binary event where you're like, oh, now I'm really willing to use this. And so it feels binary, even though maybe the emergence is actually kind of gradual. And so, yeah, I think the big thing is, yeah, like just dramatically improved reasoning and coherence and the ability to therefore do agent-like tasks reliably. So, you know, mini-stap operations.
Starting point is 00:12:33 I think people are finding ways to do this on GPD4 now. I mean, we saw cognition and Devon. I thought that was like a absolutely killer demo. And people have discovered that if you just repeatedly in a smart, very smart way, invoke, you know, inference GPD4, you can squeeze a lot more intelligence out of it. And so yeah, like, gosh, imagine when the models are bigger and better. So I think it'll be scale. You know, there'll be like another multiple of scale.
Starting point is 00:13:01 But what do you think about multi-modality? Do you think that'll be part of it too? Yeah, I would expect it would be natively multimodal. And, you know. What about like, what about active reasoning? Like, are we, like, within months of that happening? Like, this year, you feel like someone's going to make a breakthrough there? I've seen some things.
Starting point is 00:13:21 So I do think this idea of kind of looping in latent space and, like, pondering, you know, pondering a topic is an exciting idea. There's probably ways to do that without looping in latent space. Like chain of thought reasoning does work. And so you can just like internal monologue to do that. You sort of spread the compute that you need to figure out the hard token over lots and lots of intermediate tokens. And then, you know, the big breakthrough everyone's waiting for is sort of this RL on text, where you can use RL on text to like learn reasoning tokens to get better at reasoning, basically,
Starting point is 00:14:01 in a way that makes sense to the model. And maybe some of that is what's in Claude. I'm just speculating. I don't know. But yeah, I think actually we're going to learn a lot in the next year. Like, what if GPT-5 is not that impressive? That would be a huge deal. If it doesn't feel like a 3.5 to 4 leap, that would be a sign of some kind of asymptote,
Starting point is 00:14:23 at least temporarily, you know, while we find the next sigmoid to climb, essentially. So I think that a year from now will know a lot more about the future of humanity, I think, when we see GPD5 and many other things that are in progress. Speaking about a year from now or beyond, you know, given your background and experience at GitHub, you know, building co-pilot and at Microsoft, I'm very interested in what the future of open source kind of looks like, especially as relates to LMs, but also software development and to engineering sort of writ broadly. So if we're to sort of unpack those things,
Starting point is 00:15:01 I guess I'd just love to understand how you see software development evolving or co-evolving with the presence of a product like co-pilots. And what does it mean for the next generation of developers in terms of what skills they need to actually be gathering or gaining in learning COPSI? Yeah. Well, I'm sort of glad AI came around. I sort of felt like software development was starting to stagnate a little bit. Yeah, sure.
Starting point is 00:15:29 Prior to AI. So I think, you know, the productivity of an individual developer today compared to 20 years ago is just incredible from my point of view. Like the, and there's a few reasons for that one is, you know, we, maybe compared to 40 years ago, we added these important high level productivity tools like compilers and garbage collection and then, you know, object orientations. certain things that were just sort of somewhat helpful patterns. And then open source created this, you know, reusable library of components that people could draw from. And then the introduction of software registries like stuff like NPM really accelerated that reuse, sometimes far too much. But so those were like quite radical changes in how software was written. I think the web was a big changed to because people got View Source, which for a while was a big deal. It kind of oriented.
Starting point is 00:16:27 It created a new generation of open source people who were not ideological, but just highly practical. And so I think that's sort of where we were pre-A.I. And then AI comes along and it is this compression engine that's able to distill the collective learnings of all the open source code that's out there into a helper and maybe soon a colleague who, you know, can help do some of the work with you. I personally still think it's incredibly useful to learn to code. You know, I take, I guess I take the different side of what Jensen said last week. The reason I think it's useful for the audience. What did he say last week?
Starting point is 00:17:12 He said like you don't need to learn to code or something. I'm going to misquote him. I have immense respect for tens of these. Something like you don't even learn to code and there will be like little coding agents, you know, thousands of them and then they'll be like super agents supervising the coding agents. And he may be right, but to the extent that we will live in a human dominated economy for a long time, you know, human reasoning will be valued and- Which is by the way a wild statement to say, but yeah, continue.
Starting point is 00:17:36 Yeah, I think we will for a while. So during that period, which no one knows how long it is, I think human reasoning will still be very valuable. And learning to code is a great way to refine your reasoning abilities and your problem solving abilities. Not to mention when the AI writes buggy code, suddenly like it all comes into focus. You need to debug something throughout the stack. And so I think definitely we see like with GitHub co-pilot that it really raises the productivity of not as good programmers.
Starting point is 00:18:09 And initially at least senior, very senior developers were just got much less value out of it. Now I think we're reaching the point where senior developers are getting value, but they have an advantage in that they can like debug things go wrong. But I agree with John Carmack, you know, who said the core skill is problem solving. And there will until we are fully surpassed, you know, human problem solving will be incredibly valuable. And I think you can learn and refine that skill through coding. But I use copilot and all the stuff all the time now, too.
Starting point is 00:18:45 It's fantastic. You feel sort of naked without it. If you're a little bit rusty or you're an episodic programmer, you know, you can like dive right back in and you don't feel totally lost, especially given how sprawling APIs and SDKs and so for libraries, you know, have become. It's so sprawling that. I mean, it sort of feels like an exoskeleton in a way where essentially you're sort of going into, you know, these, I don't know, like forests of code and trying to like figure something out. like I've been writing some extensions for Raycast and essentially, you know, I have a co-pilot there sort of helping me out with TypeScript because it's not something I've written before.
Starting point is 00:19:19 And of course, it's just sort of like sort of auto completes and suggest things that's like, you know, you probably should know this, but whatever, we'll let it go. And, you know, here you go and you can move on. Like you said, I'm simply trying to like solve problems, right? Like this is not like, you know, mission critical code. So it, I guess like the other thing that I'm wondering about, given your experience, is like what it looks like for a younger generation that's coming up with the ability to learn to code.
Starting point is 00:19:40 And what, like, I do feel like it's kind of like, you know, I studied Latin in high school. And it's like helped me a great deal in terms of understanding just the root of language and, you know, word etymology and things like that. Is there going to be a similar type of benefit, you know, to learning basic, you know, skills and coding based on like, you know, the structure of language or that coding itself is a type of structured way of expressing ideas and concepts that is very terse and very efficient relative to spoken language. which evolved over thousands of years, such that now this becomes a way for humans to communicate better with machines. And I guess I'm just trying to understand, GitHub is such an interesting place. If you think about it from a different type of creator economy lens,
Starting point is 00:20:23 we think about development and engineering as being something that only very technical people do. But increasingly, it just feels like in whether it's one year or five years, there's going to be a generation that grows up with the ability to code as a second language, like coding as a second language. So I guess that's kind of what I'm trying to Think about based on, you know, just the presence, the growth, the popularity of GitHub, but also a set of tools that, you know, bootstrap you into the ability to solve problems, as you said, as opposed to having to go deep with any particular language. Yeah, I think the bootstrapping is really valuable because the activation energy to get started on something is often the barrier to get anywhere. And if you can kind of get some momentum and some dopamine quickly, you're just, I mean, like, I definitely, even, you know, me, I mean, I've been coding for so.
Starting point is 00:21:10 along and and thanks to GPT4 and co-pilot, the likelihood that I start a little project is so much higher because I just know I'm not alone and I'm in there with, you know, a little agent that knows the terrain and knows the APIs. It's getting over the blank slate problem. You just like gets fun. Yeah, I think of it like an e-bike where it sort of flattens the hills a little bit. You know, biking is still fun and, you know, but like I can bike around. Which can go faster.
Starting point is 00:21:37 Yeah. And you just don't have the, the schleppy parts aren't. quite as bad. Yeah. So yeah, I think that's right. I mean, some people are worried that, you know, we're going to sort of outsource our cognition to these prostheses and we'll become soft. And I think there's some risk of that.
Starting point is 00:21:53 But mostly that's like an eternal conversation. You know, when I was learning to code, it was like, oh, if you didn't know assembly, you weren't a real programmer and these kids today using compilers or, you know, they don't really get it. They're so confused. And so I just, I just. I think, you know, mostly we figure this out and smart people figure out what they need to know to get something done. And I think AI just increases the chance that a smart person will try to get something done.
Starting point is 00:22:23 And I think that's a good, that's basically really good thing. So people seem to love it. Like they demonstrate that by using it. Totally. Yeah. It's been amazing to see. On that point, one more question on this topic. Like, you know, one of the, I guess, memes or tropes that was going on a couple years ago was that data is the new oil.
Starting point is 00:22:44 And, you know, with the fund that Brian and I run to invest in AI companies, we sort of thought about that a little bit differently in that we kind of think about what we call AI varietals. Now, obviously being in California, sort of understand where that might come from. But the thought is kind of like data is more like the new terroir, if you will, and that the region and the culture and sort of like the ground for, which the grapes grow is sort of like data that is particular to a certain vertical or to a certain industry and that understanding the nuances of that data and information is actually quite important to building valid or useful LOM driven products. In other words, you can't just, I know I understand that we're trying to move towards this AGI kind of future where you know, you have one bot that does all the things. But it just seems like where, as they say, the, the
Starting point is 00:23:35 rubber meets the road is actually quite important to the effectiveness of these different AI products. So I guess I just wanted to hear you talk about kind of like the idea of like data as the new oil being the general purpose resource versus the pursuit of or the creation of specific data sets that come from different areas and realms and the utility of one or the other. Yeah, I mean, first of all, I definitely agree that data is really important. And in fact, like sort of more time I've spent in this field, the more I feel that the models in a way they are the data. Interesting.
Starting point is 00:24:10 You know, Mistral has demonstrated very impressive results with their small models. And one of the ways they've done that is just by very, very carefully curating their training data and not just kind of throwing the raw internet at it, but, you know, by really cleaning, cleaning their training data with a cascade of models that sort of, you know, remove duplicates and, like, you know, take out the garbage basically and turns out if you're raising your little baby AI and you're sort of screaming nonsense at it half the time like as well as if it has like really clean training data that's very high quality and high IQ and so so I do think that's true and you know by the way this is also a reason this actually sort of very good news for labs because they in many cases people publish papers but they don't publish their training data set and so to the extent that you really clean up your data you have this durable
Starting point is 00:25:04 advantage, you know, that you don't have to release. And so, and they can't walk out the door in quite the same way. So, yeah, I think data is really, really important. There's domains like robotics right now that are fundamentally limited, I think, by the training data. And, you know, we should see foundation models and robotics do unbelievably impressive things in the near future. And the gap between here and there is not hardware and it is not models or compute.
Starting point is 00:25:31 It is purely the training data. And I think, you know, my bet would be in a year or so. You've got, you know, robots, two arms sprout out of desk and they assemble a Lego and they wash dishes and they open an Amazon box. But they assemble themselves and then they, yeah. Yeah, maybe they do that. They tie shoes. What are they training for robotics on right now, essentially? Well, there are some sort of academically constructed data sets that are quite limited.
Starting point is 00:26:01 Is this based on the real world or based on simulation and synthetic data? Yeah, good question. So there are all of those things exist. In the world of kind of locomotion navigation simulation seems pretty helpful. In the world of dexterous manipulation, it's hard to simulate, you know, tying shoes and stuff like that quite as well. And so there you may need to like just record stuff happening. And the ways that people do that are teleoperations. teleoperation so you get a robot, you know, you have a warehouse and you have a bunch of
Starting point is 00:26:33 tele operators come in and they just like use the robot to do it. Super expensive. People are also just like taking egocentric YouTube videos and using those in pre-training. So you don't really learn robotic stuff from that, but you do learn some of the- Like there's a lot of like TikTok papers that have come out that basically like use the dances as a way to like train the models. Yeah, I guess you could do that. But you can take like just unboxing videos from YouTube for example.
Starting point is 00:26:59 I see that in your pre-training and you start to learn some basic manipulation physics and stuff like that. That's helpful for pre-training. And then there's these devices called data hands that are floating around just a few weeks. Physical devices or digital devices? Yeah. And so they have like cameras in them and they have like I see. And you can just grab them and use them to do stuff. And then they record that.
Starting point is 00:27:24 And so that you're the robot. That's so cool. You can use that in training data. So there's lots of stuff like that happening. So yeah, so basically I think data is just like this critical thing and the quality of it really matters. As to this question of like general models versus sort of specific models, the results that we see are the general really does well basically. And so like just these general models, you know, GPT4 properly prompted can in certain domains
Starting point is 00:27:53 like do as well as a $10 million specially trained model in that domain. And so, you know, it's hard to be sure about what the reason is for that. Maybe that $10 million model was not properly trained, for example. But that does seem to happen again and again. And so yeah, whatever world we want to live in, nature seems to be telling us that generalization is very powerful. And I think the other thing is a little bit about like the person who is either controlling, programming, or prompting it.
Starting point is 00:28:27 to the degree that they have acquired the vernacular of a different industry or vertical and or they have expertise. That's true. So having someone who's actually an architect or someone who's actually a doctor working with an AI engineer, whether it's necessarily a general model or a specifically trained model, I think having that person that has that domain knowledge and has experience is probably necessary to evaluate the outcomes. Yeah, I think that's right.
Starting point is 00:28:52 Yeah, I mean, the other sort of data matters argument is like, I think, I think, I think GPT 4 got a lot better at chess for turbo. And why was that? It was because somebody added like a chess eval and so they added a test training data on that eval. And so yeah, I just think they're, the other example I look at is GPT4 can do ROT 13 quite well,
Starting point is 00:29:19 but it can't do ROT 12 as well. And they're like cognitively equally difficult exercise. matter of the evals or what is that? It's because there's a lot of ROT 13 training data on the internet. I see. Okay. There's not a lot of people doing ROT 9 or ROT 9 or stuff like that. So someone should fact check me. Maybe they fix that.
Starting point is 00:29:38 Maybe it's good at that now too. But that was certainly true for a while. So that's sort of like an interesting observation about, you know, how these models work and how important the data actually is. Could I, could we talk a bit about you really briefly? Like the story that I've heard was that, you know, you were clued into the fact that AI was on the brink of a transformational moment by doing copilot's GitHub. Is that true or does your interest and activity in the AI space go back further than that even? Well, yeah, that is true.
Starting point is 00:30:13 I mean, definitely GitHub copilot woke me up to the fact that language models are working and they're going to get much better. And so large deep learning models are the future. I mean, that was super clear to me. when we did that. But no, my interest goes way back. I mean, I actually trained my first neural network in 1992. And it was not very good, but it was a neural network. And I went to school, I went to MIT, and I really was excited about my artificial intelligence classes.
Starting point is 00:30:40 And then I found out, you know, I didn't know this at the time, but we were in like this AI winter. And in fact, like all the techniques and classes were pretty, it was not that, it was not that, you know, it was not that impressive and very interesting to me. to me. And then I didn't really do anything with AI. In 2017, I started this thing called AI grant that was at the time conceptualized as like grants to open source AI research projects. And we gave out a bunch of money to open source AI research projects. And then some of those people went on to found AI companies like Cohere and Cresta were both founded by people who received those original open source AI grants. And then we sort of repurposed. that as like an AI startup accelerator because I already had the domain.
Starting point is 00:31:27 So we did like we repurposed that a couple years ago. But yeah, it was really GPT3 came out and I had somehow not really noticed GPT2. I'd sort of had passed by without me paying a lot of attention to it. But GPT3 hit me in the face and I was like, oh my god, this is incredible. We have to build some kind of product with this. We started working on that, working with open AI and prototyping things and like the models got better. the models got better every couple weeks. And I was the most fun product I've ever built.
Starting point is 00:31:59 It was so much fun. And we had this internal Slack channel where people were just like freaking out. It was passing everyone's interview questions and this kind of thing. So I knew it worked and I knew it was going to just get better. It was not going to get worse. It was going to get better. So I left GitHub believing that we get hub, this is very egocentric. few, but that we had sort of shown the world what opening eyes models could do and that AI was real
Starting point is 00:32:27 and it was coming. And because our customers were developers and developers are the people who build products that they would go off and build a whole bunch of AI products. And then like there was an 18 month period where that just did not happen. There were just very few startups started and very few products built. I thought there'd be co-pilots for everything for law and for medicine and for architects. And it didn't happen. And then like come August 22, I was starting to go crazy because, because stable diffusion was out and mid-journey was out. So that was neat. But like in this realm of text and reasoning and nothing was occurring.
Starting point is 00:33:01 And so that's why we reanimated AI grant to like say, hey, you guys should build AI products. You know, products not papers, apps not archive. And then chat GPT came out a few months later. And that was the actual starting gun for people, which shocked us. I didn't, yeah, I didn't expect it would take that. How did you hook up with Daniel in terms of investing together? Yeah, I mean, Daniel and I ran in sort of similar Silicon Valley circles, and so we knew each other.
Starting point is 00:33:34 I actually was at his YC demo day in 2010. I met him there briefly. And then we had friends in common, and we would meet at dinners, and we liked each other. And we started working on some projects together. We'd angel invest together. we co-led with some other friends, the first big round in Retool, and that was really fun experience. And we did AI grant together back in 2017 and some other things. And so we just liked working together.
Starting point is 00:34:04 And bit by bit, our amateur investing activities together became more serious and larger scale. Really what happened is we invested like small amounts of money in companies that got very big and then like couldn't didn't have enough money to buy more in the subsequent rounds and so we went off and fixed that problem and found ourselves in the position of like full-time investors eventually um do you do you have do you have a thesis that you're working from that like um or are you just like we we want to see everything and we want to go with whatever the new is Well, we're pretty entrepreneur focused, I think. If there's any thesis, it's that the entrepreneur builds the company.
Starting point is 00:34:53 And this is a mistake that if you're someone like me, you can easily make is that, you know, you've done things in the past. You've built products. And so you fall in love with ideas, but you project yourself into the company. And so you think, oh, if I were running this, I would do this and this and this. And you get kind of, you build this intergalactic plan to take over, you know, take of the world. and then like you forget to evaluate whether the founder will do any of those things or have better. Founder product fit is like so important.
Starting point is 00:35:23 Yeah. And just like how good is this founder going to be at recruiting people and how good is this founder going to be at making hard decisions and how much do they want to win? Like there's a lot in the psychology of the founder that I think is really important. So that's been something I've updated on a lot personally is, yeah, like I tend to really fall in love with ideas. And so this is something I have to work on. What are some characteristics of like AI founders that you think are like more promising that you've learned recently?
Starting point is 00:35:55 Because I feel like this era is just, it's different in terms of building software and products that people will use and that working with AI requires a slightly different sort of attitude. Well, there are a lot of evergreen things. Like you have to care about building great products and you have to care about winning. you have to have, you know, so those things are not different. The biggest difference I'd say with AI founders is whether they've deeply internalized scaling laws and whether they actually believe them or not.
Starting point is 00:36:25 And the founders who have the courage to scale and believe that high quality data, lots of high quality data plus lots of compute plus good talent combining those things will lead to like a better product. those founders will, yeah, they will outpace the ones who don't. I mean, like, just to make this clear, can you, what is the inverse of that? Like a founder who doesn't believe in scaling laws, what are, what choices would they make now as they're building their company? Well, they would think that our model is worse, but our product is better.
Starting point is 00:36:59 I see. Yeah. And the other version of this mistake occurs a lot in the AI field right now where people are like just into the ML and they don't spend any. Yes. The product and this is the most common issue, but we have also encountered the founder who thinks, well, our model is good enough, but our products better. And the problem with that is if the next competitor's model suddenly becomes 10 times better.
Starting point is 00:37:21 The switching cost seems so low now too. Well, and it just doesn't matter how much better your product is if their models 10 times better. So I learned a lot about that from mid journey in my proximity to mid journey because he was, he's been on Discord this whole time, but he's winning because his model is the best. And model actually is the best in ways along dimensions that users care about like controllability and aesthetics and stuff like that. You know, when Dolly 2 came out, it was very clear that Open AI did not think of it as a product.
Starting point is 00:37:50 They thought of it as a technical capability. They thought of it as text to image. And David thought I'm building a tool for imagination. And so, of course, this is a product that must have aesthetics and, you know, be pleasing and, you know, it should have certain types of control ability. It's not a technical capability demo, a research artifact. It's something that people use for their imagination. The other thing that's really, sorry, just to build on that, like, I've been thinking
Starting point is 00:38:15 more and more that good products nowadays are kind of the artifacts of healthy, useful conversations. And the fact that Mid Journey was first built in Discord means that instantly he's getting feedback from thousands of people who are using the product, complaining about the product, he's able to see it in real time and then make adjustments and changes as opposed to the conventional method of building a software. that ends up on a website or in an app store where you really don't have that bi-directional kind of meaning searching process with your constituency.
Starting point is 00:38:46 Yeah, I think the discord thing has been incredibly beneficial for them. I think you're right. He's like incredible at using polls and listening to his office hours with all of his users and it's bidirectional. So I think that's spectacular. And then it also makes the product self-documenting and more social and fun. So those are valuable things. It's time for him to get that web app out there, I think.
Starting point is 00:39:11 Agreed, agreed. Yeah. Yeah, just final one for you. I'm going to bring back that period when you said, you know, there was 18 months where not a lot was happening. You were expecting this Cambrian explosion and it didn't happen. What would you fear would be the thing that would lead to another sort of roadblock period where things slow down?
Starting point is 00:39:36 We're expecting GPD 5 and next week something else and now video and robotics and stuff. But like what would be the fear that you have from anything that would put a pause on sort of the revolution that we're experiencing right now? Well, I think there is just baseline friction of product people doing great things. There's just not that many people who were. I mean, I thought the, you know, cognition demo was very telling in that way and that, you know, GPT4 had been out for a year. and they were the first ones who figured out how to repeatedly inference it and get agentic behavior. And so, like, why couldn't someone have done that six months earlier? Is there a good reason? Maybe there is. I don't know.
Starting point is 00:40:14 But I would be surprised if it wasn't possible to do six months earlier. So I think that's the baseline thing. And the world's just incredibly contingent. The frontier is extremely inefficient. It feels like there's millions of people at the frontier. There's really hundreds. And so it's just, like, extremely, extremely inefficient. Yeah, what could slow things down?
Starting point is 00:40:36 Well, I think the biomarkers to watch are probably, yeah, like, is GPT-5 noticeably and conspicuously much, much better? I think that's obviously a big one. I think probably Microsoft's AI revenue would be another one. If Microsoft co-pilot revenue were shockingly low somehow or sort of missed expectations, or if Microsoft starts to attenuate the guidance there, they're extremely good at guiding the street, that would be very interesting. I think that would take the heat off a little bit. If scaling, basically the entire future is determined by how much AI capabilities improve
Starting point is 00:41:16 over the next 10 years and what the shape of that curve is. The one thing I feel incredibly confident about is that in 2034 AI capabilities will be dramatically better than they are. Today, I feel a little less confident at drawing the shape of the climb from here to there, you know, how much of it happens soon, how much of it happens later, you know, how, how gradual it is. Is it a bunch of stacked sigmoids or is it just this like exponential curve? I'm not really sure. But I think that's the key question about the future. And it's like if this asymptote soon, it's a revolutionary new tech platform.
Starting point is 00:41:52 And, you know, it's as big a deal probably as the web in some ways. But it's just something we can grok. And if it doesn't, ascentote for a while, then it's civilization altering. And I think that seems somewhat more likely to me than the phlegm. Nat, thanks for coming on and painting the picture of all that. My pleasure. Thanks for having me.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.