a16z Podcast - a16z Podcast: AMPLab, the Power of Open Source, and the Future of Systems Software

Starting point is 00:00:00 The content here is for informational purposes only, should not be taken as legal business, tax, or investment advice, or be used to evaluate any investment or security and is not directed at any investors or potential investors in any A16Z fund. For more details, please see A16Z.com slash disclosures. Welcome to the A16Z podcast. I'm Michael Copeland. The place where Apache Spark was born, UC Berkeley's AMP lab, has not just created a major open-source software, platform, it spun out more than its share of groundbreaking companies. Full disclosure, A16Z has invested in three of them. So how did they get there? How is open source and the Amplab approach reduced the friction between student and faculty ideas and launching them into the real world? Co-founder and director of the Amplab, Michael Franklin, joins A16Z's Peter Levine to discuss the AMP Lab model and their own relationship as an academic and an investor. How Yuan Li also joins the discussion, which was part of our 2015 academic roundtable,

Starting point is 00:01:07 to offer another perspective. Lee's company, Takyan Nexus, came out of work he did as a student in the Amplab and the resulting open source project. Lee describes his struggles and victories in making the transition from student to founder and leader of a company. The Amplab example and what every academic and entrepreneur can learn from it on this segment of the A16Z podcast. So let's start with you, Michael. You were an entrepreneur in between being an academic for a long time, but Amplab came out of the desire to do what? And how did you sort of make that happen? Yeah. So, boy, where do you start? So, yeah, so it was interesting because I was off, I took two years off. from teaching at Berkeley to get a company started and then, you know, kind of came back after that.

Starting point is 00:02:01 Jan Stoica, who's also one of the founders of the Amplab, at the same exact time, went off and was away for two years doing a company. And when we came back, I think there were a couple things going on. One is, after you've done a startup, academia can seem a little, a little slow, a little quiet. And so, you know, you get used to being in this environment where kind of every day is kind of a life or death situation. Right. And you walk in, and, you know, mostly in academia there's definitely life and death situations, but they're sort of not every day. Death is slower. Death is slower.

Starting point is 00:02:41 Right. And so I think, you know, I won't speak for Jan, but I know from my perspective I was kind of craving a little bit more excitement, a little bit more interaction. You know, a lot of traditional academic environments are a little solitary. And so there was a project going on at Berkeley called the Rad Lab, which was a project of systems people and machine learning people who had gotten together to do what at the time was called autonomic computing, where you use machine learning algorithms to help manage large computer systems. And so I ended up joining that thing, not because I was so interested in not autonomic computing, but just because I like the environment of having all those people around. The problem was that Rad Lab was getting ready to wind down. It had been started with a five-year timeline. And so they were thinking of splitting up.

Starting point is 00:03:32 They were talking about how to split up the lab. You know, we had this big open, collaborative space. And, you know, when I had been out in my company, I saw what now is what's called the big data revolution or whatever. But every company I talked to to try to sell them stuff, it was just clear they were getting more and more data. And just that seemed to me. to be kind of the perfect problem for this group of people

Starting point is 00:03:57 who on one hand did large-scale computing, on the other hand did machine learning. And started talking to Jan, who was out seeing similar things and we realized that was the next thing to do. So it was a technology trend that you really... I mean, it's sort of bigger than that in a lot of ways, but it goes out from there. But big data was it.

Starting point is 00:04:16 Yeah, and I think just about everyone here as a professor. I mean, the message I gave, So when we have a new chancellor at Berkeley, and when he showed up, he had heard about the AMP Lab, and he kind of wanted to understand more about it. So he invited me to tell him about it. And the thing I said to him was, you know, if you remember one thing from this conversation,

Starting point is 00:04:35 here's what I'd like it to be, is that AMP Lab happened because there were two faculty members who got to go off away from the university for, you know, a couple of years, and engage with what was going on out of the world. And that's where we saw it coming. And we saw it before a lot of other people in academia because of that. So Amplab, you know, it's working. I mean, it's from our perspective.

Starting point is 00:04:58 And Peter, I want you to jump in here. It's like it's working incredibly well, both from an academic standpoint, you know, to get this kind of critical mass of people and ideas together. And then boom, out the other end comes, you know, all these great companies and ideas and projects. But so, and I want you both to answer this. Peter, you first, what are the ingredients sort of the AMP lab has mixed up or that you can identify that makes Amplab work so well from your perspective? From my perspective, I think the most interesting aspect of the Amplab is sort of this macro trend in system software now being done by a whole new generation of folks. Enterprise and infrastructure software, system software was

Starting point is 00:05:49 traditionally done by folks, let's say, leaving Cisco or leaving Oracle. And it always was a derivative, like you work at Cisco and then you go do a startup or you work at Oracle and go do a startup. And it was always kind of, to me, there was a bit of incrementalism. And it was sort of an older generation of people actually doing the work on some of those new companies. And what's happened in many universities, but Amplab in specific, as we find that there's now the new generation working on the most interesting computer science problems applied to system software. And yes, big data, but it's databases, operating system software, kind of all the infrastructure pieces. And I got to tell you, that has fundamentally moved the needle in the industry to where

Starting point is 00:06:41 you know, the companies coming out of there are way, way different than what I found, you know, from people coming out of, let's say, Oracle or Cisco. It's just a revolutionary new thinking and kind of it just opens your eyes to really new ways of doing computing. And that's been, that to me, that's been amazing. If you were to distill that to an ingredient, for lack of a better word, I mean, is it focused then? Is it like that you guys don't try and boil the ocean and solve every problem, but you're focused on, look, if I want to do research and do my graduate work in infrastructure

Starting point is 00:07:14 and, you know, big data, I go to Amplab. No, we were trying to boil the ocean. Are you telling us we didn't know the ocean? I'm not saying. No, I don't know. I mean, I think from my perspective, I think another way to say what Peter was

Starting point is 00:07:34 saying was, first of all, open source has made a lot of this possible. And so, you know, I always tell people when I started my career as a database systems researcher, you know, you'd come up with an idea. If you thought it was a pretty good idea, you know, you'd go and you'd tell, you'd give a talk at IBM, and you'd give a talk at Oracle, and you'd give a talk at Microsoft. You know, and, you know, either they would take your idea and steal it. Steal it.

Starting point is 00:08:02 Well, they'd hire you. They would put it in the system. Right. And then, you know, maybe you'd hear about it, maybe you wouldn't. Or they'd say, oh, no, we don't need that. Whereas now, you know, the way our students do work is, you know, they work on a piece of software. If it looks useful, they blog about it or somebody else does, or they put it out on the Amplab repo and flip a switch, and all of a sudden people start trying it.

Starting point is 00:08:24 So the friction from, you know, student idea to actually people using it. I think that's great. It was a great friction has been way reduced. And so then that friction and that relationship between the corporate world, then, how do they arrive at the Amplab and then how do you leverage them? So, yeah. So, I mean, I don't know if everyone here knows a lot about the Amplab. But one of the things that's interesting from an academic perspective in the Amplab is our funding comes about half from, you know, the government from NSF and DARPA and places like that and half from companies. So we currently have 30 companies that help sponsor the work.

Starting point is 00:09:01 And we engaged with them very early, telling them not only what we've done, but what we're thinking of doing. And that has a couple nice features. One is we get kind of instant feedback on what sounds right to them and what doesn't ring true. And we also get feedback from them on problems that they're having that we're not really looking at. And then the other advantage it has is that by getting them involved early, they're more inclined to try to, you know, use what we do and give us feedback. Right. So who gets IP rights among all those contributing companies?

Starting point is 00:09:41 Brilliant question. So, and this was done way before Amplab. You know, Berkeley has a long history of open source. And so there's been a bunch of projects that work this way. And the way it works is we don't give the companies any IP rights. and we It works for you at least It seems to work for the companies too

Starting point is 00:10:06 We could talk about who it might not For maybe But the idea is we do everything in open source And so then the next question is Well then given that the companies could get at the stuff anyway Why do they give us money And that's a more complicated question Different companies have different reasons

Starting point is 00:10:26 Well Peter you have a good answer for us You invest in multiple open source companies, so open-service projects that become companies in a different sort. And we'll talk, H.I, with you about it more in a minute. But how does it work for you? And when you're approaching an academic institution, where do you start? And, you know, let's take Amplab. Like, how did that relationship even blossom? Well, we had a number of connections into Amplab.

Starting point is 00:10:52 Mike and I actually worked together in a past generation, so we knew each other, Ben, of course, new folks there. And so we kind of, look, it's always been a matter of finding a couple of people inside a university and then building on that relationship. I think what's been unique about Amplab, it's actually one relationship with multiple projects. Typically, it's one relationship with one project. So it has become a very effective sort of mechanism for us to go in and actually look at all the projects going on. It's just a centralized place at Berkeley,

Starting point is 00:11:27 which has been fantastic. And then look, I mean, once we see the projects and we look at them and evaluate them as potential businesses, we invest as we would any other company, right? And so then it starts as, you know, commercial enterprise and we go from there. Maybe you can respond to this question of IP and the corporate world because, you know, again, in the open source project that become companies, you have a very specific kind of view of the world and why open source work. as a business? And that might sort of answer to why the corporate world is fine with, you know,

Starting point is 00:12:03 investing the way they do. Well, one of my beliefs about open source, which gets to sort of this point, the IP ownership, first of all, with open source, it's out there. IP ownership is, to me, less important than who actually wrote the code. And so what I look at fundamentally for an open source project is I'd like to see that the inventors of, you know, the inventors of the code are part of the company. And whenever you have a fork, which is, you know, a, let's say you have two projects that use the same code base after all it's open source, the fork typically is not as thoughtful as the folks who have invented it.

Starting point is 00:12:45 And all the projects coming out of Amplap, in fact, one of the rules, I mean, I do most of our open source investing, and that's one of my guidepost is to say, look, it's got to be a real exception if we're going to invest in something that's a fork of, you know, the real folks doing it. And so what we have found out of Amplab, it is the inventors of the particular project who are starting the company. And that gets you great competitive differentiation

Starting point is 00:13:11 because if anyone forks it and tries to come up with another team, it's like, look, we're the inventors of the project. We run the roadmap. If you want support or add-on products or whatever it might be, the fork company is never going to be as strong as the one who has invented it.

Starting point is 00:13:29 So the projects coming out of Amplab certainly fill that first requirement for me. You know, it's an interesting question that we get periodically whenever somebody new shows up in the administration at Berkeley of, you know, why are we doing this open source stuff? Why don't we patent these things?

Starting point is 00:13:51 And then there's this conversation that you have to go through about, well, okay, you know, how many software patents you know, we brought in, you know, eight figures worth of, you know, industrial donations for Amplab. How many software patents has the University of California ever had that brought in that much money? And the answer is maybe one. They had a patent on the browser, I believe, right? The first browser, where's Mark?

Starting point is 00:14:18 Yeah, Mark's out here. One of the very early browsers was done at Berkeley. Let's put it that way. And they had some patents on that. I think that was the one that they made money on, but I'm not even 100% sure. And the good news is it'd be hard to find an administrator at Berkeley who could give you the answer either. And so that's the thing. Once you have to make this argument about the importance of building the brand, building the level of activity.

Starting point is 00:14:44 And then, you know, it's a little indirect. But yeah, these, you know, the opportunities for philanthropy start coming in. And in fact, Berkeley now started this thing they call the Founders Pledge, where they ask people who are We're doing things in the university to pledge some portion of their future revenues or profits or whatever. And it's non-binding, but whatever. So they're working on it. Michael, how do you view projects that come out of Amplab and research? Is there sort of a sense that you have like, oh, my God, here's another one.

Starting point is 00:15:19 Like this one's going to be off to the races and let's go? or how does I just wonder like the sort of when research becomes commercialized or commercializable what are the clues there? I don't know about commercial or commercializable geez that's a

Starting point is 00:15:41 tougher one. I mean just I thought where you were going with that was sort of what's a good idea and what's not a good idea whether it's from a research perspective or do you guys even care whether it's commercialized or not. Yeah I think I think that's I hesitated because that's not really high on our list when we start thinking about what to do. Yeah, that's what I would imagine. So what's a good idea?

Starting point is 00:16:02 And then I'll ask you the commercializable part because you probably do care. Well, wait a minute. By the way, you know, I don't want to say we don't care. I'm just saying that's not the first thing we worry about. But I think, you know, a lot of it just has to do with somebody in the lab being, you know, being, I mean, it's going to sound cliche, I guess, but being passionate or being excited about working on something. Because, you know, that's what happens. You know, a lot of our most successful projects now

Starting point is 00:16:30 are things where, you know, somebody goes off and works on some piece of the system, you know, puts it out on GitHub, and somebody finds it, and all of a sudden there's a community starting to form around it. And so, you know, we're in this environment now where ideas go out there quickly, and then very quickly you sort of see what gets traction and what doesn't.

Starting point is 00:16:52 So it's traction, it's sort of passive. It's, you know, and it's got to be a good idea. I mean, that's what a good idea is, traction and passion? Absolutely. Well, and also, I mean, we are a university. We're a research lab. Our main output is great students. And, you know, one thing I want to mention to all the professors here,

Starting point is 00:17:12 since, you know, you might think, well, okay, but, you know, I have this problem. I have to decide. Do I want to do, you know, good research, or do I want to do things that are, you know, maybe useful? and I think most of the people since you're sitting here I'm probably preaching to the choir but that's a false dichotomy I mean we were very fortunate in this current year

Starting point is 00:17:35 you all know the ACM dissertation awards they give an award to sort of the best PhD work in computer science around the world so every school gets to nominate two people I'm chair of the department it just so happened last year the two people got nominated were from Amplap. Just a coincidence. Matei Zaharia won the award.

Starting point is 00:18:03 He's a guy who did Spark. John Ducci, who's the other guy who nominated, got one of the two honorable mentions. So of the three awards, ACM gave worldwide, last year, two of them came from Amplett. What does that say about technology and kind of its place in just broader society then and culture and research, et cetera, in academia?

Starting point is 00:18:21 I think there's probably a better appreciation now for impact than maybe there was in academia, you know, even five years ago. I agree with you. I think that's right. If you look at, you know, a lot of innovation up till, well, to me, up until recently, it came from like Bell Labs and IBM and Microsoft Research. And that's where a lot of the new, I would say, the new commercial ideas came from folks leaving those or pushing the envelope. So now I see a lot of it coming out of universities, which is, that's fantastic. I have to tell this great story, though, if you don't mind. Please. So we had a delegation from China visiting this weekend, and we showed them, you know, the AMP Lab and all the stuff we were doing.

Starting point is 00:19:07 We showed them a bunch of other data science things going on around Berkeley. And at the end, the president of this university, who's very prestigious university in China, said to us, well, whose plan was this? you know who came up this is you know the work is wonderful who came up with this plan and you know we all sort of just looked each other and laughed because there really wasn't a plan it was just it was really just a bunch of you know different faculty who were doing what they were interested in and decided to collaborate let's

Starting point is 00:19:38 let's shift some gears a little bit to advances in computer science what are the most important things you're starting to see and what are getting your students excited, and Peter, you know, answer this question as well. I think, so the whole idea of big data has multiple, I think it's a generational kind of, there's multiple aspects of big data. Now that we're able to collect data and now we're able to do real-time big data a la spark, I think machine learning and deep learning are probably the next. I mean, to me, I see a lot of those projects, you know, people coming either out of

Starting point is 00:20:12 university out of Google, out of a variety of places to think about how we process and become much more predictive about the information that we have. And I'm actually pretty excited about that. I mean, there's a lot of noise in the space right now. There's a lot of people sort of interested in looking at that, but I think there's a relatively new frontier that's occurring at a lot of universities around sort of machine learning and deep learning. Yeah, I mean, so machine learning, deep learning, I also want to know, like, with your databases would be another thing, like that, anyway. With your colleagues outside of the CES department, is that all part of their, you know,

Starting point is 00:20:54 language now? Like, oh, I'm working on a big, you know, AMP lab, we're all about big data, machine learning, you know, deep learning. Do they get that? Yeah, I think a lot of people do. I mean, one of the big buzzwords around our campus these days, I think most campuses is data science. And that one tends to be a little more inclusive than big data. Big data is looked a little bit as kind of a computer science thing. Whereas data science, really all around campus,

Starting point is 00:21:21 I mean, even astronomers go around saying that they're data scientists, for example. So data science is a really hot topic and one, you know, about how do you solve real analytical problems in a bunch of domains. I've been getting interested in things around trying to, I've been doing databases my whole career, so I'm sort of interested in reaching out

Starting point is 00:21:53 from the database to actually go out and affect the world. So, you know, if you look at things like cloud robotics or you know, think about things you can do with drones or anything like that, I think a lot of people are getting excited about

Starting point is 00:22:08 Internet of Things, but in the new version, you know, in the old version of Internet of Things, it was just you've got sensors out there and you're collecting the data and you're doing something with it. In the new version of Internet of things, you're actually going out and interacting with the world. And that involves also the people that are in the world that have to coexist with all these machines. One of the areas that we're seeing, you know, both from universities and new projects coming up, this whole idea of moving the compute out to the endpoint. So like, it's really interesting to see this, the ebb and flow of computing from sort of centralized to distributed. We've been in this centralized world, cloud computing,

Starting point is 00:22:46 and then you have the endpoint, which shows pixels. And now we're starting to see a number of really interesting projects where the endpoint, our supercomputer in our hand, is actually being used as a computer, not a display device. And I think the internet of things and drones and however you want to define this sort of range of computers that exist, not so much in our pocket, but out there in the world, I believe that there's going to be computing that will be fundamental to how we start to think about data and real-time analytics that occur not so much in the cloud, but at the end point.

Starting point is 00:23:20 And so we're starting to see that, in fact, out of universities and all of that. So there is a trend now back towards, I predicted, distributed computing, we'll be back. and it's going to be with all these endpoint devices and drones. Michael, I've been asked you one last question, and we'll bring H.Y. App, the P is for people. You just mentioned people, you know, and maybe you guys were a little bit ahead of yourselves five years ago,

Starting point is 00:23:45 but how do people fit into sort of the Amplab view of the world? And again, when you think about these sort of database-backed Internet of things and how you bring in all the people and all the data, how do people fit into it all? It's interesting because the view of people in the lab is, or the role of people in the lab has evolved over the, since we started. When we initially started, the idea that we had was that algorithms, in terms of, you know, machine learning, you know, statistical processing and machines, the idea was cloud computing and large clusters. And then people, what we were

Starting point is 00:24:25 thinking about was, well, these are the three resources that you have available, three types of resources you have available to make sense of data, make sense of big data in particular. So our view of people, when we originally started, was to think about human computation and crowdsourcing and things like that. In fact, Tim's sitting back there, Tim was one of the early people in the AMP lab, and we worked on a project where it was called CrowdDB, where, you know, I like to call it the world's stupidest database system, where, you know, if you asked it a question and you weren't attached to the network, it would just say,

Starting point is 00:25:00 gee, I don't know the answer to that question. What do you think? But if you were attached to the internet, then it would create a bunch of jobs on Mechanical Turk or something and send them out and let the crowd answering. And then all of a sudden it wasn't such a dumb database system anymore. And so we still have a big part of what we're doing

Starting point is 00:25:18 in terms of bringing in human-in-the-loop analytics, looking at how do you get people either as individual experts or analysts or as crowds to do? data cleaning. The idea was you wanted to be able to bring in people to solve those parts of these, you know, machine learning problems that the machine learning wasn't quite up the speed for. And how things have evolved kind of over the five years is now we're also

Starting point is 00:25:41 being a little more concerned with the fact that, you know, ultimately the results of the analysis is, in many cases, going to end up in front of a person. So how do you support that person in their job? But the original vision was really about human computation and crowdsourcing. Great. Well, Michael, we're going to kick you off now. Thank you so much. All right. Thanks, guys. So, H. Weatherly, he was the co-creator of Takyan, right?

Starting point is 00:26:09 And so he is, as you say, Peter, the original OG open source guy on this project. And you can describe it. It's a memory-centric distributed storage system, which you've now formed Takian Nexus, which is the company that is built around that. Maybe it just gives us a brief discussion of, Takayan and what it is, the problem that you've gone after. And actually, why you wanted to address that problem to begin with? Oh, okay.

Starting point is 00:26:41 So Taekan is a memory-centry distributed storage. And I think from the high level, the vision is that memory-centric computing is the future, and you have a computer layer of a storage layer, and we want to be the storage layer. And why I wanted to do this problem, that's interesting. So personally, I'm very interesting in storage, but when I joined like Berkeley Amplap, like four years ago. So at Amplab, we have other two projects like Mesos, Apache Mesos, and Apache Spark. Both are very successful. And actually, the company behind those projects also, like, funded by Andres.

Starting point is 00:27:23 and towers. But at the time, the storage piece is still missing. So I was fortunate that my advisors and the lab give me the opportunity to work on this piece. And it's just very fascinating to work on something you think is very important, interesting. Yeah. So Peter, let's get back to this good idea, bad idea question. You know, clearly what H.I. was working on you thought was a good idea. So what were the attributes? of that idea that you thought then, and or evidence, that you thought this was something that, you know, you wanted to put your time and effort and the firm's money behind? When I, when we as a firm look at, you know, what's a good idea versus maybe not such a good

Starting point is 00:28:09 idea, and time will tell, you never know, but we go into it, all our investments we obviously go into it, think it's a freaking great idea. They don't all turn out that way. So that's just one caveat. The view here, I have been in the storage space for a long time, and when we met, one of the concepts that we've been talking about internally, imagine if memory in a computer flattened down to where there was only one memory hierarchy, or let's say two, okay? I know you may not think so. But like, well, hang on, though. Spinning disks are going away. Okay? So if you take that to the next lot, you know, we have SSDs and we have RAMs. and then we have, you know, CPU memory. And part of my belief was that if memory gets cheaper and cheaper, why don't we have a file system and why don't we have an architecture that actually supports

Starting point is 00:29:04 this new type of computing? There's lots of computers and operating system code and the way we write files and the way we treat structures and all this stuff has all been written under the assumption that there's this memory hire. that goes from very fast and expensive to cheap and slow. And everything we do bounces through that hierarchy. And my belief, whether you believe,

Starting point is 00:29:29 look, I love that people disagree, for the most part, when more people disagree, that's the project we want to go invest in. Because like, you know, it's too crazy for, you know, hell, that'll never work. But when enough people disagree that there will never be flat memory structures, then it's a great idea. You know, at least to get started, right?

Starting point is 00:29:49 It may never happen, but that was the thesis, that memory was going to get much, I mean, look, you look at cell phones and you look at the memory in there, it may not be fault tolerant and may, you know, fall apart all the time, but part of the systems we want to go solve for that, but it's very inexpensive. And so to the extent that the mobile supply chain starts to eat the back-end data center was kind of our thinking on how this memory, you know, in-core memory starts to flatten itself. So, With that, if you make that assumption, and if you believe it, which I, you know, believe to some degree, then this project exactly maps into that future. And even, I would say, even if it doesn't quite map exactly like there, it's still a great project. So, like, the grand view would be memory flattens, and this becomes basically the in-memory file system for all computing. And if we don't quite get there, it's still a great file system for big, data and other applications. So that's sort of the, you know, we sort of look at the grand view and then say, okay, well, if that doesn't happen, because we were wrong about that, what else

Starting point is 00:30:59 can this become? So that, I mean, like when we look at ideas, I try to look at ideas on that dimension to say, you know, what transformations might occur in the future to where there really is a big need for a whole new design of something. And this was one of those ideas. Now, I do build taxonomies in my brain and probably the simplest thing that I do, and I'll leave you with this, like, how do I actually, like when you see something, how do you know or how do you choose to invest in it? So what I have, I've been here now five years, and my whole philosophy around this is I take something that's very popular right now, cloud computing, and I say, what happens when cloud computing doesn't exist anymore? What fills that place? And all of a sudden, like, and I put in my head the most ludicrous things, because 15 years ago, if you assume Microsoft might not be at the top of the compute chain and, you know, everyone would look at you like you were crazy or digital equipment or, you know, you say Google now, oh, Google may not be at the top of the food chain in the future and everyone would look at you like, well, you're crazy. They'll be there forever.

Starting point is 00:32:11 So I think by take, for me, taking away something that's out there has been a really interesting exercise, and then you find projects that actually, you know, I don't go out and look to say what happens, let's say, when VMware goes away or whatever. You sort of weigh, I have these views in my mind, and then all of a sudden you start to see things that fill the spaces. And so that's kind of my, you know, some of my models. And a lot of, actually, a lot of our Amplab investments have been. based on, look, phenomenal entrepreneurs, great ideas in their own right, but then me also thinking, okay, well, like, if something isn't there, maybe this is the thing that fills it. Is that part of the sort of ecosystem, Amplab Do Tube? Because it's kind of, it's hitting all those slots? Well, Amplab, certainly if you look at Spark, Mesos, and Takion, I should say Takion Spark Mesos in that sort of architectural order, it does create a full stack of, you know, kind of the next generation of big data kind of infrastructure. And that's really interesting. I mean, it's in memory. It's got like, you know, scale out, all this stuff that currently

Starting point is 00:33:24 doesn't really exist commercially. And so, yeah, we, you know, kind of, it's part of that. And H.W. Do you think Amplab is a place like that because there's this kind of, you know, ecosystem of smart people working on these different parts of it. So look, somebody else is already here, so I'm going to go here. I mean, is it this sort of ecosystem that exists there and you need to figure out where you fit within that ecosystem? Or is it just, look, here's my interest, and I'm going to go after it? But from my perspective, I think the lab gives us a very great environment.

Starting point is 00:33:59 And from the individual perspective, we don't have that much pressure, like to, you know, in terms of paper publishing, those type of thing. Right. And in the meantime, in the meantime, we can focus a thing for a pretty long time. And, you know, different people, they have their own interest. We have a pretty large lab, and we have a lot of people to bounce the idea. We can talk with professors and talk to students. Actually, the professor don't have an office, like they just sit in the same cube as we students there.

Starting point is 00:34:32 So it's a very good environment. I think we can leverage. In the meantime, the lab also talk with the industry very regularly. Like we have the retreat thing like twice a year, and we have like maybe 20 industry sponsors that we talk, communicate. I think those are very helpful. Peter, in your experience, what are some of the traits that you see in H.Y.

Starting point is 00:34:58 And other entrepreneurs that correlate with success. One is just, look, a deep understanding of the space that they're in, that's, and the passion to really go after it. We always want to see that. I think also it's the willingness to learn all the things that the entrepreneur doesn't yet know. And so, look, I mean, you know, H.Y is a great example and a very positive example. He's very passionate, and he understands his area better than anyone on the planet. But there's a lot of other things that you need to do to learn how to run a company. And all the little things that you all might think are easy and stupid actually trip you up.

Starting point is 00:35:42 And so what we want to look for is somebody who is going to be coachable because there is sort of a pattern. There's a blueprint on how to go do this. I mean, now that I've done this multiple times and pretty much take somebody through, here's what you do this week, next week, next month, in terms of building out of sales organization, in terms of hiring a product manager, in terms of hiring a CFO, like there's all marketing, how do you do that? I mean, there's all these other pieces to building a company besides the technology. And so one of the attributes that we want to try to assess up front is, is the entrepreneur and, you know, PhD student grad capable of being coached into these new areas? Because let's face it, I mean, a lot of folks don't

Starting point is 00:36:29 want to be coach. It's like, look, I'm just going to go do my shit and, like, leave me alone. I don't want to learn about this other stuff. And you probably can't build a company if the only thing you do is focus on your technology. And you can, I mean, we're in it all the time. So that's what I would say. That you thought you never would have to deal with and that you either love or wish somebody else would do it. I mean, I just, I think I just want to, I have a goal and I just want to achieve the goal. And along the way, there are different things. For example, like Peter mentioned,

Starting point is 00:37:05 in terms of hiring, like process, there's many, many things in the hiring. Like, at the beginning, we don't have a process. And then you have a process. And you modify the process, improve the process. In the meantime, you have this and that condition. And we talk about this type of stuff. And some cases very, it's very new to me, right?

Starting point is 00:37:24 But it's also very interesting to me. Is there a class or something in school you wished you had learned that would have helped you? Do you wish you had taken more marketing classes or any for that matter? You know, I think school taught me really a lot, and that's the reason, you know, like, I can still do some stuff. But in the meantime, I also think a lot of things you should, like, you can learn a long away as you go. If I have to put something, I'm an engineer background, I'm a CS background. So back to the school, maybe from the technical perspective, it's like, you know, be great if more students know how to build a production level like system, if they want to do a system level thing. Right.

Starting point is 00:38:11 For other stuff, I think, I mean, you can learn along the way if you're willing to learn. Yeah. Right, I agree. Yeah. Maybe you're not quite there yet, but your views on the other parts of the business. Have they changed, you know, so sales, marketing, you know, all that stuff? Are you just learning as you go? And are you a bottleneck still, or are you so in tune to not being a bottleneck that that's not happening?

Starting point is 00:38:36 I try, I try not to be the bottleneck. Yeah, but like the real, I mean, you got a lot of traffic. Right. So, I mean, Peter taught me a lot, like, you have to prioritize. So in some sense, you're always about. tonight since you can you have infinite things to do but in the meantime like we got a great team so I think our people are great and they can do things like like move things forward fast and and without like my my my interaction there right so I'm pretty happy about that part I think

Starting point is 00:39:14 one of the most difficult transitions a person makes in their career whether it's from academia or from just an individual is from individual contributor to being a manager and leader. And when you're an individual contributor, you do everything and you're the best at it and you hack your code and you do whatever.

Starting point is 00:39:34 And when you become a manager, you have to do that through other people. And that transition is non-intuitive and very, very difficult for, I mean, I had to go through it too. I'm not, I mean, yes, of course, I went to college and all that. I'm not at your level. But

Starting point is 00:39:49 it didn't matter. I went from individual computer programmer to a manager, and it's incredibly difficult because most of your time, look, as an individual contributor, your time is spent writing code. And when you become a leader, which is the nicer way of saying manager, a leader, most of your time really ought to be spent in hiring and coaching new people to come on. And that's completely counterintuitive. It's like, look, if I write a more code this week, the project's going to move ahead. But if you don't hire anybody, you're going to continue to be the bottleneck. And I'm like, that transition, and, you know, you're still going through it. That transition is a real, it's difficult for everyone. And I think particularly difficult for

Starting point is 00:40:35 people who have written the code and like, you know, every line and every comment and all that stuff. It's particularly difficult to go through that. H.I. What hurdle did you recently put behind you? And what hurdle is next? I mean, so as an early startup, like one challenge is you always want to get the best people and takes time. And especially our project is growing pretty fast. So we all have a lot of inbound traffic. We always have trouble like in a fully exploit that. So. And what's next? What's the hurdle that you're looking ahead? Is it still hiring? I mean, we just keep hiring.

Starting point is 00:41:16 Hiring. You know, I think, I mean, we spend a lot of time on this. I think the next hurdle for these guys is to figure out how do you commercialize the open source project that you have? It gets to what exactly is the business model? What's paid for? What's free? Is anything free? Like, coming up with that business model, such, because if you don't have that early, what starts, and you don't know what that is,

Starting point is 00:41:42 then potential customers and partners don't know what you're going to charge for in the future and all that. And it makes it, the longer you go on without having a model, the harder it is to roll back some of those things. Like in a year from now, so I'll go, oh, well, we're going to charge for that. And everyone's like, well, we've been using it for free for a year. Like, you can't charge for that anymore. And so being able to lead with the crumbs along the way and saying, okay, here's what we're going to be doing, get ready for this, kind of gets the, community, you know, preconditioned to kind of understand what the model is going to be. So I think right now that's what we're kind of talking about.

Starting point is 00:42:22 H. My last question for you, you were back at that point where you were the bottleneck. You knew it. You had a choice to not go ahead and start Techian Nexus. Do you still think you made the right choice? Absolutely. Yeah. I think I made the right choice. Yeah.

Starting point is 00:42:38 Well, you know, in five years. Michael Rankland H. W.A. Lee, Peter Levine. Thank you guys so much. Thanks, everyone.

a16z Podcast - a16z Podcast: AMPLab, the Power of Open Source, and the Future of Systems Software

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.