The Infra Pod - Modal is upping the game for serverless! Chat with Erik

Episode Date: December 4, 2023

Ian and Tim sat down with Erik (CEO of Modal Labs) that's gaining wide adoption of the serverless platform built for data teams. Come sit down and listen with us how the journey started, and wha...t's Erik hot spicy future take on the serverless infra.

Transcript
Discussion (0)
Starting point is 00:00:00 Welcome back to yet another Infra Deep Dive podcast. This is Tim from Essence VC. Let's go, Ian. This is Ian, doing a little angel investing, helping turn some security companies into platforms, namely Snyk. And I'm super excited today to be joined by Eric of Modal Labs. Eric, why don't you tell us a little bit about yourself and a little bit what you're working on at Modal?
Starting point is 00:00:27 It's great to be here. I am working on a company called Modal, which I started about two and a half years ago. And basically the genesis was that I've been spending most of my life working with data for at least most of my professional career. And really anything related to data, everything from business intelligence to like large-scale distributed training of large models and everything in between and so a couple years ago i started thinking about how do we make it easier to work with data that's like an enormous problem the amount of like data startups and different components of the data stack is
Starting point is 00:00:59 is like thousands now but what i realized was that there's a core layer at the bottom around just running code that stood out to me. That's hard. And data teams are wasting a lot of time doing that. And if I build that layer, then maybe I can build all these other layers on top of it later. So I started thinking about what does it look like to run code in the cloud and not have to think about everything around resource management and provisioning and infrastructure and stuff, specifically for the type of things that data teams need. So that's why I started working on modal. Now we have a few hundred paying users, a few thousand users running a lot of Gen AI
Starting point is 00:01:34 applications. That's been sort of where we found a lot of initial attractions is we have some people running very large scale serverless GPU models. It's like one big use case. But we also have other types of use cases like biotech, 3D rendering, and all kinds of other stuff. And that's a great introduction. You have this insight that one of the biggest inhibitors to a data team being successful, especially as new stacks emerge, new things come and you have to be successful,
Starting point is 00:01:59 is deploying code and just getting stuff to run in the cloud. What was your insight? Yeah, I think it's almost like easier to look at it backwards. I think one of the sort of trends that I've seen in the last few years is, first of all, data teams have grown a lot. I spent seven years at Spotify and I built the music recommendation system there. So that was one of my core experiences working with data. I was the data team at Spotify at some point.
Starting point is 00:02:22 It was like a one-person data team. Now there's probably hundreds of people doing data. Data used to be the weird side thing that no one was really doing. Now it's pretty commonplace. I think that's something that created a lot of demand for people working with data and a lot of demand for infrastructure. And now this Gen AI stuff is obviously throwing fuel on the fire. So that's one of the tailwinds we've seen.
Starting point is 00:02:44 I think in conjunction with that, like you look at like, you know, backend developers and frontend developers, and you think about like how their workflow has changed, right? Like, you know, especially like frontend development. Occasionally I do it, I'm like, it's like so nice. Like you're just like, you know,
Starting point is 00:02:56 you load up the code in one monitor and like you put, you know, a web browser in the other one and it just like reloads. And so I started thinking about like, how do you have these like fast feedback loops? And I started thinking a lot about like developer productivity and like what's like inhibiting people from like building things quickly.
Starting point is 00:03:10 And a lot of it came back to like, I want to have this like super fast feedback loop. And I looked at like what data teams are doing and realized like, oh, like data teams, like in order to run something in the cloud, they have to like, you know, build a Docker container and then like push the Docker container to the cloud and then like go and trigger it in AWS console
Starting point is 00:03:26 and then download logs. So you have this feedback loop that's several minutes long. It's this super janky developer experience. And there's many reasons why data teams often can't run things locally. They need GPUs or they need to run things on production data. So there's a lot of weird things about how data teams are operating with the fact that data teams didn't really have a stack that's built for them, with the fact that data teams are new and they're coming at this and need tools that don't really exist. Those are all the trends that I think are why we started this company and why I think there's such an enormous opportunity in the market to build infrastructure for data teams.
Starting point is 00:04:03 Many, many founders and people have been trying to reduce complexity or build a for data teams. Many, many founders and people have been trying to reduce complexity or build a better developer experience. We heard this probably thousands or millions of times from someone out there. But as you know, everyone's approach, what steps they take are drastically different. They have a different idea in their mind.
Starting point is 00:04:19 What does that actually mean? I found it most intriguing about Modo is it looks different. I don't think it looks the same as any other better, easier developer experience. You can argue there's so many of these in the past. They all kind of trying to make things faster.
Starting point is 00:04:36 Reading your back in 2021 software infra 2.0 blog post where it talks about feedback. Can we have no more YAMLs at all? There's a particular set of thoughts process you have going to this space. I think it's just different. So maybe talk about
Starting point is 00:04:50 what are the most important things you have in mind when it comes to building Modo that you think is just different than other developer, better experienced products out there? And why do you think this is so important? And maybe also why people
Starting point is 00:05:02 don't focus on those things? Because I don't think it's like common knowledge yeah exactly i think like a couple things like i strongly believe in is that in order to understand developer productivity you have to look at the developer feedback loop and i think of it as almost like a nested set of four loops when you're writing code like innermost code you're like writing some code and you get a syntax error but like slightly larger code would be like you run a unit test and it fails. For data teams, often that feedback loop has been very janky. Running things in production has been very hard.
Starting point is 00:05:32 And so the feedback loop has been very long. And one of the things why it's so complex for data teams is that because it's so hard to work with the cloud, they sort of leave that to the end and then iterate locally. And then you have this problem like getting into the cloud. So one of the things I started with is like, what if we put the cloud inside the feedback loop? What if we just run everything in the cloud? And so one of the things I believe in is that data teams often have to run everything in the cloud anyway, because that's where the data is.
Starting point is 00:05:55 Why don't we just do all the development in the cloud, essentially? So that's one principle I think made a lot of sense for Modo. I just wanted to build a hosted platform that does everything for you, that seems to get started, that runs everything. Like you don't have to install anything except the Modo client library. The other thing that is more like why no one else has done this is that you have to be a little crazy, I think, to like go down and like do all the infrastructure. I think in the past, like there's been a lot of like people trying to like wrap Kubernetes
Starting point is 00:06:23 or wrap Docker and like build something build something on top of existing infrastructure that does this. And that's often when you go and look at a lot of startups or a lot of bigger companies, you look at what they actually do for the data team. It's often some sort of wrapper around Kubernetes and a bunch of other stuff. And I just realized very early, this is not going to work. I wanted to have the ability to take code and run it in the cloud in less than a couple of seconds. In order to get that speed, I realized like I can't use existing primitives. And so we realized very early, like we're going to have to build our own container around that.
Starting point is 00:06:53 We're going to have to build our own file system. We're going to have to build our own, you know, container builder and a whole bunch of other stuff. I think you have to be a little like crazy to go down and like, you know, like build something like more like a foundational level. And I think a lot of people shy away from that. I mean, every startup is a wrapper to some extent. But I think there's a lot of thin wrapper startups that aren't going to be able to deliver the user experience that I wanted. So we realized we're going to have to go down and rebuild all of that stuff.
Starting point is 00:07:17 And that's why it took a couple of years. We have to spend a lot of time just building a lot of foundational infrastructure in order to get there. So when you talk to your customers today, traditionally, just building a lot of foundational infrastructure in order to get there. So when you talk to your customers today, traditionally, having started companies multiple times, I've thought of problems in this space. I'm like, well, in order to get a customer to say yes,
Starting point is 00:07:34 I'm going to have to apply it in their side-to-side infrastructure, which immediately, this is the chasm problem, right? This is your opportunity to do the craziest thing to get the alpha. I'm curious, how do you think about talking to people about the fact that, hey, it's not going to run on your cube, it's going to be inside my little world. We're building for sell for the data teams. I'm curious, how do you think about talking to people about the fact that, hey, it's not going to run on your kube, it's going to be inside my little world. We're building for sell for the data teams. I'm curious how your thought process in having that discussion with your customers or users are. The answer here might be, we're not there yet, and that's totally cool,
Starting point is 00:07:56 but I'm sure you have a plan to deal with this because it's a fundamental problem in this space. Yeah, I think for us, foundationally, in order to build the infrastructure we wanted, like, we kind of have to control it. And so, like, just structurally, it's, like, easier to own it end-to-end. I do think, though, that being said, today, in the era of the cloud, like, there is more and more of, like, an embrace of, like, oh, actually, like, you can run infrastructure in the cloud. Like, Snowflake has done that for many years, right? And I think there's, like, more and more of more of like an acceptance that that's the way to go. When I look back at like when the cloud came in 2007
Starting point is 00:08:29 or whatever, like that was my first encounter with the cloud. I remember like my first thought was like, that's nuts. Like, why would I want to take like my code and put it in someone else's computer? You're crazy. Five, 10 years later, that's like so normal. And like, we didn't even think about it. Like that's just like a normal part of how we operate.
Starting point is 00:08:45 And now like, I think there's like the exact same situation again, but other vendors in the cloud that are not the hyperscalers and having seen the cloud, like, I just kind of feel like, you know, it will take some time. We're an early stage startup. Like we may not be able to go out there and tell bank of America to move their like, you know, core data pipelines to model. But I do think if we keep pushing for people to move their core data pipelines to Moodle. But I do think if we keep pushing for people to move their stuff and the fact that we're actually running things for
Starting point is 00:09:09 them in our own multi-tenant environment, I think we're on the right side of that trend. The other thing is, of course, the monetization opportunity is much bigger. It's like running people's compute. It's more of a precedent for charging for it. So I think that also helps. The other thing is also, when I look at many startups like i think they often take it for granted because they go out and like they do sort of customer discovery and they talk to a lot of bigger companies and of course like bigger companies they're always going to say that they absolutely need to run things in their own environment but i don't know if that's actually true when people tell me that like i don't actually hear what they say. What I hear is that we don't need the product desperately enough.
Starting point is 00:09:46 So then I'm like, okay, well, how do I make them need the product so much that they're willing to waive this condition? Because I already did that for Snowflake at some point already, probably in many cases. So maybe if I make the product even better, eventually they'll just give in and use modal, even though it's not running in their own custom environment. That's where things are going. And I think a lot of startup founders actually would benefit from pushing harder for that. So I think you used the term data teams. We all have a different assumption of what that means.
Starting point is 00:10:15 What does data teams really mean to you? Is it every developer building any backend? Or there's actually some particular kind of devs you group into as a data team audience as the strongest audience you're designed for? There's a reason I use the term data team and it's because it's sufficiently nebulous. And also the fact that what people actually do with data
Starting point is 00:10:35 and the titles that people use and the roles keeps changing. For a while it was data science and data engineer and then it was analytics engineer and now it's prompt engineer, ML engineer, AI engineer. Like, I don't know. There's so many different things. What they all have in common is like they're working with large data set or like doing large amounts of compute.
Starting point is 00:10:52 They often do things like, you know, more like on a batch way. They often need like to scale things up and down. They need GPUs sometimes. There's like a set of like common characteristics. They're often not super latency sensitive when you're building infrastructure that's actually a very useful thing to slice by because if you say actually we can live with like a second latency for a lot of stuff now you actually like have a freedom to build it in like a very different way than if you say i need millisecond latency for
Starting point is 00:11:18 everything so i think there's like a set of characteristics that they all have in common but i'm a little i'm a little nebulous when it comes to actually defining the customer user base. But there's clearly a lot of people working with data out there. And I think eventually they'll hopefully consider using Modal. Modal is really interesting. I looked at it, played around with it. Something that's very different is
Starting point is 00:11:37 very opinionated. And that opinion, I think, is what you baked into it so that you could create the experience you wanted to create, which is make it really fast and easy to set up, and get going and test the production. You don't have to configure cloud stuff because you define how your code is supposed to work, and then under the hood, it configures itself. Can you describe a little bit how you came to that conclusion? Was that naturally as a part of pulling on this thread? Or is this insight that came from getting rid of the YAML? Where did that come from? And how have
Starting point is 00:12:03 you seen people respond to it? When I look at it, I'm like, this is fantastic. I hate writing Kubernetes YAML. I have managed data teams. We're finding data engineers that can do both the low-level cloud stuff, kubes, Terraform, but also deal with the complexity of actually building at-scale data pipelines. Not often a common skill sets the end of this divergence in terms of people.
Starting point is 00:12:24 So it's very interesting. So can you talk through the thought process behind all of it? Yeah, I don't know if it's super opinionated. I think it's different. One of the things I think is maybe different with modal is that we start to look at what data teams are actually doing, like 95% of the time or 99% of the time, it's all Python. So that kind of presents a natural opportunity. You actually don't have to build for like arbitrary programming languages and i think the approach has often been traditionally with the cloud is like because you don't want
Starting point is 00:12:53 to build for like every language at the same time you just like take a docker container and then you know run that container but i think if you actually like look at data teams since they're all using python you can actually build a much tighter integration directly with the Python code. And the benefit of doing that is like now you can think about like actually Python functions, calling other Python function as a foundational unit of work, not like necessarily like containers calling containers and like pushing to the user to deal with all the like marshalling and serialization, that kind of stuff. I think that's maybe one weird difference between how modal operates and other.
Starting point is 00:13:27 I'm not necessarily long-term hung up on that. I think it's just the right place to start right now because you can actually deliver an end-to-end self-provisioning runtime to borrow Swix's terminology using a single language. In the future, we might add other languages and have very different SDKs for those. I think the other thing, you're talking about the
Starting point is 00:13:45 code defining the infrastructure. I think that comes from more around I just want to have a fast feedback loop. And it turns out that having multiple layers, you have to first build a container and then push that. It's easier if the code just defines it itself. And that's also my experience working
Starting point is 00:14:01 with DSLs and other languages like configuration is kind of annoying like modal has like zero configurations potentially there's like nothing you can configure like it's all code that was someone inspired actually by palumi seeing like kind of infrastructure as code for the first time instead of using terraform you like define your stack in like python or like whatever they actually support a lot of different languages i think we borrowed maybe from that to some extent, but also just like my general observation.
Starting point is 00:14:28 I built a workflow engine in the past called Luigi. Nobody uses it today, but like, that's also my like sort of experience is like configuration often ends up being very limiting. And like at the end of the day, why not just like make an SDK instead? And then like people can just like, you know, knock themselves out like programming things.
Starting point is 00:14:42 And so like making the cloud programmable, I don't know if I consider that opinionated. To me, it's like the right way to programming things. And so making the cloud programmable, I don't know if I consider that opinionated. To me, it's the right way to do things. I think people want a programmable interface to use the cloud. I say opinionated, and then in two years to five years' time, I might say the way that you should do it, in the same way
Starting point is 00:14:57 that Kubernetes was initially relatively esoteric, and now it's the accepted way. It's sort of like, this is the next step. And many of what I see, and Modal is a great example. And there's some others that are kind of like infrastructure from code companies that are out there, like AMP is one that comes to my mind that I've seen. It's sort of the next stage, which is really taking us back in many ways to the good old days of the year 2000 with PHP, fast CGI, and MySQL and PHP iBadmin. It's like, you didn't have configuration because everything lived in a box. And thenadmin. It's like, you didn't have configuration
Starting point is 00:15:25 because everything lived in a box. And then we had to unplug the box, and we opened up this huge wide world of configurability, this huge wide world of Lego blocks you can bring together in the cloud. And now we're kind of putting those Lego blocks back in, but different, but better. I think so too.
Starting point is 00:15:40 And the way I think about it is like code as infrastructure, but then you actually put the app code into the code as well. So now you have the app code also doing the infrastructure at the same time. If you think about it, like that's like the programming model, like I think makes sense for cloud development. Maybe you're retouching it a little bit, but I actually really just curious. I feel like all of these new paradigms or new way of doing things, let's say Kubernetes or Docker, they're all like
Starting point is 00:16:05 a new religion, right? Somebody has to go and be so zealous about it, right? And then go spread the love. Because we see many, many approaches and many, many things never got wide adoption, but Kubernetes did. There's definitely certain maybe key moments or key elements and key things that need to happen to kind of push forward where like everybody's finally accepted to be almost like the standard. I feel like Modo is probably in many other relative comparisons, more opinionated. To adopt it, you have to program it with the SDK. You can't just take any random code without any changes. What do you think is the necessary things for people to fully accept it to be the wide adoption? Is it just reliability of the system? Everybody can do everything? What do you think needs to happen
Starting point is 00:16:50 so that, wow, this is like Kubernetes now? Kind of a side note, by the way, but I feel like Kubernetes in a way, the approach of Kubernetes was kind of accepted when Kubernetes came out. If you look at a lot of previous attempts like Docker Swarm or Mesosphere, like you worked on. I think they were just like very hard to use. And then Kubernetes just like made it slightly easier to use. But I don't know like what it would take to make modal accepted. Like to me, it's just pushing for adoption
Starting point is 00:17:14 and like getting people to use it, like proves that it's like viable. And to be clear, like, I don't know, like I think it's the right way to do things, but like maybe I'm wrong. Hopefully I won't be wrong. So let's jump into what we always love. It's called a spicy future.
Starting point is 00:17:30 Spicy futures. So spicy future, as it sounds like, is Mr. Eric and we all chime in as well. We kind of talk about what you believe in the next three to five years. What should happen in infrastructure? What do you see the state will be? I think you already kind of stated it, but maybe we can even state it more clearly. Whatever paradigm shift or state change you want to see happen,
Starting point is 00:17:53 or you believe what should happen, and what necessary things it needs to do to get there. I think we're still very early with the adoption of the cloud. And what I mean with that is like, we're still sort of like taking pre-cloud technology and kind of putting it in the cloud. In a lot of cases, like, you know, the whole notion
Starting point is 00:18:09 of like a cluster. Why does that make sense in the cloud? Like, why do you need to think about what's a cluster? Like, it should just be like things, you know, scaling up and down, like magically, right?
Starting point is 00:18:17 Like you shouldn't have to think about that. So that's like one kind of like anachronism that I think clearly shows that like we have a long way of going. I think a lot about developer productivity and like one thing I believe in is like we have a long way of going. I think a lot about developer productivity.
Starting point is 00:18:25 And like one thing I believe in is like, we shouldn't even write code locally. Like why are we doing that? And I think part of that is just like the cloud is like annoying to work with because there's a lot of things missing and like moving the developer workflow into the cloud. But it's like kind of absurd when you think about it, like almost every profession has moved their entire workflow to the cloud.
Starting point is 00:18:43 Right. If you're like a journalist, you're probably writing articles in like a cloud editor. Or if you're like a event manager, you're probably like using some like, you know, cloud-based CRM and like emailing stuff. But then you go to like developer, they're actually like probably one of the few professions that are actually doing everything locally almost. And to me, that's like absurd. One of the things I definitely see is that obviously we've moved a lot of app code to the cloud.
Starting point is 00:19:06 I think we're also going to move the entire developer environment to the cloud. And it's going to look very different, I think, too. So in a large organization, you kind of have your local dev, and your CICD, and your pull request merge, and you have your deployment step to the Kubernetes pod. How does that change in your vision of Cloud 2.0, that end-to-end workflow? And you kind of have that feedback loop, right?
Starting point is 00:19:26 Which is like the developer sitting there watching the code migrate along this pipeline to play a game of Mario, where you're trying to get to the end of the level and hit the flag, right? How does this change over time for the application? I mean, you're very focused on the nebulous idea of the data team, but I'm kind of curious.
Starting point is 00:19:41 I mean, all developers kind of work in that workflow. How does that workflow change in your purview? I think initially, and we've seen this a lot, it's going to be kind of lift and shift. People are just going to stick things into a VM and put it in the cloud. And so I think we're going to see that with like cloud development too.
Starting point is 00:19:58 Like next five years, a lot of it's just going to be like, oh, it's not your laptop. It's a virtualized environment in the cloud. It's extremely identical. But I think if you actually like kind of think about like what is the cloud like there's so many things like we should you know clearly like you can do like completely different things like one of the things like for instance like one example i think a lot of it is like the cloud can scale almost infinitely so like why am i running one unit test at a time why can i take like every fucking unit test in my entire code base and just run in a Lambda and spin up 10,000 Lambdas
Starting point is 00:20:27 and run all of them in parallel? What if you can do those things? And suddenly, when you think through that, I think there's so many things that we can do to dramatically change the entire development workflow. We could run enormously massive test suites in parallel or CI-CD pipelines. I think we can dramatically rethink how those work.
Starting point is 00:20:47 We don't have the like programming paradigm to do that quite yet. That's like, I think like something that we're going to have to invent over the next few years. I think the other thing that doesn't quite exist is like the sort of idea of like ephemeral resources in the cloud. Like it's like annoying to like test your code against a database in the cloud because like now you have to like create a database in the cloud and Because now you have to create a database in the cloud and then run tests against it and then tear down that database. So that's why I think a lot of people, instead of using RDS, they end up running Postgres in a Docker container locally.
Starting point is 00:21:15 But what if you had an ability to create ephemeral databases or whether it's Postgres or Redis or whatever? I think once you rethink what the cloud offers and once you start playing around with the idea of creating ephemeral resources that are anonymous and having the ability to start containers in a couple of seconds or less, there's so many different things you can think about the development workflow that would look very, very different. I'm curious to get your perspective on this. Today we have three hyperscaler clouds.
Starting point is 00:21:44 You've got Azure, you've got GCP, you've got AWS. In the world where the cloud is the IDE, the cloud is your CICD, your cloud is your deployment, do you think we end up in a world where it's like the hyperscalers own everything end-to-end? Like you pick your AWS and you're there to go? Or do you think we're going to end up with a world
Starting point is 00:21:59 that's maybe more of a mosaic of services? Which is true for some organizations today, where they have these mosaic of third parties and they have their primary cloud provider. And you have other organizations that have multiple different cloud providers. And then you have other organizations that are just all in on AWS. If you're not an AWS service, pound sand. I'm kind of curious to think, how do you think the ecosystem evolves and changes to this new workflow? Because it will change buying behavior.
Starting point is 00:22:22 Totally. To me, I think the direction things are going is more along the lines of Snowflake. And what I mean with that is Snowflake runs in the cloud, right? But it runs typically on AWS, GCP, or Azure, or something else. But that's sort of abstracted
Starting point is 00:22:37 in a way. So I think a lot of people will use all these cloud services that may, in turn, run on AWS and the hyperscalers, but they may not realize it or they may not have to think about it in the same way. So I think to a large extent, you won't interact with cloud consoles as much as today. And look at, for instance, what Vercel has done for a lot of front-end development. If you're using Vercel, you don't really think about the cloud provider. You think about Vercel as like where you host your front end, similar to like what Snowflake is doing for a data warehouse.
Starting point is 00:23:08 And so I think there's a sort of layerization where I think there's like the tier one, like there's a sort of bottom tier of like, you know, not in a bad way, like AWS, GCP, and Azure, like the big hyperscalers, they benefit from enormous like economies of scale. And I think there's a very stable oligopoly. And actually, I think they make a lot of money doing that because people think of that as a commodity, but margins are pretty good. It's like 60% of these two or something like that. And then I think there's going to be a layer above
Starting point is 00:23:33 of vendors that own more end-to-end workflows and user experiences. And they maybe cater to different user groups, just like Modal focuses on building data apps and abstract the way they underline clouds. Like I think there's going to be other ones too. I already mentioned
Starting point is 00:23:48 like Vercel or Snowflake. I think there's many more examples like Fly or Railway or whatever. I think that's like the sort of world we're slowly moving towards. I'm curious, like what are the trends
Starting point is 00:23:57 you think that will accelerate the shift to the, I use the word model, but sort of the thought process of infrastructure from code and a lot of the things that you've bundled in with this like really tight development loop.
Starting point is 00:24:06 And I see it at Cloud IDE. If you've done Investor Decks, a lot of our listeners have done Investor Decks, looked at Investor Decks. There's always the why now slide. Why is this a good idea now versus all the other times? Are there key trends you see as this accelerator through drivers for why this will happen? Moving away from local to other areas and all this other stuff?
Starting point is 00:24:25 I think all the trends are there. The clouds are here. Fast networks are here. Relatively cheap internet bandwidth is here. Those things weren't here 10 years ago. Those things would have been a lot harder to be done. I think to me now it's more like a software and a workflow problem. And we've seen this. Changing how developers work takes a lot of time.
Starting point is 00:24:48 Developer workflow is not dramatically different today than it was like 10 years ago. We do a little bit more CICD, you know, there's a little bit more like unit testing, better IDEs, there's better, I don't know, you know, syntax highlight, like whatever, right? But like fundamentally, it's like these things like change very slowly. And I think there's also because it's sort of like a whole ecosystem of sort of vendors and products that kind of move in lockstep with each other right like you know you have to build a in order to build b and then like once b is built then you can move from a to c and then once c is built then you can move from b to d like there's like often like sort of like a lot of interdependencies between a lot of different products and that's why these things tend to move pretty slowly but i think all the things are there To me, it's just a matter of time before developer environments move to the cloud.
Starting point is 00:25:28 So I want to maybe ask a spicy question and don't have to name particular company names. But to me, one good example, when I see different philosophies that are kind of like in contrast is like the Vercel versus Netlify kind of play, where Vercel starts with the Next.js. Everybody adopts Next.js and that becomes the cloud where Netlify is like, okay, I'm not going to change your code. Take anyone's approach. And the next few years or so, I think you're going to see everything exists, right? To me, like when you talk about the next generation of clouds, I think there's actually opposing or maybe like different philosophies. There's a philosophy of I'm going to build like a universal control plane on top of all the clouds
Starting point is 00:26:09 and whatever you do behind me, do the same. But there's like some middleware here. You know, we can look at databases, layers and stuff like that. You're much more at the forefront, right? Change your code right into modal and everything behind the scenes work beautifully well but that does require like a lift and shift that you mentioned obviously that's your belief right this is the place we should go this is where the best experience people will be going for do you think maybe in the next five to ten years some other kind of vendors shouldn't exist anymore or shouldn't that those layers doesn't make any sense to you? It might be Kubernetes,
Starting point is 00:26:46 but you know, what is it in the middle you think will not be there anymore? Like people are not even talking about it anymore. It's like a time and space kind of thing. Yeah, I think Kubernetes
Starting point is 00:26:55 makes a lot of sense today, but I don't know if it makes sense in like 10 years. I think Kubernetes is not cloud native in a certain sense. You know, so the question is like,
Starting point is 00:27:04 what is a cloud native Kubernetes? And I would love to see some sort of open source project doing that. But my feeling is the first version of that is not going to be open source. It's going to be some sort of proprietary. And similar to Modal in that sense, we're building an end-to-end thing. We're building a proprietary. It might be very opinionated. But I think in the long run, to me, the spirit of Modal
Starting point is 00:27:24 is very much where I think data development will go. And hopefully by the time that data development moves there, we're also abstracted away modal in a certain sense that we can also offer other programming paradigms that at least spiritually align with how modal works. I do think there's a lot of startup vendors out there that are too thin wrappers around kubernetes and i think that's going to be difficult if there's something newer and better that makes everything easier on the other hand i do think kubernetes is realistically going to be around for another 10 years right but i think like betting on kubernetes right now is like betting on like hadoop in like 2008 like that was the right. Hadoop was the least bad way to do big data in 2008.
Starting point is 00:28:08 But by year 2016, I already felt like Hadoop was kind of legacy software. So you focus a lot on generative AI. You've got tools for fine-tuning. You've got tools for deploying generative AI models,
Starting point is 00:28:23 open-source generative models. You've got this whole suite. So clearly you're making a bet because you've spent a lot of product time investing in that part of the stack and enabling these workflows. Help us understand your thought process on if you're making a decent-sized bet on generative AI as a business
Starting point is 00:28:39 and how you think it will impact the type of products people build and why you made that bet. There's clearly a yin and a yang to your thought process. And we'd love to understand where you think this will be from three years from now in terms of the impact on the type of products we build in the five years. And then what the developer experience is
Starting point is 00:28:55 to enable people to actually create those products. Yeah, totally. First of all, we actually started the company before all this Gen AI stuff started happening. And I wouldn't call it as bad as much as just like, suddenly like a lot of people started asking how do i run jnai apps on modal and we just like realized like actually it kind of makes sense for us to do this uh so we started leaning into that and just like doubling down and you know a year later you know it's a very significant amount of our revenue coming from that sector but it was actually
Starting point is 00:29:21 a little bit backwards like we didn't necessarily bet on it i think jnai right now like it's like weird but like i didn't come up with this analogy but i think a lot of it reminds me of like when iphone came in like the app store the first like year or so it was like kind of like gizmos and like remember i don't remember but it's like ibeer app you drink like a beer like like with your phone and there's like a lot of like funny like consumery stuff and similarly i think back then i remember there was a lot of like funny like consumery stuff and similarly i think back then i remember there was a lot of companies that like we need to have a mobile strategy and so they would like pay people a lot of money to make a mobile app that in many cases sucked because it was like kind of like a tool looking for a solution people were like oh the mobile like mobile is the
Starting point is 00:29:59 future we need to have a mobile app but people didn't really like think about like okay like what problem does that actually solve and it wasn wasn't until a few years in, you started seeing more of a mobile and native type companies. And those were things like Uber or Spotify, I guess, to some extent, or I don't know, Foursquare or whatever, actually taking advantage of this platform and rethinking the workflow and building a very different type of product that actually truly needed this thing. And that that's where i think we're not quite there yet but i think we're starting to see some of it right so like some of the customers we have in the jnai space the initial customers it was like somewhat like kind of gimmicky stuff like which was cool but now we're starting to see sort of a second wave of customers and those tend to be people building more like end-to-end
Starting point is 00:30:41 video processing pipelines or like audio processing pipelines in many cases like you know maybe building tools for creators like podcasts you know make it easy to edit podcasts you know do like post-production right using gen ai because like it turns out like a lot of like speech synthesis or transcription or like video editing or lip syncing or translation like those are actually really valuable things but you have to put them into like an end-to-end workflow in order for it to like really make sense. So to me, that's like the next set of exciting things is going to be like, well, it's like more like end-to-end things. And then I think the third way is going to be like completely new, like business models that were like fundamentally like depend on Gen AI to exist. And I don't know if we quite yet see that.
Starting point is 00:31:24 I don't think there's like, maybe OpenAI is like the only one, but like that to me, it's like going to take a little longer to sort of shake up. So, you know, there's like a lot of also LLM, infra, middleware, like discussions right now, right?
Starting point is 00:31:37 There's so many kind of primitives people are kind of proposing. And I think Modo definitely, I think in my mind, fits into like the inference side, you know, calling GPU, able to actually call models. Where do you see Modo fits maybe in the more midterm-ish? Where do you think the primitives you want to provide? Because maybe you've seen applications moving towards, I don't know, providing something even more upstream than just inference only.
Starting point is 00:32:02 You can also go into training. There's a lot of different potential things you can use with data and GPUs. And so I just don't know what you think aligns with the platform and fits well to what you're almost like roadmap or vision-wise. I think as it pertains to modal,
Starting point is 00:32:19 I always wanted to build a very general-purpose compute platform. My end goal is to take over all of compute and then all this stuff up the stack too. And so to me, that general purpose compute platform. My end goal is to take over all of compute and then like, you know, all this stuff up to stack two. And so to me, that's not just inference. Like we do really well with inference right now because it turns out there's a lot of gap for that. And Model makes that particularly easy.
Starting point is 00:32:34 But we're starting to see a lot more people using Model for fine tuning, to some extent, a little bit training. Like we're very interested in long-term also doing things like pre-processing data pipelines, like dealing with, you know, ML observability, feature stores, workflows, scheduling, building query engines, real-time streaming. There's just so many different things, many of them decomposed data stack that we want to take over long-term. But right now, we're very focused on just the compute, and in particular, inference is where like we have kind of a wedge into the market. Looking at the larger market as a whole, it's still very early.
Starting point is 00:33:08 And you mentioned LLMs. I actually don't know in the long run how it's going to shake out. Clearly, there's a lot of startups building higher level services. There are a lot of companies right now building, for instance, LLM APIs. And I've talked to a lot of VCs and everyone's like, yeah, but that's going to be a super thin margin. Very few people are actually going to make money. I struggled myself to understand long-term what the layers and the
Starting point is 00:33:31 boxes are and what the supply chain is going to look like and who's actually creating value. To me, that's very early to understand. I do think it's relatively clear that at least focusing on compute and building a lot of different things, supporting a lot of different types of use cases
Starting point is 00:33:48 that people need within compute. To me, that seems clear. People are going to need compute. So that's where we want to focus. It's relatively far down in the stack, actually. It's far down the stack in terms of pieces of fundamental building blocks, primitives, right? But the way that you've built it is you've removed
Starting point is 00:34:04 so much of the middle part so that you're actually presenting this very high-level concept, which is very unique to what you're doing. I think a lot of the thought process around LLM and compute for LLM is, while we're looking at what are other comparable markets that we can look at to say,
Starting point is 00:34:20 well, this is how this market unfolded, so this market will unfold here. And we know that at the end of the day, that GPUs, while being tightly constrained today, won't be tightly constrained forever. And just like hosted compute was in the 1990s and the early 1000s and such, is that it's how computers drastically
Starting point is 00:34:35 commoditized. It's still a good margin, but you have to build all this ancillary stuff around it to capture it. Yeah, which is like, you know, the AWS market, right, like no one comprises in IAM for AWS, but like so much of their margin is driven by the fact that IAM is actually pretty good, right? Like, you think IAM is pretty good?
Starting point is 00:34:54 Well, in comparison to what we had before, you know what I mean? In terms of what you get out of the box, it's definitely better than like IP firewall management and all the other nonsense we were doing before all that. So yeah, yeah, that's absolutely true. The idea of security moving away from the network layer to the app layer, 100% agree that that's an obvious trend.
Starting point is 00:35:14 Yeah, I think you were probably inventing Layer 7-11. I've never heard that term before, but it sticks in my mind now. Isn't it Layer 7? That's usually the highest stack, right? But you could come to 11. You had to come up with 9, 10, 11. No, 8, whatever, it doesn't matter. That's usually the highest stack, right? But you could come to 11. You had to come up with 9, 10, 11. No, 8, 9, 10 yourself too. So it doesn't matter.
Starting point is 00:35:31 Yeah, yeah. I think we really touched on what we wanted to. Anything you want to maybe talk about? Like something to highlight about Modo, maybe it's an upcoming launch or some kind of like recent milestone you think is worth kind of plotting here? Well, we're generally available since last week. So that's exciting. We announced our A round too.
Starting point is 00:35:51 We're out there. We definitely encourage everyone to try it out and looking forward, like to me, next few months, like we're going to focus just a lot on like scale and performance. But we have some really exciting features on the roadmap. We're thinking a lot about beyond the pure compute layer, like where do we go? And there's a lot of really exciting stuff that we're working on that we hope to gradually release to the public over the next, I don't know, six months or so. Cool. And so where should
Starting point is 00:36:16 people go and learn more or start to use Modo? Modo.com. M-O-D-A-L.com. Cool. Well, I think this is a blast. We got all the spicy takes we wanted to. So thanks for being on the pod. And obviously we'll continue to use Moodal
Starting point is 00:36:33 and see all the successes coming from there. Awesome. Thanks for having me. Thanks for joining. It was great fun.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.