Orchestrate all the Things - Another globally distributed cloud native SQL database unicorn: Yugabyte raises $188M Series C funding at $1.3B valuation. Featuring CEO Bill Cook, Co-founder Karthik Ranganathan

Starting point is 00:00:00 Welcome to the Orchestrate All the Things podcast. I'm George Amadiotis and we'll be connecting the dots together. A number of successive funding rounds have given Ukebyte unicorn status while positioning the company to aim for a big piece of a growing pie in the shape-shifting database market. I hope you will enjoy the podcast. If you like my work, you can follow Link Data Orchestration on Twitter, LinkedIn, and Facebook. Thank you both for making the time for the conversation

Starting point is 00:00:35 today. And it's been a while since we last talked. And actually, I had to go back and check my records a little bit when was the last time. And I have to confess, I was a little bit confused because, well, you know, with all the funding and the different rounds that you've raised, I thought, so the last time was in 2020, and I thought that was your previous funding as well. But it turns out there was another one in between.

Starting point is 00:01:04 So, yeah, let's start by that. Yeah, I can help clarify that George. We did a, you know, we call it a B1 round at the beginning of this year, really based on the market opportunities. There were some investments we wanted to make that we weren't fully funded for with the previous B round. Namely, we wanted to invest more

Starting point is 00:01:31 in the fully managed service public cloud offering, right? If you think about that kind of the mass market play for you for you go by. And so we wanted to increase the focus in engineering on that effort. And then we also wanted to increase the focus in engineering on that effort. And then we also wanted to increase our go-to-market coverage. We were primarily in the U.S. and had just started in Europe and wanted to establish a presence in Asia, given the market opportunity in front of us and just being able to have global coverage.

Starting point is 00:02:10 So we did raise an additional $47 million at the beginning of the year, which candidly we had thought would be adequate and it was for another year and a half or so before we would start to contemplate our C funding round, which is what this announcement's about. However, kind of the same story. We were ahead of plan on all of our metrics. We were seeing good adoption by large enterprises. And given the market opportunity,

Starting point is 00:02:45 as you know, given the market opportunity, as you know, covering the database market, when a shift starts to happen, you know, it accelerates, you know, and you want to be known as the market leader. And so in our board meeting in, I guess it was towards the end of the summer, we started to talk about,

Starting point is 00:03:11 you know, potentially raising the C round and really just accelerating our growth plans across the company. You know, all aspects go to market, engineering, the rest of it. And we started that process, I guess it's maybe eight weeks ago now, approximately. And Karthik can be honest here, but I think it was about eight weeks ago now, approximately. And Karthik can be honest here, but I think it was about eight weeks ago or so. It was a quick process, yeah. Pretty quick. And, you know, the receptivity, I mean, you know how it goes. You want to, you know, we have a view about the opportunity

Starting point is 00:03:37 and, you know, our position, but, you know, the investors, you know, obviously validate that with writing checks and investing in the company. And, you know, obviously validate that with writing checks and investing in the company. And, you know, it went pretty fast and we were, you know, we got a very strong reception to kind of our position and market opportunity and the investment thesis and, you know, ultimately led to this C round, which you have the details on that Sapphire's leading and you see the raise and the rest of it. So does that fill in the gaps for you, George? Yes, yes, it does.

Starting point is 00:04:07 It does. Thank you. Yeah, I was wondering because I thought, okay, I must have missed something. Because that's quite an amount, I mean, in total that you have raised in a relatively short time. So I guess that speaks to validation that you just mentioned. It seems that investors must like something about what you do. Yeah, and candidly, we could have run the company more conservatively and waited longer to raise and the rest of it. But again, given the market opportunity, given the positioning and the validation we're seeing, you know, for the technology, we felt it was the right thing to do.

Starting point is 00:04:50 And obviously, you want to get the right valuation, the right investors around the table. So that's why we're quite pleased with, you know, if you look at the, you know, the key investors here being Sapphire, who, you know, both Sapphire and Meritech I had worked with before. They were both investors in Greenplum back, you know, a decade or so ago. So, you know, that familiarity helps, right, that they know the people, they know the company, they know they were like our leadership team and what Conant and Karthik and Mikhail had built. And then the two new investors outside of Sapphire and Meritech are really Alkion, who I didn't really know,

Starting point is 00:05:33 but Avi and the team had been tracking UgaByte for a few years. And, you know, they're a big crossover fund, even, you know, funding companies for the long term, you know, that have the potential to go public. And then one of our strategic customers, Wells Fargo, expressed interest in investing as well. So, I mean, from my perspective, it's a fantastic group of investors we have that are in it for the long haul, believe in our company, believe in us as the organization,

Starting point is 00:06:06 the people here, and want to fund us to win and lead the market. So it couldn't be a better scenario from my perspective. Karthik, anything you want to add to that? No, I think you got everything, Bill. I think it really comes down to the trends that are big shifts and really the opportunity in front. As organizations and comes down to the trends that are like, you know, big shifts and, you know, really the opportunity in front. As organizations and enterprises move to the cloud or start with the cloud, it's really that type of an opportunity.

Starting point is 00:06:33 Yeah, yeah, indeed. And, yeah, I guess there's not too many offerings around. If you want something that's, well, both SQL,, SQL Compact, let's say, and globally distributed. You're down to pointing fingers on one hand, basically, I think. Yeah, and it's interesting. I think our competitive positioning versus, you know, Cockroach is, you know, fairly validated in the market now.

Starting point is 00:07:02 If you think about our investment in Postgres and the SQL richness that we have, which large enterprises are looking for as they migrate off of traditional databases, Oracle, DB2, SQL Server, et cetera. So I think that's one positive thing. And then maybe to comment on, similar to what we talked about previously, George, the trends that we're seeing seeing the microservices, right?

Starting point is 00:07:27 Cloud native microservices is certainly driving the need for a distributed transactional database like you go by. And you know, that's a major trend, obviously new apps being born or built in the cloud. And then the third one that we talked about, which is more emerging, but actually we're very excited about, is the whole story around edge, right? If you think about edge and 5G and IoT more broadly,

Starting point is 00:07:56 it's driving a need for data consistency in different tiers, meaning it's not all going to reside in a single public cloud. By definition, it can't. And so that, and if you want a consistent transactional database across all of those tiers and not locked into any particular hyperscaler public cloud offering, to your point, George, there's only a few options and arguably we believe Uga Byte's

Starting point is 00:08:28 probably the strongest offering given our background in history. And maybe Karthik, you could comment, we've made this comment as well, that there was a lot of buzz around the NoSQL players, obviously Karthik being one of the initiators of that when they built Cassandra and HBase when they were at Facebook. But when you get to scale and you get to some of the attributes that people are looking

Starting point is 00:08:55 for in their applications as they grow and scale and they want the transactional consistency, there's a bit of a rejuvenation, if you will, of why SQL really matters and their need for something like UgoByte. Maybe Karthik, you can expound on that a bit. That's another trend that's happening. Yeah, absolutely, yeah. So I think we noticed this back at Facebook when we started building,

Starting point is 00:09:27 like I joined along with my co-founders. We met at Facebook in about 2007. And back then there were no SQL databases. And, you know, the biggest ask of the need of the hour at that time was to both federate queries as well as data. So query processing should be done on multiple nodes and your data should be distributed across nodes. There's really no database that was really available on commodity hardware that could do it. So as Bill said, we ended up building Apache Cassandra. We ended up building a lot of features into HBase

Starting point is 00:09:55 and putting it into production. We were also the team running this as a service for our internal Facebook customers at a pretty massive scale. It was like the scale is pretty comparable to public cloud scale. But from developers, what we heard again and again was they needed features like secondary indexes and more richer database features and joins and other types of queries and so

Starting point is 00:10:17 on and so forth. And what that made us realize was that, yes, we had solved the problem of distributing data and distributing query processing, but the ask one by one, if you kept layering that one after the other, it started going back to SQL, right? So we said at that point, look, we really need a database that supports SQL in order to be generally usable, and it needs to have all the underpinnings of NoSQL. So we really need to hybridize the two sides in order to build a database that can become

Starting point is 00:10:46 something like a default database for cloud native applications. And so fast forward to YugoByte, we made the decision that Postgres was absolutely on fire growing really fast. So we would pick that as the upper half and leave borrowed from the page of Amazon Aurora, which absolutely has been fantastic in its growth and the ability for end users to adopt it because it's really compatible with Postgres, not just at the protocol or wire compatibility level, but also at the features and the semantic level, how it functions, right?

Starting point is 00:11:15 So to that effect, we took the upper half of Postgres and said, let's make all of this feature set available in a cloud native shared nothing distributed world, right? Which is more like Google Spanner, and really hybridized that design into a single offering and made the offering as open source as Postgres itself. And on top of that, made sure that when people want to consume it for serious applications,

Starting point is 00:11:37 they have the ability to consume it as a managed service using Ugo Byte Cloud. So that really, I mean, it's a simple story at the end, but that's really the trajectory that we saw the world headed at and still heading in. And the hypothesis turns out to be true, which is what I guess this round and all of the growth is about.

Starting point is 00:11:54 Yeah, yeah, thank you Karthik. So actually you touched on a couple of points that I'd like to elaborate on. So cloud first and your managed offering, market managed service offering first of all. But before we go there, one last question on the, well, on financials and valuation and all that. So I saw in the press release that your valuation is now something like 1.3 billion. So I guess you're officially a unicorn if that means anything by now, because it's kind of over years maybe. But I was just curious, so how is that valuation derived?

Starting point is 00:12:36 Well, the valuation on, I guess, like if you look at the overall market and the investment, et cetera, right? Like the valuation comes from, you could say the three different buckets, the first bucket being enterprises that are adopting us and the expansions that come thereafter. The second bucket comes from our open source adoption and awareness. So the future potential for people to start running on us.

Starting point is 00:13:00 And then the third bucket comes from the ability to consume YugoByte really easily using a service, et cetera. So I think these are the three in terms of how people consume us and they think about. And then there's three separate trends that are fueling this as Bill was alluding to. The first one being microservices from the large enterprises. Second one being applications that are born in the cloud,

Starting point is 00:13:23 and this could be large enterprises or small companies. The third one being new generation of applications that are like, you know, being built at the edge that require geographic distribution of data built into the database as a first class citizen even to start, right? And the third bucket's early, but it's more common than we often think, right?

Starting point is 00:13:40 If you really think about a shopping experience today, you have storefronts and you have a central e-commerce app in the cloud and they have to work together, right? Similarly, vehicles are becoming smart vehicles, there's wearables, there's like so many other places where this type of distributed substrate starts to live. So to put that all together, right? Like I think this is really the thesis,

Starting point is 00:13:59 but I think Bill, go ahead and- Well, George, are you saying specifically how did the investors pick that valuation? Or are you asking the underpinning that Karthik described? Well, yeah, I mean, the underpinning was indeed elaborated quite eloquently by Karthik. I was wondering, yeah, who came up with that valuation, basically? Well, Sapphire. I mean, Sapphire did.

Starting point is 00:14:23 Sapphire led the round. So they're the ones that put a, you know, a term sheet in and, uh, you know, they set the terms on valuation and, you know, candidly as I think it was in the press release, we were oversubscribed as far as investor interest. So, you know, that's just a, then you're, you know, discussing with each of the investors, how much they want to invest. And we talked to some of our existing investors about taking a little less than their pro rata to make all the numbers work to get the new investors in at the levels they wanted to get in. And

Starting point is 00:14:56 it worked out beautifully. So it's really, and maybe, you know, if you were to ask Sapphire, Jay Das, who's joining our board, you know, I think they would say a couple things, you know, if you were to ask Sapphire, Jay Das, who's joining our board, you know, I think they would say a couple things. You know, one is they believe in the thesis of the company. Then you look at the TAM, you know, of the market opportunity. It's about, what did we determine, $60 billion or so, $65 billion validated from, you know, the normal sources out there of where we play for the transactional database. And so from an investor perspective, to be able to invest in a company that has a large enterprise market opportunity, has enough proof points that there's a thesis that they're going to be the winner. And then you look at the valuations that are in the market of the analytics companies like

Starting point is 00:15:50 Snowflake and others, it's a very prudent investment from their perspective, given the opportunity versus the risk. I'm speaking on behalf of Sapphire and the investors, but I think that's what they would say. Okay. Okay. And if you look at the analysts, like IDC and William Blair and all of these analysts, I mean, you can see clearly from their report that the size for a transactional relational market is actually about twice the size of the analytics SQL market, relational and analytics market. So companies like Snowflake, et cetera, are playing in a market whose TAM is like about half

Starting point is 00:16:26 the total TAM available to transactional relational databases. And I don't know, George, if you saw Jason's from William Blair's report on the database market. They did a report a few weeks ago that was talking about the database market more broadly. You guys mentioned in there. But that gives you a sense of what's happening in the scope. And then there'll be actually a spotlight coming out

Starting point is 00:16:51 specifically on UgoByte later this week as well. Okay, okay. Yeah, that will be interesting. I'll keep an eye out for that. Okay. Okay, nice. So let's move to what I think is probably leading your growth. So the cloud offering. Last time we spoke, it was still, well, I'm not sure if it was actually in your drawing board or it was already in progress, but either way, by now it's already generally available. And you're going to tell me if my assumption is true. Is it indeed the one that's probably driving your growth?

Starting point is 00:17:31 In part. And Karthik to answer that, I think just to ground George, talk about the self-managed and fully managed cloud. Cause we have two flavors that I think are really critical to driving our growth here. Yeah. So I think the last time we spoke,

Starting point is 00:17:50 maybe we'll just start with the, one of the founding thesis of the company, just like we said, we needed a transactional database that had relational features built for the cloud. Similarly, another of the founding thesis was that the world was going to go database as a service, but unlike the analytics space, the OL OLAP space the transactional space people would not be comfortable on day one putting their data into a managed service by a third

Starting point is 00:18:12 party vendor right whoever that is and obviously we were born in 2016 so this thesis is as of 2016 that over the next 10 years or so there'd be like that want to run databases as a service but in in their own account, public clouds, or in a fully managed account. So it could just vary across the two sides. So accordingly, what we did was we built our fully managed DBaaS control plane software as a package software that can ship into, you know, that we can run, or our customers can run inside their VPC and cloud accounts. So that means like so and we started this about six months into the starting the company so this has been our commercial product, the main flagship

Starting point is 00:18:54 commercial product that we've been selling to enterprises and it's become a mainstay of enterprises for them to run the database and the database tier inside their in their environment. So it's something that we can remotely manage if required, but they fully take care of it on their side and run. So they get the control plane software, which takes care of the database, and they just have to take care of the control plane

Starting point is 00:19:15 software, which is a much easier task than dealing with a lot of different things that the database needs to do. So to Bill's point, we built this in such a way that we can use the same thing for our fully managed cloud. And the first offering where customers run it by themselves, it's a self-managed cloud, like YugoByte Cloud, self-managed version. And the one that we run for them in a multi-tenant fashion

Starting point is 00:19:38 is the fully managed YugoByte Cloud. And any feature we build for one site, one group, you automatically get on the other side. So it's absolute reuse of software and features and et cetera, which is a good place for us to be. Because we think people will move back and forth between whether they want to run it as a fully managed service or self-manage it themselves. But with the ease of use of a fully managed service over the next many years still right so from that perspective all of our revenue almost all actually like maybe we have one or two support type contracts but everything else comes from self-managed cloud consumption and it is a

Starting point is 00:20:14 subscription model it is like you know pay for consumption by the number of course so on so forth so it's exactly the way you would expect a cloud offering to work where all of the support is increasingly built into software making it easy for people to manage and operate at scale so on and so forth what we just announced is the fully managed version and as you correctly like you know laid out George like it's it's like the first you know year or so we had it in beta and we were mostly learning about the market appetite and the use case and the access patterns and so on. And just like last month we released that in

Starting point is 00:20:52 GA generally, it's generally available now. So in other words, another way of saying this is we have had like our managed cloud software being used in mission critical use cases at massive scale for like you know a year plus now year two years now almost right and this latest version is the one where everything all the servers reside on the yugo byte accounts and we just expose endpoints for end users to consume so it takes the usability and ease of use to the next level but you know case in point the software and the paradigm has existed for a while. Okay, okay, thank you.

Starting point is 00:21:28 And a related question that I had again, picking up some info bits from the press release, it was mentioned that you had something like 1,200 organizations signing up for your cloud. And I was wondering that sign up equal user in that case, or do some of these organizations have test accounts? Are they all paying customers basically? No, they're not paid customers. There are a lot of test accounts, people building software developers at various stages.

Starting point is 00:22:03 Obviously, a lot of that number comes from the fully managed cloud, which we just opened as generally available, like just a few weeks ago. So these were things, these were organizations that were using our beta tier on our cloud itself for development, for testing development and even exploration

Starting point is 00:22:24 and so on and so forth for a while. And now they're starting to get to a point where they're looking for some features in order to go production. We have a handful of paying customers on the cloud side, but we have a lot more in the pipeline, given we just went generally available and there's still some feature asks from these organizations

Starting point is 00:22:41 in order to go GA. Now, some of that number comes from the self-managed version of the cloud, which is equally easy to use. And when specifically when people need, or when the users have a lot of compliance needs or are going to deploy in like, none of the big three public clouds, which is initially the only places we'll have presence,

Starting point is 00:23:01 like our fully managed cloud today operates on AWS and GCP with support for Azure coming pretty soon, right? But a number of these users want to operate in different clouds where we actually don't have support currently. And so the self-managed cloud is of interest to them as well. And maybe to tie it to the funding round, George, the numbers we've talked about are we're approaching 50 commercial paid customers. And size-wise of talked about are we're approaching 50 commercial paid customers

Starting point is 00:23:26 and size wise of the company, we're about approaching 250 employees. That gives you a sense of where we're at on the journey and the opportunity in front of us. Okay. Yeah, so the other thing I was wondering about, again, with relation to the cloud, you mentioned feature requests from some of your customers, and I was wondering what those features may be, and actually how are they prioritized in your overall queue, let's say?

Starting point is 00:24:01 Yeah, that's a good question right now, because we're right in the middle of that planning process and prioritizing. We had a roadmap, obviously, on the database itself and on the cloud offering. But I'll let Karthik maybe hit the top four or five things that we're focused on. Yeah, absolutely. I think, you know, like Bill said, great, insightful questions, really good questions. So the first set of features has to do with, you know, just catch up on what you would think are, you know, table stakes features for a cloud. But, you know, it's just a prioritization. Like, for example, we have VPC peering, but people want to use private link, right, which is a more advanced form of VPC peering. Or the ability to do multi-region cluster support without having to contact our support right like

Starting point is 00:24:46 they just want to self-service it but it's a harder problem because we don't want like unlike our self-managed offering we don't want this to be super complex but flexible we want this to be very opinionated but simple right so it's really about taking apart the user interface on how that's going to be what are the majority of the users looking for so a little bit of that type of really make it easy to self-serve kind of model that we're working with uh support for azure for example and like you know just some of those like what i would call uh basic features but it's just a long list right so we have to catch up on those uh the second big bucket uh revolves around security and enterprise features like uh they're independent but i mean bucket revolves around security and enterprise features. They're independent, but as in security features and enterprise features independently, we're just

Starting point is 00:25:30 clubbing them into one bucket. So things like we're already signed up for external audits and pen testing has been done. So we've done a bunch of those, but we're working on our SOC 2 compliance, which we're pretty close to getting here. And then PCI compliance is another request. So on the compliance bucket, there's a bunch of stuff. And then enterprises ask for features like BYOK, bring your own encryption keys in order to encrypt data. So even we can't make any sense of the data. We have transparent encryption with key rotation,

Starting point is 00:25:59 but they want to supply their own keys. So you can think of a number of these type of advanced features that enterprises want in order to make the deployment even more secure than it already is. And then the third bucket is a little bit more internal, both in terms of operational improvements, as well as to instrument the metrics on the cloud. Because of the number of users and the sheer volume,

Starting point is 00:26:24 it's very difficult for us to actually go user by user to see if we can help them if they're like there's something we can do better for them is the user experience better so we're working on inverting this to you know have the metrics instrumented where the system tells us how we can help people better and you know give them an even better experience and so on because pretty soon you know a sample may not be as meaningful as like the broad learnings across the market relics so what are the features that people are most looking for or what are the so we want them to tell us by being able to communicate with us through the application so it makes it easier for

Starting point is 00:26:57 them and for us to help them but through the app right so there's that a class of features and then last but not least there's the whole huge ask for data migration services. Our Postgres compatibility naturally lends itself to people being able to migrate from a number of databases with relative ease because the migration is very similar to migrating to Postgres. So data migration service is a huge ask. A lot of people asking about it, so on and so forth. So these are some of the simple ones, like, I mean, there's obviously internal automation and making things more, giving APIs for our clouds,

Starting point is 00:27:32 just like a whole bunch of other stuff that I'm not mentioning here, and not to mention on serverless, like to figure out how developers can consume it. So there's a number of angles here, but yeah, I just giving you a list, so. Yeah, and I would add, George, and if you pull up a level from a broader perspective,

Starting point is 00:27:48 if you think about the database landscape for the last 40 years or so, and how transactional applications have been built and leveraging traditional databases, when you think about the trends that are happening around microservices, cloud native, how do you rethink, and a distributed database like you the byte? How do you think about, you know, what features make sense in this new world? What do you build in the app? What do you put in the database here?

Starting point is 00:28:15 How do you secure across clouds and in any clusters? How do you provide the data privacy and the rest of it you need from security? So from a thought leadership perspective, we take that very seriously. We want to be known as an opinionated leader about how to build apps and build databases and across that whole spectrum that we've described.

Starting point is 00:28:37 And so that's really what's driving our roadmap and our features and being very thoughtful about it and being very collaborative with our customers and our features and being very thoughtful about it and being very collaborative with our customers and our partners on the journey. Thanks. I had the suspicion that what you would be working on would probably be the kind of operational improvements that basically refer to dividing them into those different buckets, which I guess in a way is a sign of maturity, so you're not adding anything spectacular, but just making improvements to what you already have and already works. That's right.

Starting point is 00:29:15 That's right. Okay, so final question, and I think we should be wrapping up soon. So final question. The other thing that is probably noteworthy about, to me at least, about your offering is the fact that you have a number of onboarding paths, I would call them, for developers. So SQL is the obvious one, and then you also have Cassandra query language compatibility and the GraphQL interface as well. So I was wondering if you have any insights as to how those are all working. So, you know, which one is the most popular

Starting point is 00:29:51 and whether the other two, let's say, are also worth keeping around, so to speak. Yeah, I think it's still early days, but between the three that you mentioned, the most popular is SQL. So it's just overwhelmingly popular, I think, followed by GraphQL as number two, and followed by the SQL, because center query language is number three. One of the things that we see is people come in for the SQL support, which is just terrific in our case. But once they understand the trade-offs between SQL and CQL and really the type of application

Starting point is 00:30:26 that they're building, they try to see the best of both worlds that they can get. And different microservices end up going to different sites. So CQL, the way it works for us, may not be the leading acquisition API, but it definitely is a very valuable API. So specifically when it comes to very high performance and very high geographic distributed data, CQL is able to perform really well. And one of the things we're doing strategically is connecting the two sides.

Starting point is 00:30:55 So you can use CQL as a foreign table to our YSQL offering, like Postgres foreign table in some sense. So you will be able to write data on one side and actually join it with data on the other side, right? And so that's been a very heavily asked for feature. So I think we are also tightening the different offerings to make it more coherently appear as a single thing rather than multiple different acquisition points. So I think the point is, where we stand, I think it's becoming clear that we cannot ignore

Starting point is 00:31:26 any of these paths, like they're all super critical to end users for different reasons, right? And yeah, and there's other asks too, there's asks for, can you support other things in addition to GraphQL, right? Can you support an off the top REST API service, for example, or can you like, so there are more asks, so we are, those are some of the things

Starting point is 00:31:44 that we are also thinking about and figuring out how we want to go about it. Okay, thank you. And yeah, it seems like you have a lot of work lined up ahead of you and you're now quite well funded to be able to execute on that. So congratulations again for the funding round and best of luck in your future plans. Thanks, thanks, George.

Starting point is 00:32:11 Thank you, George. I appreciate the coverage. Yeah, thank you. Thanks a lot. I hope you enjoyed the podcast. If you like my work, you can follow Link Data Orchestration on Twitter, LinkedIn, and Facebook.

Orchestrate all the Things - Another globally distributed cloud native SQL database unicorn: Yugabyte raises $188M Series C funding at $1.3B valuation. Featuring CEO Bill Cook, Co-founder Karthik Ranganathan

A number of successive funding rounds have given Yugabyte unicorn status, while positioning the company to aim for a big piece of a growing pie in the shape-shifting database market Article publ...ished on ZDNet

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.