Screaming in the Cloud - The Future Is Time Series Data with Russ Savage

Starting point is 00:00:00 Hello and welcome to Screaming in the Cloud with your host, cloud economist Corey Quinn. This weekly show features conversations with people doing interesting work in the world of cloud, thoughtful commentary on the state of the technical world, and ridiculous titles for which Corey refuses to apologize. This is Screaming in the Cloud. Welcome to Screaming in the show. Thanks, Corey. Glad to be here. About a month or so ago at this point, we had effectively one of your co-founders, Paul, on the show and talked for about half an hour about some of the ins and outs of Influx and how one might use a time series database, what a time series database might be. And that got a decent enough reception that we decided, hey, who else can we have on? So congratulations, you're in the hot seat. But unlike some software products, people generally don't emerge into the

Starting point is 00:01:10 world fully formed from the forehead of some ancient god. How long have you been at Influx, and where were you before? Yeah, I've been at Influx for over two years now. I've been in the data space for a long time. I came from a small Hadoop startup that got acquired by Google, and then I was previously at Elasticsearch. That sounds like an interesting and winding road. It sounds like you did fundamentally what everyone was trying to do now, and get out of Hadoop.

Starting point is 00:01:40 You were able to do it by changing companies. Some folks, not so lucky. But from my naive understanding, for God's sake, my favorite database is Route 53. So I have other problems that I have to work through. But Hadoop seemed like it was a big thing. And then suddenly it wasn't a thing anymore. And I don't see new Hadoop projects. They all feel somewhat legacy.

Starting point is 00:02:01 Is that match your experience? Or am I just talking about things I don't understand, or quite possibly both? Well, I think so. My personal opinion is I think Hadoop solves Hadoop scale problems really well. I don't know if as many companies have Hadoop scale problems as they think. And, you know, what's happening is computers are becoming more powerful, individual computers are becoming more powerful, and so you can do more with less. And so I think for a ton of the problems that people are facing now, you can run that on much smaller systems. You don't need a 50 or 500 node Hadoop cluster to do that. Also, the complexity of setting up and maintaining Hadoop was very high.

Starting point is 00:02:49 And I think with these smaller systems, it's much easier to get them set up and keep them running a lot less maintenance overhead than you have with a Hadoop cluster. It always was interesting watching people come out with big data problems. And, oh, great, so where's your data? And they pull a thumb drive out of their pocket pocket and you don't have a big data problem. Ultimately, you can create big data problems if you're a terrible enough programmer. I mean, my only debug strategy is print lines. So my logs are enormous because I never go back and fix anything. So every time someone has a request, I get 8,000 lines of logs. So yeah, there's always something

Starting point is 00:03:22 people can do to turn it into a big data problem. Yeah, I think a lot of those problems, they actually turn into larger than Excel problems or larger than what I can run on my local machine problems. But that doesn't mean that they're Hadoop problems. That seems to be the point that the world has reached consensus around as well. So you wound up leaving a Hadoop startup

Starting point is 00:03:44 and finding yourself effectively selling open source time series databases, which step one, selling open source software has always been a bit of an interesting challenge for some folks. Can we talk a little bit about that? And I guess first, what inspired you to come to Influx Data? And secondly, how are you, I guess, engaging with the community when you do have a service or a software that you are attempting, successfully it turns out, to sell? Yeah, great question. My background is as a developer,

Starting point is 00:04:15 and I love to write code. Not production code, but a lot of code regardless. I think my passion for open source, I love the community that evolves around open source projects. And so when I was looking for a new role, I was looking specifically for companies that were strong in the open source communities.

Starting point is 00:04:38 And so Influx Data is a great company for that. So we have a set of products, open source products that we use as the basis for our cloud and enterprise offerings. And so I really love the fact that there's so much capability in our open source tool that the individual developer or the small team can really, it's not just demo ware, it's not just share ware.

Starting point is 00:05:03 You can actually get real workloads done in the open source tool. And then when you're getting more value out of open source, or you need some support, or you want to create a team dedicated to managing it, we can help you do that easier. And I do want to point out, otherwise, Lord knows, I will get letters, that as you're mentioning this, I pull up the InfluxDB open source repository on GitHub or Jithub, depending upon pronunciation choice.

Starting point is 00:05:38 And yes, indeed, it is licensed under the MIT license, an actual open source license. None of these nonsense, oh, you can use the source for anything unless you make money on it, and then we're coming for you. As some companies in this space recently have been doing as a, I guess, a hedge against cloud providers. It's interesting that you folks, A, haven't felt the need to do that,

Starting point is 00:05:58 and B, have continued to engage in good faith with your own user community to work with people who are very clearly passionate fans of what you're doing. Yeah, I agree. And I really applaud our leadership for doing that. Paul Dix, our CTO, is very passionate about open source and very passionate about licensing. And he tends to write a lot of blogs on that topic.

Starting point is 00:06:21 So that was also really important that we're a true open source company, not one in name only. I really think that can't be stressed enough. If I were going to be building something to give to the world or put out there as a product, I have a lot of decisions to make. And on the one hand,

Starting point is 00:06:38 there is some compelling value to having something be open source, but there does come with costs. And I think these ideas of just trying to get the good parts of the open source world, community engagement, free promotion, et cetera, et cetera, but not any of the downsides of,

Starting point is 00:06:53 well, no one can ever offer this as a service except us, it just rings hollow and feels transparent. I mean, I'm sympathetic to business model challenges, but suddenly moving the goalposts on existing communities really breaks the social contract that open source communities have come to expect. Yeah, I agree. The problem that I see, so I've been on the other side

Starting point is 00:07:15 and I've done evaluations of different software and looked at long term, you're thinking you're bringing in third party software into your company. Your company is going to be around for a long time. And you want to make sure that you're not bringing in something that you're going to have to replace in a few years. And I think the real key with open source

Starting point is 00:07:35 is you can take that technology and really embrace it and really take it to the next level, really expand upon it. And when you're bringing open source software into your company, you're thinking about the long-term horizon. If you have companies that are making changes or deciding to suddenly change the way they're licensing or change that, it really calls into jeopardy

Starting point is 00:08:00 your long-term vision, and it's a risk. I think that understanding that open source is a double-edged sword in some ways, and it is an approach to solving certain problems, but it's not a strategy in itself, is something that a number of somewhat naive founders have gotten trapped in in the past. Of course, here we are now where you're very clearly doing well. I've talked to a number of Influx customers that we'll talk about in a bit, but it's very clear that you're doing something right. And believe me, when I have people on this show, one of the first things I do is start Googling, all right, who hates them? And I can't really find anyone saying negative things about Influx. Believe me, I've looked. Am I just looking in the wrong places or does the community actually seem to like you folks? I talk to the community every day and I have a ton of positive interactions with them and we're always working to bring them closer and bring them

Starting point is 00:08:55 into our discussions as we move as a company. I think that community engagement is really important and it also drives a ton of transparency in the organization. Telegraph is one of our projects. It's a data collection agent and all of our discussion happens in the public with our community. I think that's really important and it makes people want to be a part of that community.

Starting point is 00:09:25 We want people to want to be a part of our community, and that's what we strive for. That tends to be a hot button that gets a lot of people riled up. Let's change gears a bit and talk about something else that I've been using to successfully annoy people for a couple years now. I have been a long fan of the idea that multi-cloud as a best practice is stupid. If you're building something and you want to be able to magically deploy it to AWS and GCP and Azure and Oracle Cloud because you lost a bet somewhere, then you're slowing yourself down and effectively trading feature velocity for a level of faux agnosticism that you're not necessarily going to ever take advantage of. And I stand by that statement. But what people then don't listen to is paragraph two after that, which is here's a list of exceptions. And first and foremost among that list of exceptions is who are your customers? If your customers are attempting to service their own

Starting point is 00:10:27 customers in a variety of different locations, they have to meet them where they are. Our service is awesome, but you have to move over to this cloud provider is absolutely not going to happen. You're not going to be able to serve their needs. And in turn, telling customers that they should migrate their own work, their own stuff and become multi-cloud themselves. If they listen to previous episodes of this podcast, they're going to say, oh, wait, we've heard about this. It's stupid. Now, because of you being who you are and having to service a wide variety of different customers, you fundamentally need to have an offering that spans beyond a single provider. No, I completely agree with you. need to have an offering that spans beyond a single provider.

Starting point is 00:11:06 No, I completely agree with you. I think one of the things that you look at is, we are a platform company, and we want our users to build on our platform. When you're running a platform company, and you want users to build on top of it, you need to make that platform available where your users are. The reality is that different companies use different clouds for different reasons. I don't think that one company is going to, you know, it makes sense for one company to

Starting point is 00:11:35 run in multiple clouds in most cases. We have some customers who are running in Azure Cloud for specific reasons that, you know, we're not really, we don't really specific reasons that we don't really care what those reasons are. We just know that they can only run in a specific cloud and that's where we want to be. And so, yeah, we 100% want to run our infrastructure in as many clouds as our customers need. And that's sort of a piece that often gets lost in nuanced conversations about this when customers have needs you've got to be able to meet those needs rather than condescendingly telling them that they're wrong all the time unlike certain cloud companies we can all think

Starting point is 00:12:15 of but not name the fun part where that also expands beyond just the idea of multiple clouds it's easy to also sit here and say, oh, and hybrid is dumb too. You shouldn't have any on-premise physical data centers. Oh, that's adorable. Great theory, but here in practice, we have this thing called legacy, which engineers hear as old and broken, but here in the real world,

Starting point is 00:12:38 I hear as things that make money. So I don't see hybrid going away anytime soon. And unlike a lot of different platform companies we could name, you have viable options for folks who are running on-prem. Was that something you planned from day one? Did that accidentally happen along the way as you were building this stuff out? Yeah, so we have on-prem and we have cloud. And I think the interesting thing there is a lot of companies will tell you that they're all cloud.

Starting point is 00:13:07 But if you actually dig down into their engineering organizations or dig down into the development areas, you'll actually find a ton of quote-unquote on-prem development. You've got people writing applications against on-prem versions of software. And they want to make sure

Starting point is 00:13:23 that those applications run, whether it's on-prem, whether it's in the cloud, wherever. And so I think one of the big things that we focused on is, again, the idea of a platform and being able to develop against that platform and having a common set of APIs, whether you're running this stuff on-prem and you're building out development as a proof of concept to make sure that it's working for you, or you're running this stuff on-prem and you're building out development as a proof of concept to make sure that it's working for you, or you're running this in the cloud and you want to service a global customer base, or if you're an IoT shop

Starting point is 00:13:52 and you're running in an oil rig in the middle of the ocean and you don't always have cloud connectivity. So I think, again, it's another question of where are customers running their infrastructure, and since we're an infrastructure company, we want to be where they are. But something you just mentioned is fascinating, the idea of oil rigs, for instance.

Starting point is 00:14:09 People tend to not necessarily have a lot of exposure to that type of environment. But to my understanding, a lot of them tend to live in the middle of the ocean where internet connectivity, shall we say, comes at a premium. And having something cloud-hosted just simply isn't an option there for anything that needs to be even slightly performant.

Starting point is 00:14:27 So there really is no alternative short of asking AWS to build a region in your house, effectively. I've asked. They won't. U.S. Bathtub 1 will not exist this year, but we're optimistic for 2020. There are use cases that the cloud cannot, with current technology, address. And I think that there's this certain willingness of the part of the cloud native sort to turn our noses up at that type of use case. But things like that are important. And customers are there with problems. Making sure that you can address what those customers are up to and meet them where they are is something that I think an awful lot of companies just sort of tiptoe past

Starting point is 00:15:10 and don't really investigate in the name of ideological purity. Yeah, I think you're right. And I think we see, at least in our customer base, basically where the application or where the infrastructure is deployed kind of serves for different use cases. And so when you look at running something on the edge or running something on-prem on an oil rig, for example, you're basically, who's your customer in that regard?

Starting point is 00:15:37 It's the individual operator of the system on the platform looking at real-time data of how different gauges, how different pressures are operating. And that's valuable, and that gives you a lot of capabilities and insights. But in order to get long-term trends, in order to do large-scale analysis across hundreds of platforms all over the world,

Starting point is 00:15:59 you want to send that data into the cloud for processing. And that data that's in the cloud is going to be massive aggregated data across many different platforms. And the audience for that is going to be a data scientist who's building out a machine learning algorithm that can then be pushed down into the oil rig. And so you get this system where you're actually running your software, you're running the same software,

Starting point is 00:16:23 but you're addressing two different needs and two different business needs. I guess I have the advantage on some folks because although I've never worked on an oil rig, I did grow up in rural Maine. And you have about the same level of internet connectivity in some of those places as you would on an oil rig. So running things in my living room was always what I did growing up. There was always the one room that was 20 degrees hotter than anywhere else because that was my makeshift data center with crappy old computers. And it was terrible, but it was also a half step above IBM Cloud, so there's that. And eventually, the world modernized. Internet came to rural Maine, but I remember those days where you're generating more data locally than you're ever going to be able to shove into however large a pipe is to get it to a cloud provider. So especially when you're in the business of doing data collection and aggregation, which fundamentally is what a lot of databases are, whether they're time series or not, it needs to be as close to what's generating those inputs as possible, I'd imagine. Yeah, and especially true for time series data

Starting point is 00:17:26 where time is important and latencies are important and milliseconds matter, right? And so we get into a scenario, our database is one of the few that can do at the nanosecond level. And while we're not recommending that people monitor everything at the nanosecond level, it does help if you can trigger an event

Starting point is 00:17:46 and then you start collecting data a lot more frequently than you normally would so that you can see exactly what's going on during that event. And then you go back to one-second intervals after it's finished. And so I think putting your database close to where you're located and reducing that latency gives you an advantage in your response time to the specific problem that comes up.

Starting point is 00:18:11 Absolutely. Anyone who wants to experience this themselves can wind up trying something in an AWS account and then just start a stopwatch and see how long it takes that event to show up in either CloudWatch logs or, far worse, Amazon CloudTrail. They put the eventual in eventual consistency. There's value to getting rapid response that can inform what's going on. I mean, there are use cases where that simply won't do.

Starting point is 00:18:32 Imagine having significant latency in a self-driving car, for example. At that point, oh, you should have stopped at that light four lights ago. Not a great plan. Right, and yeah, the latencies on on-prem and when you're close to the source are very different than the latencies on-prem and when you're close to the source are very different than the latencies in cloud. And so, again, it's different use cases that you're solving and it's different dimensions where you're analyzing the data. Our biases tend to inform how we wind up looking at different tools

Starting point is 00:18:57 and how they might factor into our own use cases. So rather than continuing to assume that everyone would use a time series database like I would, because I have a sysadmin background, what kind of customers do you have? What are they doing with a time series database that isn't, for example, just monitoring whether a computer is up or not? Yeah, great question. So to be clear, a lot of our customers are monitoring whether a computer is up or not.

Starting point is 00:19:23 That is a huge use case for us. It seems a common pain point. Yeah, it turns out it's hard to keep those things running. No, so we have a ton of use cases around time. And our belief is, if you actually look at the data that's being collected and the data that's out there, most of the data that you're seeing is actually more valuable when you look at it through the lens of time and you look at it how things trend over time. And so even if you think of traditional data stores, if you add a time element to it, you suddenly start getting more insights than you could without. And we've got use cases everywhere. So for example, we were talking about oil platforms earlier.

Starting point is 00:20:08 We have a company named Equinor who is using our platform for monitoring those oil platforms in the Norwegian continental shelf. And it's really exciting and really interesting to hear some of the stuff that comes out of there. But basically the ability to gather that data in real time and make decisions on it was really important for them. When you think about this, we consider this an IoT use case. A lot of people when they think about IoT, they think about you're looking at temperature sensors or you're looking at different air quality measures. But a ton of our industrial IoT customers

Starting point is 00:20:49 are monitoring large, massive oil rigs or drilling platforms or things you wouldn't typically come to the top of your mind when you're talking about IoT, but very, very important use cases. So what use cases do you have that are outside of the IoT space? I mean, as much fun as it is to talk about, A, oil rigs, and B, refrigerators that tell me when the milk is expiring,

Starting point is 00:21:13 what other use cases do you see from customers? Yeah, IoT use cases are really fun to talk about because they're so varied and widely used. But we have people using us for a ton of different things. So as I mentioned before, we have customers that are managing data centers with tens of thousands of machines. And we even work with NASA, who's monitoring their infrastructure for launching satellites. And at least until it launches, it presumably has a better connectivity to the internet

Starting point is 00:21:43 than it does once it's in space, although one starts to wonder. Exactly. You never know what Elon's up to. The problem I run into always is that I tend to think of these things just in the context of what I've used time series databases for before. Now, the challenge, of course, is I was a network admin once upon a time when the closest thing we had to a time series database was rrd tool uh or mrtg that wrapped rrd tool and as a result oh time series databases those are those things that are terrible and cause all matter of problems and its primary use case is to be embedded in cacti whose in turn primary use case is to sit in the corner until someone breaks into it and uses it as an attack platform that That may not be entirely hypothetical. But increasingly,

Starting point is 00:22:27 the capabilities have dramatically changed. And being able to see such, I guess, different capabilities now, I look back at the outages of yesteryear, for lack of a better term, and it would have been so much easier to diagnose some of these outages with the right tooling, as it turns out. Unfortunately, it is a time series database. It is not a time traveling database. So I don't think you can go back and necessarily help me with those problems in the past. But for other people who, I guess, who are stuck in that outmoded view of time series,

Starting point is 00:23:01 what's changed? How would you describe the advancements? Because I'm sort of going to assume just on a lark here that you didn't declare influx DB feature complete in 2013 and the rest has just been maintenance releases. No, definitely not. I think the key fundamental difference between a generic general database and a time series database is you can turn a generic database into a time series database with enough man turn a generic database into a time series database with enough manpower, with a big enough team.

Starting point is 00:23:30 With a dumb enough perspective, you can turn DNS into a database as well. I've done it, but everyone looks at you with this horrified look on their face, and then you get asked to leave the interview immediately. Right, right, exactly. It's basically about what you ultimately want to accomplish and where you want to put your resources. And I'm a product manager, so I'm always thinking about resources and priority.

Starting point is 00:23:55 But if you're a company and you are solving a specific business need, does it make sense to actually spend 50% of your time taking a generic database and turning it into a time series database so then you can solve your problem? Or would it make sense to go out and get a best-in-class time series database so that you can focus on the thing that's actually going to drive capabilities of their business?

Starting point is 00:24:20 And so when I think of the reasons why you would choose a time series database, behind the scenes, we've done a ton of legwork and a ton of the optimizations that you would have to do yourself if you were turning a generic database into time series and so that gives you a great starting point and allows you to focus on the business problem that you want to solve instead of these infrastructure issues with creating a

Starting point is 00:24:45 time series database. What's interesting to me, if I take a look through the Influx suite of tooling, for lack of a better term, you have a hosted version that people can run wherever you have an enterprise offering, you have cloud hosted versions. What is the decision matrix for people who are trying to figure out which version would make sense? Great question. We talk to our customers a lot and basically we break it down into two different paths. Between those paths are different decision points

Starting point is 00:25:16 that you can make depending on what you want to solve to cross the divide. On the on-prem scenario, you've got the open source instance that you can run locally and build your application against. You have an enterprise instance of on-prem that you can host yourself if you need more power or high availability or security. And then we also have the equivalent in cloud. So in cloud, we have a free tier that lets you get started quickly and build your application against. And then if you want to host all your infrastructure in cloud. So in cloud, we have a free tier that lets you get started quickly and build

Starting point is 00:25:45 your application against. And then if you want to host all your infrastructure in cloud, we then offer that pay-as-you-go service in cloud. And so basically, it depends on your use case, it depends on your infrastructure, it depends on how you want to run things. But either way, whether you're on-prem and want to manage all the infrastructure yourself, or you don't want to manage any infrastructure and you want us to do that for you, we have offerings for both. One of the, I guess, surest signs of a fanatic is when they have absolutely nothing negative whatsoever to say in any context about a given topic. You don't strike me as someone who falls into that category. So I have to ask, from a high level, what unsolved problems are there today in the world of time series databases? Ooh, that's a tough one.

Starting point is 00:26:33 Unsolved, unsolved problems. So the thing that I see when I talk to customers is, and you're starting to see this a ton in the marketplace, is I think everybody had this dream and, you know, bringing this back to Hadoop is this like, this data lake, which actually turned out to be a data swamp as people realized. But you've got these specialized databases, and I 100% agree that you should use the right tool for the job. And so I think the specialized databases make a ton of sense. But I think the issue that you're seeing is you're actually starting to create these individual silos across different storage, different storage mechanism, different databases, different tools.

Starting point is 00:27:15 And one of the struggles that people have is connecting all of that stuff together, right? And so you see a ton of companies coming up that are basically just connector companies that bring data from A to B and C to B and all of these things together. And so I think that's a huge struggle in specialized databases and in time series. And so that's something where you'll see a lot of development in our data manipulation language

Starting point is 00:27:49 called Flux that actually will make it much easier to bring that data from those disparate databases, from those disparate locations, and run analysis that you couldn't really do before. We see a ton of people who, in the past, have basically had to build applications on top of multiple platforms to bring data from all of those platforms and combine them. And our goal is

Starting point is 00:28:11 to bring that application logic closer to the data store. And so that you can then leverage the capabilities of the community and not have to build everything yourself. And so we're starting to see areas in flux where people are connecting to different data sources, bringing that information that has typically been siloed in individual tools into the same place and running that analysis. I have some level of contempt for companies

Starting point is 00:28:38 that are entirely built around solving a particular pain point or a problem that are effectively one feature release away from another service or product that puts them completely out of business from a model perspective. I'm not going to necessarily name names, but if your entire company's job is to take data in one format and translate it to another, if the source company releases one day a feature of, oh, by the way, you can now query this and get a JSON result instead, and that destroys your company. Maybe when you're raising giant piles of VC money, thinking through how unassailable of a moat you've built really winds up being worth pursuing.

Starting point is 00:29:18 It definitely seems from conversations I've had with you and with others that this is a serious, it's not a problem that you've built things around. It's an entire field of inquiry where you might have heard of time series databases and are scratching the surface a little bit, but the further you get into it, the more you realize there's an entire sector here. There's an entire ecosystem that is dealing with painful problems. I mean, we've gotten to a point now where I wouldn't believe this was possible. You can commit the cardinal sin of posting a graph on the internet and not label your axes, but you can generally assume with a time series driven graph that X is time and it moves from left

Starting point is 00:29:56 to right. That alone is an industry win. Congratulations. You have finally gotten statisticians and high school math teachers to stop screaming about that one. That alone says that there's a lot more to it than that. I think my rubric for determining whether something's important or not based upon a 10th grade experience I had might be slightly flawed, but I'd also think that this is something that is starting to rise in awareness. People are starting to understand that there's something there. And again, I keep looking for dirt on you people and I'm having a lot of trouble finding it. I appreciate it.

Starting point is 00:30:28 I think the one thing that I like to focus on, again, I really love the community that we've built over the years and the trust that we've built in that community. And so I think if you're honest and you're doing good work and you're being transparent about the work you're doing, then it'll attract good people.

Starting point is 00:30:49 And so I'm, again, going back to our original conversation. That's one of the main reasons I came here in the first place and it's the reason I'm still here now. Well, thank you so much for taking the time to speak with me. If people want to learn more about what you're up to, where can they find you? If you want to learn more about Influx Data, influxdata.com. We've got a free sign up for our cloud cloud to offering. You can you can take it for a spin and see if it meets your needs. And that's a great place to learn more. Terrific. Thanks again for taking

Starting point is 00:31:19 the time to speak with me today. I appreciate it. Thanks, Corey. Thanks for having me. Russ Savage, Director of Product Management at Influx Data. I'm Corey Quinn, and this is Screaming in the Cloud. If you've enjoyed this episode, please leave it a five-star review in Apple Podcasts. If you've hated this episode, please leave it a five-star review in Apple Podcasts, and in the comments, ask Russ to hit me with his belt. This has been this week's episode of Screaming in the Cloud. You can also find more Corey at screaminginthecloud.com

Starting point is 00:31:52 or wherever Fine Snark is sold. This has been a humble pod production stay humble

Screaming in the Cloud - The Future Is Time Series Data with Russ Savage

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.