The Data Stack Show - The PRQL: Making the Data Stack Serverless in the Cloud with Mike Driscoll of Rill Data

Episode Date: March 11, 2024

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building a...nd maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Transcript
Discussion (0)
Starting point is 00:00:00 Hi, Data Stack Show listeners. I'm Pete Soderling, and I'd like to personally invite you to Data Council Austin this March 26 to 28, where I'll play host to hundreds of attendees, 100 plus top speakers, and dozens of hot startups on the cutting edge of data science, engineering, and AI. If you're sick and tired of salesy data conferences like I was, you'll understand exactly why I started Data Council and how it's become known for being the best vendor-neutral, no BS, technical data conference around. The community that attends Data Council are some of the smartest founders, data engineers, and scientists, CTOs, heads of data, lead engineers, investors, and community organizers. We're all working together to build the future of data and
Starting point is 00:00:40 AI. And as a listener to the Data Stack Show, you can join us at the event at a special price. Get 20% discount off tickets by using promo code DATASTACK20. That's DATASTACK20. But don't just take my word that it's the best data event out there. Our attendees refer to Data Council as Spring Break for Data Geeks. So come on down to Austin and join us for an amazing time with the data community. I can't wait to see you there. Welcome to the Data Stack Show prequel. This is a short bonus episode where we preview the upcoming show. You'll get to meet our guest and hear about the topics we're going to cover. If they're interesting to you, you can catch the full-length show when it drops on Wednesday. We are here with Michael Driscoll from Real Data.
Starting point is 00:01:33 Michael, thank you so much for joining us on the show today. Great to be here, Eric. All right, well, give us your brief background. How did you originally get into data and what are you doing at Real today? Yeah, thanks. My background is actually probably not that dissimilar from a few of your guests you've had over the years. I actually started my career as a software developer working for the Human Genome Project a couple decades back. And naturally, there's a lot of data in the human genome project. And that was really the beginning of a multi-decade love affair, working with data at scale, heterogeneous data.
Starting point is 00:02:17 And since then, I've started a few companies. My first startup was an e-tailer called customink.com. We sell t-shirts on the internet. I later started a consultancy called Dataspora. We did a lot of consultant work for banks and folks in the big data era. I then went on to start a company called Metamarkets, which was acquired by Snapchat or Snap, the makers of Snapchat that did analytics for advertising.
Starting point is 00:02:47 And now I've got Rill data. We're a few years into that journey and focused on an operational business intelligence product with Rill. All right, that's quite a journey, Michael. And I know that part of this journey also includes like some very interesting technologies like Druid and from the conversation we had earlier like I've learned a few things that I wasn't aware about Druid and the relationship it had like with BI and what were like the initial ideas behind it and I'm super excited excited to get into that and learn more
Starting point is 00:03:26 about how you started building Druid while you did that and how you ended up today, actually, with real data that has Druids on the backend, but it's more than a query engine, right?
Starting point is 00:03:43 So I'm super excited to get into the details. What about you? What are you excited to talk about today? Yeah, well, I think there's a few big macro trends that we're seeing in the data world today. I would say I would be delighted to talk about some of the emerging data engines that are out there for powering fast analytics at scale, really at any scale.
Starting point is 00:04:11 So Druid and ClickHouse, also DuckDB, we for me is particularly exciting is the trend towards serverless frameworks. there's a lot of new frameworks out there for really taking not just, you know, data technologies to the cloud, but making them serverless in the cloud. And so I look at, yeah, you know, almost any area of the data stack I think is being remade to be truly
Starting point is 00:05:04 serverless at scale in the cloud. And that's a pretty exciting area that's going to take several years to play out. Yeah, 100%. We'll have a lot to talk about that. So, Eric, what do you think? Should we dive in? Let's do it. All right, that's a wrap for the prequel.
Starting point is 00:05:23 The full-length episode will drop Wednesday morning. Subscribe now so you don't miss it.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.