The Data Stack Show - 116: Data Democratization & Self Service with Aron Clymer of Data Clymer

Episode Date: December 7, 2022

Highlights from this week’s conversation include:Aron’s background in the world of data (2:18)Recent Clients and major projects (3:30)Helping to spearhead data-driven growth at Salesforce (6:50)St...ories about Marc Benioff, co-founder of Salesforce (16:12)Biggest learnings as a consultant in the data strategy space (17:58)The need for data democratization (23:33)Advice for Aron’s younger self in consulting (28:45)Current trends in data democratization and sales service (35:01)Aron’s favorite tools and platforms to use (42:19)Favorite part of the consulting process (47:45)Final thoughts and takeaways (50:29)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Transcript
Discussion (0)
Starting point is 00:00:00 Welcome to the Data Stack Show. Each week we explore the world of data by talking to the people shaping its future. You'll learn about new data technology and trends and how data teams and processes are run at top companies. The Data Stack Show is brought to you by Rudderstack, the CDP for developers. You can learn more at rudderstack.com. Welcome to the Data Stack Show. Costas, we love talking with consultants on the show because they get to see so many things on the front lines and they craft so many technologies, especially the ones that are vendor agnostic, implementing all sorts
Starting point is 00:00:40 of technology. And today we're going to talk with Aaron Clymer. He founded DataClymer and really interesting consultancy focused on warehouse visualization stuff. But he was really instrumental in turning Salesforce into a data-driven company, which is absolutely what I want to ask him about. You think about Salesforce, I mean, they're so successful. And I just want to ask him about. You think about Salesforce. I mean, they're so successful. And I just want to know, what does it mean for them to go from being not data-driven to data-driven? I mean, that sounds weird.
Starting point is 00:01:16 You would think that they would be out of the box. So that's one. And then if Aaron is so kind, I just want to ask him if he has any good Mark Benioff stories because, you know, there are lots of those and he was there for a while. So that's what I'm going to ask him about. How about you? Yeah. What I love about consultants is that they have seen enough out there to summarize use cases, but also edge cases.
Starting point is 00:01:46 So I think it's like, I really, really like to talk about what patterns exist out there, building data infrastructure. And also like share some, you know, weird stories, like some edge cases, some things that like you really can't like to hear about. So yeah, that's what I would love to chat about with him. And of course, like also to listen on the Salesforce stories. All right, well, let's dig in. Let's do it.
Starting point is 00:02:20 Welcome to the Data Sack Show, Aaron. Thanks for giving us some of your time. Oh yeah, absolutely. Thank you for inviting me here. Okay. We'll start where we always do. Give us your background and tell us what led you to starting a data climber. Oh yeah. I've just been so passionate about the data space for so long. I had been in corporate America for 20, 25 years before starting this. And it just kind of dawned on me finally that I'm never going to be a true expert. I don't get into consulting.
Starting point is 00:02:54 I was at Salesforce and PopSugar for 10 years. So two companies, two stacks, two sort of business challenges and data sets in 10 years. And I thought, I got to do 50 of those in a year, you know? So that's how I'm going to be able to walk into the next company or the next client and be able to say, I know exactly what you need to do in this situation for this business problem you have in data. So I started Data Climber. We're a systems integration data warehousing implementation firm and started that six years
Starting point is 00:03:24 ago and just have loved the journey ever since. Very cool. And could you describe just like a couple of recent projects so that we get a sense of, you know, what's a, I know having done consulting, there is no typical project necessarily, but maybe just a couple of examples of like clients and projects that you've done recently. Yeah, sure. We're in a lot of different verticals and industries simply because data warehousing overall is very applicable across the board, right? And we can do this almost everywhere.
Starting point is 00:04:04 We happen to get into the major league sports and actually all of sports early on. I think the San Francisco Giants were our fourth client. And the, there, it's just a very interesting story that I think does expand across industries, but most major league sports teams are on a third party, you know, MSP managed service, something, you know, they're not doing their own stacks and they're all feeling a lot of pain around not being able to customize the way they want or bring in the data they want. So there's just this big need out there to own their own destiny, you know, own a stack, do whatever you need to do. You don't, you know, never get stuck kind of thing. And so that's resonated across the industry. So we've been at a lot of different sports teams we implemented for
Starting point is 00:04:45 Las Vegas Raiders, the Vikings, six or seven other teams. And so we've done a lot, including working with the NFL directly. So that's been just kind of a fun, it's almost a niche.
Starting point is 00:04:57 I don't want to label us as the sports SI because of course they're fun logos and they're fun projects to talk about. Outside of that that a lot of gosh payment providers banks financial services and trying to think of a really interesting one to talk about yeah across the board a lot of high tech too i mean i come from the bay area and high tech and uh quite a few COVID based as well,
Starting point is 00:05:25 sort of, you know, health tech as well. Colors, one that's been really interesting, just they have a whole platform that sets up testing sites for medical testing. COVID tests would be a good one, of course, that they cover. And just being able to really drive all their, again, operate more of their operations through data, which isn't a topic I'm just excited about in general. So yeah, a lot, just a different, a great, Pete's Coffee is a great one too.
Starting point is 00:05:50 That's all inventory operations. More of a West Coast coffee company, if you're from the West Coast, but nationwide technically. Yeah. Okay. So I have to ask on the sports side of it. I mean, I don't know a ton about sports, but are you dealing with, you know, if you think about a franchise, you sort of have like player data, but then you also have, you know, this is not the sexy ball, sexy money ball aspect. Yeah, yeah, yeah. So it's not the player side.
Starting point is 00:06:28 It's all the business side of these teams. It's sales and marketing that it's all about fan engagement at the end of the day, which is why that translates across the board, right? It's essentially customer engagement. Yep. You know, that would be the approaches that were applicable across industries. For sure. It makes total sense. Okay. Well, let's, so a lot of questions about current stuff and what you're seeing on the grounds, you know, because one thing I love about consulting is that you get to see such a wide variety of problems at a wide variety of different
Starting point is 00:07:03 business models and all the tools out there as well. But I'd like to rewind. So you spent a really long time at Salesforce. So I think you said it was seven or eight years at Salesforce. And, you know, as we were talking before the show, the main thing was really helping the company become more data driven in a number of different ways. And we're talking about huge scale here, even though it's an even larger organization now. But you said you worked with, I think it was 450 product managers. And give us just an overview of what was the state when you entered and then what was the state when you left and then I'd love to pick your brain on you know how you actually drove that journey
Starting point is 00:07:48 yeah absolutely in eight years does seem like a lifetime for a lot of people these days one organization yeah I came in there in 2008 and there was about 3,000 people at the time in the company I had a feeling they weren't using data as much as they could be. And sure enough, they weren't using data at all, actually. First thing I did there is actually build a predictive model against their 30 day free trial dataset. I bet I could predict with some, you know, some kind of percentage accuracy, who's going to convert, you know, which customers are going
Starting point is 00:08:24 to convert on 30 day free trial. I bet in the first five days, there's some to convert, you know, which customers are going to convert on their day-to-day free trial. But in the first five days, there's some interesting usage data, right? Sure enough, yeah. There was a signal there in the first five days. You could get a much better idea of who's going to convert. I brought that to the head of sales, presented my findings. And the answer I got was, well, why would we need data? We've got a huge sales force.
Starting point is 00:08:44 We call every prospect on the phone and we talk to them. So I don't need data to help me in. I, that was just, it just, it was a, that was a great example of what I sort of the headwinds I was facing. I'm like, okay, yeah, you know, there's no, no real need for data because we think we're doing everything as well as we can be. So, you know, it was a lot of over eight years of having discussions with mostly Prio is mostly dealing with a product usage data set.
Starting point is 00:09:12 So I was dealing with product managers and product more and talking to them, but that bled into sales, marketing, customer success as well. There's just having a lot of conversations around, let's get out of the vanity metric sort of mentality of always having up into the right number of users using my product because Salesforce always is up into the right. Salesforce grew 30% year over year, and I think it still is, you know, since day one, since 1999. So growth has not been the problem for Salesforce, but finding metrics that actually make a difference and using data that actually
Starting point is 00:09:45 is going to be actionable was the challenge. So it was more about having the right conversation about what is actionable data, you know, and getting people to really think through, if I had the answer right now in front of me, what would I do? And if I could figure out what I would do, then it's worth measuring and worth finding out what is the answer to that, you know, business problem and what data do I need to answer that question? A lot of that, those kinds of conversations. Yeah, for sure. And it's so interesting to hear, you know, you think about a company like Salesforce and you're like, well, surely it's data-driven, right? It's so great to hear that. It's like, we call every customer on the phone. We don't need data. So what did it, how would you,
Starting point is 00:10:23 I'm sure there's an infrastructure side to this, right? So was there any existing infrastructure for BI or, you know, what product managers across that many products, you know, what were they doing to get any intelligence on, you know, what they were trying to accomplish? Yeah, right. I mean, when I got there, and unfortunately, one of the handcuffs of being in Salesforce was it took a long time to get cloud technology internally in there. So, I mean, I started in 2008, and we were on an Oracle data warehouse with business objects, all this legacy stack that was very slow and very, took a big team to maintain and operate.
Starting point is 00:11:14 And yeah. Wasn't cloud Salesforce this whole thing though? Like, that was like their, you know. That's slightly hypocritical. It's super hypocritical. It came down to trust, customer trust being their number one value. And they really wanted to make sure they had tried and true technology internally that was super trusted. That was it.
Starting point is 00:11:35 That was the reason. So that made sense. I could see that, you know, from a business strategy standpoint. But unfortunately, it just meant that we had to use technology that had been around for quite a while, right? No cutting edge. So for me, it was about the patterns and, you know, luckily that is actually the key, I think, to doing it right and doing it well in data warehousing, specifically patterns, design patterns, best practices, you know, that hasn't changed all that much. It's changed a little bit, but but you know a lot of those patterns that we developed on an old legacy stack still apply today in terms of best practices so it's great to really hone those you know get those really down
Starting point is 00:12:14 especially when you have poor performance right that's when you have a lot of efficiency you have to you know build into the system so i enjoyed like coming up with how are we going to use this legacy stack? How are we going to do it well? And when it came down to was when I started, it was this data warehouse with very, you know, no really standards on how you're going to ingest data or get data or build metrics or even model your data very well. So we just had to come up with a lot of standardization around data ingestion specifically. So we had to, you know, with 400 product managers and, you know, my team was only 20, 25 people. I mean, you know, to scale that we had to go to a product manager and say, let's talk about your actionable metrics. Once we
Starting point is 00:12:55 understand what those are now, here's how your team can instrument all of this and the products that we capture all this data and just build some frameworks that made that really easy to do. Super interesting. Okay. And what was it like when you left? Like how, you know, what were, give us a couple examples of, you know, the ways that product managers were operating with data that they, you know, that they weren't when you started. Yeah. Well, first of all, I guess I should say from a technology standpoint, my, probably my biggest
Starting point is 00:13:24 win from a, from the stack itself was to be able to get Tableau in there and replace BusinessObjects as much as possible. Because BusinessObjects, I must have trained 400 people on that tool, and I don't think any of them ever used it very much. I mean, the usability was pretty bad. It's a large SAP product. But Tableau, as most of us in the data professional world know, is much easier to use. So that was great to be able to train in a lot of people to self-service their data using Tableau, at least at the time.
Starting point is 00:13:56 And so, but what was more important, I think, was really just the fact that everybody, including, again, a lot of people outside of product knew how to ad hoc query their data and get answers. You know, how to explore data and slice and dice on all the dimensions they cared about. And then build data products actually out of that. So we built things like an early warning system to detect customer health, essentially, early warning for unhealthy customers, let's get them back healthy. Healthy.
Starting point is 00:14:23 That was really more for customer success, right? But it was all based on product usage data from the product teams. And then, of course, the product roadmaps being data-driven was a big thing and making sure that a lot of product managers were making those data-driven decisions. And then the third thing we did that was actually really had a pretty big impact was just internal advanced analytics. So for instance, we would have a couple folks on the team who could do predictive modeling and we would build a predictive model, not necessarily to predict the future, but just to analyze the system and understand the drivers of success of something.
Starting point is 00:14:59 Regression will tell you, you've identified your target, what success means. It'll give you the top five reasons why that success is being achieved at certain customers, for instance. And so we actually did a bunch of analysis that led to whole new products being built with the results. So pure internal analytics, not operational production models or anything, but it was super useful. Yeah, yeah. Did you do any sort of cross-product stuff, right? Because you're collecting it from all these different products. And I mean, the interesting thing about the Salesforce ecosystem is lots of different products you can, you know,
Starting point is 00:15:37 you have the CRM in the center, but a big ecosystem of products, like very inquisitive. Did you do anything on that front? You know, I think maybe the closest thing was to develop white space analysis where you really were looking at each customer and understanding where the white space was, meaning what are all the products they're not using? You know, what are they using?
Starting point is 00:15:56 What are our opportunities to basically upsell, cross-sell, or at least fill out the picture for customers? But not necessarily like, you know, how one product usage affects the other or any of that thing. Yeah, yeah. Super interesting. Okay, last question for me
Starting point is 00:16:13 because I've been monopolizing. Do you have any good stories about Mark Benioff? Oh, man. I wish I did. You know, when they were smaller, I thought, where is he? He's got to be walking around here somewhere. You know, I have nothing good things to say about Mark. I mean, of all of the successful, you know, mega successful CEOs like that. First of all, he founded it. What kind of a founder can take it to a hundred plus billion, right? Really stand up guy found it and creating the foundation.
Starting point is 00:16:45 He created the one, one, one model to give back 1% of profit and time and, and created this huge foundation. I just, you know, just really. Yeah, just nothing but respect for the guy and what he's done and how he was, he stayed out of the headlines.
Starting point is 00:17:00 He there, you know, he was very stand up, respectable person that just drove the company to success. But nothing, no amazing stories about him. I liked the parties he threw. He threw great parties. That was really interesting.
Starting point is 00:17:13 Awesome. All right. Well, thank you for thank you for doing a Salesforce deep dive Kostas. Yes. Take it away. Thank you. Thank you, Eric. So Aaron, you mentioned that you turned into consultancy because you were hungry
Starting point is 00:17:31 like to learn more and like see more like use cases out there and like become, like, let's say like an expert motivator, right? Like in this space. So I'd love to hear like what you've learned and try to turn this into some kind of like patterns, right? And to do that, like my first question is about like the people who are coming, like, that are coming and they are asking for your help, right? So what are they asking for?
Starting point is 00:18:04 Like what's like the most common, like, let is that are coming and they are asking for your help, right? So what are they asking for? Like, what's like the most common, like, let's say project that you see out there is like people coming and saying, oh, like, we don't have like a data strategy, right, and we won't like to implement a data strategy, we won't like to start like identifying the data that we can use,, build the right infrastructure, all that stuff. Or you see more of a modernization need out there where you have businesses that they already do something, but they feel like, okay, we probably need to update a few things if we want to stay relevant. So, okay. These are like, just like two examples.
Starting point is 00:18:48 Hopefully they are like more and better ones. So I'd love to hear like what you see out there. David Pérez- Yeah, it's really interesting. I, you know, as I think about even what you're saying there and I think about how I'm always surprised at how, and maybe then it's the Salesforce story all over again, how, you know, companies are using data a lot less than you think. And so we are maybe surprisingly more working on the table stakes sort of basic stuff. You know, it's just get a data warehouse running
Starting point is 00:19:19 and self-service, get data into people's hands. Not even advanced analytics, not, you know predictive modeling necessarily let's just get a really great data model that really is a 360 view of the business or the customer to you know curate that really well with the again data modeling and best practices there but let's just self-service this data like that it still remains the number one use case and it hasn't changed in a long time. So we do normally work with kind of mid-sized companies.
Starting point is 00:19:50 So a lot of times they don't have something and we are bringing the entire stack to them for the first time. Another good quick just sports example, there is Big Ten. So a collegial sports conference with 28 sports, they didn't have a data stack at all so we're building a full stack for them but it's the innovation there is that all the schools are
Starting point is 00:20:11 going to use it too so it's going to be this shared environment that actually no other sports organization has anything like that even the major leagues don't have a really like shared environment like that they're still sharing files with sftp But so, you know, unfortunately, I don't have these amazing sexy stories of all of this advanced stuff because it really is just getting data in front of people. You know, I was thinking about what you said about from a pattern and from the business side too. Even that is pretty broad. You know, it depends. Some of our clients really want, you know, they're focused on the financial data and we just need to get financial data in front of people. They just want to actually create a P&L in a BI tool, even, you know, get out of their financial tool.
Starting point is 00:20:54 The pattern really is let's all get out of all of our analytics capabilities of our SaaS applications and let's get into a BI tool where we can do a lot more and do whatever we want. It's a lot more powerful. You know, we have some clients where we're getting their inventory data as a mess. They just need to get on top of their inventory data. For others, it's absolutely marketing is a big case study in general. Like a lot of marketers need a better marketing system and some digital transformation. Some let's get it in the cloud and go full cloud and migrate what we have. But, you know, it's just across the board. There's just a lot of almost every department in a company we work with, right?
Starting point is 00:21:32 Sales, marketing, finance, and on a company. Yeah. And do you see that like this transformation like happens, like stays, let's say in the department or is it is it more of something global that happens in the company? Let me give you an example. Make it simpler. Do you see finance coming and be like, we need BI instead of doing everything inside the SAP for whatever reason?
Starting point is 00:22:02 Or it stays there, right? Maybe it's an opportunity for you to expand in the accounts, obviously. But or you see more, let's say, broad projects where companies are like, we want to democratize our data. So we don't just want to take the data out of SAP, but we also want to make sure that everyone
Starting point is 00:22:23 inside the company has access to this data and they can figure out one way or another, like how to get far in from that. Yeah, we usually do start with a department or two, however, because it's mid-sized companies and not necessarily large enterprises. Good news is that we are able to convince them to make sure this is an enterprise data warehouse solution. It's a central data warehouse for the whole company. We might start with one department, one data set.
Starting point is 00:22:48 But even in my stories about Salesforce there, the product usage data set is the entire company can use that and find value from that. So even if we start with one department, the data is still consumed by a lot of different departments. So that's the story. We were huge fans of, again, keeping a single source of truth, one data warehouse.
Starting point is 00:23:10 And actually, maybe that gets back to your question about patterns again, is that I think is the important thing, right? Is to really centralize all your code as much as possible, govern it, keep it governed and controlled so it doesn't become a mess. Because trust, if you lose trust in your data,
Starting point is 00:23:26 that's the number one killer of data projects. You do have to put a lot of thought and governance into what you're doing. Yeah, 100%. And like, you mentioned the word like data democratization like a couple of times. Like, what does this mean in the context of the company? To me, that means as many people as possible who should have access to the data should have it in a self-service way. They don't have to be technical. They have a tool, cloud tool that they can use to then query the data, hopefully ad hoc and ad nauseum if they want to. So give you a good example. I actually back from just before I started this, when I was at Rand
Starting point is 00:24:02 Data at PopSugar in San Francisco, B2C company, and we democratized the data across every single department. And my favorite story from there is the PR department. PR department was two people who were trying to put out as many stories as possible. And they had a fashion search data set. So they could actually detect fashion trends anywhere in the world with this data set.
Starting point is 00:24:26 It was pretty fun. But the only way they could get the data and get a story was to submit a request to the data team and wait about two or three weeks, get an answer, and write a story. It was very easy to get that data set into a modern cloud stack with a cloud BI tool. In that case, it was Looker. And I was able to train them in about an hour and a half how to query that data to find any fashion trend in any place in the world at any time period, right? And so with just that one training, they were then able to produce a story every day.
Starting point is 00:24:58 You know, so that was like 15x productivity because of self-service with, you know, a pretty simple data set. The idea is to get every employee to be able to do that to then sort of incorporate data into their daily job, their business process, and make it more data-driven, that's how you change culture. Henry Suryawirawan, That's super interesting. Like let's say a little bit like longer, like in sculpture,
Starting point is 00:25:22 just like super interesting. So, okay. Making the data accessible is obviously a very important step. You need to have the data there available for anyone to go and work with it if you want to democratize that, but that's not enough, right? Like you also need the people, know, first of all, that the data is there available and also, let's say, build the kind of, like, thought process of, like, when they come with a new problem, like, to go and reach out for the data and see, like, how the data is being, like, held.
Starting point is 00:25:58 And so there's also, like, some kind of, I would say, like, educational parts in this whole process. How do you see this thing working? Because you know, like many times, like most of the time, maybe all of the time, like especially this show, like we focus like a lot like on the technology side of things. But it's very interesting what you said about like Salesforce and the insights that you tried like to communicate to the sales monitor, right?
Starting point is 00:26:25 Like, at the end, no matter, like, what kind of data system you have there, like, it's people's problem, like, to adopt and, like, use that, right? So how do we make people work with the data? Like, learn how to use the data, find the data. And maybe we don't have to, I don't know, but like, I'd love to hear like from you, like how big of a problem is and like how we can solve that. Yeah, I think that's still actually a huge problem. And it's not like I've cracked that not completely at all, but that's why I always
Starting point is 00:26:58 come back to data is hard. Even if you're an end user, it is difficult, right? Just having the data dictionary that makes sense is a huge challenge, you know, so that an end user can truly understand what they're using. I think there's always a partnership between a data team or more of a data, more of the data technical folks and the end users, a constant education. And there's always a communication there because when you want new data, you can't just use, you know, wave a magic wand and have it available. So you do need some help there. But I think it comes down to, you know, a lot of
Starting point is 00:27:30 iterative training essentially and get, and building people what they need. Once they get the hang of it and they do it a few times, but they can have some small wins quickly, then they're definitely going to keep using it. And there's also just that some people are curious naturally and some people aren't. And if you're not curious, you're probably not going to use data very much. But, you know, I think it's that small wins quickly. And at the same time,
Starting point is 00:27:56 and this is kind of what I most love in data is making it as easy as possible. So taking a lot of effort to curate a data set that is easy to use, right? That almost anybody can use. So it takes a lot of work to make sure you've got exactly the dimensions and measures of, you know, whatever you need. And not too much and not too little, right? To make it consumable and not boil the ocean.
Starting point is 00:28:19 But also low-grain data. I always go back to that. I'm not talking about aggregations of data. I mean, have access to the low-grain data, I always go back to that. I'm not talking about aggregations of data. I mean, have access to the low-grain detailed data, but make it just as clean and easy as it is to understand how to use. And I think a lot of people then finally do start to actually use it. And of course, you need a tool that makes it easy. Yeah, makes sense, makes sense. it easy. what you would say to make that person back there more successful in like this attempt to use data for some like because as you said like there are people that are like more curious like you have
Starting point is 00:29:14 people that like they are like say champions of like using data inside like the company but it's not always easy to do that right and it's very easy to get this care on. So what's your advice, let's say, to these people, how they can make it happen at the end? Yeah, that's a great question. I think if I had been a little more on top of a game plan there, I think I would have said, well, it doesn't hurt to do a little pilot, right? Let's do a little POC. Let's get this in the hands of a sub subgroup of the team. Let's get it, you know, 20, 30 sales reps. And let's see if we can come up with, you know, the top five value and benefits to them
Starting point is 00:29:57 of this data and this data set. And let's see if it makes sense. Let's try it out. So I would, you know, perseverance is one of our values. I think that's true in consulting for sure but it's true everywhere right if you persevere enough you usually can get what you're looking for and with a group that is a little bit of perseverance to again iterate through a something to get and use get this in the hands of end users get their feedback and see if you can really understand
Starting point is 00:30:26 how it's going to be best implemented. Henry Suryawirawan, Which function do you think is like the easiest one in the company, like to sell data related like projects? David Pérez, I think marketers are very data driven by nature, right? And there's so much, so much usage of that data. And then again, I think product managers as well. I mean, if you're building any kind of tech products, understanding usage of that product is pretty key. Those two, I think, are most data-driven.
Starting point is 00:30:52 Yeah. I would actually say like the same thing. Probably like I would say sales are like the less open to that stuff. Like, unless you can convince them that like you can, you know, take the pipeline and make it 10x bigger or something, that's the way to work with it. Yeah. That's what I thought. Maybe if you understood who to talk to, who to focus on in your lead funnel.
Starting point is 00:31:20 Makes sense. Right. But yeah. I don't think, but you know, Eric probably had like also some suggestions there because he hasn't worked like a lot on that stuff. But yeah, I think like sales is, it's a very interesting and silencing function like to go and like sell something like that.
Starting point is 00:31:37 But what did you think, Eric? Well, you know, one thing that's really interesting is the incentive structure is really different for sales you know than those other functions you know if you think about marketing or product you know product is motivated by you know planning the roadmap and then executing against the roadmap and, you know, perhaps even driving like feature adoption, although maybe that's, you know, sort of like a growth function within product.
Starting point is 00:32:13 And then marketing, you know, certainly a little bit closer to the sales side and then like you have numbers to hit. And so you're sort of pursuing those aggressively. But really, even in both of those situations with product and marketing, you're measured on execution and throughput. And of course, there are key metrics there, right? Feature adoption or mitigating churn on the product side or whatever it is, right? Activation. And in the marketing side, you have traffic and leads and all those sorts of things and of course there are like
Starting point is 00:32:46 performance bonuses and stuff but when you get into sales the foundational compensation structure is fundamentally different right like your motivation
Starting point is 00:32:55 is tied directly to you know primarily one vector and so I think that's what
Starting point is 00:33:03 makes it difficult is like the focus on marketing and product. You're rewarded for measuring because you're rewarded on output, right? And so measuring is in your best self-interest. And interestingly enough for sales, I'm not saying like, I'm not, this isn't like a dig on sales, but there's just less inherent self-interest and being heavily data-driven because that's not the way that you make money, you know, or make a lot of money. So I don't know, that's my take. But I would also say, I think that's changing. You know, I talked to more and more salespeople who are coming
Starting point is 00:33:50 from highly technical environments, they need to understand a technical buyer. And the more that once you see the power of using data to help you do your job, even in a sales context, like that, you realize how helpful it can be. So I think that's also changing a lot. And I think that, you know, I see salespeople more and more asking for, and even marketing data, right? Like these opportunities, what channels did they come from? Right? That's a great question for a salesperson to ask. Not all of them do. But those sorts of things are really interesting because it can help them
Starting point is 00:34:28 prioritize or, you know, sort of rank and do other stuff. So I do think it's changing. Yeah, I don't know. Does that answer your question, Kostas? Yes. So, Aaron,
Starting point is 00:34:41 let's go back to, let's get like a little bit more technical, like, and let's talk a little bit more about like the technologies that you see out there being used. Like we have, like we've talked like many times about like the modern data stack, like the clouds that we mentioned before. What are like some trends that you see out there that are really, let's say, transformative? And I would, my guess is that you will probably mention the cloud data warehouse,
Starting point is 00:35:14 but together with that, what else is out there that really enables the data democratization and the self-service around data? Alex Rauchmanis- Yeah, absolutely. Well, yeah, my world is around data warehouses. I always start there. But, you know, in my experience, getting the data, yeah, getting the right tool that is going to expose that data, self-service, make it easy, all the stuff we talked about is so critical.
Starting point is 00:35:40 And I've been, I've actually only used two BI tools that so from the BI standpoint that make that really clean easy and you just use the right approach the first one was Looker that I started actually started this company doing a lot of Looker work because Looker is just a lot executes live SQL against your data warehouse so you get instant query instant result again the most recent data you have. And you can query as much data as you want and add as much details. But I love that. Sigma computing is another one that's taken that same approach.
Starting point is 00:36:15 We work with a lot of them lately because they've just become so successful in the market. Again, because of that full cloud approach where you don't have to be technical but behind the scenes the tool is executing directly queries against your warehouse live but on top of that it has a spreadsheet interface pretty much everybody knows how to navigate a spreadsheet interface so the training in the ux is i'm finding this you know that to be pretty incredible in terms of how easy it is to get people using the tool. So I like Sigma computing specifically for BI. And then they're doing a lot more kind of blending traditional analytics with some really cool stuff. Like they can write back to the data warehouse and, you know,
Starting point is 00:37:05 eventually people will be able to use it even more like a spreadsheet, meaning they can even, you know, enter in some of their own data and mesh it with data that's in the data warehouse.
Starting point is 00:37:12 So really interesting stuff like that. And then this bleeds into like the pattern and the trend that I see that I love
Starting point is 00:37:21 is just more and more applications like that that are running directly on top of the data warehouse. So you have the most recent data you can get. And if there's new data that comes in and instantly available to you and all of that. So in the marketing space, we work with another company, Flywheel Software, that allows marketers to create audiences, run campaigns, do A-B testing, do some AI on top of that, all directly on the data warehouse.
Starting point is 00:37:49 And so I love seeing these full cloud approaches that are right on top of your stack. And the future of all of this is running more and more of your business directly off the data warehouse. But that's the most exciting trend I'm seeing out there. I have a question for you on what that trend, by the way, I agree, super exciting trend. Something that Kostas and I have talked a ton about. What does that mean for, let's call them maybe real-time packaged SaaS analytics tools. So Google Analytics, you have a whole class of product analytics tools,
Starting point is 00:38:28 you know, a la sort of Amplitude Mixpanel, et cetera, which are really useful because it's plug and play, right? I mean, everyone complains about Google Analytics, but the reality is it's ubiquitous in marketing because it automatically produces all the reports you need. And if you think about, if I said, okay, could you go rebuild all of these views that are in Google Analytics and Sigma, that is an unbelievable
Starting point is 00:39:02 undertaking, right? And arguably like not necessarily worth it. Obviously there are severe limitations to Google Analytics, which is why there's a huge movement to the warehouse. So there's this interesting gap. What do you think the future looks like for sort of the package SaaS side of things? So the gap, tell me again, the gap that you're pointing at exactly.
Starting point is 00:39:25 So like, you know, Google Analytics is much more limited in terms of data flexibility, querying data points than being able to do whatever you want in your warehouse. But at the same time, it has all these reports out of the box, right? And it's like, well, it doesn't, there's a point at which it doesn't make sense to pay an analyst to rebuild something that's already a prepackaged interface that people can just use out of the box. And so there's like,
Starting point is 00:39:53 I agree that things are moving towards running your business off the warehouse, but, you know, there still is a really big gap, like starting out of the box with Looker or Sigma or Tableau and building an entire suite of web analytics is like, just Google Analytics.
Starting point is 00:40:09 Yeah, that problem is actually being solved with templates. And both Sigma and Looker have a way for a third party to create all those templates and then just have them plug and play at any client. So I think that's where this is going to go. Like even Google, I mean, I haven't looked lately, but they may have already created this, right?
Starting point is 00:40:26 A Google Analytics suite that if you're using Sigma or using Looker, press a button and it's instantly all those dashboards and reports are actually there because it only took
Starting point is 00:40:36 one person to develop those and then they can deploy that anywhere. So it's basically a migration of a lot of that logic, but it's a one-time migration that everybody can take advantage of.
Starting point is 00:40:47 So that'll be where it goes in the long term is that, you know, that's going to make a lot of sense because the reason, again, you made the good point, is why would you even want to do it in your warehouse in the first place? Well, A, there's a lot more flexibility in the way you do it.
Starting point is 00:40:58 B, you can bring in data from any other data source right into that report from your Google Analytics report, essentially, and tack it on. And you can do that in 10 minutes rather than the five days it would take you to do in Excel. So, yeah.
Starting point is 00:41:10 So I think that's, you know, all of the forces are driving people to the data warehouse because of efficiency and, you know, this is a logical place and all of that work that's done. And so all these problems will be solved through templatization, migration of certain things and so forth. Yeah, I agree with that. I think the challenge that I've seen a lot of companies face in that process is that once you get full configurability, you start
Starting point is 00:41:37 changing. One of the interesting things about packaged SaaS is there are guardrails, right? And so they sort of force you to build meaningful reports because you are limited in what you can do. And when you remove those limitations, right, you go down these customization paths that are like, okay, you know, it's getting way too crazy. But yeah, I agree. That's super interesting.
Starting point is 00:41:57 Sorry, Costas, I jumped in, interrupted. No, no, no, no. You should more often, honestly. My pleasure. Please go on. Yeah, so we were discussing about the technologies. Any favorites? I know that's like a hard thing to ask like from someone like consulting, but let's take like data warehouses, right? Like there's a lot of evolution and innovation happening there from like starting with Redshift and having today like products like Snowflake. So what do you love to work with and what you would have, let's say,
Starting point is 00:42:38 suggestions on how to improve? David Pérez- Yeah. And it's still a really amazing, I mean, one of the exciting things about this whole space is it's kind of fractured. There are a lot of vendors and there's a lot of ways you can do this. And at Data Climber here, we are technology neutral at the end of the day.
Starting point is 00:42:56 So we're trying to use best-in-class solutions. So if another new best-in-class data warehouse comes along, we're definitely going to take advantage of that. Snowflake has just become almost a de facto for us, at least. And it's not necessarily because we made that choice early on and said, we're doing Snowflake. They just took over the market. A lot of our clients, by the time we even talked to them, which was pretty early in their data strategy, had already kicked the tires on Snowflake because it was so ubiquitous. So when I started this, it was a lot of Redshift, Amazon Redshift, and a little bit of BigQuery.
Starting point is 00:43:35 And it really shifted to Snowflake. So now like 90% of our projects plus are Snowflake. As soon as I saw Snowflake and started playing with it, I mean, yeah, it's just made a ton of sense to separate compute from storage, never get stuck with scalability, essentially. Just get rid of all the headaches. The way I described it all the time was you get rid of pretty much every technical headache of data warehousing. You know, you don't need an EPA anymore. You don't need to worry about compression and performance and scaling and, you know, compute even.
Starting point is 00:44:04 You can just keep adding compute really easily. You can do it programmatically even. So you can scale it up and scale it down as you want. So all these features and then data sharing across multiple database and data warehouse implementations so that it looks as if the data is in your data warehouse, but it's actually in somebody else's data warehouse. Stuff like that that only the cloud can do kind of blew my mind. So when I saw this, I thought,
Starting point is 00:44:27 yo, yeah, this is the place to be. And sure enough, I mean, we've never had any issues with, you know, Snowflake implementation or any regrets. So Snowflake is, for us at least, the de facto standard. And then for data, I mean, there's really three pieces
Starting point is 00:44:43 to data warehousing in my mind. There's the data warehouse itself. There's data ingestion. And you could talk about reverse ETL, getting the data back out of the data warehouse into systems. And then there's your visualization or your end user applications, right? So for the data movement, essentially, Fivetran has always been a very solid product for us. So we love 5-Tran. And we, again, I mentioned Sigma Computing for the BI layer.
Starting point is 00:45:14 We did a lot of those kind of tools. DBT as well. I mean, DBT is a really wonderful product to use. I love the fact that they came on the scene and also solved problems that you thought might have been solved by now, but were not necessarily, or not in an elegant way, like dbt sort of solves a lot of data modeling challenges in an elegant way. Makes sense. Sorry, I was going to, I didn't mean to interrupt. I was going to mention back to your
Starting point is 00:45:39 area, our discussion about templates and packages, even dbt is packaged up all these modeling, prepackaged modeling solutions, right? So there's another good example of, oh, you can plug and play a Shopify model or whatever, you know, whatever it is, because these vendors are out there and they're standard. Sure.
Starting point is 00:45:54 One last question from me about like the data warehouses. So you mentioned like Red Sea, Big Rear and Snowflake, and you were going through like the features of like Snowflake. It makes a lot of sense, like when you compare it with something like Red Sea, BigQuery, and Snowflake. And you were going through the features that Snowflake has. It makes a lot of sense when you compare it with something like Redshift, right? But BigQuery always was, and still is, a very self-service kind of data warehouse.
Starting point is 00:46:18 It scales, you don't even have to define warehouses. Someone could say that it's even even more, let's say serverless or easily use that like Snowflake. Why do you think that like Snowflake, sorry, BigQuery hasn't managed like to get more traction that it has? David Pérez- Yeah, that's a really good question. I think for me, at least early on, I would always get nervous when I realized it's still a shared resource and it's still a little bit hard to calculate exactly what your compute will be. Whereas Snowflake, it was just guaranteed. And so there was that. There was also just the pricing model of, you know, based on the scanning, you know, scanned records and how do you predict what that's going to look like for your cost model going forward? That was a little harder to get my head around then.
Starting point is 00:47:09 At least I understand compute minutes in Snowflake and I can probably get a good idea of how long something should run. And therefore I can try to predict my costs a little better. So I'm guessing, I mean, that is literally the non literally the less technical side of things, but it's really the marketing, you know, the go-to-market approach to BigQuery that made it a little more difficult for people to get their head around. That's my guess. Super interesting. Yeah, I totally agree, actually. Eric, all yours. All right. Well, we're really close to the buzzer here, as I like to say. But one question I'd love to know. So you said, OK, two big projects over a decade.
Starting point is 00:47:51 I want to do 50 of these a year, you know, 100 of these a year. Yeah. What's your favorite part? You know, and I would say like of the process, not necessarily the outcome, because of course, it's great when you know the company says yeah this is amazing like everyone's logging into their dashboards every day and we make good decisions but you know that's of course the outcome and that's great and we all love that but in terms of the process that leads there what's your favorite part like you know if you had to get your hands dirty going through that journey which part would you choose to focus on? Oh, I hands down love,
Starting point is 00:48:27 love the delight on an end user's face, you know, when they get it and you've taught them just enough to be dangerous with their data, right? And so to me, it's that last mile of like, I kind of talked about this before, right? Curating that data set to be so
Starting point is 00:48:43 exactly what a group of users need let's say that's just simple enough to use but also complex that it's very meaningful so i love the curation part really and that you step back farther that's really data modeling right you really enjoy the modeling but much a little less low of the low level modeling but the high level modeling that semantic layer sort of final modeling where you're pre-defining all your joins and you're making sure your fan outs are not going to cause problems and all the things that you worry about in terms of making a data set bulletproof. So a user can't shoot themselves in the foot. They're always going to get the right answer. Easy to use. I just love that final step of curating the set and then training them up and showing them how to
Starting point is 00:49:24 use it and then watching their eyes light up when they say, oh my God, I can do this and this and I can slice and dice however I want. And I answer questions ad hoc. That's probably why I focus on data democratization too. But I just, I do like that last mile
Starting point is 00:49:37 of the journey the most. Yeah, that's super cool. I kind of view that as a great example of sort of art and science, right? You can almost think about it as an architect, you know, where it's like, okay, well, you're building a house, but if you're trying to bring someone's vision to life, you know, you have the ability to, you know,
Starting point is 00:49:55 shape it in a way that sort of brings them delight as an outcome, which is, you know, really specific to businesses and users, depending on their use cases. So very cool. Well, Aaron, this has been such a great show. Appreciate the time. Learned a ton. I'll look for an invite from Mark Benioff
Starting point is 00:50:13 for one of his parties because I hear that they're really good. You got it. All right. Well, thanks so much for joining us. Yeah, thank you very much. It's been a pleasure to talk to both of you. It was a great discussion.
Starting point is 00:50:24 Thanks for talking shop. I always love it. You know, one thing I think that is so interesting about Aaron's story that we've heard so much on the show, and this is going to sound so cliche, but you know, the people side of things is always the hardest. And he referred back to that over and over again. You know, just in terms of like, how do you make progress in organization in terms of becoming data-driven? And it's, I guess maybe my big takeaway, this is how I would frame it, Kostas.
Starting point is 00:50:57 I love talking with people like Aaron because they just don't talk about the tools when you ask them about becoming data-driven, right? Like, it's so simple for them, right? It's like, I mean, I need to ingest data. Like, how are you going to do that? Well, it doesn't really matter. There's lots of good tools, right?
Starting point is 00:51:14 You need to warehouse data. And it's like, well, how are you going to do that? And they're like, well, I mean, like Snowflake's great, but like everything's pretty good, right? And then it's like, well, you need visualization. And it's like, well, how do you do that? And they're like, we have a couple of tools we prefer, but like everything's pretty good, right? And then it's like, well, you need visualization. It's like, well, how do you do that? And they're like, we have a couple of tools we prefer, but like everything's pretty good.
Starting point is 00:51:28 I just love it. Like for Aaron, things are kind of simple. You know, you have ingestion, you have, you know, storage, you have modeling and then you visualize it so that people can make decisions. And like they've changed preferred vendors over time, but like he wants to help people make better decisions. And it's really refreshing to just hear the perspective of like,
Starting point is 00:51:51 all the tools are awesome. And maybe we prefer some of them, but really it's doesn't matter what you use. It's actually about like getting a good data model and delivering like really clear visualization and dashboards so that people can make decisions. So I love that. I think it's so great.
Starting point is 00:52:09 Yeah. Yeah. A hundred percent. Like, and I love the focus on like the human factor, to be honest, like, because that's what it is, right? Like you can have the best technology out there, but you need the culture and like to educate the people on how to use this data and like how to think in terms of data and they make decisions
Starting point is 00:52:34 or like make it part of their work. Right. I think like, especially like the example that he gave about his own experience back at Salesforce, he went like to sales and sales were like, okay, yeah, that's cool. But why do we need it in case like. We cost every customer. Yeah.
Starting point is 00:52:53 Like, so I think that was like a really good part of like a conversation that we had. And yeah, it's one of these things that you can, you need a consultant. You need someone with like, you know, implementing, but it's not part of the system itself. Right. So it's an observer and like they can see what's going on and like they can see what the bottlenecks are.
Starting point is 00:53:16 And yeah, looking forward to have him again on the show and like discuss more about this. Yeah. Yeah. You know, if you're a vendor in the modern data stack, every problem is a nail that the modern data stack needs to hammer. And it's really refreshing to hear from someone like Aaron
Starting point is 00:53:35 who brings it back to basics. So thanks for joining us. We'll have many more great guests on the show and we will catch you on the next one. We hope you enjoyed this episode of the Data Stack Show. Be sure to subscribe on your favorite podcast app to get notified about new episodes every week. We'd also love your feedback. You can email me, ericdodds, at eric at datastackshow.com. That's E-R-I-C at datastackshow.com. The show is brought to you by Rudderstack,
Starting point is 00:54:05 the CDP for developers. Learn how to build a CDP on your data warehouse at rudderstack.com.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.