The Data Stack Show - 200: Data Team Struggles: Telling Stakeholders the Truth vs. What They Want to Hear (How to Tell The Truth, Tactfully)

Episode Date: July 31, 2024

Highlights from this week’s conversation include:Lightning Round Discussion (1:21)Data Team's Truthfulness (2:21)Culture as a Blocker (9:10)Misconceptions about Data Jobs (10:32)Cultural and Technol...ogical Influences (11:51)Challenges in Data Science Projects (15:19)Embracing the Process (17:23)Barriers to Entry (19:36)Hiring Data Leaders (22:06)Challenges of Data Leadership (25:38)Evolving Hiring Criteria (27:30)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Transcript
Discussion (0)
Starting point is 00:00:00 Hi, I'm Eric Dotz. And I'm John Wessel. Welcome to the Data Stack Show. The Data Stack Show is a podcast where we talk about the technical, business, and human challenges involved in data work. Join our casual conversations with innovators and data professionals to learn about new data technologies and how data teams are run at top companies. Welcome back to The Data Show.
Starting point is 00:00:36 This is our monthly show with the Cynical Data Guy. If you didn't hear the introductory show last month, definitely go back and check it out. For those of you tuning in for the first time or hearing about Cynical Data Guy for the first time, here's a little overview. So every month, I pick out some really nice LinkedIn posts, and we get Matt Kelleher-Gibson, who's on the Rudderstack team, who came to Rudderstack from the bowels of jaded corporate data America, straight to Rudderstack. And he gives us his hot takes on these LinkedIn posts. So cynical data guy,
Starting point is 00:01:14 welcome back to the show. Thanks. So I'm back. Yeah. And of course, John's agency is Agreeable Data. So we do try to pit them against each other. All right, this is lightning rounds. So we'll get cynical data guys take. John, you get a chance to respond. Some of these LinkedIn posts, the authors will remain anonymous for obvious reasons.
Starting point is 00:01:40 Others will share. And I will just- For their safety and security. For their safety and security. For their safety and security. And their emotional well-being. Yeah, right, right. You can't expose everyone to the cynical data guy. That's fair. Okay, are we ready? Let's go. Yep. All right, first post, actually,
Starting point is 00:01:55 Clint Dunn, who we had on the show recently, this is a great post. I'm going to read some excerpts here. It starts out with a zinger. Your data team is lying to you. Data people are supposed to be truth seekers. Why would they lie? The reason is actually very simple. Data teams tell you what you want to hear. That's why your head of data rarely gives you unfiltered bad news. Instead, they'll hint at an issue and try to redirect you towards a better goal. It's a lie and pretty packaging. Yeah, that's very true.
Starting point is 00:02:32 But, so, yeah, they're lying to you. And they're mostly lying to you because you want them to lie to you. This becomes the thing where everyone says, well, I want Data to be a partner. It's like, well, you say that, but you treat them as to be a partner. It's like, well, you say that, but you treat them as a junior service partner.
Starting point is 00:02:50 So that's what they're going to act like. So, okay, give us a scenario. Don't name any names, but give us a scenario where, you know, I mean, lying, I guess, I don't want to put you on the spot and ask you when you've lied. But put him on the stand, ask him when he's lied. Take us to corporate America as a data leader, and you're in a meeting, and you face this situation.
Starting point is 00:03:16 Okay, so there's different types of versions of this I can think of. The one that immediately comes to mind is the one where you've been reporting something you've now gone through and realized oh man there was like duplication or a wrong source or like something has caused it and the number is about to get very off and there's off as in like from what it was before but more accurate right you're going to be now delivering accurate results, but they're not consistent with what you've been saying, right? Or what the company has been saying for a while now. So there's one form of lying, which is we just report it the old way and just don't even, and we come up with like version 2.0 of whatever the metric is and massage around exactly why, well, this is what we want to transition to and things like that.
Starting point is 00:04:09 The other version of it is where you talk about how the numbers are going to be a little different, but it's okay and it's going to be more accurate and it's still directionally the same. Oh, directionally. Directionally the same. Directionally accurate. Anything we've done in the past, we just, you know,
Starting point is 00:04:24 there was a little hiccup and now we fixed that that and you know and it wasn't our fault it was clearly someone else's fault not that they care lots of hand waving happened lots of hand waving yeah so the reaction to this change i mean well the reaction or what everyone fears the reaction is going to be everyone's fears this is like the we're all getting fired type moment like right that's just the way it is without naming anything i i remember working on one team where i had a boss that anytime we got a problem the first thing he said is what did we do wrong and we had to spend the next 30 minutes explaining to him how it was not our fault on any of this stuff. And it was business users or downstream, upstream, all those types of things. Yep.
Starting point is 00:05:09 The reaction, I think, from people who are, if you're not in that fearful state, is kind of the like, oh, all right, here we go. Especially if it's something where you figured out that there was like a previous iteration of this was done incorrectly. Or there was, you know, a lot of the like, oh, there was Gary and he did all of this was done incorrectly, or there was, you know, a lot of the like, oh, there was Gary and he did all of this and there was this mistake in his code. And so like Gary's code mistake, it was a Gary process that's what it is. Gary DB. And and it's you want to fix it, but you also know that you're going to get into this thing of like, well, those aren't my numbers and it's's like, well, I know they're not your numbers. You can't say because your numbers have been wrong and every decision you've made is off of long term numbers. That doesn't go over well. So you have to come up with these very, well, it's not that it was wrong or all these types of things to try to move around that. That's why I say, that's when you end up with just like,
Starting point is 00:06:06 you know, sales 2.0, because we can't change sales 1.0, even though it's wrong. Agreeable data guy? My first thought on this was actually in marketing in Eric's territory. So there's this really popular book in marketing called All Marketers Are Liars. Like, do you know that one?
Starting point is 00:06:24 Yeah. Seth Godin book, yeah. So, and his point in point in the book and he actually there's actually two different covers the originals all marketers are liars and then the new cover is all marketers are liars and there's like a cross through with like all marketers tell stories which is interesting fun fact gosh you know i was winning wow a point for one point for cynical data guy one point for cynical data guy i mean we may just need to move on okay actually which yeah the point the point being that whenever you're taking complicated data and simplifying it into a story a that's really hard right and then b like how do you do that such that you keep the
Starting point is 00:07:09 right level of fidelity or accuracy to communicate what you're trying to communicate and not get lost everybody get lost in the details yep it's like i guess on one level like you could call i mean i wouldn't call it lying but like on one level like when you roll up enough it's like you miss out on so many details that like yeah like you're not covering all of the things it's inherently reductionist when you're doing that stuff right like you're gonna leave things out right really just you know it's taking it from a book to a movie i can't include every plot in there i gotta make decisions of what i'm gonna put in and data people are the book is always better people generally right right? Because there's more context and, you know, whatever. And that's how you end up with the emails to a VP when they say, what was sales last month?
Starting point is 00:07:52 It's like, well, here's a chart with all of our sales products for the last full, you know, every day of the month. And they're like, am I supposed to add up every one of these numbers to get my answer? Well, yeah, it's in the chart. I just wanted one number. Right, right, right. Cynical data guy, we need to move on from this lightning round, but last word. In general, your business
Starting point is 00:08:16 stakeholders want the truth or don't want the truth? If it disagrees with them, they don't want the truth. Your face was unbelievable. The look of like, are you serious? I should have taken a picture. Okay, lightning round one done.
Starting point is 00:08:35 Moving on. The author of this post will remain anonymous. I did a poll last year asking data and AI leaders what the biggest blocker was to delivering successful initiatives. The overwhelming answer was dot culture. No surprise, really. Changing culture in an organization is incredibly hard. But might generative AI be the answer to solving all sorts of cultural adoption issues?
Starting point is 00:09:04 Join me live on the 16th of July to find out more. No. Next question. I'm going to go with the no next question as well. Oh, also hashtag culture, hashtag generative A. Hashtag wishing. Okay. That one was too good not to include.
Starting point is 00:09:21 Okay. This post, the author of this post, we will disclose, it's actually the cynical data guy himself. Okay, and this post went viral. So, I mean, hundreds of thousands of impressions, I don't know, 80 comments, however many. Not that you would be obsessively looking at the stats. No.
Starting point is 00:09:44 Okay, I'll read it. One of the worst things to happen to data was the sexiest job of the 21st century article. It caused two major problems. One, it attracted people to data and data science that were looking for high status jobs.
Starting point is 00:09:59 Two, it gave the impression that businesses had already bought into data-driven culture and had the tech ready for cool work. The reality was and is most of the job is hard, unseen work. There is very little splashy work. And second, data is a support function in most businesses and the infrastructure is nowhere near ready to just walk in and do quote unquote, or quote, cool work. I'll stop there.
Starting point is 00:10:30 It's a really good poster. Wow. Wow. Should I respond? It's his post. So maybe I should respond. You go first. First off, I have to derail us for a second.
Starting point is 00:10:41 I was going to look up an example on company culture and the top SEO spot for company culture examples is WeWork right now. All right. We should do a whole episode. Yeah, that's worth a whole episode. Yeah, that's worth a whole episode.
Starting point is 00:10:56 And that's why we're all going to hell. Yeah. Yeah, so the data science. Do you know around when that article came out? We're talking 15 years ago? It was like 2011-ish, 2010-ish. Yeah, yeah data science. Do you know around when that article came out? We're talking 15 years ago? It was like 2011-ish, 2010-ish. I'm probably off by a year or so in some direction.
Starting point is 00:11:12 I've actually been thinking through this a little bit personally because that is around when I started in data. And I think within a couple of years of when you started in data too. And it pretty much corresponds to um which i think was an influencer not the main influencer but a influencer is moneyball right so that was around that time too so i feel like there's all these like some cultural things and like some technological innovations it's a little downwind from moneyball because moneyball was like kind of 2003 or the movie yeah the movie and the book i mean
Starting point is 00:11:46 who reads book yeah i mean it's not like no you're right moneyball is like 2000 it's not like i read 30 books a year or anything yeah right um anyways all this to say there were like lots of like cultural wins and like data and like like moneyball the movie and sure other things like wow this and then you know i don't even know who wrote the article some you know forbes or whoever from like but coming off of that and then even the reality then was what percentage of the people were actually doing like cool data work there like that was i don't know dreamy then like that's that was what people wanted it to be right right and it was I'll just make up a number maybe 10% maybe less than 10% we're doing that and then yeah let's call it like 1% so then like you're going from that like oh that's so great like
Starting point is 00:12:38 that's going to be the future and then when somebody declares the future enough years go by and then like the obvious reaction to it is like oh okay like we're about in the same spot that we were yeah yeah so and there's a more tangible thing that came out of that because i remember when i first started having a hire and the number of people that if you put out a post that said data scientist on it, they would go to it and had outsized expectations for salary. And like we had one, when I first started as a manager, we were, so we're going to hire a junior data scientist, like someone, you know, out of school or something like that. We got about two weeks in and we were like, this does not work.
Starting point is 00:13:21 And we downgraded the role to a data analyst because we were like, this is just crazy. And this was also at the beginning of when all of these data science master degrees started coming out. Oh, yeah, yeah, yeah. Of varying levels of quality, much of it dubious. And we went to people who were in the pipeline
Starting point is 00:13:38 and we're like, well, we're changing the role. We want to let you know that, give you a chance if you want to go there. And we had a couple people that were like, I just really feel like I'm just a modeler and that's like really what I want to let you know that give you a chance if you want to go there and we had a couple people that were like i just really feel like i'm just a modeler and that's like really what i want to do i and i saw the same thing you know work at other places and there was one they were not they weren't paying market wages right right well we're going to make all these positions a data scientist wrong like don't do that because i'm like you're gonna get people who just want the title and they're gonna leave in a year when they have a year's worth of
Starting point is 00:14:09 experience to go somewhere to make a lot more well i was gonna be fair they want the title to make the money that's why i want the title i mean there's no status thing too but a lot of it's the money because the hr does like the indexes against market and they see the title and you get more money like it's a fairly simple equation at a mid to large company. And they had a lot of turnover issues because people would stay for 12 to 18 months and they'd go work at a bank as a VP because everyone at a bank is a VP and, you know, get a 30, 40% pay increase.
Starting point is 00:14:36 By the way, can I just call out that the agreeable data guy just had such a cynical take. Well, you did have like the number two comment on it all I know yeah really
Starting point is 00:14:47 yeah I didn't even look at that I know I know right and I pulled the post to go back and correct the record
Starting point is 00:14:53 Moneyball 2003 okay for the book and the movie was 2011 okay yeah so it's about right yeah
Starting point is 00:15:00 wow okay how many projects have you worked on how many data let's get specific because you've done tons of types of data works That's about right, yeah. Wow. Okay. How many projects have you worked on? How many data... Let's get specific. Because you've done tons of types of data works. Tons of types of data work.
Starting point is 00:15:12 How many data science projects have you worked on that you would describe as sexy? Oh, I love this question. Well, this is all on a sliding scale because we're all data people here. So, one or two. Wait, out of how many? Hundreds?
Starting point is 00:15:30 Yeah. Or at least dozens, right? Oh, at least dozens. Yeah. Yeah. Well, yeah, because you also have to remember, I've worked at several places where it was like, I spent a year going like, all right, we're going to fix this so we can actually do data.
Starting point is 00:15:44 And then I realized it's probably actually just the stand. That's everywhere. The analogy that I used is at the time when we first started like trying to hire people was I said, it would be like if you went into a law firm, right? And especially if you were like, I don't want to go to law school, I want to, but I want to work in law firm. And so you got a master's in lawyering. And it was all about how to be a trial lawyer and how to pick a jury and how to do a closing statement and all that. And then you walked into a law firm and you're like, I'm ready to go.
Starting point is 00:16:15 And they would look at you and go, that's like 2% of our work. And we avoid that whenever possible. Like that was part of my job as a manager was kind of like crushing data scientists dreams when they're like we could build a model or we could take the average the dream crusher i got i got to decide if i agreeable more agreeable take on this but i think we're pretty aligned on this one the only like i don't even know if it's a caveat is is that some some of it is i don't know that like some of it's a culture
Starting point is 00:16:52 problem sure like we laugh at that which ai obviously is gonna fix obviously so we can consider that a solve right but but i think's, and this maybe is kind of a culture problem, but I do think there's a component we haven't talked about of people really like to understand how things work. Yeah. And models are inherently opaque
Starting point is 00:17:14 unless you are pretty good at math. And then if it's AI, then basically no one knows. Except for Sam Altman. Yeah, just Sam. Well, I will say one thing about this, that like, cause there was some comments in that post too.
Starting point is 00:17:27 And it was like, I used in the post, I compared it to like being like a professional athlete where it's like, you gotta love to practice, right? Like there's a lot in this and there were a decent number of people who have gotten into this
Starting point is 00:17:39 and maybe they got into it because of that. And then they went in and found like, oh, okay, I'm gonna have to do a lot of sequel work. There's a lot of this stuff I'm going to have to do. And they, you know, for lack of a better term, fell in love with the process. So there is that out there.
Starting point is 00:17:54 You know, it wasn't all bad. It was just, you get to sift through a lot of other people to find them. And it makes the job, when I was a manager, harder. Like I put a lot of emphasis on like training culture and specifically on things like how to think because i was like i would much rather take someone i mean one of the best data analysts i ever hired a 50 year old woman who would come back into the workforce after raising her kids and the only reason i knew she was 50
Starting point is 00:18:21 years old was because that was the first thing she told me when i said ask me about yourself she was one of the she's one of the best analysts i ever worked with years old was because that was the first thing she told me when I said, ask me about yourself. She was one of the best analysts I ever worked with, and it was because she really embraced, like, I want to learn how to do this better. I want to learn how to think. She was really great at that. And she had super strong SQL skills, too. You just got to
Starting point is 00:18:40 work through it a lot of times. That sounded almost positive. I know. I think we're i think we're reversing today no my sense yeah my only other take on this though is i don't think this is unique to data like i remember talking to having this conversation with somebody about being successful in general like it were and it was you're going to have to spend an inordinate amount of time like around the things you're actually doing to be able to execute on what you're doing and in any organization whether it's politics whether it's like i need access to this thing
Starting point is 00:19:11 or i need this like you're going to spend a ton of time on that not just it or data marketing is the same way and and there's just a lot of like reps and discipline and like yeah you know relationship building all these things around that to be successful at the one thing and even like a time percentage breakdown like you yeah especially as you progress in an organization you spend even more time on the like auxiliary things and even less time on like the core like what needs to be done yeah i think the one thing you'll notice in that is that a lot of other professions where you they're like let's call them like high prestige or where like a lot of people would like to do them or there's a lot of money there's a lot of barriers to entry yeah and especially 10 years ago there weren't a ton of yeah that was the like you knew a certain amount of like
Starting point is 00:19:57 sequel skill or excel or just skills yeah you could get in there or i mean you know there's people that were just like i just love data which i always those people want to write off not hiring you i don't care about how passionate you say you are about this because this is not an emotional gig i love digging ditches right it's like yeah okay yeah good but like there just wasn't a lot of that there i think we've seen a lot of differences now just in the fact that like, you know, if you want to really work in like algorithms, let's say like,
Starting point is 00:20:28 you better go get a PhD now. There's no jumping into that. Yeah. Yeah, yeah. There's a, is it okay for me to insert a comment here? Yeah.
Starting point is 00:20:37 Is that allowed? I guess I actually make the rules. Yeah, you do. Wow. Yeah. Okay. You think you make the rules. Yeah, we're just guests
Starting point is 00:20:43 on this show. You're the hosts. I'm driving the car and asking you what exit I should get off at. Yeah. You think you may. We're just guests on this show. You're the hoax. I'm driving the car and asking you what exit I should get off at. Okay. One brief comment from the moderator. There's a really good paper for those listeners or for listeners who'd be interested. It's, I think, a college commencement speech called The Inner Ring. And it talks about how people try to.
Starting point is 00:21:04 Oh, that's a C.S. Lewis. Exactly. It's a great one. That's a great one. Yeah, it's really good. And basically the summary is there are people who try to get in the inner ring or chase a title with prestige or whatever
Starting point is 00:21:15 in any context of life, whether it's I want to date a size title or whatever. And the whole point of it is if you just focus on getting extremely good at a craft, you'll realize that you have entered an inner ring of craftspeople who, to your point, love the process, right? I just want to hone my skills in this craft. Okay, we have time for one more.
Starting point is 00:21:37 Since the culture AI one was brutally and swiftly disposed of, We're going to try to squeeze one more in. Excellent. Okay. The author of this post will remain anonymous. What makes a great data leader? This may be controversial, but most of the time I go off gut feeling when I determine if a data professional
Starting point is 00:22:00 is right for the job. Or if you're the cynical data guy, if they say, I just love... I'm just passionate about this data. Obviously, this gut feeling is produced by what the candidate and their references tell me, but I've been doing this job so long, it's a well-rehearsed process
Starting point is 00:22:16 in knowing who's right and who's wrong for a role. When I ask to speak to data professionals, I know that a good leader in data will always hold a mixture of characteristics likely to be a storyteller, strategic thinker, skills in cross-functional collaboration, people management, and be a risk taker. I think those are all true. And I look forward to the day when companies start hiring for those data leaders. That's probably when I'll be a data leader again. What do they... Wait, what did you say?
Starting point is 00:22:47 Wow, that's not what I expected. That's a cool data guy. I know. Okay, so they're not hiring for those. What are they hiring for? Technical skills. When was the last time you coded a neural net? That was literally a question I had
Starting point is 00:23:02 in a director's interview like three, four years ago. And I was explaining to the person how like well i haven't done that because i've been managing but i can talk to you about how i got this you know in a regulated industry we got this model into production and over here it's like yeah we're looking for someone who's coded more recently than you have even though that's not part of the job description? Well, it shouldn't be, but I think for a lot of people, they think of director as super individual contributor.
Starting point is 00:23:35 Yeah. Agreeable data guy? I think it's funny how it starts out. Where basically it's like, I use my intuition to make decisions rather than data so that's fascinating right for me i have other thoughts about that but they're not relevant to this right so we're gonna yeah but yeah okay beyond that i got a little hung up on that honestly yeah beyond that yeah like i understand that's why we picked the post yeah i mean i don't think people like hiring data leaders, period.
Starting point is 00:24:08 I think if you already have, it ends up being first data person in the door. Like, since nobody, like, the IT people don't really know how to manage data people. Like, the business doesn't either. So, they just keep, like, going down the train or up the ladder. And then they're going to self-promote to be whatever leader there is and when they hire somebody even if they're hiring a for director they're looking for like some sort of technical skills and to offload work because i mean if you are hiring like i i just you know like that's pretty much what happens like oh we're gonna like we want
Starting point is 00:24:43 to keep you like you know about the legacy stuff like much what happens. Like, oh, we're going to like, we want to keep you like, you know, about the legacy stuff. Like we'll promote you and hire in a new guy. I mean, that's typically what happens. I also think it's a bit of a, I mean, you know, John, if you think about what are the profiles of most people who move up to being like VPs and stuff. Yeah, we actually talked about this a lot. Yeah. They come from a specific background they've typically come more from the data engineering or architecture side of thing which is a little more of like it's farther away from the end product right yeah but it's also one where it's a very like deliver type thing and
Starting point is 00:25:17 the people who get promoted are the ones that were there like we need this yesterday and they say okay and they get it done right yep right And then as you go up the ladder, like then they're like, well, we want you to be a strategic partner. Well, it's like, well, you have not been rewarding a strategic partner this entire time. Even rewarding technical skills and project management.
Starting point is 00:25:36 And just get me to get it done fast. Right, yeah. Put out the hottest fire. Put out the hottest fire, get it done quickly. And so like, it's not like it's the person who's in that position's fault necessarily that's what you've been hiring for because it's also because you know you view it a little bit as like a junior service team member right like that's what you want them to be and so that's what you've been hiring for
Starting point is 00:25:57 those skills which i think is also a problem because those are those people are farthest away from kind of like the end of the data pipeline. When you then say, how are we going to use this in a strategic way? They're dealing with infrastructure. They haven't dealt with the actual, what the number is in this. Why would they
Starting point is 00:26:19 know what to do with that? I will say that I do think it depends on who is hiring. Is the CEO hiring? Is another executive VP? Is the CFO hiring? Is the CIO hiring? I think it depends. I would say if you have
Starting point is 00:26:33 an analytical CEO, which there's actually not a lot of those, that is a great hire for this to actually like value a data leader. Because it's their job. But if it's the CFO,
Starting point is 00:26:44 like probably pretty mixed bag. And then because it's their job but if it's the cfo like and like probably pretty mixed bag like and then if it's a technical like cto like pick on my old role like odds are and again it's a cto that came from a developer or ops background they're probably least likely to value like a yeah data leader type skills i think it's like as he as john said you know a lot of the first ones either were like a lot of the first data leaders when the first started becoming popular were either like academic PhDs, which were hired because of how smart they were on the topic, which we saw had issues early on with many of them,
Starting point is 00:27:17 not knowing how to run a budget and the team. Whoops. An academic. Is this a, what is this? Is this a tenured position burn or the other one just being like this was the person who had worked here the longest and they had been the person working in excel and that creates an archetype that yeah think about
Starting point is 00:27:38 it they think well what's the data leader oh well it's gary over there he's been here for 20 years he knows so much about the business which yeah is valuable yeah yeah and then when they go and they look for other people they're not looking for it and it's it'll probably evolve over time but right now it is we're not looking for those skills we're looking for your technical ability that's super nice and for all our listeners named gary yeah, sorry. We love you, Gary. Sorry, Gary. All right. Well, that is a wrap on this month's show. If you're out there and you find a LinkedIn post that you want the Cynical Data Guy to comment on, send it to us.
Starting point is 00:28:17 You can go to datasackshow.com, reach out to us there. Tag us on LinkedIn. Tag us on LinkedIn. I think we're on X too, if anyone uses X anymore. And Cynical Data Guy has a blog now, so just look that up. Go find him on Substack. Subscribe if you haven't. Great shows coming up for you, and we'll catch you next time. The Data Stack Show is brought to you by Rudderstack,
Starting point is 00:28:40 the warehouse-native customer data platform. Rudderstack is purpose-built to help data teams turn customer data into competitive advantage. Learn more at RudderStack.com.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.