The Data Stack Show - 223: End-of-Year Product Trends: The Cost of Rushing Features with The Cynical Data Guy

Episode Date: January 8, 2025

Highlights from this week’s conversation include:Christmas and New Year Edition of Cynical Data Guy (0:28)Discussion on AI (0:42)12 Days of Shipments (1:04)Attention-Grabbing Strategies (2:01)Founde...r Mode vs. Manager Mode (3:11)Technical Debt Remediation (5:03)LinkedIn Posts Discussion (6:05)Cultural Impact on Roles (8:03)Investment in Modernization (12:07)Reflection on Company Strategies (15:03)Gratitude for Data Trends (16:18)Future of Data Access (19:14)Looking Forward to 2025 in Data (21:45)Final Thoughts and Takeaways (22:11)The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Transcript
Discussion (0)
Starting point is 00:00:00 Hi, I'm Eric Dotz. And I'm John Wessel. Welcome to the Data Stack Show. The Data Stack Show is a podcast where we talk about the technical, business, and human challenges involved in data work. Join our casual conversations with innovators and data professionals to learn about new data technologies and how data teams are run at top companies. Welcome back to the Data Snack Show.
Starting point is 00:00:35 We're here for a very special show, a Christmas edition of the Cynical Data Guy. So I'm here with Matt. Matt, welcome back to the show. Welcome back. Here to bring the festivus to your Christmas cheer. Excellent. Yeah, unfortunately, Eric can't be with us today. So you're stuck with myself and Matt, but we've got some fun topics.
Starting point is 00:00:54 We are going to start out the show, which is a record for us, by the way. We're going to start out the show talking about AI. It's unusual. It's Matt's favorite topic. Oh, so much. All right. So one of the things that I've seen a lot, Matt, I think you've seen it as well, is we keep seeing these various forms that OpenAI calls it the 12 days of ship mess. And there's basically there's several different software companies doing this,
Starting point is 00:01:19 where they're attempting to brand pushing really hard at the end of the year, essentially, to get product out. What do you think about that, Matt? It sounds like someone made promises for the end of the year and they have not kept up with them at this point. Then some people that are like, crap, we promised investors we were going to do something. We haven't done it.
Starting point is 00:01:41 Yeah. And it's in such a juxtaposition to like the other the what i would be used to i think what you're used to too is like we're gonna do like we're gonna do code freezes so we don't break anything we'll be on vacation and keep and things are very stable at the end of the year so what how do you think this is gonna work out i i mean i think i think it partially just garners attention is one of the big things but the unfortunate thing is once one person does it and they get attention for it yeah then everybody's going to try to do it next year and it's and it just becomes the new standard and nobody really gets any credit for it right it just becomes oh now we have to ship 12 features every December. Why are we
Starting point is 00:02:26 doing this? Here's an article. OpenAI, let's see, their 12 days of shipments, they shipped the O1 reasoning model, which if you'd like to spend $200 a month on chat GPT, you can. I will pass.
Starting point is 00:02:42 What else did they ship? Apple Intelligence was day five with ChatGPT. My iPhone isn't good enough for that. It won't support it. That's too bad. And a couple of other features, day four. Oh, and then Sora was the other big one. That's their, it can create video.
Starting point is 00:03:00 I still don't have a use case for that in my personal life or professionally at this point. So good for them. Not something I'm going to be taking advantage of. Yeah, that's fair. Okay, so we've got the 12 days of Stripmas. And Matt and I were talking before the show about, and we've talked about this topic before, about this founder mode versus, I don't actually have a good word for the opposite. What's the about this founder mode versus I don't
Starting point is 00:03:25 actually have a good word for the ops. What's the opposite of founder mode? I think they called it normal. Okay. Okay. Manager mode. So we're talking about like this, things like the 12 days of shipments.
Starting point is 00:03:37 It's like, well, we don't want people to like slack off at the end of the year, or maybe we're late on some deadlines, but mainly we don't want people to slack off at the end of the year. or maybe we're late on some deadlines. But mainly we don't want people to slack off at the end of the year, so we're going to set a bunch of deadlines. Doesn't it feel a little bit like the dad who hasn't been paying attention and then he sees his kids kind of slacking off or something,
Starting point is 00:03:56 and he goes, that's it. We're all buckled down. You guys are going to be locked in your rooms. Never coming out until you get an A or something. It had that little bit of feeling of like we're detached and then you came back and you're overcorrected. Yeah. Well, it actually reminds me,
Starting point is 00:04:14 I had this guy I worked for years and years ago and he would intentionally, it was one of these things where we, you know, pretty much a 24-7 operation, but he would intentionally come in like Christmas Day, New Year's Eve, New Year's Day, and just to like, you know, check and see who's working type of thing. And, you know, I don't know. This kind of reminds me of that.
Starting point is 00:04:36 Of like, all right, let's make sure. Let's make sure we got everybody putting in the extra hours. Yeah. The only thing with this is it kind of it's it risks eliminating one of these unknown like not unknown but under the radar things that kind of keep a lot of these places going which is when everything kind of slows down suddenly you can sneak in there between christmas and new year's and all that technical debt you can actually make a dent in it nobody was letting you touch right we're going to 12 days of shit mess when is anyone going to secretly do all the tech debt remediation i mean you never get rid of all of it but you're at least going to pay it down a little
Starting point is 00:05:14 bit yeah now we're just throwing everything on the credit card yeah till next year just gonna keep throwing it on there we gotta pay it No. I just let the interest keep rolling. Yeah. No, I mean, that was something that I actually scheduled. I'd never been on a team that did this, but we actually scheduled quarterly clean stuff up, pay tech down stuff. It was like a week quarter or something like that. We'd do security reviews that nobody got around to. We'd clean stuff up. We'd do security reviews that nobody got around to. We'd clean stuff up. We'd actually pay down some debt
Starting point is 00:05:48 or just review stuff, stop, for a minute. And I don't think many teams do that. That's unique, being able to do that. Most of the time, you get very hung up in the, but I have another data request. Why can't I just do that? Right. For sure.
Starting point is 00:06:06 All right. So it wouldn't be a good episode without some LinkedIn posts, right? Of course. That is what I'm very cheerful for this past season. All the LinkedIn posts that we get to then bring on here. All right. I think Matt's got one for us here. Maybe even a couple.
Starting point is 00:06:24 So this one was out there and it's a very short one. It just says, data engineers can do what analysts do. Analysts can't do what data engineers do. Some fighting word. Man. Tell me about some of the comments first. That does not sound like a dangerous place to be in that comment section. That got some fiery comments, though.
Starting point is 00:06:50 My favorite one was a person who, not in the thread, but separately just wrote, a data analyst can't do what a data engineer can do because it's boring. What? A data analyst can't do what a data engineer can't do because it's boring yeah well honestly i think if you are an analyst a lot of times you look at what they're doing as a data engineer oh i never want to do that yeah it's just the it's the plumbing like that's boring yeah i mean it's different personalities for usually sure but i, as a former data analyst, those are fighting words right there.
Starting point is 00:07:27 Same, yeah. I spent a number of years as a data analyst. Well, and also, actually, here's a really interesting thing. Depending on the company I worked for, I'd be curious if this was true for you, the value of a data analyst was drastically different than engineering. So I remember one of the first companies I worked for,
Starting point is 00:07:49 the analyst, I would say kind of in the middle. And then actually the people above analyst tended to be project managers, which is a little unique. Another company that I worked for, analyst was kind of a lower tier thing. And then there was kind of the traditional IT hierarchy, and analysts were kind of a lower tier thing and then there was like kind of a traditional it heart and the analysts were kind of lower and then a third company that i worked for there was it was just a bigger company so it was like more split out and there were kind of like levels of analysts versus business intelligence you know so it really i think it really depends
Starting point is 00:08:20 on the culture because there is that like there's some cultures where it's like well these guys actually like drive the business forward. They have the business knowledge. Like we value them the most. Like we can just replace the IT people. And then other people it's like, well, I actually like the IT, you know, data engineering skill set. It's really hard to find somebody good.
Starting point is 00:08:37 And, you know, we pay them a lot of money. So we value them a lot. I mean, what have you seen? Well, so we're going to go back to the old, the olden times theater world when i first started i think the most common thing you saw was people would hire data analysts yes i want whatever it is i need an excel monkey i want someone to go just find me a nerd to do whatever all those types of things and then there was this brief error where they said, the first data hire shouldn't be an analyst.
Starting point is 00:09:07 It should be a data engineer. The problem with that was you would hire a data engineer and they would come in here and they'd be like, I'm going to go make the plumbing or whatever. And then they would immediately start getting data analyst requests. Yeah, of course. So you'd have this data engineer who's now spending over half of his time trying to do data handling requests which to be honest most of them were not very good at
Starting point is 00:09:33 sure that was not their thing i have i've worked with some very good data engineers i have also worked with data engineers that literally didn't know what we do as a company sure i mean like you would sit in a meeting they'd be like why are we doing this why do we have all of this information on them on these customers because we're giving out loans do you not understand like why do we even need this yeah why is this all secured because it's personal information about so that i felt like was always a problem i think there's always this temptation to say well we got to do it sequentially right data engineering is the first step we got to do that first and we'll like build up but it doesn't really work because people want
Starting point is 00:10:23 something tangible from it that's why anwra typically you were the first hire right so you've got to kind of it's like this cold start problem you have to figure out well and i think we i think it was the the episode we did with the team from zyletic we talked we went way, way deep on housing and plumbing and analogies between that and data. And I think that applies again here where it's like, okay, we hire a data engineer, the first hire. You can end up with a house with seven bathrooms
Starting point is 00:10:56 all plumbed with three sinks each. Yes. Multiple showers and all the bathtubs in weird places. And the hard part with that is the better the data engineer is, the more they're susceptible to this idea when you say, hey, we're doing this data migration. We need to get this data from a place. And we can set up a staging area and we'll do this.
Starting point is 00:11:20 This is going to last four months. Right. I don't need, this is not a full-term thing right i don't you know you're wanting to lay down railroad track as we're just gonna pick it up behind right it's not gonna be over right now i mean that's actually a really interesting topic because again if we're talking homes like there are situations where like hey this is an rv situation right we literally want to park this here for a couple weeks.
Starting point is 00:11:49 It needs to be livable. It doesn't need to be perfect. And then we're going to move it. And then in data, the category is typically either fully overbuilt, like we're going to build an empire and we want this to last a lifetime. Or a tent. There's not
Starting point is 00:12:04 much in between as far as philosophy right there's the one end which is the we have to eat it right at front yeah and it needs to be the taj mahal that we're doing and then there's the other one that's like there's a canvas over there and i can hang a string and we can do it and and it's neither of those because you need to have something that's a little sturdier most of the time i mean there's also there's different situations required from things but you generally need something that's going to be sturdier but that can evolve right and sometimes that means you have to redo things and that pisses off a lot of people sure who built it like i don't want to build this i'm just going to have to redo things. And that pisses off a lot of people who build it.
Starting point is 00:12:45 I don't want to build this. I'm just going to have to rebuild it two years. I get that. Also, that's the best way to go about this. Yeah, I mean, I think, I mean, the rework thing is challenging, right? Because especially when you're pitching projects or talking to your projects, like that, that'll come up like, okay, we have to rework this later. In the honest, if you keep saying no,
Starting point is 00:13:10 and you're going through a complex project, that's a little bit of a red flag over any large time frame. Because the reality is, if you're going to invest in such modularity and flexibility that you will never have to rework anything, then that's not necessarily the right answer. Nor even if you do invest in that, there's always going to be some amount of rework. And the worst one is when you're sitting on that edge between an old system and the possible new system. I remember one place I worked,
Starting point is 00:13:49 there were clear things that we knew would make our ability to track user data and stuff better. We're going to sunset that out. We're going to have a new app. And like, okay, so when is it going to come? Three to five years. Five years later, they were three to five years from it. And all of these problems had piled up to the point Three to five years. Five years later, they were three to five years from it.
Starting point is 00:14:10 And we were in all of these problems and piled up to the point where it was causing actually customer problems. And it was one of those, like, if we had just done the work, not just little bits here and there over that, we wouldn't be in this situation. Right. Where instead they had to fire a whole team just to modernize this app that they were still working on trying to place. Totally. Well, and the interesting part there, too, is it can be a good strategy. There's a company that I was speaking to recently where essentially the company had been around 20 or 30 years. They got acquired by a much larger company. And they made it.
Starting point is 00:14:45 They made it on sticks and stones and on older technology and this, that, and the other. And sold. And whatever owners were part of that company, was it the right decision for them to just make it happen and keep the lights on for 20 years with bare minimum? And maybe now it's also easy to think through, well, could they have done more if they'd invested more here and there? Maybe too. They may have missed out on some things. It's hard to quantify.
Starting point is 00:15:15 It's always hard to quantify, which makes all of these things tough to justify. Because these modernization efforts usually happen after a hard loss not like an opportunity loss yeah that's yeah that's very true it's usually once you there's a perception you get a wall of what you can do yes or that there is some feeling that's holding you back or you get new leadership in and they have that moment of where they're horrified you're doing what no we have to stop this right right but it always has those risks it never goes smoothly and sometimes the right end the right answer is not keep it or get rid of it it's kind of like we talked about a previous episode follow it out right and just use the frame of it and just put everything else together on it.
Starting point is 00:16:05 Yes, definitely. All right, so we've got to talk about the festivus and the cheer and the Christmas. So we've got the 12 days of Shabbos. Yeah. What are 12 things? I'll do that for you. What are 12 things that you're grateful for this year no what are one or two things that you can look back on um that you're as festivus or chair
Starting point is 00:16:33 whichever category you'd like to choose but i'll say one thing that i'm happy for is the final kind of break away from all of these sass walled gardens and putting the warehouse kind of like in the center yeah i always hated working on it when it was like oh we got all this data and it's in a sas application and they won't let it out i hated that yeah so move it to the warehouse i'm very grateful for it comes with a whole bunch of other challenges but i'd rather deal with those held hostage yeah by the vendors that's a good one other thing i am you know we're now a couple years away from the peak insanity of the COVID, tech evaluations, stuff like that.
Starting point is 00:17:33 We're seeing more and more tech companies are having to act more like real companies now that money is free. And I would just be like to show my appreciation for all this, just say, welcome to my world where money doesn't just fall out of the vents every time you want to do something. Well, I mean, if you're an AI, I would argue that it's still kind of falling out of the sky.
Starting point is 00:18:00 But other than that, yes. Yes. But I try to pretend that's not real. No, you pretend that's not real. I know you pretend that's not real alright my number one I would have to agree and even kind of expand on there's an awesome
Starting point is 00:18:14 trend and data of A companies like centering what they're doing around a warehouse but like broader scope than that just seeing all the growth in open source data formats like Iceberg, for example. AWS released some cool things with S3 and Iceberg at the conference this year. And then all the other major vendors are playing with it as well. But I think that's
Starting point is 00:18:40 a really exciting trend where if there's some future where essentially all of a company's data can live in some commodity storage. And then applications that get access to it, there is a flip of like, I have my data and I allow application access to the data versus like I store all of my data with an application. And they're required by law to give me the data if I leave. But it can be one CSV file at a time if they want it to be. Or put them all in an XML. I hope that philosophy keeps trending in that direction where it really is more about we have all of our company data stored together and we're allowing these software vendors to to quote use it or to be part of their product
Starting point is 00:19:32 and then when we're ready to leave we just cut off access we're not like quote migrating necessarily i think that would be a hugely positive thing yeah i think the big key for that will be having the open formats not be kind of co-opted yeah i'm sure you know oh we've got 17 vintages of iceberg depending on your different flavors that are not really compatible with each other right you know well they have it but you can only use their catalog it doesn't play nice with anybody else and those types of things yeah this will all be great as we kind of decouple the warehouse up until we reach the absurdity point of that and then someone sells a completely coupled warehouse right right or or just a coupled you know solution that includes you know x y and z you know other solution that includes, you know, X, Y, and Z, you know, other things with the warehouse.
Starting point is 00:20:25 Right. So which is the trend, right? Like this is the type of thing that will get decoupled, further decoupled than like regrouped together. Which my absolute pet peeve frustration of the example of this is cable TV and streaming. Like there was such a hard sell of like cut the you know cut the cord save money move to like streaming platform and then essentially we're at the point where everybody's
Starting point is 00:20:53 actually paying more than they ever would have and you have to pay one per thing and it's insane we're seeing consolidation and there's attempts at more of it so they will have to pay for a streaming service you will have to pay more money than you want to and it'll come with a hundred channels or content you don't want you don't want or right back to cable we just eliminated the cable box right right right which is wild like it's just not not something i mean it makes sense retrospectively but it's like man we really man, we really failed on that initial promise. You know, pay for what you want to use, et cetera. All in part, it's going to be great.
Starting point is 00:21:31 Oh, wait, no, it doesn't work. It doesn't work. And yeah, my fear is that some version of that very well may happen with a lot of this data space deconsolidation. But I don't know, maybe it'll be different this time. Maybe. I think as we look forward to another year of commentary on the ridiculous things that happen, the fun stuff and the ridiculous things
Starting point is 00:21:55 that we get out of data, I think it'll be an interesting time all around, especially with all the stuff going on in the world. As they say, may you live in interesting times. Well, that's not a problem. That won't be a problem. All right, thanks for joining us. Merry Christmas.
Starting point is 00:22:14 Happy New Year to everybody. Happy Festivus. Stay cynical. See ya. The Data Stack Show is brought to you by Rudderstack, the warehouse-native customer data platform. Rudderstack is purpose-built to help data teams turn customer data into competitive advantage.
Starting point is 00:22:29 Learn more at ruddersack.com.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.